阮晓钢, 颜文静, 黄静, 郭佩远. 基于双鉴别器生成对抗网络的单目深度估计方法[J]. 北京工业大学学报, 2022, 48(9): 928-934. DOI: 10.11936/bjutxb2021050001
    引用本文: 阮晓钢, 颜文静, 黄静, 郭佩远. 基于双鉴别器生成对抗网络的单目深度估计方法[J]. 北京工业大学学报, 2022, 48(9): 928-934. DOI: 10.11936/bjutxb2021050001
    RUAN Xiaogang, YAN Wenjing, HUANG Jing, GUO Peiyuan. Monocular Depth Estimation Method Based on Dual-discriminator Generative Adversarial Networks[J]. Journal of Beijing University of Technology, 2022, 48(9): 928-934. DOI: 10.11936/bjutxb2021050001
    Citation: RUAN Xiaogang, YAN Wenjing, HUANG Jing, GUO Peiyuan. Monocular Depth Estimation Method Based on Dual-discriminator Generative Adversarial Networks[J]. Journal of Beijing University of Technology, 2022, 48(9): 928-934. DOI: 10.11936/bjutxb2021050001

    基于双鉴别器生成对抗网络的单目深度估计方法

    Monocular Depth Estimation Method Based on Dual-discriminator Generative Adversarial Networks

    • 摘要: 针对自监督单目深度估计精度不高的问题, 提出一种基于双鉴别器生成对抗网络的自监督单目深度估计方法. 该方法利用生成对抗网络在合成视觉上可信的图像方面的优势, 进一步提高了自监督单目深度估计的精度. 首先, 为充分利用重建图像, 在Wasserstein生成对抗网络的基础上进行改进, 构建了2个鉴别器的结构. 双鉴别器对生成器的要求和训练目标更加严苛, 避免了由于只在左图像或右图像上引入鉴别器而造成的信息损失. 其次, 针对该网络的结构, 提出了一种局部-全局一致的损失函数, 保证了像素的真实性和局部-全局内容的一致性. 在KITTI基准测试集中与单目深度估计的相关代表方法进行了比较, 实验结果表明, 该方法有效地提高了单目深度估计的精度, 具有较好的深度估计的性能.

       

      Abstract: To solve the problem of the low accuracy of self-supervised monocular depth estimation, a self-supervised monocular depth estimation method based on dual-discriminator generative adversarial networks were proposed in this paper. The advantages of generative adversarial networks were adopted to synthesize visually credible images, and further improve the accuracy of self-supervised monocular depth estimation. First, to make full use of the reconstructed image, the Wasserstein generative adversarial networks were improved and the structure of two discriminators was constructed. The dual-discriminator had more stringent requirements for the generator and training objectives, avoiding the information loss caused by the introduction of discriminator only on the left image or right image. Second, according to the structure of the network, a local-global consistent loss function was proposed to ensure the authenticity of pixels and the consistency of local-global content. Results show that the proposed method effectively improves the accuracy of monocular depth estimation and has better performance of depth estimation.

       

    /

    返回文章
    返回