深度学习在细粒度图像识别中的应用综述

    Deep Learning for Fine-grained Image Recognition: A Survey

    • 摘要: 深度学习技术在多种视觉任务中表现出优异的性能,特别是深度学习技术的发展大大促进了细粒度图像识别任务的进步.细粒度图像识别的目的在于正确识别子对象类别,例如鸟类中的不同子类别.由于细粒度图像数据通常需要具有专家知识才能够进行有效识别与标注,获取难度比较高,同时,由于细粒度类别直接具有小的类间差异性和大的类内差异性特点,需要模型能够捕捉到细微的有区分性的局部特征,这两方面原因导致这项任务极具挑战性.首先,介绍了深度学习技术的重要发展历程、细粒度图像识别任务的特点和挑战.随后,介绍了基于深度学习的细粒度识别方法的3种类型,包括基于定位-分类子网络的方法、基于端到端的特征编码方法和利用外部辅助信息的细粒度图像识别方法,并选择有代表性的工作给予了详细的介绍.最后,在常用数据集上比较了相关工作的性能,对细粒度图像识别任务进行了总结和展望.

       

      Abstract: Deep learning technology shows excellent performance in a variety of visual tasks. In particular, the development of deep learning technology has significantly promoted the progress of fine-grained image recognition tasks. The purpose of fine-grained image recognition is to correctly recognize sub-object categories, such as different sub categories in birds. Because fine-grained image data usually need expert knowledge to effectively identify and annotate, it is difficult to obtain. At the same time, because fine-grained categories directly have small inter-class differences and large intra-class differences, the model needs to be able to capture subtle distinguishing local features. The two reasons make this task very challenging. This paper first introduced the important development process of deep learning technology, the characteristics and challenges of fine-grained image recognition tasks, then introduced three types of fine-grained recognition methods based on deep learning, including methods by localization-classification subnetworks, and by end-to-end feature encoding methods and the fine-grained image recognition methods using external auxiliary information, and selected representative works to give a detailed introduction. Finally, the performance of related work was compared on commonly used data sets, and the task of fine-grained image recognition was summarized and prospected.

       

    /

    返回文章
    返回