深度学习在图像识别中的应用研究综述

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

深度学习在图像识别中的应用研究综述

详细信息查看全文 | 推荐本文 |

英文篇名：Survey of Application of Deep Learning in Image Recognition
作者：郑远攀 ; 李广阳 ; 李晔
英文作者：ZHENG Yuanpan;LI Guangyang;LI Ye;School of Computer and Communication Engineering,Zhengzhou University of Light Industry;Henan Engineering Laboratory of Emergency Platform Information Technology;
关键词：深度学习 ; 图像识别 ; 卷积神经网络 ; 胶囊网络 ; 迁移学习 ; 非监督学习
英文关键词：deep learning;;image recognition;;convolutional neural network;;capsule network;;transfer learning;;unsupervised learning
中文刊名：JSGG
英文刊名：Computer Engineering and Applications
机构：郑州轻工业大学计算机与通信工程学院;应急平台信息技术河南省工程实验室;
出版日期：2019-04-19 14:43
出版单位：计算机工程与应用
年：2019
期：v.55;No.931
基金：国家自然科学基金(No.51404216);; 河南省高等学校青年骨干教师资助计划(No.2015GGJS-184);; 河南省科技攻关项目(No.152102310374);; 郑州轻工业大学青年骨干教师资助计划(No.2013XGGJS001);; 应急平台信息技术河南省工程实验室开放基金(No.YJ2013005);; 郑州轻工业大学校内科研基金(No.2015XJJY010)
语种：中文;
页：JSGG201912004
页数：17
CN：12
分类号：25-41

摘要

深度学习作为图像识别领域重要的技术手段,有着广阔的应用前景,开展图像识别技术研究对推动计算机视觉及人工智能的发展具有重要的理论价值和现实意义,文中对深度学习在图像识别中的应用给予综述。介绍了深度学习的由来,具体分析了深度信念网络、卷积神经网络、循环神经网络、生成式对抗网络以及胶囊网络等深度学习模型,对各个深度学习模型的改进型模型逐一对比分析。总结近年来深度学习在人脸识别、医学图像识别、遥感图像分类等图像识别应用领域取得的研究成果并探讨了已有研究值得商榷之处,对深度学习在图像识别领域中的发展趋势进行探讨,指出有效使用迁移学习技术识别小样本数据,使用非监督与半监督学习对图像进行识别,如何对视频图像进行有效识别以及强化模型的理论性等是该领域研究的进一步方向。
As an important technical means in the field of image recognition, deep learning has broad application prospects. Carrying out image recognition technology research has important theoretical and practical significance for promoting the development of computer vision and artificial intelligence. The application of deep learning in image recognition gives a review. The origin of deep learning is introduced. Deep learning models such as deep belief network, convolutional neural network, cyclic neural network, generated confrontation network and capsule network are analyzed. The improved models of each deep learning model are compared and analyzed one by one. In this paper, the research results of deep learning in image recognition applications such as face recognition, medical image recognition and remote sensing image classification are summarized. The existing researches are worth discussing. The development trend of deep learning in the field of image recognition is carried out. The discussion points out that the effective use of migration learning technology to identify small sample data, the use of unsupervised learning and semi-supervised learning to identify images, how to effectively identify video images and the theoretical significance of the model are further directions in this field.

引文

[1]郭丽丽,丁世飞.深度学习研究进展[J].计算机科学,2015,42(5):28-33.
    [2]Hinton G E.A practical guide to training restricted boltzmann machines[J].Momentum,2012,9(1):599-619.
    [3]Hinton G E,Salakhutdinov R R.Reducing the dimensionality of data with neural networks[J].Science,2006,313(5786):504-507.
    [4]Hinton G E,Osindero S,Teh Y W.A fast learning algorithm for deep belief nets[J].Neural Computation,2006,7(18):1527-1554.
    [5]Babri H A,Tong Y.Deep feedforward networks:application to pattern recognition[C]//Proceedings of International Conference on Neural Networks(ICNN’96),1996:1422-1426.
    [6]Lecun Y,Bottou L,Bengio Y,et al.Gradient-based learning applied to document recognition[J].Proceedings of IEEE,1998,86(11):2278-2324.
    [7]Shao L,Wu D,Li X.Learning deep and wide:a spectral method for learning deep networks[J].IEEE Transactions on Neural Networks and Learning Systems,2014,25(12):2303-2308.
    [8]胡清华,张道强,张长水.复杂环境下的机器学习研究专刊前言[J].软件学报,2017(11):2811-2813.
    [9]Cortes C,Vapnik V.Support-vector networks[J].Machine Learning,1995,20(3):273-297.
    [10]Schapire R E.Theoretical views of boosting and applications[C]//Proceedings of 10th International Conference on Algorithmic Learning Theory(ALT 99).Berlin:Springer,1999:13-25.
    [11]Krishnapuram B,Carin L,Figueiredo M A T,et al.Sparse multinomial logistic regression:fast algorithms and generalization bounds[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(6):957-968.
    [12]Hinton G E,Salakhutdinov R R.Reducing the dimensionality of data with neural networks[J].Science,2006,313(5786):504.
    [13]孙志军,薛磊,许阳明,等.深度学习研究综述[J].计算机应用研究,2012,29(8):2806-2810.
    [14]刘方园,王水花,张煜东.深度置信网络模型及应用研究综述[J].计算机工程与应用,2018,54(1):11-18.
    [15]Zhong P,Gong Z,Li S,et al.Learning to diversify deep belief networks for hyperspectral image classification[J].IEEE Transactions on Geoscience and Remote Sensing,2017,55(6):3516-3530.
    [16]Huang G,Liu Z,Maaten L V D,et al.Densely connected convolutional networks[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:2261-2269.
    [17]Zhang K,Guo Y,Wang X,et al.Multiple feature reweight DenseNet for image classification[J].IEEE Access,2019,7:9872-9880.
    [18]周俊宇,赵艳明.卷积神经网络在图像分类和目标检测应用综述[J].计算机工程与应用,2017,53(13):34-41.
    [19]Lipton Z C.A critical review of recurrent neural networks for sequence learning[EB/OL].(2015)[2019].https://arxiv.org/abs/1506.00019v1.
    [20]朱小燕,王昱,徐伟.基于循环神经网络的语音识别模型[J].计算机学报,2001(2):213-218.
    [21]张剑,屈丹,李真.基于词向量特征的循环神经网络语言模型[J].模式识别与人工智能,2015(4):299-305.
    [22]Mirza A H,Cosan S.Computer network intrusion detection using sequential LSTM Neural Networks autoencoders[C]//Proceedings of 26th Signal Processing and Communications Applications Conference,2018:1-4.
    [23]Cho K,van Merrienboer B,Gulcehre C,et al.Learning phrase representations using RNN encoder-decoder for statistical machine translation[EB/OL].(2014)[2019].https://arxiv.org/abs/1406.1078.
    [24]郝志峰,黄浩,蔡瑞初,等.基于多特征融合与双向RNN的细粒度意见分析[J].计算机工程,2018(7):199-204.
    [25]Mou L,Ghamisi P,Zhu X.Deep recurrent neural networks for hyperspectral image classification[J].IEEETransactions on Geoscience and Remote Sensing,2017,55(7):3639-3655.
    [26]陈欣于,俊洋,赵媛媛.基于CNN和B-LSTM的文本处理模型研究[J].轻工学报,2018,33(5):103-108.
    [27]Goodfellow I J,Pouget-Abadie J,Mirza M,et al.Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems.Montreal,Canada:MIT Press,2014:2672-2680.
    [28]Zhu L,Chen Y,Ghamisi P,et al.Generative adversarial networks for hyperspectral image classification[J].IEEETransactions on Geoscience and Remote Sensing,2018,56(9):5046-5063.
    [29]商显震,韩萌,孙毓忠,等.融合生成对抗网络和朴素贝叶斯皮肤病诊断方法[J].计算机科学与探索,2019,13(6):1005-1015.
    [30]唐贤伦,杜一铭,刘雨微,等.基于条件深度卷积生成对抗网络的图像识别方法[J].自动化学报,2018,44(5):855-864.
    [31]Mehdi Mirza S O.Conditional generative adversarial nets[EB/OL].(2014)[2019].https://arxiv.org/abs/1411.1784.
    [32]Denton E,Chintala S,Szlam A,et al.Deep generative image models using a laplacian pyramid of adversarial networks[J].Computer Science,2015.
    [33]Arjovsky M,Chintala S,Bottou L.Wasserstein GAN[EB/OL].(2017)[2019].https://arxiv.org/abs/1701.07875.
    [34]Gulrajani I,Ahmed F,Arjovsky M,et al.Improved training of wasserstein GANs[EB/OL].(2017)[2019].https://arxiv.org/abs/1704.00028.
    [35]Radford A,Metz L,Chintala S.Unsupervised representation learning with deep convolutional generative adversarial networks[J].Computer Science,2015.
    [36]Mao X,Li Q,Xie H,et al.Least squares generative adversarial networks[C]//Proceedings of IEEE Conference on Computer Vision,Venice,Italy,2017:2813-2821.
    [37]Berthelot D,Schumm T,Metz L.BEGAN:boundary equilibrium generative adversarial networks[EB/OL].(2017)[2019].https://arxiv.org/abs/1703.10717.
    [38]Sabour S,Frosst N E,Hinton G.Dynamic routing between capsules[EB/OL].(2017)[2019].https://arxiv.org/abs/1710.09829.
    [39]Deng F,Pu S,Chen X,et al.Hyperspectral image classification with capsule network using limited training samples[J].Sensors,2018,18(9):22.
    [40]栗科峰,黄全振.融合深度学习与最大间距准则的人脸识别方法[J].计算机工程与应用,2018,54(5):206-210.
    [41]Wu R,Kamata S.A jointly local structured sparse deep learning network for face recognition[C]//Proceedings of IEEE International Conference on Image Processing(ICIP).Phoenix:IEEE,2016:3026-3030.
    [42]马晓,张番栋,封举富.基于深度学习特征的稀疏表示的人脸识别方法[J].智能系统学报,2016(3):279-286.
    [43]Shailaja K,Anuradha B.Effective face recognition using deep learning based linear discriminant classification[C]//Proceedings of IEEE International Conference on Computational Intelligence and Computing Research(ICCIC).Chennai:IEEE,2016:1-6.
    [44]孔英会,王之涵,车辚辚.基于卷积神经网络(CNN)和CUDA加速的实时视频人脸识别[J].科学技术与工程,2016(35):96-100.
    [45]李倩玉.基于改进深层网络的视频人脸识别研究[D].合肥:合肥工业大学,2016.
    [46]Fu T,Chiu W,Wang F.Learning guided convolutional neural networks for cross-resolution face recognition[C]//Proceedings of 27th International Workshop on Machine Learning for Signal Processing(MLSP).Tokyo:IEEE,2017:1-5.
    [47]Schroff F,Kalenichenko D,Philbin J.FaceNet:a unified embedding for face recognition and clustering[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Boston:IEEE,2015:815-823.
    [48]Sun Y,Wang X,Tang X.Deeply learned face representations are sparse,selective,and robust[J].Computer Science,2014.
    [49]Sun Y,Liang D,Wang X,et al.DeepID3:face recognition with very deep neural networks[J].Computer Science,2015.
    [50]Liu W,Wen Y,Yu Z,et al.SphereFace:deep hypersphere embedding for face recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Honolulu:IEEE,2017:6738-6746.
    [51]Yi D,Lei Z,Liao S,et al.Learning face representation from scratch[J].Computer Science,2014.
    [52]Omkar M.Parkhi A V A A.Deep face recognition[C]//British Machine Vision Conference(BMVC),2015:1-12.
    [53]Liu J,Deng Y,Bai T,et al.Targeting ultimate accuracy:face recognition via deep embedding[J].Computer Science,2015.
    [54]Taigman Y,Yang M,Ranzato M,et al.DeepFace:closing the gap to human-level performance in face verification[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Columbus:IEEE,2014:1701-1708.
    [55]刘吉,孙仁诚,乔松林.深度学习在医学图像识别中的应用研究[J].青岛大学学报(自然科学版),2018(1):69-74.
    [56]吕鸿蒙,赵地,迟学斌.基于增强AlexNet的深度学习的阿尔茨海默病的早期诊断[J].计算机科学,2017,44(S1):50-60.
    [57]李书通,肖斌,李伟生,等.基于3D-PCANet的阿尔兹海默病辅助诊断[J].计算机科学,2018,45(S1):140-142.
    [58]Mohamed A A,Berg W A,Peng H,et al.A deep learning method for classifying mammographic breast density categories[J].Medical Physics,2018,45(1):314-321.
    [59]何雪英,韩忠义,魏本征.基于深度学习的乳腺癌病理图像自动分类[J].计算机工程与应用,2018,54(12):121-125.
    [60]Xu J,Xiang L,Liu Q,et al.Stacked sparse autoencoder(SSAE)for nuclei detection on breast cancer histopathology images[J].IEEE Transactions on Medical Imaging,2016,35(1):119-130.
    [61]Coudray N,Moreira A L,Sakellaropoulos T,et al.Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning[J].bioRxiv,2017:197574.
    [62]李琼,柏正尧,刘莹芳.糖尿病性视网膜图像的深度学习分类方法[J].中国图象图形学报,2018,23(10):1594-1603.
    [63]翁铭,郑博,吴茂念,等.基于深度学习的DR筛查智能诊断系统的初步研究[J].国际眼科杂志,2018,18(3):568-571.
    [64]Ghesu F,Krubasik E,Georgescu B,et al.Marginal space deep learning:efficient architecture for volumetric image parsing[J].IEEE Transactions on Medical Imaging,2016,35(5):1217-1228.
    [65]Wurfl T,Hoffmann M,Christlein V,et al.Deep learning computed tomography:learning projection-domain weights from image domain in limited angle problems[J].IEEETransactions on Medical Imaging,2018,37(6):1454-1463.
    [66]Gao X,Lin S,Wong Y.Automatic feature learning to grade nuclear cataracts based on deep learning[J].IEEETransactions on Biomedical Engineering,2015,62(11):2693-2701.
    [67]Yan Z,Zhan Y,Peng Z,et al.Multi-instance deep learning:discover discriminative local anatomies for bodypart recognition[J].IEEE Transactions on Medical Imaging,2016,35(5):1332-1343.
    [68]Sun X,Park J,Kang K,et al.Novel hybrid CNN-SVMmodel for recognition of functional magnetic resonance images[C]//Proceedings of IEEE International Conference on Systems,Man,and Cybernetics(SMC).Banff:IEEE,2017:1001-1006.
    [69]Yu Z,Ni D,Chen S,et al.Hybrid dermoscopy image classification framework based on deep convolutional neural network and Fisher vector[C]//Proceedings of14th International Symposium on Biomedical Imaging(ISBI 2017).Melbourne:IEEE,2017:301-304.
    [70]王鑫,李可,徐明君,等.改进的基于深度学习的遥感图像分类方法[J].计算机应用,2019,32(2):382-387.
    [71]王鑫,李可,宁晨,等.基于深度卷积神经网络和多核学习的遥感图像分类方法[J/OL].电子与信息学报.doi:10.11999/JEIT80628.
    [72]Zhang H,Yang B,Fang T,et al.Learning deep features for classification of typical ecological environmental elements in high-resolution remote sensing images[C]//Proceedings of 10th International Symposium on Computational Intelligence and Design(ISCID).Hangzhou:IEEE,2017:223-227.
    [73]Huang Z,Cheng G,Wang H,et al.Building extraction from multi-source remote sensing images via deep deconvolution neural networks[C]//Proceedings of IEEE International Geoscience and Remote Sensing Symposium(IGARSS).Beijing:IEEE,2016:1835-1838.
    [74]Zhou Z,Li S.Peanut planting area change monitoring from remote sensing images based on deep learning[C]//Proceedings of 4th International Conference on Systems and Informatics(ICSAI).Hangzhou:IEEE,2017:1358-1362.
    [75]Lin J,Li X,Pan H.Aircraft recognition in remote sensing images based on deep learning[C]//Proceedings of33rd Youth Academic Annual Conference of Chinese Association of Automation(YAC).Nanjing:IEEE,2018:895-899.
    [76]Zou Q,Ni L,Zhang T,et al.Deep learning based feature selection for remote sensing scene classification[J].IEEE Geoscience and Remote Sensing Letters,2015,12(11):2321-2325.
    [77]Yang X,Liu W,Tao D,et al.Multiview canonical correlation analysis networks for remote sensing image recognition[J].IEEE Geoscience and Remote Sensing Letters,2017,14(10):1855-1859.
    [78]Deng Z,Lei L,Sun H,et al.An enhanced deep convolutional neural network for densely packed objects detection in remote sensing images[C]//Proceedings of International Workshop on Remote Sensing with Intelligent Processing(RSIP).Shanghai:IEEE,2017:1-4.
    [79]Tong G,Li Y,Cao L,et al.A DBN for hyperspectral remote sensing image classification[C]//Proceedings of12th IEEE Conference on Industrial Electronics and Applications(ICIEA).Siem Reap:IEEE,2018:1757-1762.
    [80]Tanase R,Datcu M,Randucanu D.A convolutional deep belief network for polarimetric SAR data feature extraction[C]//The Proceedings of IEEE International Geoscience and Remote Sensing Symposium(IGARSS).Beijing:IEEE,2016:7545-7548.
    [81]Lv F,Han M,Qiu T.Remote sensing image classification based on ensemble extreme learning machine with stacked autoencoder[J].IEEE Access,2017,5:9021-9031.
    [82]Krizhevsky A,Sutskever I,Hinton G E.ImageNet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing System,2012:1097-1105.
    [83]肖艳秋,杜江恒,闻萌莎,等.基于颜色特征和改进支持向量机算法的交通标志检测与识别[J].轻工学报,2018,33(3):57-65.
    [84]Xiong C,Wang C,Ma W,et al.A traffic sign detection algorithm based on deep convolutional neural network[C]//Proceedings of IEEE International Conference on Signal and Image Processing(ICSIP).Beijing:IEEE,2016:676-679.
    [85]Zuo Z,Yu K,Zhou Q,et al.Traffic signs detection based on faster R-CNN[C]//Proceedings of IEEE 37th International Conference on Distributed Computing Systems Workshops(ICDCSW).Atlanta:IEEE,2017:286-288.
    [86]Ciresan C,Meier U,Gambardella M,et al.Convolutional neural network committees for handwritten character classification[C]//Proceedings of International Conference on Document Analysis and Recognition.Beijing:IEEE2011:1135-1139.
    [87]Wu C,Fan W,He Y,et al.Handwritten character recognition by alternately trained relaxation convolutional neural network[C]//Proceedings of 14th International Conference on Frontiers in Handwriting Recognition.Heraklion:IEEE,2014:291-296.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700