摘要
针对传统手势识别算法存在手势定位不精确,识别率不高,鲁棒性不强等问题,提出改进的Faster RCNN网络进行手势的精准定位和识别。Faster RCNN采用强语义信息、低分辨率的顶层特征图作为RPN网络的输入,导致对小目标识别率不高。改进的Faster RCNN结合FPN网络算法,将高层特征通过上采样不断与前层特征融合,构造不同尺度的特征金字塔模型作为RPN网络的输入,提升了Faster RCNN对手势的检测效果。
Aiming at the problems of traditional gesture recognition algorithms such as inaccurate gesture positioning, low recognition rate and low robustness, an improved Faster RCNN network is proposed to accurately locate and identify gestures. Faster RCNN uses strong semantic information and low-resolution top-level feature maps as input to the RPN network, resulting in low recognition rate for small targets. The improved Faster RCNN combines the FPN network algorithm to integrate the highlevel features with the previous features through up-sampling, and constructs the feature pyramid model with different scales as the input of the RPN network, which improves the detection effect of Faster RCNN on gestures.
引文
[1]Cao Xinyan,Zhao Jiyin,Li Min.Monocular vision gesturesegmentation based on skin color and motion detection[J].Journal of Hunan University:Natural Sciences,2011, 38(1):78-83(in Chinese)
[2]任彧,顾成成.基于HOG特征和SVM的手势识别[J].科技通报,2011,27(2):211-214.Ren Wei, Gu Chengcheng. Gesture recognition based on HOGfeatures and SVM[J]. Science and Technology Bulletin,2011, 27(2):211-214.
[3]Ma M, Chen Z X, Wu J. A recognition method of hand ges-ture with CNN-SVM model[M]//Communications in Com-puter and Information Science. Heidelberg:Springer, 2016,681:399-404
[4]Ren S Q, He K M, Girshick R, et al. Object detection networkson convolutional feature maps[J]. IEEE Transactions on PatternAnalysis and Machine Intelligence, 2016, 39(7):1476-1481
[5]S AHMAD R ADZI, K H MOHAMAD, S S LIEW, et al,Convolutional neural network for face recognition with poseand illumi-nation variation[J],International Journal of En-gineering and Technology(IJET),2014,6(1):44-57.
[6]Lin, Tsung-Yi, Piotr Dollár, Ross B. Girshick, Kaiming He,Bharath Hariharan and Serge J. Belongie. Feature Pyramid Net-works for Object Detection[J]. IEEE Conference on ComputerVision and Pattern Recognition(CVPR)(2017):936-944.
[7]Ren,Shaoqing,Kaiming He,Ross B. Girshick and Jian Sun.Faster R-CNN:Towards Real-Time Object Detection withRegion Proposal Networks. IEEE Transactions on PatternAnalysis and Machine Intelligence. 39(2015):1137-1149.