基于编解码网络的多姿态人脸图像正面化方法

设为首页

收藏本站

网站地图 | English | 公务邮箱

远程访问

NSTL服务站

基于编解码网络的多姿态人脸图像正面化方法

详细信息查看全文 | 推荐本文 |

英文篇名：A multi-pose face frontalization method based on encoder-decoder network
作者：徐海月 ; 姚乃明 ; 彭晓兰 ; 陈辉 ; 王宏安
英文作者：Haiyue XU;Naiming YAO;Xiaolan PENG;Hui CHEN;Hongan WANG;Beijing Key Laboratory of Human-Computer Interaction, Institute of Software, Chinese Academy of Sciences;University of Chinese Academy of Sciences;State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences;
关键词：人脸正面化 ; 卷积神经网络 ; 编解码网络 ; 多任务学习 ; 人脸识别 ; 表情识别
英文关键词：face frontalization;;convolutional neural network;;encoder-decoder network;;multitask learning;;face recognition;;facial expression recognition
中文刊名：PZKX
英文刊名：Scientia Sinica(Informationis)
机构：中国科学院软件研究所人机交互北京市重点实验室;中国科学院大学;中国科学院软件研究所计算机科学国家重点实验室;
出版日期：2019-04-15 14:00
出版单位：中国科学:信息科学
年：2019
期：v.49
基金：国家重点研发计划项目(批准号:2016YFB1001405);; 国家自然科学基金项目(批准号:61661146002);; 中国科学院前沿科学重点研究计划项目(批准号:QYZDY-SSW-JSC041)资助
语种：中文;
页：PZKX201904006
页数：14
CN：04
ISSN：11-5846/TP
分类号：86-99

摘要

多姿态人脸图像正面化可以缓解头部姿态变化对人脸分析任务的影响.以往直接从多姿态人脸图像合成正面人脸图像的方法存在细节特征缺失的问题.针对这一问题,本文提出一种基于编解码网络的多姿态人脸图像正面化方法——多任务卷积编解码网络(MCEDN).该方法引入正面基础特征网络合成正面人脸基础特征,并在此基础上融合编码网络提取的多姿态人脸局部特征进行细节补偿,最终合成更加清晰的正面人脸图像.利用多任务学习机制建立端到端模型,统一局部特征提取、正面基础特征解析、正面图像合成3个模块,通过共享参数提升整个模型的效果.与已有方法对比, MCEDN在多个数据集上都可以合成结构稳定、细节清晰的正面人脸图像.我们直接使用合成的正面人脸图像进行人脸识别和表情识别,识别准确率达到先进水平,这表明MCEDN可以有效保留人脸细节特征,支持人脸分析任务.
Multi-pose face frontalization can alleviate the influence of pose variance on face analysis. The traditional method of synthesizing a frontal face image directly from a multi-pose face image presents a problem in missing face details. To overcome this problem, we propose a face frontalization method based on the encoderdecoder network, namely multitask convolutional encoder-decoder network(MCEDN). The MCEDN introduces a frontal raw feature network to synthesize the global raw features of the frontal face. Then, the network utilizes the decoder to synthesize a clearer frontal face image by fusing local features extracted by the encoder and global raw features. We use a multitask learning mechanism to build an end-to-end model. The method then integrates three modules, namely local feature extraction, global raw feature synthesis, and frontal image synthesis. The model performance was improved by sharing parameters. In comparison with existing methods, MCEDN can synthesize frontal face images with a stable structure and rich details on multiple datasets. Then, we use the synthesized frontal images for face recognition and face expression recognition, and the state-of-the-art results demonstrate that the MCEDN preserves a number of face details.

引文

1 Zhu Z Y,Luo P,Wang X G,et al.Deep learning identity-preserving face space.In:Proceedings of the IEEEInternational Conference on Computer Vision,Sydney,2013.113-120
    2 Zhu Z Y,Luo P,Wang X G,et al.Multi-view perceptron:a deep model for learning face identity and view representations.In:Proceedings of the Advances in Neural Information Processing Systems,Montreal,2014.217-225
    3 Argyriou A,Evgeniou T,Pontil M.Multi-task feature learning.In:Proceedings of the 20th Annual Conference on Neural Information Processing Systems,Vancouver,2006.41-48
    4 Zhu X Y,Lei Z,Yan J J,et al.High-fidelity pose and expression normalization for face recognition in the wild.In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,Boston,2015.787-796
    5 Asthana A,Marks T K,Jones M J,et al.Fully automatic pose-invariant face recognition via 3D pose normalization.In:Proceedings of the International Conference on Computer Vision,Barcelona,2011.937-944
    6 Hassner T,Harel S,Paz E,et al.Effective face frontalization in unconstrained images.In:Proceedings of the IEEEConference on Computer Vision and Pattern Recognition,Boston,2015.4295-4304
    7 Fang S Y,Zhou D K,Cao Y P,et al.Frontal face image synthesis based on pose estimation.Comput Eng,2015,41:240-244[方三勇,周大可,曹元鹏,等.基于姿态估计的正面人脸图像合成.计算机工程,2015,41:240-244]
    8 Prince S J D,Warrell J,Elder J H,et al.Tied factor analysis for face recognition across large pose differences.IEEETrans Pattern Anal Mach Intel,2008,30:970-984
    9 Chai X J,Shan S G,Chen X L,et al.Locally linear regression for pose-invariant face recognition.IEEE Trans Image Process,2007,16:1716-1725
    10 Wang Y N,Su J B.Multipose face image recognition based on image synthesis.Pattern Recogn Artif Intel,2015,28:848-856[王亚南,苏剑波.基于图像合成的多姿态人脸图像识别方法.模式识别与人工智能,2015,28:848-856]
    11 Li Y L,Feng J F.Multi-view face synthesis using minimum bending deformation.J Comput-Aided Design Comput Graph,2011,23:1085-1090[李月龙,封举富.基于最小扭曲变换的正面人脸图像合成.计算机辅助设计与图形学学报,2011,23:1085-1090]
    12 Yi X B,Chen Y.Frontal face synthesizing based on poisson image fusion under piecewise affine warp.Comput Eng Appl,2016,52:172-177[仪晓斌,陈莹.分段仿射变换下基于泊松融合的正面人脸合成.计算机工程与应用,2016,52:172-177]
    13 Kan M N,Shan S G,Chang H,et al.Stacked progressive auto-encoders(spae)for face recognition across poses.In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,Columbus,2014.1883-1890
    14 Ouyang N,Ma Y T,Lin L P.Multi-pose face reconstruction and recognition based on multi-task learning.J Comput Appl,2016,37:896-900[欧阳宁,马玉涛,林乐平.基于多任务学习的多姿态人脸重建与识别.计算机应用,2016,37:896-900]
    15 Yim J,Jung H,Yoo B,et al.Rotating your face using multi-task deep neural network.In:Proceedings of the IEEEConference on Computer Vision and Pattern Recognition,Boston,2015.676-684
    16 Ghodrati A,Jia X,Pedersoli M,et al.Towards automatic image editing:learning to see another you.2015.ArXiv:151108446
    17 Huang R,Zhang S,Li T Y,et al.Beyond face rotation:global and local perception gan for photorealistic and identity preserving frontal view synthesis.2017.ArXiv:170404086
    18 Tran L,Yin X,Liu X M.Disentangled representation learning gan for pose-invariant face recognition.In:Proceedings of the Computer Vision and Pattern Recognition,Honolulu,2017.1283-1292
    19 Theis L,Shi W,Cunningham A,et al.Lossy image compression with compressive autoencoders.2017.ArXiv:170300395
    20 Goodfellow I,Bengio Y,Courville A,et al.Deep Learning.Cambridge:MIT Press,2016
    21 Mayya V,Pai R M,Pai M M.Automatic facial expression recognition using DCNN.Procedia Comput Sci,2016,93:453-461
    22 Nair V,Hinton G E.Rectified linear units improve restricted boltzmann machines.In:Proceedings of International Conference on Machine Learning,Haifa,2010.807-814
    23 Gross R,Matthews I,Cohn J,et al.Multi-PIE.Image Vision Comput,2010,28:807-813
    24 Gao W,Cao B,Shan S G,et al.The CAS-PEAL large-scale Chinese face database and baseline evaluations.IEEETrans Syst Man Cybern A,2008,38:149-161
    25 Huang G B,Ramesh M,Berg T,et al.Labeled Faces in the Wild:A Database for Studying Face Recognition in Unconstrained Environments.Technical Report 07-49.2007
    26 Liu Z,Luo P,Wang X G,et al.Deep learning face attributes in the wild.In:Proceedings of the IEEE International Conference on Computer Vision,Santiago,2015.3730-3738
    27 Wang Z,Bovik A C,Sheikh H R,et al.Image quality assessment:from error visibility to structural similarity.IEEETrans Image Process,2004,13:600-612
    28 Ding H,Zhou S K,Chellappa R.Facenet2expnet:regularizing a deep face recognition net for expression recognition.In:Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition,Washington,2017.118-126
    29 Wu X,He R,Sun Z A,et al.A light CNN for deep face representation with noisy labels.2015.ArXiv:151102683

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700