基于稀疏性的人脸检测与识别方法研究

英文题名：Face Detection and Recognition Based on Sparsity Methods
作者：晏哲
论文级别：硕士
学科专业名称：模式识别与智能系统
中文关键词：人脸检测 ; 人脸识别 ; 核方法 ; 稀疏表示分类 ; 分类器集成 ; 1范数支持向量机
英文关键词：Face Recognition ; Face Detection ; Kernel Method ; Sparse Representation Classifier ; Classifier Ensemble ; 1-norm Support Vector Machine
学位年度：2010
导师：张莉
学科代码：081104
学位授予单位：西安电子科技大学
论文提交日期：2010-01-01

摘要

人脸是一个信息丰富的模式集合,是人类互相判别、认识、记忆的主要标志。人脸检测与识别在计算机视觉、模式识别、多媒体技术研究中占有重要的地位,因此人脸检测与识别技术是模式识别与机器视觉领域最有挑战性的研究课题之一。本文的工作涉及到了人脸的检测与识别,主要的工作内容如下：
     稀疏表示分类算法利用压缩感知的基本原理,通过求解由全部训练样本对测试样本最佳线性表出的稀疏向量来进行分类。在实验中发现,对于同方向分布的不同类的样本,稀疏表示分类算法在对样本单位化后无法正确分类。为了解决这个问题,本文将Mercer核引入到稀疏表示分类算法中,提出了核稀疏表示分类算法。由于高斯核函数可以作为样本间的相似性度量,这样就很好的解决了原算法出现的问题。在人工数据、UCI数据和人脸数据库上的仿真实验均验证了此改进算法的有效性。
     基于核稀疏表示的分类算法与稀疏表示分类算法中所应用的随机降维映射在实际的应用中是一种对维数约减非常有效的方法。但是对不同的随机映射降维矩阵,核稀疏表示分类算法会得到不同的识别结果,因而不能够保证分类算法的稳定性。如果想提高算法的性能,对不稳定的分类器而言,集成是一种很好的选择。因此我们提出了基于核稀疏表示分类算法的多分类器集成方法,可采用的多种决策级融合方法包括：最大(Max)、最小(Min)、求和(Surn)、均值(Mean)和多数投票(Majority Vote)。实验验证了该集成方法的有效性,而且实验表明了求和与均值这两种策略是较好的集成规则。
     标准支持向量机(SVM)已经用于人脸检测,但是由于支持向量个数较多,导致检测速度不高。1-norm SVM采用1-norm正则项替代了标准SVM中的2-norm正则项,而1-norm正则项能够诱导稀疏性。已经证明了1-norm SVM的解比标准SVM的解更具有稀疏性,因此我们把1-norm SVM应用到人脸检测中,期望能提高检测速度。在构建1-norm SVM人脸检测系统的时候,采用的是经典的人脸检测系统,并在最后加入了去除重叠标识人脸的步骤。最后通过实验验证了1-norm SVM的确能够提高检测速度,而且检测精度和SVM可以相媲美。
Human face embodies extremely rich information and is the key symbol for distinguishing, recognizing and memorizing individuals. It plays an important role in computer vision, pattern recognition and multimedia technology. Therefore, automatic recognition of human face is one of the most challenging subjects in pattern recognition and computer vision. Our work in this thesis relates with two areas of automatic recognition of human face:face detection and recognition. The main work is as follows.
     Based on the theory of Compressed Sensing (CS), the goal of Sparse Representation Classifier (SRC) is to find a sparse vector which is a linearly optimal representation for a test sample by using all training samples, and use it to classify this test sample. If the samples are distributed in the same direction, SRC can not classify them exactly. To solve this problem, we propose a Kernel Sparse Representation Classifier (KSRC), which introduces the Mercer kernels to SRC. As the similarity measure between samples, RBF kernel function is a good solution to the problem. The experiments on artificial data, UCI database and Extended Yale B database have verified the effectiveness of KSRC.
     Based on Kernel Sparse Representation Classifier, we present an ensemble method for KSRCs. The random projection used in KSRC and SRC is an effective way for dimensionality reduction. But for different random matrix, KSRC will get different results. So we use the ensemble of KSRCs to ensure the stability of the algorithm. There are many rules to combine the multiple classifiers, such as Max, Min, Sum, Mean, and Majority Vote rules. The experiments have verified the effectiveness of combining classifiers, and also show that Sum and Mean are better ways.
     Support Vector Machines (SVMs) have been applied to face detection. But the test speed of SVMs is not satisfied. In order to improve the test speed, we apply 1-norm SVMs to face detection.1-norm SVMs adopt the 1-norm regularization which can induce sparsity. It has been shown that the solution of 1-norm SVM is sparser than standard SVM, so use 1-norm SVM to face detection can improve the detection speed. We have verified the effectiveness of reducing the detection time by experiments.

引文

[1]Jain A. K., Ross A., Prabhakar S. An introduction to biometric recognition. IEEE Trans. on Circuits and Systems for Video Technology,2004,14(1):4-20.
    [2]王蕴红,谭铁牛,现代身份鉴别新技术：生物特征识别技术.中国基础科学,9,2000,4-10.
    [3]Lee H. C., Gaensslen R. E.. Advances in Fingerprint Technology[M]. New York: Elsevier,1991.
    [4]Biometricgroup Official Website. http://www.biometricgroup.com
    [5]王亮,胡卫明,谭铁牛.人运动的视觉分析综述.计算机学报,25卷,第3期,2002.
    [6]梁路宏,艾海舟,何克忠.人脸检测研究综述.计算机学报.2002第25卷第5期.
    [7]Yang M. H., Kriegrman D. J., Ahuja N.. Detecting faces in images:a survey. IEEE Trans. on Pattern Analysis and Machine Intelligence,24(1),2002.34—58.
    [8]Facedetection Official Website. http://www.facedetection.com/.
    [9]C. Kotropoulos, I. Pitas. Rule-based Face Detection in Frontal Views Proc[C].Int'l Conf. Acoustics, Speech and Signal Processing,1997.2537-2540.
    [10]李华胜,杨桦,袁保宗,人脸识别系统中的特征提取.北京交通大学学报.25(2),2001,18—-21.
    [11]姜军,张桂林,一种基于知识的快速人脸检测方法,中国图像图形学报,7(1),2002,6-10.
    [12]卢春雨,张长水,闻方,等.基于区域特性的快速人脸检测法.清华大学学报,39(1),1999,101-105.
    [13]彭进业,俞卞章,王大凯等.多尺度对称变换及其应用于定位人脸特征点.电子学报,30(3),2002,363-366.
    [14]王延江,袁保宗,唐晚芳.一种快速彩色图像中复杂背景下人脸检测方法.电子学报,30(10),2002,1566-1569.
    [15]刘明宝,姚鸿勋,高文.彩色图像的实时人脸跟踪方法.计算机学报,21(6),1998,527—532.
    [16]游亚平,袁保宗.复杂背景下人脸检测的数学形态学运算方法.电子与信息学报,26(12),2004,1863—1870.
    [17]游亚平,李明,袁保宗.可变光照下的人脸检测.信号处理,20(2),2004,101-107.
    [18]Hsu R. L., Mottaleb M. A., Jain A. K.. Face detection in color images. IEEE Trans. On Pattern Analysis and Machine Intelligence,24(5),2002,696-706.
    [19]李晓华,沈兰荪.基于多级梯度能量描述的压缩域人脸检测.电子与信息学报.27(12),2005,1909-1015.
    [20]凌旭峰,杨杰,叶辰洲.彩色序列图像的人脸检测和识别系统.电子学报,31(4),2003,544-547.
    [21]艾海舟,梁路宏,徐光佑等.基于肤色和模板的人脸检测.软件学报,12(12),2001,1784-1792.
    [22]Dowdall J., Pavlidis I., Bebis G.. Face detection in the near IR spectrum. Image and Vision Computing,21,2003,565-578.
    [23]Eveland C. K., Socolinsky D. A., Wolff L. B.. Tracking human faces in infrared video. Image and Vision Computing,21,2003,579-590.
    [24]Bruneli R., Poggio T.. Face recognition:Features versus templates. IEEE Trans, on pattern analysis and machine intelligence,1993,15(10):1042-1052.
    [25]Yullie A., Hallinan P., Cohen D.. Feature exaction from faces using deformable templates. International Journal of Computing Vision,1992,8(2):99-111.
    [26]Jain A. K., Zhong Y., Dubuission Jolly, Marie Pierre. Deformable template models: A review. Signal Processing,1998,71(2):109-129.
    [27]Miao J.. A hierarchical multiscale and multiangle system for human face detection in a complex background using gravity-center template. Pattern Recognition,32, 1999,1237-1248.
    [28]梁路宏,艾海舟,何克忠.基于多模板匹配的单人脸检测.中国图像图形学报,4(10),1999,825—830.
    [29]Cootes T. F., Taylor C. J., Cooper D. H., et al. Active shape model-their training and application. Computer Vision and Image Understanding,61(1),1995,38-59.
    [30]Cootes T. F., Wheeler G. V., Walker K. N., et al. View-based active appearance models. Image and Vision Computing,20,2002,657-664.
    [31]Cootes T. F., Twining C. J., Taylor C. J.. Diffeomrophic statistical shape models. British Machine Vision Conference,1,2004,447-456.
    [32]Liu C. A.. bayesian discriminating features method for face detection. IEEE Trans, on Pattern Analysis and Machine Intelligence,25(6),2003,725-740.
    [33]Moghaddam B., Pentland A.. Probabilistic visual learning for object representation. IEEE Trans. on Pattern Analysis and Machine Intelligence,19(7),1997,696-710.
    [34]Sung K. K., Poggio T.. Example-based learning for view-based human face detection. IEEE Trans. on Pattern Analysis and Machine Intelligence,20(1),1998, 39-51.
    [35]Rowley H. A., Baluja S., Kanade T.. Neural network-based face detection. IEEE Trans, on Pattern Analysis and Machine Intelligence,20(1),1998,23-38.
    [36]Garcia C., Delakis M.. Convolutional face finder:A neural architecture for fast and robust face detection. IEEE Trans. on Pattern Analysis and Machine Intelligence, 26(11),2004,1408-1423.
    [37]Waring C. A., Liu X.. Face detection using spectral histograms and SVMs. IEEE Trans. on Systems, Man and Cybernetics-Part B:Cybernetics,35(3),2005, 467-476.
    [38]方昱春,王蕴红,谭铁牛.融合人脸轮廓和区域信息改进人脸检测.计算机学报,27(4),2004,482-291.
    [39]梁路宏,艾海舟,肖习攀等.基于模板匹配与支持向量机的人脸检测.计算机学报,25(1),2002,22—-29.
    [40]Viola P., Jones M.. Robust real time object detection. In Proceedings of IEEE Workshop Statistical and Computational Theories of Vision,2001.
    [41]艾海舟,肖习攀,徐光佑.人脸检测与检索.计算机学报,26(7),2003,874-881.
    [42]Li S. Z., Zhang Z,. FloatBoost learning and statistical face detection. IEEE Trans. on Pattern Analysis and Machine Intelligence,26(9),2004,1112-1123.
    [43]Xiao R., Li M. J., Zhang H. J.. Robust multipose face detection in images. IEEE
    Trans, on Circuits and Systems for Video Technology,14(1),2004,31-41.
    [44]Huang C., Ai H., Li Y., et al. High-performance rotation invariant multiview face detection. IEEE Trans. on Pattern Analysis and Machine Intelligence,29(4),2007, 671-686.
    [45]周杰,卢春雨,张长水等.人脸自动识别方法综述.电子学报,28(4),2000,102-106.
    [46]Kong S. G., Heo J., Abidi B. R., et al. Recent advances in visual and infrared face recognition—Areview. Computer Vision and Image Understanding,97,2005, 103-135.
    [47]Zhao W, Chellappa R, Rosenfeld A, et al. Face recognition:A literature survey. ACM Computing Surveys,35(4),2003,399-458.
    [48]Turk M, Pentland A P. Eigenfaces for recognition[J]. Journal of Cognitive Neuroscience,1991,3(1):71-86.
    [49]L. Craw and P. Cameron. "Face Recognition by Computer" in Proceedings, British Machine Vision Conference, pp,489-507,1996.
    [50]P.N Belhumeur and D.J.Kriegman, "Eigenface vs Fisherface:Recognition Using Class Specific Linear Projection" IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.19, pp.711-720,1997.
    [51]边肇棋,张学工等.模式识别(第2版)[M].北京：清华大学出版社,2000.
    [52]崔国勤等.基于支持向量机的人脸识别方法.计算机科学,2003,vol.30,No.4.
    [53]Samal A., Iyengar P. A.. Automatic recognition and analysis of human faces and facial expressions:A survey[J]. Pattern Recognition,1992,25(1):65-77.
    [54]粱路宏,艾海舟,何克忠.基于多模板匹配的中人脸检测[J].中国图像图形学报,1999,4(10)：825-830.
    [55]Yuille A. L.. Deformable templates for face recognition[J]. Journal of Cognitive Neuroscience,1991,3(1):59-70.
    [56]Valentin D., Abdi H., et al. Connectionist models of face processing—A survey[J]. Pattern Recognition,1994,27(9):1209-1230.
    [57]Flming M., Cottrell G., Categorization of faces using unsupervised feature extraction[A]. Proceedings of the International Conference on Neural Network[C], California Univ San Diego, CA, USA,1990,65-70.
    [58]Intrator N., Reisfeld D., Yeshurun Y.. Face recognition by supervised/unsupervised hybrid network[J]. Pattern Recognition Letters,1996,17(1):67-76.
    [59]Fogelman S. F., Viennet E., Lamy B.. Multi-modular neural network architectures: Applications in optical character and human face recognition[J]. International Journal of Pattern Recognition and Artificial Intelligence,1993.7(4):721-755.
    [60]Nefian A., Hayes M.. Hidden Markov models for face recognition[A]. IEEE
    International Conference on Acoustics, Speech and Signals Processing[C]. Seattle, Washington,1998,5:2721-2724.
    [61]Samaria F.. Face Recognition Using Hidden Markov Models. Cambridge: University of Cambridge,1994.
    [62]K. Sik Yoon, Y. Kug Ham, R. H. Park. Hybrid approaches to frontal view face recognition using the hidden Markov model and neural network. Pattern Recognition,1998.
    [63]Allen Y.Yang., Yi Ma, John Wright and S.Shankar Sastry. Feature Selection in Face Recognition:A Sparse Representation Perspective. Manuscript Submitted to IEEE Trans. PAMI, August 2007.
    [64]D. Donoho. "For most large underdetermined systems of linear equations the minimal 1-norm near solution approximates the sparest solution," preprint,2004.
    [65]E. Candes. "Compressive sampling," in Proceedings of the International Congress of Mathematicians,2006.
    [66]D. Donoho and Y. Tsaig. "Fast solution of 1-norm minimization problems when the solution may be sparse," preprint, http://www.stanford.edu/tsaig/research.html, 2006.
    [67]J. Mercer. Functions of positive and negative type and their connection with the theory of integral equations. Philos. Trans. Roy. Soc. London,1909,209:415-446.
    [68]焦李成等.自然计算,机器学习与图像理解前沿.西安：西安电子科技大学出版社,2008.
    [69]B. Scholkopf, A. Samola, K. R. Muller. Kernel principal component analysis, In B. Schlkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods-SV Learning, Cambridge, MA, MIT Press,1999:327-352.
    [70]S. Mika, G. Ratsch, J. Weston, et al. Fisher discriminate analysis with kernels. In Y.-H. Hu, J. Larsen, E. Wilson and S. Douglas, eds., Neural Networks for Signal Processing IV,1999:42-48.
    [71]Blake C, Keogh E., Merz C. J. (1998). UCI repository of machine learning databases[http://www.ics.uci.edu/～mlearn/MLRepositoty.htm], Department of information and Computer Science, University of California, Irvine, CA.
    [72]A. Georghiades, P. Belhumeur, and D. Kriegman, "From few to many:Illumination cone models for face recognition under variable lighting and pose," IEEE
    Transactions on Pattern Analysis and Machine Intelligence, vol.23, no.6, pp. 643-660,2001.
    [73]J. Kittler, M. Hatef, R. Duin, J. Matas. On combining classifiers. IEEE Trans.on Pattern Analysis and Machine Intelligence.1998.20.226-239.
    [74]Ali. K. M., Pazzani M. J.. On the link between error correlation and error reduction in decision tree ensembles. Technical report 95-38, ICS-UCI,1995.
    [75]J. Bezdek, et al. Fuzzy models and algorithms for patter recognition and image processing.1999.
    [76]Jun Cao, et al. Recognition of Handwritten Numerals with Multiple Feature and Multistage Classifier, Pattern recognition,1995 28(2):153-160.
    [77]Lei Xu, et al. Methods of Combining Multiple Classifiers and Their Application to Handwriting Recognition. IEEE Trans. on System, Man And Cybernetics 1992(22): 418-435.
    [78]Robert P. W. Duin, David M. J. Tax.. Experiments with Classifier Combining Rules.. MCS 2000, LNCS 1857, pp.16-29,2000.
    [79]Osuna, E., Freund, R.& Girosi, F.. Training support vector machines:an application to face detection. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp.130-136.
    [80]Sergios. T., Konstantinos. K.. Pattern Recognition. Third Edition, Elsevier,2006: 60-75.
    [81]Chapelle O., Vapnik V., Bonsquet O.. Choosing kernel parameters for support vector machines, http://www.citeseer.ni.nec.com/chapelle01 choosing.html.
    [82]J. Zhu, S. Rosset, T. Hastie and R. Tibshirani.1-norm support vector machines.In S.Thrun, L.Saul, and B.Scholkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, pp:49-56,2004.
    [83]Li Zhang and Weida Zhou. On the Sparsity of 1-norm Support Vector Machines. Neural Network,2009. http://dx.doi.org/10.1016/j.neunet.2009.11.012.
    [84]Viola P., Jones M. J.. Robust Real-Time Face Detection, International Journal of Computer Vision 57(2),137-154,2004.
    [85]C.-C. Chang, C.-J. Lin. LIBSVM:a library for support vector machines,2001. Software available at http://www.csie.ntu.edu.tw/-cjlin/libsvm.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700