基于视频流的人体目标检测与行为识别研究

英文题名：Research on Human Object Detection and Behavior Recognition in Video Stream
作者：付朝霞
论文级别：博士
学科专业名称：信号与信息处理
中文关键词：行为识别 ; 时空兴趣点 ; 平均Hausdorff距离 ; 质心加权 ; 地动距离
英文关键词：behavior recognition ; space-time interest point ; mean Hausdorff distance ; centroid weighted ; Earth Mover's Distance
学位年度：2013
导师：王黎明
学科代码：081002
学位授予单位：中北大学
论文提交日期：2013-11-01

摘要

基于视频流的目标检测、跟踪与识别是计算机视觉和模式识别领域的热点问题,在智能视频监控、高级人机交互、移动机器人定位与导航、虚拟现实等领域具有广泛的应用前景。经过几十年学者们的不懈努力,上述技术已经有了较多的研究成果。由于视觉应用系统中环境的复杂性以及目标本身的多样性,给目标检测、跟踪和识别技术带来了极大的困难。实践表明一般意义上的目标检测、跟踪与识别技术还远未成熟,距离实用化尚存在一定差距,还需要开发出更为实用鲁棒的算法。本论文从理论和实际应用的角度出发,对以视频为输入的运动目标识别的相关关键技术进行研究,研究内容主要涉及运动目标的检测、运动目标的跟踪、运动特征的表征和识别方法等。
     本文研究了背景建模方法,提出了一种基于像素统计分类的视频流目标检测算法,借助把图像的像素值看成是前景高斯分布和背景高斯分布的组合,进行背景估计和自适应背景更新；以统计当前帧前景像素的点数来判定光照突变,并结合帧间差分法来检测运动目标。仿真实验表明,该算法可以实时准确地检测出前景运动目标,具有更强的适应性。通过复杂背景下的人脸检测实验表明,该算法在基于肤色信息的人脸检测中也具有一定的实际应用价值。本文还提出了一种基于链码标定的圆检测算法,利用数学形态学方法有效地去噪填充和提取二值图像的边缘,再利用链码方法确定圆度参数。实验表明,该算法简单有效,计算精度小于1个像素,具有较好的实用效果。
     针对多目标跟踪问题,本文提出了融合角点特征的多目标跟踪算法。利用改进的Harris算子提取运动目标的均匀稳定的特征点,通过特征匹配和匹配优化,完成视频运动多目标的跟踪。跟踪实验表明,该算法能够完成视角变化、旋转、仿射变换、光照变化等多种情况下的稳定匹配,可以实现小部分遮挡状态下目标的稳定跟踪。本文研究了经典的Mean shift跟踪算法,由于该算法对于快速运动的目标跟踪是无效的,而且还存在误差累积的问题,因此本文提出了基于质心加权的Kalman滤波的跟踪算法。利用背景差锁定动态目标跟踪区域,在目标跟踪开始时利用Kalman滤波来预测目标的位置,然后采用质心加权算法优化修正跟踪目标的位置,并以修正后的状态预测值进行观测更新,进而实现对跟踪目标较为精确的定位。经过仿真实验分析,该算法在有效检测到运动物体的同时能够快速准确地跟踪运动物体,具有较好的实时性与较强的鲁棒性。
     针对复杂多变光照下的人脸识别问题,本文提出了基于LBP算子与EMD的人脸识别算法,首先对图像进行一系列简单有效的预处理以提高算法的鲁棒性,然后提取图像的局部LBP特征,获得图像的LBP直方图。采用EMD方法对LBP直方图进行计算,完成对图像相似性的度量。在GTAV标准人脸库上实验结果表明,该算法显著提高了识别率。人体行为识别与理解属于更高一层的视觉任务。本文在探讨了各种人体行为识别算法的基础上,提出了一种基于时空兴趣点的人体行为识别算法,采用3D Harris角点提取不同行为的时空特征,然后采用K-means聚类和LLE结合的方法对提取的运动特征进行降维和分类,训练识别过程则采用平均Hausdorff距离的几何特征方法完成相似性配准。KTH数据库上的实验表明该算法是有效可行的,基于流的轨迹识别方法进一步提高了识别的准确率。
Object detection, tracking and recognition based on video stream are hot issues in the field of computer vision, pattern recognition, intelligent video surveillance, senior human-computer interaction, mobile robot localization and navigation, virtual reality, which have broad application prospects.In the decades'unremitting efforts of scholars, these technologies have already more research achievements. Due to the complexity of the environment and the diversity of the goal itself in the visual system, the technologies of object detection, tracking and recognition have brought great difficulties. The practical experience shows that the technologies of object detection, tracking and recognition are far from mature in the general sense and there are still certain gaps away from practical application. So they also need to develop more practical and robust algorithms. In both the theoretical and the practical perspective, this paper studies on some correlative key issues of moving object recognition with the input video. The issues mainly focus on moving object detection, moving object tracking, motion feature representation and recognition.
     In the paper, the methods of background modeling are studied and an algorithm of object detection of video stream is proposed based on the pixels' statistics classification. The pixel values of the image are seen as the combination of the foreground Gaussian distribution and the background Gaussian distribution, and the background estimation and the adaptive background update will be put up. The statistical number of the foreground pixels of the current frame determines whether the light has a larger change, and the algorithm needs to combine with the frame-difference method to detect moving object. Simulation results show that the algorithm can quickly and accurately detect the foreground object with greater adaptability. The experiment of face detection under complex background shows that the algorithm has a certain practical worth in the face detection based on skin color information. Circle detection algorithm is proposed in this paper. It firstly uses mathematical morphology method to denoising, filling and contour extraction for the binary image, and then calculates circularity index using chain code method. Experimental results show that the circle detection algorithm is simple and effective and the accuracy is less than one pixel.
     To solve multi-target tracking problems, this paper proposes a multi-target tracking algorithm based on a combination of corner feature. It extracts stable and symmetrical feature points of moving object using the improved Harris operator, and completes the tracking of video moving multi-target by feature matching and matching optimization. Tracking experiments show that the algorithm can complete stable matching under the changes of angle view, rotation, affine transformation, illumination and other circumstances, and can achieve stable tracking under a small partial shelter state. The classic tracking algorithm of Mean shift is not valid to fast moving object, and has also the problem of error accumulation. So this paper proposes an algorithm based on centroid weighted Kalman filter for object tracking. The algorithm firstly uses background subtraction method to lock dynamic target tracking area, and then uses the Kalman filter to predict the target's position at the beginning of the target tracking, and then optimizes the predictive state value adopting centroid weighted method, finally updates the observation data according to the corrected state value. Simulation results show that the algorithm can detect effectively moving objects and at the same time it can quickly and accurately track moving objects with good robustness.
     To solve face recognition problems in a complex and changing light, the paper proposes an algorithm of face recognition based on LBP operator and EMD. Firstly, it uses a series of simple and efficient image preprocessing for improving the robustness of the algorithm, and then the LBP histogram of the image is obtained by extracting the local LBP feature. The use of EMD can complete measuring the similarity of the images by calculating the LBP histogram. The experimental results in the GTAV standard face database show that the algorithm improves significantly the recognition rate. Recognition and understanding of human behavior are the higher level of visual tasks. On the basis of various algorithms of human behavior recognition, the paper proposes an algorithm of human behavior recognition based on space-time interest point. It firstly uses3D Harris corner to extract the spatial-temporal features of different behavior, and then classifies these motion features and reduces their dimensions using K-means clustering combining with LLE method in the data space. In the process of training recognition, the geometric characteristics method of the mean Hausdorff distance completes similarity registration between image sequences. The experiments on the KTH database show that the algorithm is effective and feasible, and the stream-based trajectory method improves further the recognition accuracy.

引文

[1]中国安全防范产品行业协会.中国安防行业“十二五”(2011-2015年)发展规划.中国安防,2011,3：2-9.
    [2]郑世宝.智能视频监控技术与应用[J].电视技术,2009,33(1)：94-96.
    [3]李彤.智能视频监控下的多目标跟踪技术研究.中国科学技术大学博士学位论文,2013.
    [4]A.G, Vicente, I. Bravo Munoz, P. Jimenez Molina, and J. L. Lazaro Galilea. Embedded Vision Modules for Tracking and Counting People. IEEE Transactions on Instrumentation and Measurement,2009,58(9):3004-3011.
    [5]A. B. Chan, N. Vasconcelos. Privacy preserving crowd monitoring:Counting people without people models or tracking. IEEE International Conference on Computer Vision and Pattern Recognition,2008, pp.1-7.
    [6]贾云得.机器视觉[M],北京：科学出版社,2002.
    [7]Coifman B, Beymer D, Mclauchlan P, Malik J. A real-time computer vision system for vehicle tracking and traffic surveillance. Transportation Research Part C,1998,6(4):271-288.
    [8]Magee D. Tracking multiple vehicles using foreground, background and motion models. Image and Vision Computing,2004,22(2):143-155.
    [9]Dubuisson-Jolly M, Lakshmanan S, Jain A K. Vehicle segmentation and classification using deformable templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1996,18(3):293-308.
    [10]Koller D, Daniilidis K, Nagel H. Model-based object tracking in monocular image sequences of road traffic scene. International Journal of Computer Vision,1993,10(3): 257-281.
    [11]Aggarwal J K, Cai Q. Human motion analysis:A review [J]. Computer Vision and Image Understanding,1999,73(3):428-440.
    [12]王亮,胡卫明,谭铁牛.人运动的视觉分析综述[J].计算机学报,2002,25(3)： 225-237.
    [13]Moeslund Thomas B, Hilton Adrian, Krllger Volker. A survey of advances in vision-based human motion capture and analysis [J]. Computer Vision and Image Understanding,2006,104(3):90-126.
    [14]杜友田,陈峰,徐文立等.基于视觉的人的运动识别综述[J].电子学报,2007,35(1)：84-90.
    [15]Turaga P, Chellappa R, Subralmanian V S, et al. Machine recognition of human activities: A survey [J]. IEEE Transactions on Circuits and Systems for Video Technology,2008,18(11): 1473-1488.
    [16]A.Imai, N.Shimada, Y.Shirai. Hand Posture Estimation in Complex Backgrounds by Considering[C].Asian Conference on Computer Vision,2007, pp 596-607.
    [17]ADVISOR:Annotated Digital Video for Intelligent Surveillance and Optimized Retrieval. http://www-so.inria.fr/orion/ADVISOR.
    [18]S Blunsden, R Fisher, E Andrade. Recognition of Coordinated Multi Agent Activities: The Individual vs The GrouP. Technical RePort, EDI-INF-RR-0830, the University of Edinburgh,2006.
    [19]黄飞跃,徐光祐.视角无关的动作识别.软件学报,2008,19(7)：1623-1634.
    [20]赵海勇,刘志镜,张浩.基于模板匹配的人体日常行为识别.湖南大学学报,2011,38(2)：88-92.
    [21]Doucet A, Gurdon N, Krishnamurthy V. Particle filter for state estimation of jump Morkov linear systems. IEEE Trans, Signal Processing,2001,49:613-624.
    [22]McKenna S et al. Tracking groups of people. Computer Vision and Image Understanding, 2000,80(1):42-56.
    [23]S. C. Cheung, C. Kamath. Robust techniques for background subtraction in urban traffic video. In:Proc. El-VCIP,2004,881-892.
    [24]L. Wixson. Detecting Sailient Motion by Accumulating Directionally Consistent flow. IEEE Trans. Pattern Analysis and Machine Intelligence,2000,22(8):774-780.
    [25]A. Talukder, L. Matthies. Real-time detection of moving object from moving vehicles using dense stereo and optical flow. IEEE Conference on Intelligent Robots and Systems, 2004.
    [26]D. Farin, P.With, and W. Effelsberg. Robust background estimation for complex video sequences. In:Proc of International Conference on Image Processing,2003,145-148.
    [27]P. Shi, E. G.Jones, and Q. Zhu. Median model for background subtraction in intelligent transportation system. In:Image Processing:Algorithms and Systems Ⅲ, Proc. SPIE 5298, 2004,168-176.
    [28]甘新胜.基于码书的运动目标检测方法.中国图象图形学报,2008,13(2)：365-371.
    [29]胡彪,龚晓峰.基于改进背景差法的运动目标检测.计算机工程与设计,2010,31(17)：3841-3844.
    [30]Nicholas A. Mandellos, Iphigenia Keramitsoglou, Chris T. Kiranoudis. A background subtraction algorithm for detecting and tracking vehicles. Expert Systems with Applications, 2011,38,1619-1631.
    [31]R. Cucchiara, C. Grana, M.Piccardi, and A.Prati. Detecting moving objects, ghosts, and shadows in video streams. IEEE Transactions on Pattern Analysis and Machine Intellegence, 2003,25(10):1337-1342.
    [32]S.-C.S.Cheung, and C.Kamath. Robust techniques for background subtraction in urban traffic video. In:Visual Communications and Image Processing, Proc.SPIE 5308,2004, 881-892.
    [33]T. Horprasert, D. Harwood, L. S. Davis. A statistical approach for real-time robust background subtraction and shadow detection. In:Proc of IEEE ICCV Frame-rate workshop, 1999,1-19.
    [34]B. Stenger, V. Ramesh, N. Paragios, et.al.Topology free hidden Markov models: application to background modeling. In:Proc of IEEE International conference on Computer Vision,2001,249-301.
    [35]A. Elgammal, D.Harwood, L.S.Davis. Non-parametric model for background subtraction. In:Proceedings of European Conference on Computer Vision,2000,751-767.
    [36]周恩策,刘纯平等.基于时间窗的自适应核密度估计运动检测方法.通信学报, 2011,32(3):106-114.
    [37]C. R. Wren, A. Azarbayejani, T. Darrell, A. P. Pentland. Pfinder:Real-time tracking of the human body. IEEE PAMI,1997,19(7):780-785.
    [38]A. Elgammal, R. Duraiswami, D. Harwood and L. S. Davis. Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. In: Proceedings of the IEEE,2002, vol.90, no.7,1151-1163.
    [39]A. Elgammal, R. Duraiswami, L. S. Davis. Efficient kernel density estimation using the fast gauss transform with applications to color modeling and tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence,2003, vol.25, no.11,1499-1504.
    [40]蒋鹏,金炜东.基于加权核密度估计的自适应运动前景检测方法.西南交通大学学报,2012,47(5)：769-775.
    [41]王林波,赵杰煜.光线变化下的视频图像分割.计算机应用,2005,25(1)：110-113.
    [42]朱海龙,刘鹏,刘家锋,唐降龙.窗口序列PCA投影降噪的二次前景分割方法.计算机辅助设计与图形学学报,2010,22(9)：1545-1553.
    [43]李斌,程义民,张玲.基于纹理的运动目标检测.计算机工程与应用,2008,44(7)：100-102.
    [44]T. Matsuyama, T. Ohya, H. Habe. Background subtraction for nonstationary scenes. In: Proceedings of Asian Conference on Computer Vision,2000, pp.662-667.
    [45]M. Mason, Z. Duric. Using histograms to detect and track objects in color video. In: Proceedings of 30th Applied Imagery Pattern Recognition Workshop,2001, pp.154-159.
    [46]A. Monnet, A. Mittal, N. Paragios, V. Ramesh. Background modeling and subtraction of dynamic scenes. In:Proceedings of IEEE International Conference on Computer Vision,2003, pp.1305-1312.
    [47]M. Heikkila, M. Pietikainen, J. Heikkila, A texture-based method for detecting moving objects. In:Proceedings of British Machine Vision Conference,2004, pp.187-196.
    [48]T. Ojala, M. Pietikainen, T. Maenpaa. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell.2002, 24 (7):971-987.
    [49]Yu-Ting Chen, Chu-Song Chen, Chun-Rong Huang, Yi-Ping Hung. Efficient hierarchical method for background subtraction. Pattern Recognition,2007,40:2706-2715.
    [50]陈朝阳,张桂林.基于图像对称差分运算的运动小目标检测方法[J].华中理工大学学报,1998,26(9)：34-38.
    [51]郭建.基于时空信息的运动目标检测.电子技术应用,2007,8：69-71.
    [52]崔国栋,于明,柴林燕.基于帧间差分的足球球员检测算法.计算机工程与设计,2010,31(7)：1536-1539.
    [53]任建强,陈阳舟.城市混合交通场景中的多目标参数检测.交通信息与安全,2009,27(4)：47-54.
    [54]B.K.P. Horn, B. Schunk. Determining Optical Flow. Artificial Intelligence,1981, Vol.17, pp.185-204.
    [55]周丽,朱宏.基于二重差分法的光流场运动检测.计算机仿真,2009,26(12)：168-171.
    [56]潘金山,苏志勋,王伟.运动细节估计的光流场方法.计算机辅助设计与图形学学报,2011,23(8)：1433-1441.
    [57]张磊,项学智,赵春晖.基于光流场与水平集的运动目标检测.计算机应用,2009,29(4)：975-975.
    [58]CAO Guo, YANG Xin, MAO Zhi-hong. A two-stage level set evolution scheme for man-made objects detection in aerial images. In:Proc of IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2005:474-476.
    [59]CHENG Hui, BOUMAN C A. Multiscale Bayesian segmentation using a trainable context model. IEEE Trans on Image Processing,2001,10(4):511-525.
    [60]李斌,程义民,张玲.基于纹理的运动目标检测.计算机工程与应用,2008,44(7)：100-102.
    [61]郑世友,费树岷,龙飞.基于小波提升框架的图像序列中运动目标检测算法.中国图象图形学报,2005,10(5)：596-602.
    [62]丁莹,李文辉,范静涛,杨华民.基于Choquet模糊积分的运动目标检测算法.电子学报,2010,38(2)：263-268.
    [63]张南,陶然,王越.基于变标处理和分数阶傅里叶变换的运动目标检测算法.电子学报,2010,38(3)：683-688.
    [64]侯志强,韩崇昭.视觉跟踪技术综述.自动化学报,2006,32(4)：603-617.
    [65]Tyng-Luh L, Hwann-Tzong C. Real-time tracking using trust-region methods[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2004,26(3):397-402.
    [66]Avidan S. Support vector tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence,2004,26(8):1064-1072.
    [67]胡波,陈恳等.基于KalMan预测和Mean-shift算法的视频目标跟踪.光电子·激光,2009,20(11)：1517-1522.
    [68]朱俊,任明武,杨章静,赵炜.基于角点检测的快速匹配算法.南京理工大学学报,2011,35(6)：755-758.
    [69]梅振顺,战荫伟,钟左峰.基于SURF特征的目标跟踪.中国体视学与图像分析,2011,16(1)：28-32.
    [70]Gavrila Dariu M. Vision-Based 3D Tracking of Human in Action[D]. Maryland, USA: University of Maryland,1996.
    [71]S. Wachter, H.-H. Nagel. Tracking persons in monocular image sequences. Computer Vision and Image Understanding,1999,74 (3):174-192.
    [72]R.Urtasun, D.J.Fleet, P.Fua.3D People Tracking with Gaussian Process Dynamical Model[C]. IEEE Conference on Computer Vision and Pattern Recognition,2006,238-245.
    [73]Y. Wu, J. Y. Lin, T.S. Huang. Capturing Natural Hand Articulation[C]. IEEE International Conference on Computer Vision,2001,426-432.
    [74]Kass M, Witkin A, Terzopoulos D. Snakes:Active contour models[J]. Internantional Journal of Computer Vision,1988,1(4):321-331.
    [75]Leymarie F., Levine M.D. Tracking deformable objects in the plane using an active contour model. IEEE Transactions on Pattern Analysis and Machine Intelligence,1993,15(6): 617-634.
    [76]Vieren C, Cabestaing F, Postaire J. Catching moving objects with snakes for motiontracking[J]. Pattern Recognition Letters,1995,16(7):679-685.
    [77]Paragios N., Deriche R. Geodesic active contours and level sets for the detection and tracking of moving objects. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22(3):266-280.
    [78]F Aaron, Bobiek, J Davis. The Recognition of Human Movement Using Temporal Templates. IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(3): 257-267.
    [79]T Zhao, R Nevatia. Tracking Multiple Humans in Complex Situations. IEEE Transactions on Pattern Analysis and Machine Intelligence,2004,26(9):1208-1221.
    [80]J W Davis. Sequential Reliable-Inference for Rapid Detection of human actions. In Proceeding of IEEE Conference on Advance Video and Signal Based Surveillance,2003: 169-176.
    [81]R Rosales, S Sclaroff.3D Trajectory Recovery for Tracking Multiple Objects and Trajectory Guided Recognition of Actions. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition,1999,2:2117-2123.
    [82]王红梅,张科,李言俊.图像匹配研究进展.计算机工程与应用,2004,19：42-44.
    [83]Lily Lee. Gait Analysis for Classification [R]. AI Technical Report 2003-014, the city of Cambridge, Massachusetts, USA:Massachusetts Institute of Technology-Artificial Intelligence Laboratory,2003.
    [84]孙瑾,顾宏斌,秦小麟,周娜.一种鲁棒型Hausdorff距离图像匹配方法.中国图象图形学报,2008,13(4)：761-767.
    [85]A Psarrou, Gong S, Walter M. Recognition of Human Gestures and Behavior Based on Motion Trajectories. Image and Vision Computing,2002,20(5):349-358.
    [86]Murphy K. Dynamic Bayesian networks:representation, inference and learning [D]. Berkeley:University of California,2002.
    [87]李宁,须德,傅晓英,袁玲.结合人体运动特征的行为识别.北京交通大学学报,2009,33(2)：6-10.
    [88]田光见,赵荣椿.基于连续隐马尔可夫模型的步态识别.中国图象图形学报,2006,11(6)：867-871.
    [89]FINE S, SINGER Y, TISHBY N. The hierarchical hidden Markov model:analysis and applications [J]. Machine Learning,1998,32 (1):41-62.
    [90]Oliver N, Rosario B, Pentland A. A Bayesian computer vision system for modeling human interactions [J]. IEEE Trans on Pattern Analysis and Machine Intelligence,2000,22(8): 831-843.
    [91]Galata A, Johnson N, Hogg D. Learning variable-length Markov models of behavior [J]. Computer Vision and Image Understanding,2001,81(3):398-413.
    [92]Oliver N, Horvitz E. A comparison of HMMs and dynamic Bayesian networks for recognizing office activities [A]. In:Proceedings of 10th International Conference on User Modeling [C], Edinburgh, UK,2005:199-209.
    [93]K.Kim, T.H.Chalidabhongse, D.Harwood, L.S.Davis. Real-time foreground-background segmentation using codebook model. Real-Time Imaging, vol.11, no.3,2005, pp.172-185.
    [94]H. Wang, D. Suter. A consensus-based method for tracking:modelling background scenario and foreground appearance. Pattern Recognition, vol.40,2007, no.3, pp.1091-1105.
    [95]Koller D, Weber J, Huang T, et al. Towards robust automatic traffic scene analysis in real-time. In:Proc of IEEE Conference on Decision and Control. Lake Buena Vista,1994, 3776-3781.
    [96]Stauffer C, Grimson W. Adaptive background mixture models for real-time tracking. In: Proc of IEEE Conference on Computer Vision and Pattern Recognition, Fort Collins, CO, USA,1999,2:246-252.
    [97]Dar-shyang Lee. Effective Gaussian mixture learning for video background subtraction. IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(5):827-832.
    [98]Stenger B, Ramesh V, Paragios N, et al. Topology free hidden markov models: Application to background modeling[A]. In:Proceedings of IEEE International Conference on Computer Vision[C], Vancouver, BC, Canada,2001,1:294-301.
    [99]Kim K, Chalidabhongse T H, Harwood D, et al. Background modeling and subtraction by Codebook construction[A]. In:Proceedings of IEEE International Conference on Image Processing[C], Singapore,2004:3061-3064.
    [100]Toyama K, Krulnm J, Brumitt B, et al. Wallflower:Principles and Practice of background maintenance. In:Proc of International Conference on Computer Vision,1999, 255-261.
    [101]Halevy G, Weinshall D. Motion of disturhances:detection and tracking of multi-body non-rigid motion. Machine Vision and APPlications,1999,11(3):122-137.
    [102]Wren C R, Azarbayejani A, Darrell T, et al. Pfinder:Real-time tracking of the human body. IEEE Transactions on Pattetn Analysis and Machine Intelligence,1997,19(7):780-785.
    [103]Stauffer C, Grimson W. Learning patterns of activity using real-time tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22(8):747-757.
    [104]Power P W, Schoonees J A. Understanding background mixture models for foreground segmentation. In:Proc. Image and Vision Computing, Auckland, New Zealand,2002: 267-271. [105] Lee D S, Hull J, Ero1 B. A Bayesian framework for Gaussian mixture background modeling. In:Proc of IEEE International Conference on Image Processing,2003:973-976.
    [106]Dimitrios Ioannou, Walter Huda and Andrew F. Laine. Circle recognition through a 2D Hough transform and radius complexities. Image and Vision Computing,1999,17(1).15-26.
    [107]Xu L, Oja E. Randomize Hough transform:basic mechanisms algorithms and computational complexties. Computer Vision Graphic Image Process:Image Understanding, 1993,57(2):131-154.
    [108]C. Ducottet, J. Daniere, M. Moine, J.P. Schon and M. Courbon. Localization of objects with circular symmetry in a noisy image using wavelet transforms and adapted correlation. Pattern Recognition,1994,27(3):351-364.
    [109]Chun-Ta Ho, Ling-Hwei Chen. A fast ellipse/circle detector using geometric symmetry. Pattern Recognition,1995,28(1):117-124.
    [110]Otsu N. A Threshold Selection Method From Gray-level Histograms. IEEE Transaction on Systems, Man, and Cybernetics,1979,9(1):62-66.
    [111]Johan Van Horebeek, Ernesto Tapia-Rodriguez. The approximation of a morphological opening and closing in the presence of noise. Signal Processing,2001,81(9):1991-1995.
    [112]Lee J S, Haralick R M, Shapiro L G Morphological edge detection. IEEE Transaction on Robotics Automat,1987,3(2):142-156.
    [113]Xudong Song, Neuvo,Y. Robust edge detector based on morphological filters. Pattern Recognition Letters,1993,14(11):889-894.
    [114]Zmuda, M.A., Tamburino, L.A. Efficient algorithms for the soft morphological operators, IEEE Transactions on Pattern Analysis and Machine Intelligence,1996,18(11): 1142-1147.
    [115]Huang Fenggang, Yan Guo, Song Ke-ou. The soft morphology applied to detecting image edge. Chinese Journal of Image and Graphics,2000,5 (4):284-287.
    [116]Bryan W, Scotney, Sonya A. Coleman. Improving angular error via systematically designed near-circular Gaussian-based feature extraction operators. Pattern Recognition,2007, 40(5):1451-1465.
    [117]Mingwu Ren, Wankou Yang, Jingyu Yang. A new and fast contour-filling algorithm. Pattern Recognition,2005,38(12):2564-2577.
    [118]Yong Kui Liu, Borut Zalik. An efficient chain code with Huffman coding. Pattern Recognition,2005,38(4):553-557.
    [119]Guo Haitao, Sun Dajun etc. The methods of calculating an object circularity in a digital image. Chinese Journal of Natural Science of Heilongjiang University, 2001,18(2):53-55.
    [120]Harris C, Stephens M. A Combined Corner and Edge Detector. In:Proceedings Fourth Alvey Vision Conference, Manchester, UK,1988,147-151.
    [121]毛雁明,兰美辉.一种改进的基于Harris的角点检测方法.计算机技术与发展,2009,5(5)：127-129.
    [122]Schmid C, Mohr R, Bauckhage C. Comparing and evaluating interest points [C]. The Sixth International Conference on Computer Vision,1998,230-235.
    [123]M. A. Fischler, R. C. Bolles. Random Sample Consensus:A Paradigm Applications to Image Analysis and Automated Cartography. Comm. Assoc. Comp. Mach,1981,24(6): 381-395.
    [124]Cheng Y. Mean shift,mode seeking and clustering [J]. IEEE Transactions on pattern analysis and machine intelligence,1995,17(8):790-799.
    [125]Comaniciu D, Meer P. Mean shift:A robust approach toward feature space analysis [J]. IEEE Trans Pattern Anal Machine Intell,2002,24(5):603-619.
    [126]Comaniciu D, RamesV h, Meer P. Real-Time tracking of non-rigid objects using mean shift [J]. IEEE Conference on Computer Vision and Pattern Recognition,2000:142-149.
    [127]Comaniciu D, Ramesh V, Meer P. Kernel-based object tracking [J]. IEEE Transaction on pattern analysis and machine intelligence,2003,25(5):564-577.
    [128]K. Fukunaga, L.D. Hostetler. The estimation of the gradient of a density function with applications in pattern recognition [J]. IEEE Transactions on Information Theory,1975,21(1): 32-40.
    [129]R.E. Kalman. A new approach to linear filtering and prediction problems [J]. Transactions of the ASME-Journal of Basic Engineering,1960, vol.82 (Series D), pp.35-45.
    [130]Simon J. Julier, Jeffrey K. Uhlmann. A new extension of the Kalman filter to nonlinear system. Proceedings of SPIE,1997, vol.3068, pp.182-193.
    [131]Li P, Zhang T, Ma B. Unscented Kalman filter for visual curve tracking. Image and Vision Computing,2004,22(2):157-164.
    [132]Shiuh-ku Weng, Chung-ming Kuo, Shu-kang Tu. Video object tracking using adaptive Kalman filter [J]. Journal of Visual Communication and Image Representation,2006, 17(6):1196-1197.
    [133]Ernst Dieter D., Volker G Applications of dynamic monocular machine vision. Machine Vision and Applications,1988,1(4):241-261.
    [134]Donald B.G Visual tracking of known three-dimensional objects. International Journal of Computer Vision,1992,7(3):243-270.
    [135]Matthies L.H., Kanade T., Szeliski R. Kalman Filter-based algorithms for estimating depth from image sequences. International Journal of Computer Vision,1989,3(3):209-238.
    [136]Wang Liang, Ning Hua-zhong, Tan Tie-niu, et al. Fusion of static and dynamic body biometrics for gait recognition [J]. IEEE Trans on Circuits and Systems for Video Technology, 2004,14(2):149-158.
    [137]Fanti C, Zwlnik-manor L, Perona P. Hybrid models for human motion recognition [C]. In:Proc of IEEE International Conference Computer and Pattern Recognition,2005: 1166-1173.
    [138]Masoud O, et al. A method for human action recognition[J]. Image and Vision computing,2003,21:729-743.
    [139]Bobick A F, Davis J W. The recognition of human movement using temporal templates[J]. IEEE Trans on Pattern Analysis and Machine Intelligence,2001,23(3):257-267.
    [140]Hu M K. Visual pattern recognition by moment invariants. IEEE Trans Information Theory,1962,8:179-187.
    [141]Teague M R. Image analysis via the general theory of moments[J]. Journal Optical Society of America,1980,70(8):920-930.
    [142]Kim W Y, Kim Y S. A region-based shape descriptor using Zernike moments[J]. Signal Processing,2000,16:95-102.
    [143]Veeraraghavan A., Chellappa R.and Roy-Chowdhury A.K. The function space of an activity. IEEE International Conference on Computer Vision and Pattern Recognition,2006, 959-968.
    [144]Cuntoor N.P., Yegnanarayana B.and Chellappa R. Activity modeling using event probability sequences[J]. IEEE Transactions on Image Processing,2008,17(4):594-607.
    [145]I.T., Jolliffe. Principal Component Analysis[M]. New York:Springer-Verlag,1989.
    [146]Chuang C., Shih F.Y. Recognizing facial action units using independent component analysis and support vector machine[J]. Pattern Recognition,2006,39(9):1795-1798.
    [147]Tian-Fu Gao, Cheng-Lin Liu. High accuracy handwritten Chinese character recognition using LDA-based compound distances. Pattern Recognition,2008,41(11):3442-3451.
    [148]Joshua B.Tenenbaum, Vin de Silva, John C.Langford. A global geometric framework for nonlinear dimensionality reduction. Science,2000,290:2319-2323.
    [149]Sam T, Roweis, Lawrence K. Saul.Nonliear dimensionality reduction by locally linear embedding. Science,2000,290:2323-2326.
    [150]Lawrence N.D. Gaussian process latent variable models for visualisation of high dimensional data. Advances in Neural Information Processing Systems,2004,329-336.
    [151]Z. Wang, A.C. Bovik. A universal image quality index. IEEE signal Processing letters, 9(3):81-84.
    [152]Rubner, Y., Tomasi, C., and Guibas, L.J. A metric for distributions with applications to image databases. Proceedings of the 1998 IEEE International Conference on Computer Vision, Bombay, India,1998:59-66.
    [153]Rein-Lien Hsu, Abdel-Mottaleb M., Jain A.K. Face Detection in Color Images. IEEE Transactions on Pattern Analysis and Machine Intelligence,24(5):696-706.
    [154]S Edelman, D Reisfeld, Y Yeshurun. A System for Face Recognition that Learns from Examples. Proc. European Conf. Computer Vision,1992,787-791.
    [155]A Shashua, T Riklin Raviv. The Quotient Image:Class-Based Re-Rendering And Recognition With Varying Illuminations. IEEE Trans, on PAMI,2001,23(2):129-139.
    [156]D. J. Jobson, Z. Rahman and G. A. Woodell. A multiscale retinex for bridging the gap between color images and the human observation of scenes. IEEE Trans. On Image Processing,1997,6(7):965-967.
    [157]Haitao Wang, S. Z. Li, Y. S. Wang. Face recognition under varying lighting condition using self; quotient image. In:Proc of IEEE International Conference on AFGR,2004, 819-824.
    [158]何晓光,田捷,毋立芳,张瑶瑶,杨鑫.基于形态学商图像的光照归一化算法.软件学报,2007,18(9)：2318-2325.
    [159]王小明.可变光照下人脸检测与识别研究.华东师范大学博士学位论文,2010.
    [160]R Gross, I Matthews, S Baker. Fisher Light-Fields for Face Recognition Across Pose and Illumination. Proceedings of the 24th DAGM Symposium on Pattern Recognition, September,2002,481-489.
    [161]Zhou S, Chellappa R. Illuminating light field:Image-Based face recognition across illuminations and poses. In:Proc. of the FGR, IEEE Computer Society, Washington,2004, 229-234.
    [162]Shashua A. Geometry and photometry in 3D visual recognition[D]. PhD Thesis. Massachusetts Institute of Technology, Cambridge, MA, USA,1994.
    [163]Hallinan P W. A low 2-dimensional representation of human faces for arbitrary lighting conditions[A]. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition[C],Seattle, WA, USA,1994:995-999.
    [164]P. Belhumeur, D. Krieglnan. What is the set of images of an object under all possible illuminationconditions, IJCV,1998,28(3):245-260.
    [165]R Basri, D Jacobs. Latmbertian Reflectance and Linear Subspaces. ICCV2001, Beckman Institute,2001, Vol.2,383-390.
    [166]W.Chen, M.J.Er, S. Wu. Illumination compensation and normalization for robust face recognition using discrete cosine transform in logarithm domain. IEEE Trans. SMC-B,2006, 36(2):458-466.
    [167]X. Xie, K. Lam. An efficient illumination normalization method for face recognition. Pattern Recognition Letters,2005,27(6):609-617.
    [168]K J Dana, S K Nayar, B van Ginneken, J J Koenderink. Reflectance and texture of real-world surfaces. In:Proc.of CVP,1997,151-157.
    [169]RA Jarvis. A perspective on range-finding techniques for computer vision. IEEE trans pattern analysis mach. Intell.5:1983,122-139.
    [170]Blanz V, Vetter T. Face recognition based on fitting a 3D morphable Model. IEEE Trans on Pattern Analysis and Machine Intelligence,2003,25(9):1063-1074.
    [171]Ojala T, Pietikainen M, Harwood D. A comparative study of texture measures with classification based on feature distributions [J]. Pattern Recognition,1996,29:51-59.
    [172]Ahonen T, Hadid A, Pietikainen M. Face description with local binary patterns: Application to face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(12):2037-2041.
    [173]Ojala T, Pietikainen M, Maeopaa T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns [J]. IEEE Trans, on Pattern Analysis and Machine Intelligence,2002,24:971-987.
    [174]Rubner Y, Tomasi C, and Guibas L J. The earth mover's distance as a metric for image retrieval. International Journal of Computer Vision,2000,40(2):99-121.
    [175]A. F. Bobick, J.W. Davis. The recognition of human movement using temporal templates[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(3): 257-267.
    [176]D. Weinland, R. Ronfard, E. Boyer. Free viewpoint action recognition using motion history volumes[J].Computer vision and image understanding,2006,104(2-3):249-257.
    [177]Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik. Recognizing action at a distance. In:Proceedings of the International Conference on Computer Vision (ICCV'03), vol. 2, Nice, France,2003, pp.726-733.
    [178]Saad Ali, Mubarak Shah. Human action recognition in videos using kinematic features and multiple instance learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI),2010,32(2):288-303.
    [179]Md. Atiqur Rahman Ahad, Takehito Ogata, Joo Kooi Tan, Hyoungseop Kim, Seiji Ishikawa. Motion recognition approach to solve overwriting in complex actions. In: Proceedings of the International Conference on Automatic Face and Gesture Recognition (FGR'08), Amsterdam, Netherlands,2008, pp.1-6.
    [180]Lena Gorelick, Moshe Blank, Eli Shechtman, Michal Irani, Ronen Basri. Actions as space-time shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2007,29(12):2247-2253.
    [181]I. Laptev. On space-time interest points[J]. International Journal of Computer Vision, 2005,64(2):107-123.
    [182]P. Dollar, V. Rabaud, G. Cottrell, S. Belongie. Behavior recognition via sparse spatio-temporal features[C]. IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance,2005:65-72.
    [183]P. Scovanner, S. Ali, M. Shah. A 3-dimensional sift descriptor and its application to action recognition[C]. Proceedings of the 15th international conference on Multimedia,2007: 357-360.
    [184]X. Jiang, T. Sun, B. Feng, C. Jiang. A space-time SURF descriptor and its application to action recognition with video words[C]. In:8th International Conference on Fuzzy Systems and Knowledge Discovery,2011:1911-1915.
    [185]H.Bay, A.Ess, T.Tuytelaars, L.Vangool. Speeded-up robust features(SURF)[J].Computer vision and image understanding,2008,110(3):346-359.
    [186]I. Laptev, M. Marszalek, C. Schmid, B. Rozenfeld. Learning realistic human actions from movies. In Proceedings of IEEE Computer Society Conference Computer Vision and Pattern Recognition,2008:1-8.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700