用户名: 密码: 验证码:
基于视频的目标检测、跟踪及其行为识别研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
目标的检测、跟踪与行为识别作为视频监控的主要研究内容,是当前计算机视觉领域的研究热点,其不但具有重要的实际意义,而且对计算机视觉的其他研究领域有着重要的推动作用。视频监控技术研究的主要目的是赋予机器视觉系统人类的视觉感知功能,以能够在图像序列中发现目标、跟踪目标,并对目标的行为进行识别和理解。经过几十年尤其是近十年的不懈研究,上述技术取得了长足的进步,但实践表明一般意义上的目标检测、跟踪与行为识别技术还远未成熟,要开发出真正鲁棒、实用的视频监控系统还需要更为鲁棒的核心算法。本论文主要对视频监控相关的关键技术进行研究,研究内容涉及目标的检测、目标的跟踪以及人体行为识别等方向,内容涵盖了可见光图像和红外图像领域。现对论文的主要创新点概述如下:
     1)提出一种基于几何约束和颜色信息的人脸检测算法
     该方法在充分考虑人脸区域与头发区域的颜色特征与几何关系的基础上,给出了用于描述人脸区域和头发区域之间几何约束的表达模型;在对肤色区域和头发区域分别进行检测后,根据不同候选区域之间的几何关系,通过几何约束对人脸和头发可能存在的区域进行特征判别,完成对图像中的人脸检测。
     2)分别提出一种单幅红外图像中和序列红外图像中的人体检测方法
     在单幅红外图像中,针对红外图像中人体图像亮度较高的特点,首先通过亮度方向投影确定可能存在人体的候选区域,进而采用方向梯度直方图对候选目标进行描述。最后将方向梯度直方图作为输入向量采用Fisher线性判别和贝叶斯分类器对候选目标进行分类,完成对候选目标中存在的人体进行检测。在红外序列图像中,首先采用自适应高斯混合模型对序列图像进行背景建模,在准确分割出前景目标的基础上,设计了一种新的人体目标表达模型。以人体表达模型作为输入向量,构建支持向量机对人体目标进行分类判别。在不同的红外场景下进行人体检测实验时,所提出的两种算法均取得了满意的检测效果。
     3)从自适应目标表达特征的选取和遮挡情况下的目标跟踪两个角度出发对Mean-Shift框架下的目标跟踪问题进行了改进
     Mean-Shift框架下的目标跟踪大多采用静态目标表达模型,这在动态变化的场景中容易导致跟踪失败。针对该缺点,论文提出一种基于自适应特征生成模型的目标跟踪方法。通过构建目标和背景的局部信噪比,对当前目标所处特征空间的可跟踪性进行量化评估,选用性能最优的表达模型作为当前的特征跟踪模型。实验表明,与采用静态模型相比,提出的算法具有更好的鲁棒性和可行件。
     经典的Mean-Shift算法要求目标在连续两帧之间部分的重合,在目标发生遮挡时难以满足该条件。该论文将目标的运动在较短时间内看作一时不变系统,通过引入Kalman滤波进行参数辨识而使发生遮挡后的跟踪系统具有后续状态预测的能力。整个跟踪过程分为Mean-Shift跟踪下的Kalman参数辨识和基于Kalman状态估计的Bhattacharyya系数分析两个子过程交替执行。对不同的视频序列测试的结果表明,算法能够对发生遮挡后的目标进行持续、稳健的跟踪。
     4)在粒子滤波跟踪框架下提出一种多线索融合方法
     在复杂的动态背景下,采用多线索进行目标跟踪可以提高系统的鲁棒性。论文在注意到不同特征具有不同的鉴别性能的基础上,从分析采样粒子和参考样本的特征空间距离和物理距离之间的关系出发,提出相对鉴别系数这一概念对不同特征问的鉴别性能进行描述,进而采用二次加权的方法对不同特征进行融合。实验结果表明,所提出的算法在多个复杂场景下均能够对目标进行准确、鲁棒的跟踪。
     5)提出一种基于周期运动分析的人体运动识别方法
     人体运动识别是视频监控的最终研究目标之一,论文对此提出一种基于周期运动分析的人体运动识别方法。该方法首先通过背景抽取获取人体的前景图像,并采用3个参数对人体的轮廓变化进行测量,以获取人体运动的周期。然后将人体图像投影到一个低维的特征空间中,此时同一周期所包含的特征点组成一个封闭的环。最后设计了一种特征距离函数求取不同的运动环之间的距离,对不同的运动进行分类识别。
     上述的几个创新点,按照在视频监控任务中所处的层次,由低到高有机连接,为视频监控技术的实用化提供了理论支持。
As the main research tasks in video surveillance, object detection, tracking and behavior recognition are widely studied in computer vision. They not only have great practical significances, but also play important roles to other topics' progress in computer vision. The ultimate goal of object detection, tracking and behavior recognition is to give the machine vision system human's cognitive ability, so as to find, track and recognize behavior of the target in image sequences. After more than tens years' development, great progress has been made in this field especially during the past ten years. However, practical experience has shown that video surveillance technologies are currently far from mature. A great number of challenges need to be solved before one can implement a robust video surveillance system for commercial applications.
     This thesis tries to get insights on some key issues in video surveillance; these issues include face detection, human detection in infrared images, object tracking and human behavior classification. Some important technologies and solutions are studied which are necessary for robust and practical video surveillance systems. The main contributions of this thesis can be concluded as follows.
     1) A face detection algorithm is proposed based on the color and geometric information. In this method, a geometric model is designed to describe geometric relations between the face region and the hair region. After a coarse detecting for the skin likeness regions and hair likeness regions, the geometric constrains between them is adopted to detect the face and the hair. Different detecting results indicate that the proposed method is both efficient and robust.
     2) The pedestrian detection problem in infrared images is solved from two aspects: pedestrian detection in single image and pedestrian detection in image sequences. For pedestrian detection in single infrared image, the regions of interest (ROI) are located based on the high brightness property of the pedestrian pixels, and then histograms of oriented gradients (HOG) are adopted to describe the ROI. Taking HOG as input vector, the pedestrian region is detected through Fisher linear discriminant and Bayesian classifier. For pedestrian detection in infrared image sequences, we first adopted a GMM (Gauss Mixture Model) to construct the background model, and then on the basis of segmenting the forward object, a shape-based human representing vector was designed. Taking account of occlusions among multi-humans, the intensity projection curve was applied to separate single body from occlusions. Finally, we took human shape vectors as samples, training a SVM (Support Vector Machine) classifier to detect the humans among foreground objects. Experimental results show that both of the two proposed methods are robust and efficient.
     3) Two algorithms are proposed to improve the performances of Mean-Shift tracking framework. On the one hand, according to the poor tracking ability when adopts static feature model, this thesis presents an adaptive feature generating model based tracking program. In this program, the object is seemed as valid tracking signal, on the contrary, the background is seemed as noise; after constructing the likelihood maps, a local SNR (Signal Noise Ratio) is computed to evaluate the tracking ability of current feature space, and the feature space with maximal SNR is selected as the optimal tracking feature space. On the other hand, an improved Mean-Shift based tracking algorithm is proposed to solve the poor tracking ability problem in occlusions. A time-invariant system is used to describe the movement of the target during a short time sequences, and through Kalman filter we identify this system so as to make it have ability estimate the coming states while occlusions taken place. The whole tracking system can divided into two parts: a Kalman parameter identifying system based on the object tracking and a Bhattacharyya coefficient analyzing system based on the Kalman state estimating; in the tracking process those two parts run by turns according to different cases. Experiment results of variant video sequences demonstrate that the proposed methods are more robust and feasible than the classical one.
     4) An algorithm for fusing multiple cues adaptively in particle filter tracking framework is proposed. Though it is noted that the fusion of multiple cues will lead to an increased reliability of the tracking system, most of current tracking algorithms are based on single cue and are, therefore, often limited to a particular environment. This thesis present a novel multi-cue based tracking method under the particle filter framework. Taking account of both the practical distance and the Bhattacharyya distance between particles and target, a parameter which called Relative Discriminant Coefficient (RDC) is presented to measure the tracking ability for different features. Multi-cue fusion is carried out in a reweighing manner based on this parameter. Experimental results demonstrate the high robustness and effectiveness of our method in handling appearance changes, cluttered background, illumination changes and occlusions.
     5) A periodic motion analysis based human action recognition algorithm is proposed. Recognizing human action is a critical step in many computer vision applications. In this thesis the human behavior classification problem is addressed from a periodic motion analysis viewpoint. Our approach uses human silhouettes as motion features which can be obtained very efficiently, and subsequently the periodic motions are obtained by measuring the human shape's deformation. After a periodic analysis, each action unit is represented as a closed loop in the projection space and matching is performed to deicide the action's type by computing the distances among these loops. To demonstrate the effectiveness of this approach, human behavior classification experiments were performed on an open dataset. Classification results are very accurate and show that this approach is promising and efficient.Above proposed novel ideas in this thesis are try to solve three basic problems in video surveillance research: object detection, object tracking and object behavior recognition, and provide theoretic basis for commercial applications. These proposed novel ideas are associated with each other according to the research level in video surveillance.
引文
[1] D Marr. Vision [M]. W H Freeman and Co., 1982.
    [2] A Blake, A Zisserman. Surface Descriptions from Stereo and Shading. Image and Vision Computing, 1992, 3(2): 183-191.
    [3] A Yilmaz, C Rao, M Shah. View Invariant Representation and Recognition of Actions. International Journal of Computer Vision, 2002, 50(2):203-226.
    [4] Z Rasheed, M Shah. Scene Detection in Hollywood Movies and TV Shows. In Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, 2003:343-348.
    [5] I Haritaoglu, D Harwood, L Davis. W4: Real-time Surveillance of People and Their Activities. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2000, 22(8):809-830.
    [6] I Cohen, G Medioni. Detecting and tracking moving objects for video surveillance. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1999, Vol. 2:319-325.
    [7] A M Elgammal, L S Davis. Probabilistic Framework for Segmenting People under Occlusion. In Proceeding of IEEE International Conference on Computer Vision, 2001: 145-152.
    [8] C Stauffer, W Grimson. Adaptive Background Mixture Models for Real-time Tracking. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1999, Vol. 2:246-252.
    [9] N Ohta. A Statistical Approach to Background Subtraction for Surveillance Systems. In Proceedings of IEEE International Conference on Computer Vision, 2001, Vol.2:481-486.
    [10] A Mital, N Paragios. Motion-based Background Subtraction Using Adaptive Kernel Density Estimation. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004, Vol.2:302-309.
    
    [11] A J Lipton, H Fujiyoshi, R S Patil. Moving Target Classification and Tracking from Real-time Video. In Proceedings of IEEE Workshop on Applications of Compute Vision, 1998:8-14.
    
    [12] A Neri, S Colonnese, G Russo, et ah Automatic Moving Object and Background Separation. Signal Processing, 1998, 66(2): 219-232.
    [13] R Mech, M Wollborn. A Noise Robust Method for Segmentation of Moving Objects in Video Sequences. In Proceedings of International Conference on Acoustics, Speech and Signal Processing, 1997:41-45.
    [14] F Moscheni, S Bhattacharee, M Kunt. Spatial Temporal Segmentation Based on Region Merging. IEEE Transaction on Pattern Analysis and Machine Intelligence, 1998, 20(9):897-915.
    [15] A Verri, S Uras, E De Micheli. Motion Segmentation from Optical Flow. In Proceedings of the 5th Alvey Vision Conference, 1989:209-214.
    [16] J Barron, D Fleet, S Beauchemin. Performance of Optical Flow Techniques. International Journal of Computer Vision, 1994,12(1):42-77.
    [17] N Friedman, S Russell. Image Segmentation in Video Sequences: A Probabilistic Approach. In Proceedings of the 13th Conference on Uncertainty in Artificial Intelligence, 1997:175-181.
    [18] G McLach Ian, T Krishnan.The EM Algorithm and Extensions. John Wiley& Sons, Inc., 1997.
    [19] R P Wildes. A Measure of Motion Salience for Surveillance Applications. In Proceedings of International Conference on Image Processing, 1998:183-187.
    [20] E Stringa. Morphological Change Detection Algorithms for Surveillance Applications. In Proceedings of British Machine Vision Conference, 2000:402- 411.
    [21] Y Kuno, T Watanabe, Y Shimosakoda, et al. Automated Detection of Human for Visual Surveillance System. In Proceedings of IEEE International Conference on Pattern Recognition, 1996: 865-869.
    [22] R Cutler, L Davis. Robust Real-time Periodic Motion Detection, Analysis, and Applications. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2000, 22 (8):781-796.
    [23] A Selinger, L Wixson. Classifying Moving Objects as Rigid or Non-rigid without Correspondences. In Proceedings of DAPRA Image Understanding Workshop, 1998, Vol.l:341-358.
    [24] O Javed, M Shah. Tracking and Object Classification for Automated Surveillance. In Proceedings of European Conference on Computer Vision, 2002:343-357.
    [25] T Kanade, R Collins. Advances in Cooperative Multi-Sensor Video Surveillance. In Proceedings of DARPA Image Understanding Workshop, 1998:3-24.
    [26] C Stauffer. Automatic Hierarchical Classification Using Time-base Co-occurrences. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Rcognition, 1999: 333-339.
    [27] R Collins, A Lipton, H Fujiyoshi, et al. Algorithms for Cooperative Multisensor Surveillance. Proceedings of the IEEE, 2001, 89(10):1456-1477.
    [28] D Comaniciu, V Ramesh, P Meer. Kernel-based Object Tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(5): 564-577.
    [29] M Oliver, B Rosario, N Pentland. A Bayesian Computer Vision System for Modeling Human Interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8): 831-843.
    [30] T Zhao, R Nevatia. Tracking Multiple Humans in Complex Situations. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(9):1208-1221.
    
    [31] S McKenna, S Jabri, Z Duric, et al. Tracking Groups of People. Computer Vision and Image Understanding, 2000, 80(1):42-56.
    [32] C Pai, H Tyan, Y Liang, et al. Pedestrian Detection and Tracking at Crossroads. Pattern Recognition, 2004, 37(5):1025-1034.
    [33] O Masoud, N Papanikolopoulos. A Novel Method for Tracking and Counting Pedestrians in Real-time Using a Single Camera. IEEE Transactions on Vehicular Technology, 2001, 50(5): 1267-1278.
    
    [34] Y Bar-Shalom, T Fortmann. Tracking and Data Association. Academic Press, 1988.
    [35] M Isard, A Blake. Condensation-conditional Density Propogation for Visual Tracking. International Journal of Computer Vision, 1998, 29 (1): 5-28.
    [36] Y Chen, J Z Wang. A Region-based Fuzzy Feature Matching Approach to Content-based Image Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(9): 1252 -1267.
    [37] Y Wu, T S Huang. Robust Visual Tracking by Integrating Multiple Cues Based on Co-Inference Learning. International Journal of Computer Vision, 2004, 58(1): 55-71.
    [38] P Pe'rez, J Vermaak, A Blake. Data Fusion for Tracking with Particles. Proceedings of the IEEE, 2004, 92 (3): 495-513.
    [39] P Li, F Chaumette. Image Cues Fusion for Object Tracking Based on Particle Filter. Lecture Notes in Computer Science, Vol. 3179, 2004: 99-107.
    [40] K Nummiaro, E Koller-Meier, L V Gool. An Adaptive Color-based Particle Filter. Image and Vision Computing, 2003,21 (1): 99-110.
    [41] T Wang, Q Diao, Y Zhang, et al. A Dynamic Bayesian Network Approach to Multi-cue based Visual Tracking. In Proceedings of The International Conference on Pattern Recognition, 2004: 167-170.
    [42] T N Tan, G D Sullivan, K D Baker. Model-based Localization and Recognition of Road Vehicles. International Journal of Computer Vision, 1998, 27(1):5-25.
    [43] D Koller, K Daniilidis, H Nagel. Model-based Object Tracking in Monocular Image Sequences of Road Traffic Scenes. International Journal of Computer Vision, 1993, 10(10):257-281.
    [44] S Ju, M Black, Y Yacoob. Cardboard People: A Parameterized Model of Articulated Image Motion. In Proceedings of International Conference on Automatic Face and Gesture Recognition, 1996:38-44.
    [45] S Watcher, H Nagel. Tracking Persons in Monocular Image Sequences. Computer Vision and Image Understanding, 1999, 74 (3): 174-192.
    [46] C R Wren, A Azarbayejani, T Darrell, et al. Pfinder: Real-time Tracking of Human Body. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(7):780-785.
    [47] N Paragios, R Deriche .Geodesic Active Contours and Level Sets for the Detection and Tracking of Moving Objects. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22 (3):266-280.
    [48] M Bertahnio, G Sapiroo, G Randll. Morphing Active Contours. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22 (7):733-737.
    [49] N Peterfreund. Robust Tracking of Position and Velocity with Kalman Snakes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22 (6):564-569.
    [50] M Isard, A Blake. Contour Tracking by Stochastic Propagation of Conditional Density. In Proceedings of European Conference on Computer Vision, 1996: 343-356.
    [51] A Baumberg, D Hogg. An Efficient Method for Contour Tracking Using Active Shape Models. In Proceedings of IEEE Workshop on Motion of Non-Rigid and Articulated Objects, 1994:194-199.
    [52] D S Jang, H I Choi. Active Models for Tracking Moving Objects. Pattern Recognition, 2000, 33 (7): 1135-1146.
    [53] J Segen, S Pingali. A Camera-based System for Tracking People in Real Time. In Proceedings of International Conference on Pattern Recognition, 1996:63-67.
    [54] M Oren, C Papageorgiou, P Sinha, et al. Pedestrian Detection Using Wavelet Templates. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1997: 193-199.
    [55] Y Wu, T Yu, G Hua. A Statistical Field Model for Pedestrian Detection. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, Vol. 1:1023-1030.
    [56] B Leibe, E Seemann, B Schiele. Pedestrian Detection in Crowded Scenes. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, Vol.1: 878-885.
    [57] Y Fang, K Yamada,Y Ninomiya, et al. A Shape-Independent Method for Pedestrian Detection with Far-Infrared Images. IEEE Transactions on Vehicular Technology, 2004, 53(6): 1679-1697.
    [58] M Bertozzi, A Broggi, C Caraffi, et al. Pedestrian Detection by Means of Far-Infrared Stereo Vision. Computer Vision and Image Understanding, 2007, 106(2-3): 194-204.
    [59] M Bertozzi, A Broggi, P Grisleri, et al. Pedestrian Detection in Infrared Images. In Proceedings of the International IEEE Conference on Intelligent Transportation Systems, 2003: 662-667.
    [60] H Nanda, L Davis. Probabilistic Template Based Pedestrian Detection in Infrared Videos. In Proceedings of IEEE Intelligent Vehicle Symposium, 2002, Vol.1: 15-20.
    [61] F Xu, X Liu, K Fujimura. Pedestrian Detection and Tracking with Night Vision. IEEE Transactions on Intelligent Transportation Systems, 2005, 6(1):63-71.
    [62] C Dai, Y Zheng, X Li. Layered Representation for Pedestrian Detection and Tracking in Infrared Imagery. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, Vol. 1:13-20.
    [63] E Goubet, J Katz, F Porikli. Pedestrian Tracking Using Thermal Infrared Imaging. Technical Report, TR2005-126. Mitsubishi Electric Research Laboratories, 2005.
    [64] M Yasuno, S Ryousuke, N Yasuda, et al. Pedestrian Detection and Tracking in Far Infrared Images. In Proceedings of IEEE Conference on Intelligent Transportation Systems, 2005:182-187.
    [65] R Collins, A Lipton. A System for Video Surveillance and Monitoring: VSAM Final Report. CMU-RI-TR-00-12, Robotic Institute Carnegie Mellon University, 2000.
    [66] P Remagnino, T Tan, K Baker. Multi-agent Visual Surveillance of Dynamic Scenes. Image and Vision Computing, 1998, 16 (8):529-532.
    [67] F Aaron, Bobick, J Davis. The Recognition of Human Movement Using Temporal Templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(3):257-267.
    [68] T Zhao, R Nevatia. Tracking Multiple Humans in Complex Situations. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(9): 1208-1221.
    [69] J W Davis. Sequential Reliable-Inference for Rapid Detection of human actions. In Proceedings of IEEE Conference on Advance Video and Signal Based Surveillance, 2003:169-176.
    [70] R Rosales, S Sclaroff. 3D Trajectory Recovery for Tracking Multiple Objects and Trajectory Guided Recognition of Actions. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1999, Vol.2: 2117-2123.
    [71] A Kojima, M Izumi, T Tamura, et al. Generating Natural Language Description of Human Behavior from Video Images. In Proceedings of IEEE International Conference on Pattern Recognition, 2000: 728-731.
    [72] A F Bobick, A D Wilson. A State-based Technique for Summarization and Recognition of Gesture. In Proceedings of 5th IEEE International Conference on Computer Vision, 1995:382-388.
    [73] L W Campbell, A F Bobick. Recognition of Human Body Motion Using Phase Space Constraints. In Proceedings of 5th IEEE International Conference on Computer Vision, 1995:624-630.
    [74] L Wang, T Tan. Silhouette Analysis-based Gait Recognition for Human Identification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(12): 1505-1518.
    [75] 王亮, 胡卫明,谭铁牛. 基于步态的身份识别. 计算机学报, 2003,26(3):353-360.
    [76]R Polana,R Nelson.Low Level Recognition of Human Motion.In Proceedings of IEEE Workshop on Motion of Non-Rigid and Articulated Objects,1994:77-82.
    [77]J Yamato,J Ohya.Recognizing Human Action in Time-Sequential Images Using Hidden Markov Models.In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition,1992:376-385.
    [78]A Psarrou,Gong S,Walter M.Recognition of Human Gestures and Behavior Based on Motion Trajectories.Image and Vision Computing,2002,20(5-6):349-358.
    [79]K Fukunaga,L D Hostetler.The Estimation of the Gradient of a Density Function with Applications in Pattern Recognition.IEEE Transaction on Information Theory,1975,21(I):32-40.
    [80]D Comaniciu,P Meer.Mean Shift:A Robust Approach toward Feature Space Analysis.IEEE Transaction on Pattern Analysis and Machine Intelligence,2002,24(5):603-619.
    [81]R Collins.Mean-Shift Blob Tracking through Scale Space.In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2003,Vol.2:234-240.
    [82]A Elgammal,R Duraiswami,L Davis.Probabilistic Tracking in Joint Feature-Spatial Spaces.In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2003,Vol.1:781-788.
    [83]C Yang,R Duraiswami,L Davis.Efficient Mean-Shift Tracking via a New Similarity Measure.In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2005,Vol.1:176-183.
    [84]彭宁嵩,杨杰,刘志,张风超.Mean-Shift跟踪算法中核函数窗宽的自动选取.软件学报,2005,16(9):1542-1550.
    [85]C Hue,J Cadre,P Perez.Sequential Monte Carlo Methods for Multiple Target Tracking and Data Fusion.IEEE Trans on Signal Processing,2002,50(2):309-325.
    [86]N Gordon,D Salmond,A Smith.Novel Approach to Nonlinear/Non-Gaussian Bayesian State Estimation.IEE Proceedings on Radar and Signal Processing,1993,140(2):107-113.
    [87]J Liu,R Chen.Sequential Monte Carlo Methods for Dynamical Systems.Journal of the American Statistical Association,1998,93(5):1032-1044.
    [88]G Kitagawa.Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models.Journal of Computational and Graphical Statistics,1996,5(1):1-25.
    [89]T Higuchi.Monte Carlo Filtering Using Genetics Algorithm Operator.Journal of Statistical Computation and Simulation,1997,59(1):1-23.
    [90]Y Mo,D Xiao.Hybrid System Monitoring and Diagnosing Based on Panicle Filter Algorithm.Acta Automation Sinica,2003,29(3):641-648.
    [91] R Collins, A Lipton, T Kanade. Introduction to the Special Section on Video Surveillance. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2000, 22(8):745-746.
    [92] R B Fisher. PETS04 Surveillance Ground Truth Data Set. In Proceeding of Sixth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2004:1-5.
    [93] P Ribeiro, J Santos-Victor. Human Activities Recognition from Video: modeling, feature selection and classification architecture. In Proceeding of Workshop on Human Activity Recognition and Modeling, 2005: 61-70.
    [94] T List, J Bins, R B Fisher, et al. Two Approaches to a Plug-and-play Vision Architecture - CAVIAR and Psyclone. In Proceeding of AAAI05 workshop on Modular Construction of Human-like Intelligence, 2005:16-23.
    [95] D Tweed, W Fang, R Fisher, et al. Exploring Techniques for Behaviour Recognition via the CAVIAR Modular Vision Framework. In Proceeding of Workshop on Human Activity Recognition and Modeling, 2005: 97-104.
    [96] S Blunsden , R Fisher, E Andrade. Recognition of Coordinated Multi Agent Activities: The Individual vs The Group. Technical Report, EDI-INF-RR-0830, The University of Edinburgh, 2006.
    [97] E Andrade, S Blunsden, R Fisher. Simulation of Crowd Problems for Computer Vision. In Proceeding of the First International Workshop on Crowd Simulation, 2005: 71-80.
    [98] C Maggioni, B Kammerer. Gesture Computer: History, Design, and Applications, Computer Vision for Human-Machine Interaction. Cambridge University Press, 1998.
    [99] W Freeman, C Weissman. Television Control by Hand Gestures. In Proceeding of International Conference on Automatic Face and Gesture Recognition, 1995: 179-183.
    [100] S Li, Z Zhang. FloatBoost Learning and Statistical Face Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(9): 1122-1123.
    [101] L Wang, T Tan, H Ning, et al. Fusion of Static and Dynamic Body Biometrics for Gait Recognition. IEEE Transactions on Circuits and Systems for Video Technology, 2004, 14(2): 149-158.
    [102] Y Fang, Y Wang, T Tan. Improving Face Detection through Fusion of Contour and Region Information. Chinese Journal of Computer, 2004, 27(4): 482-491.
    [103] H Liu, Y Wang, T Tan. Multi-Modal Data Fusion for Person Authentication using Improved ENN. Chinese Journal of Automation, 2004, 30(1):78-85.
    [104] C Xu, Y Wang, T Tan. Overview of Research on 3D Face Modeling. Journal of Image and Graphics, 2004, 9(8):893-903.
    
    [105] M Yang, D Kriegman, N Ahuja. Detecting Faces in Images: A Survey. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2002, 24(1):34-58.
    [106] H Wu, Q Chen, M Yachida. Face Detection From Color Images Using a Fuzzy Pattern Matching Method. IEEE Transaction on Pattern Analysis and Machine Intelligence, 1999,22(6):557-562.
    [107] M Park, C Park, M Park. Algorithm for Detecting Human Faces Based on Convex-hull. Optics Express, 2001,10(6):274-279.
    [108] S L Phung, A Bouzerdoum, D Chai. A Novel Skin Color Model in YCbCr Color Space and its Application to Human Face Detection. In Proceedings of IEEE International Conference on Image Processing, 2002, Vol.1: 289-292.
    [109] C Wang, M Brandstein. Multi-Source Face Tracking with Audio and Visual Data. In Proceedings of IEEE International Workshop on Multimedia Signal Processing, 1999:169-174.
    [110] D Brown, I Craw, J Lewthwaite. A Som based Approach to Skin Detection with Application in Real Time Systems. In Proceedings of the British Machine Vision Conference, 2001: 491-500.
    [111] G R Bradski. Computer Vision Face Tracking as a Component of a Perceptual User Interface. In Proceedings of IEEE Workshop on Applications of Computer Vision, 1998: 214-219.
    [112] N Dalai, B Triggs. Histograms of Oriented Gradients for Human Detection. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, Vol.2: 886-893.
    [113] J Davis, M Keck. A Two-Stage Approach to Person Detection in Thermal Imagery. In Proceedings of IEEE Workshop on Applications of Computer Vision, 2005, Vol. 1:364-369.
    [114] J Davis, V Sharma. Fusion-Based Background-Subtraction using Contour Saliency. In Proceedings of IEEE International Workshop on Object Tracking and Classification Beyond the Visible Spectrum, 2005, Vol.3:11-11.
    [115] R Collins, Y Liu, M Leordeanu. Online Selection of Discriminative Tracking Features. In Proceedings of Ninth IEEE International Conference on Computer Vision, 2003, Vol. 1:346-352.
    [116] R Collins, Y Liu, M Leordeanu. Online Selection of Discriminative Tracking Features. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10): 1631-1643.
    [117] J Wang, X Chen, W Gao. Online Selecting Discriminative Tracking Features Using Particle Filter. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, Vol.2: 1037-1042.
    [118] B Han, L Davis. Object Tracking by Adaptive Feature Extraction. In Proceedings of the 11th International Conference on Image Processing, 2004, Vol.3:1501-1504.
    [119] B Han, L Davis. Robust Observations for Object Tracking. In Proceedings of the 12th International Conference on Image Processing, 2005, Vol.2:442-445.
    [120] H Wang, D Suter. Efficient Visual Tracking by Probabilistic Fusion of Multiple Cues. Proceedings of 18th International Conference on Pattern Recognition, 2006, Vol.4: 892-895.
    [121] J Triesch, C Malsburg. Democratic Integration: Self-Organized Integration of Adaptive Cues. Neural Computation, 2001,13 (9): 2049-2074.
    [122] M Spengler, B Schiele. Towards Robust Multi-Cue Integration for Visual Tracking. Machine Vision and Applications, 2003, 14 (1): 50-58.
    [123] C Shen, A Hengel, A Dick. Probabilistic Multiple Cue Integration for Particle Filter Based Tracking. In Proceedings of the VIIth Digital Image Computing: Techniques and Applications, 2003:399-408.
    [124] G Loy, L Fletcher, N Apostoloff, et al. An Adaptive Fusion Architecture for Target Tracking. In Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition, 2002:261-266.
    [125] P Brasnett, L Mihaylova, D Bull, et al .Sequential Monte Carlo Tracking By Fusing Multiple Cues in Video Sequences. Image and Vision Computing (2006), doi:10.1016/j.imavis.2006.07. 017.
    [126] O Masoud, N Papanikolopoulos. A Method for Human Action Recognition. Image and Vision Computing, 2003, 21(8):729-743.
    [127] V Parameswaran, R Chellappa. Human Action-Recognition Using Mutual Invariants. Computer Vision and Image Understanding, 2005, 98(2): 294-324.
    [128] E Ramasso, M Rombaut, D Pellerin. State Filtering and Change Detection Using TBM Conflict Application to Human Action Recognition in Athletics Videos. IEEE Transactions on Circuits and Systems for Video Technology, 2007, 17(7):944-949.
    [129] L Zelnik-Manor, M Irani. Statistical Analysis of Dynamic Actions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006,28(9):1530-1535.
    [130] A Yilma, M Shah. Recognizing Human Actions in Videos Acquired by Uncalibrated Moving Cameras. In Proceedings of the 10th IEEE International Conference on Computer Vision, 2005, Vol.l:150-157.
    [131] C Schuldt, I Laptev, B Caputo. Recognizing Human Actions: a Local SVM Approach. In Proceedings of the 17th International Conference on Pattern Recognition, 2004, Vol.3:32-36.
    [132] H Ren, G Xu. Human Action Recognition in Smart Classroom. In Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition, 2002:399-404.
    [133] M Blank, L Gorelick, E Shechtman, et al. Actions as Space-Time Shapes. In Proceedings of the Tenth IEEE International Conference on Computer Vision, 2005, Vol.2:1395-1402.
    [134] Y Yacoob, M Black. Parameterized Modeling and Recognition of Activities. Computer Vision Image Understanding, 1999, 73 (2):232-247.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700