基于图像的视频事件分析方法

英文题名：Analysis Methods of Video Events Based on Images
作者：王威
论文级别：博士
学科专业名称：信息与通信工程
中文关键词：视频序列 ; 视频事件检测 ; 奇异性检测 ; 运动目标检测 ; 运动目标跟踪 ; 层次化混合高斯模型 ; 自适应水平集方法 ; 属性选择
英文关键词：video sequence ; video events detection ; abnormality detection ; moving targets detection ; moving targets tracking ; local hierarchical Gaussian Mixture Model(LHGMM) ; adaptive level set(ALS) method ; characters selection
学位年度：2010
导师：王润生
学科代码：081001
学位授予单位：国防科学技术大学
论文提交日期：2009-12-01

摘要

视频处理技术在科学研究和工程应用上有着十分诱人的前景。视频设备的连续工作,产生了大量需要处理的数据。如何对视频事件进行快速而准确的分析是一个值得重点关注的热点问题。论文针对视频事件分析的相关方法进行了深入研究,主要包括:视频序列段落划分、运动目标检测与跟踪和视频语义事件检测。视频序列中的事件主要由两方面原因引起:一是目标的运动或变化情况,二是场景整体的变化情况。因此相应的研究也从两方面展开,分别是基于目标属性约束的视频事件检测和基于复杂条件约束的足球感兴趣事件检测。
     在视频段落划分方面,提出了一种基于帧间信息的视频段落划分方法,可以从视频序列中检测出时空联合分布上和序列整体平均特性明显不一致的段落。该方法选择颜色变化信息、运动变化信息和运动变化率来描述场景的变化和场景中目标的变化,并对长时间视频序列进行段落划分。方法不需要提取关键帧和运动目标,克服了一般镜头检测方法处理静态背景视频时出现的效果下降问题,并且提高了处理效率。
     在运动目标检测与跟踪方面,提出了一个既适用于静态背景又适用于动态背景的目标检测和跟踪处理框架。框架由三个部分组成:基于连续帧差的背景类型确定,采用粒子滤波方法的目标跟踪,以及采用适合具体背景类型策略的目标精确检测。在静态背景中,提出了一种改进的自上而下的局部层次化混合高斯模型算法(LHGMM)进行目标精确检测,由于方法在局部区域内进行相应计算,可以提高准确性和处理效率。在动态背景中,提出了一种自适应水平集(ALS)方法进行目标轮廓精确检测。方法能够自动确定零水平集并在特定区域内进行曲线演化,使目标轮廓检测更加精确。
     在基于目标属性约束的视频事件检测方面,首先提出一种基于模糊粗糙集的属性选择方法。该方法能够结合具体应用背景,合理选择视频处理过程需要的特征,并建立事件的时空联合描述。然后研究了三种具体视频场景下的视频事件分析方法:(1)提出一种基于区域特征的过路行人异常行为检测方法,利用背景区域分割信息和目标区域变化信息检测过路行人的异常行为,可以满足实时性处理要求。(2)提出一种基于轮廓特征的人体姿态分析方法,利用轮廓特征的周期特性对人体姿态进行分类。(3)提出一种基于运动特征的交通路口视频事件检测方法,通过对运动轨迹的自动学习分析运动轨迹的区域和方向,并检测异常行为事件,实现监控视频的在线处理。
     在基于复杂条件约束的足球事件检测方面,首先提出一种基于层次化分类树模型的视频片段分类方法。该方法仅利用简单的低层特征,就能够对提取的视频片段实现迅速而有效的分类。在片段分类的基础上,提出一种基于时间结构信息的足球感兴趣事件检测方法。方法利用足球视频事件特定的时间结构信息与运动等低层特征相结合检测预先设定的感兴趣足球事件。由于检测方法直接利用场景条件约束,避免了难以实现的大量运动目标的检测与跟踪,提高了处理质量与效率。
The technology of video processing has promising future in science research and engineering application. Due to the continuous working of the equipments, a lot of video sequences are produced and need dealing with. How to analyze the video events rapidly and exactly is a hotspot which captures much attention. The thesis focuses on the analysis methods of the video events. The research includes the following aspects: division of the video sequence, moving targets detection and the tracking, and semantic events detection. Generally there are two kinds of reasons which cause the events: the motion or the change of the targets, and the change of the whole scene. So the corresponding research also outspread in two sides, namely events detection based on the restrictions of the targets’characters and events detection based on the restrictions of complex conditions in soccer games.
     In division of the video sequences, a novel inter-frame-information-based approach is proposed to detect the sections whose spatio-temporal distribution is obviously different from the average distribution of the whole video sequence. In this approach, the color information, motion information and moving ratio information are used to describe both the change of the scene and the changes of the targets in the scene, and divide the long video sequence into sections. This approach does not need to extract the key frames, and overcomes the ineffective flaw of the shot boundary detection methods when the background is stable. The processing efficiency is also improved at the same time.
     In the moving targets detection and tracking, a novel framework is proposed to process the video when the background is either stable or dynamic. It consists of three aspects: background type recognition based on difference information of the adjacent frames, targets tracking with the color-based particle filter, and objects precise detection by using the strategies which is suitable for special background types. In stable background, a top-to-bottom local hierarchical GMM (LHGMM) is proposed to detect the targets accurately. This approach only detects the targets in local regions and can improve both the veracity and the efficiency. On the other hand, when the background is dynamic, an adaptive level set (ALS) method is proposed to get the precise external contour of the targets. This method can set the zero level set automatically and evolution the level set curve in given areas. So this method can get the accurate external contour easily.
     In the events detection based on the restrictions of the targets’characters, the characters selection approach based on fuzzy-rough techniques is firstly proposed. This approach can be used to select the characters combined with the application conditions, and describe the events by using the spatio-temporal information. Then, the events analysis is carried out in three special aspects: (1) A region-based abnormal behaviors detection approach of the road-across pedestrian is proposed to detect the abnormality by using both the region-based segmentation information and the change information of the targets. This approach can satisfy the on-line video process requirement. (2) A contour-based approach is proposed to analyze the human poses. This approach uses the periodic characters of the moving human to categorize the human poses. (3) A motion-based detection approach of the video events in the traffic crossing scene is proposed to detect the abnormal behaviors by using the moving trajectories. With the self-acting study on moving trajectories, the approach can get the regions and the moving direction of the moving trajectories, and detect the abnormal behaviors. So the surveillance systems can work on-line.
     In the events detection based on the restrictions of the complex conditions in soccer games, a hierarchical classification tree is proposed to classify the video clips rapidly and effectively only by using simple low-level characters. Then based on the clips classification, a temporal structures-based approach is proposed. This approach can detect the prior-defined exciting events by using both fixed temporal structure of clips and the low-level characters, such as motion vector and so on. Neither the tracking of the targets nor the prior training is necessary because this approach uses the restrictions of the scene directly, and this can improve both the processing quality and the efficiency.

引文

[1]孙即祥.图像处理[M].北京:科学出版社, 2004.
    [2]王润生.图像理解[M].国防科技大学出版社, 1998.
    [3] Zelnik-Manor L and Irani M. Statistical Analysis of Dynamic Actions [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28(9): 1530-1535.
    [4] Coilins R, Lipton A and Kanade T. A System for Video Surveillance and Monitoring:Vsam Final Report[Cmuri-Tr-00-12] [R]. Robotic Institute Carnegie Mellon University, 2000.
    [5] Coilins R, Lipton A and Kanade T. Introduction to the Special Section on Video Surveillance [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8): 745-746.
    [6] Romagnino P, Tan T and Baker K. Multi-Agent Visual Surveillance of Dynamic Scenes [J]. Image and Vision Computing, 1998, 16(8): 529-532.
    [7] Naylor M and Attwood C I. Annotated Digitai Video for Inlelligent Surveiilance and Optimized Retrieval: Final Report [R]. ADVISOR Consortium, 2003.
    [8]凌志刚,赵春晖,彦,潘泉,王燕.基于视觉的人行为理解综述[J].计算机应用研究, 2008, 25(9): 2570-2578.
    [9]秦莉娟.基于内容的自动视频监控研究[D].杭州:浙江大学, 2006.
    [10] Regazzoni C and Foresti G. Scanning the Issue/Technology: Special Issue on Video Communications, Processing and Understanding for Third Generation Surveillance Systems [M]. Proceedings of the IEEE, 2001.
    [11] Regazzoni C, Vernazza G and Fabri(Editor) G. Advanced Video-Based Surveillance Systems [M]. Kluwer Academic Publishers, 1999.
    [12] Diamantopoulos G and Spann M. Event Detection for Intelligent Car Park Video Surveillance [J]. Real-Time Imaging, 2005, 11: 233-243.
    [13] Polana R and Nelson R. Detecting Activities [C]. IEEE Conference on Computer Vision and Pattern Recognition, 1993.
    [14] Wang T, Li J, Diao Q, et al. Semantic Event Detection Using Conditional Random Fields [C]. International Conference on Computer Vision and Pattern Recognition Workshop (CVPRW' 06), 2006.
    [15] Duan L, Xu M, Chua T S, et al. A Mid-Level Representation Framework for Semantic Sports Video Analysis [C]. ACM Multimedia Conference, 2003.
    [16] Wang J, Xu C, E.Chng, et al. Automatic Replay Generation for Soccer Video Broadcasting [C]. ACM Multimedia Conference, 2004.
    [17] Ekin A, Tekalp A M and Mehrotr R. Automatic Soccer Video Analysis and Summarization [J]. IEEE Transactions on Image processing, 2003, 12(7): 796-807.
    [18] Phung D Q, Duong T V, S.Venkatesh, et al. Topic Transition Detection Using Hierarchical Hidden Markov and Semi-Markov Models [C]. ACM multimedia, 2005: 11-20.
    [19] Zhang D, Perez D G, Bengio S, et al. Semisupervised Adapted Hmms for Unusual Event Detection [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005: 611-618.
    [20] Lafferty J, McCallum A and Pereira F. Conditional Random Fields: ProbabilisticModels for Segmenting and Labeling Sequence Data [C]. International Conference on Machine Learning, 2001: 282-289.
    [21] Hu W, Tan T, Wang L, et al. A Survey on Video Surveillance of Object Motion and Behaviors [J]. IEEE Transactions on Systems, Man, and Cybernetics-PART C: Applications and Reviews, 2004, 34(3): 334-351.
    [22]丁忠校.视频监控图像的运动目标检测方法综述[J].电视技术, 2008, 32(5): 72-76.
    [23]侯志强,韩崇昭.视觉跟踪技术综述[J].自动化学报, 2006, 32(4): 603-617.
    [24]王亮,胡卫明,谭铁牛.人运动的视觉分析综述[J].自动化学报, 2002, 25(3): 225-237.
    [25] Coifman B, Beymer D, Mclauchlan P, et al. A Real-Time Computer Vision System for Vehicle Tracking and Traffic Surveillance [J]. Transportation Research Part C, 1998, 6(4): 271-288.
    [26]谷军霞,丁晓青,王生进.行为分析算法综述[J].中国图象图形学报, 2009, 14(3): 377-387.
    [27] Bobick A F and Davis J W. The Recognition of Human Movement Using Temporal Templates [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(3): 257-267.
    [28] Polana R and Nelson R. Low Level Recognition of Human Motion [C]. IEEE Work shop on Motion of Non-Rigid and Articulated Objects, Austin, 1994: 77-82.
    [29] Jezekiel B-A, Wang Z-q, Puvin P, et al. Human Activity Recognition Using Multidimensional Indexing [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(8): 1091-1104.
    [30] Junji Y, Jun O and Kenichiro I. Recognition Human Action in Time-Sequential Images Using Hidden Markov Model [C]. IEEE Conference on Computer Vision and Pattern Recognition, 1992: 397-385.
    [31] Robertson N and Reid I. Behavior Understanding in Videos: A Combined Method [C]. IEEE International Conference on Computer Vision, Beijing, 2005: 808-815.
    [32] Brand M, Oliver N and Pentland A. Coupled Hidden Markov Models for Complex Action Recognition [C]. IEEE Conference on Computer Vision and Pattern Recognition, San Juan,Puerto Rico, 1997: 994-999.
    [33] Hai-bing R and Guang-you X. Human Action Recognition with Primitive-Based Coupled-Hmm [C]. International Conference on Pattern Recognition, Quebec City, Canada, 2002: 494-498.
    [34] Murphy K. Dynamic Bayes Networks: Representation, Inference and Learning [D]. Berkeley: University of California, 2002.
    [35] Muncaster J and MA Y Q. Activity Recognition Using Dynamic Bayesian Networks Wilh Automatic State Selection [C]. IEEE Workshop on Motion and Video Computing, 2007.
    [36] Yang M H and Ahuja N. Recognizing Hand Gesture Using Motion Trajectories [C]. IEEE Intemational Conference on Computer Vision and Image Understanding, 1999.
    [37] Buccolieri F, Distante C and Leone A. Human Posture Recognition Using Active Contours and Radial Basis Function Neural Network [C]. Conference on Advanced Video and Signal Based surveillance, 2005.
    [38] HONG P, TURK M and HUANG T. Gesture Modeling and Recognition Using Finite State Machines [C]. IEEE Conference on Face and Gesture Recognition,2000.
    [39]王泽兵,陈朝晖.彩色图像分割技术研究[J].数字电视与数字视频, 2005, 274: 21-24.
    [40] Cheng H D, Jiang X H, Sun Y, et al. Color Image Segmentation: Advances and Prospects [J]. Pattern Recognition, 2001, 34: 2259-2281.
    [41] Orchard M T and Bouman C A. Color Quantization of Images [J]. IEEE Transactions on Signal Processing, 1991, 39(12): 2677-2690.
    [42] Comaniciu D and Meer P. Robust Analysis of Feature Spaces: Color Image Segmentation [C]. IEEE Conference on Computer Vision and Pattern Recognition, 1997: 750-755.
    [43] Pietikainen M. Accurate Color Discrimination with Classification Based on Feature Distributions [C]. International Conference on Pattern Recognition, C, 1996: 833-838.
    [44] Littmann E and Ritter H. Adaptive Color Segmentation-a Comparison of Neural and Statistical Methods [J]. IEEE Transactions on Neural Network, 1997, 8(1): 175-185.
    [45] Carron T and Lambert P. Color Edge Detector Using Jointly Hue, Saturation and Intensity [C]. IEEE International Conference on Image Processing, Austin, USA, 1994: 977-1081.
    [46] Kim W S and Park R H. Color Image Palette Construction Based on the Hsi Color System for Minimizing the Reconstruction Error [C]. IEEE International Conference on Image Processing, C, 1996: 1041-1044.
    [47] Golland P and Bruckstein A M. Why R.G.B.? Or How to Design Color Displays for Martians, Graphical Models [J]. Image Processing, 1996, 58(5): 405-412.
    [48]胡浩,王明照,杨杰.自适应模糊加权均值滤波器[J].系统工程与电子技术, 2002, 24(2): 15-17.
    [49]柯丽,杜强,苏哲.多级维纳滤波的oct图像除噪方法[J].光学精密工程, 2008, 16(4): 740-744.
    [50] Wang X. Adaptive Multistage Median Filter [J]. IEEE Transactions on Signal Processing, 1992, 40(4): 1015-1017.
    [51] Song H, Wang G and Zhao X. A New Adaptive Multistage Median Filter [C]. IEEE 6th International Conference on Parallel and Distributed Computing, Applications and Technologies, 2005: 826-828.
    [52]关新平,赵立兴,唐英干.图像去噪混合滤波方法[J].中国图象图形学报, 2005, 10(3): 332-337.
    [53]金良海,姚行中,李德华.彩色图像矢量滤波技术综述[J].中国图象图形学报, 2009, 14(2): 243-254.
    [54] Astola J, Haavisto P and Neuvo Y. Vector Median Filters [J]. Proceedings of the IEEE, 1990, 78(4): 678-689.
    [55] Trahanias P E and Venetsanopoulos A N. Vector Directional Filters: A New Class of Multichannel Image Processing Fillers [J]. IEEE Transactions on Image Processing, 1993, 2(4): 528-534.
    [56] Trahanias P E, Karakos D G and Venetsanopoulog A N. Directional Processing of Color Images: Theory and Experimental Results [J]. IEEE Transactions on Image Processing, 1996, 5(6): 868-880.
    [57] Karakss D G and Trahanias P E. Generalized Multichannel Image-Filtering Structures [J]. IEEE Transactions on Image Processing, 1997, 6(7): 1038-1045.
    [58] Gabhouj M and Cheickh F A. Vector Median-Vector Directional Hybrid Filter for Color Image Restoration [C]. The European Signal Processing Conference, 1996: 879-881.
    [59] Platanioths K N and Venetsanopoalos A N. Color Image Processing and Application [M]. Berlin: Springer, 2000.
    [60] Plstuaiofis K N, Androutsos D and Venetsanopoulos A N. Color Image Processing Using Adaptive Vector Directional Filters [J]. IEEE Transactions on Circuits and SystemsⅡ:Analog and Digital Signal Processing, 1998, 45(10): 1414-1419.
    [61] Shen Y and Barner K E. Fast Adaptive Optimization of Weighted Vector Median Filters [J]. IEEE Transations on Signal Processing, 2006, 54(7): 2497-2510.
    [62] Smolka B. Efficient Modifcation of the Central Weighted Vector Median Filter [J]. Lecture Notes in Computer Science, 2002, 2449: 166-173.
    [63] Lukac R and Marchevsky S. Adaptive Vector Lum Smoother [C]. IEEE International Conference on Image Processing, Thessaloniki, Greece, 2001: 878-881.
    [64] Lukac R. Adaptive Vector Median Filtering [J]. Pattern Recognition Letters, 2003, 24(12): 1889-1999.
    [65] Jin L and Li D. An Efficient Color Impulse Detector and Its Application to Color Images [J]. IEEE Signal Processing Letters, 2007, 14(6): 397-400.
    [66] Smolka B, Lukac R, Chydzinaki A, et al. Fast Adaptive Similarity Based Impulsive Noise Reduction Filter [J]. Real-Time Imaging, 2003, 9(4): 261-276.
    [67]杜振华,张艳宁,郑江滨,袁和金.基于提升框架的实时视频降噪方法[J].计算机应用研究, 2007, 27(3): 666-668.
    [68] Dekeyser F. Spatial-Temporal Wiener Filtering of Image Sequences Using a Parametric Model [C]. International Conference on Image Processing, 2000: 1586-1589.
    [69] Boo K J. A Motion-Compensated Spatial-Temporal Filter for Image Sequence with Signal Dependent Noise [J]. IEEE Transactions on Circuit s and Systems for Video Technology, 1998, 8: 287-298.
    [70]李岩,乔彦峰,高岩,孙志远,高丰端.基于运动补偿的自适应时域视频降噪算法研究[J].半导体光电, 2007, 28(5): 747-750.
    [71] Vidakovie B and Lozoya C B. On Time-Dependent Wavelet Denoising [J]. IEEE Transactions on Signal Processing, 1998, 46(9): 2549-2551.
    [72] Serra J. Morphological Filtering: An Overview [J]. Signal Processing, 1994, 38(1): 3-11.
    [73] Hawkins D. Identification of Outliers [M]. London: Chapman and Hall, 1980.
    [74] Hanjalic A. Shot-Boundary Detection: Unraveled and Resolved? [J]. IEEE Transactions on Circuit s and Systems for Video Technology, 2002, 12(2): 90-105.
    [75] Zhang H J, Low C Y and Smoliar S W. Video Parsing, Retrieval and Browsing: An Integrated and Content-Based Solution [C]. ACM Multimedia, 1995: 15-24.
    [76] Zabih R, Miller J and Mai K. A Feature-Based Algorithm for Detecting and Classifying Scene Breaks [C]. ACM Multimedia, 1995: 189-200.
    [77] Bouthemy P, Gelgon M and Ganansia F. A Unified Approach to Shot Change Detection and Camera Motion Characterization [J]. IEEE Transactions on Circuit s and Systems for Video Technology, 1999, 9: 1030-1044.
    [78] Yuan J, Wang H, Xiao L, et al. A Formal Study of Shot Boundary Detection [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2007, 17:168-186.
    [79] Zheng W, Yuan J, Wang H, et al. A Novel Shot Boundary Detection Framework [J]. Proc. of SPIE, 5960: 410-420.
    [80]李和平,胡占义,吴毅红,吴福朝.基于半监督学习的行为建模与异常检测[J].软件学报, 2007, 18(3): 527-537.
    [81] Stricker M and Orengo M. Similarity of Color Images [J]. Proceedings of SPIE Storage and Retrieval for Image and Video Databases, 1995, 2420: 381-392.
    [82] Zhou J and Zhang X-P. Video Shot Boundary Detection Using Independent Component Analysis [C]. International Conference on Acoustics, Speech, and Signal Processing, 2005.
    [83] Shawe-Taylor J and C. Williams e a. On the Eigenspectrum of the Gram Matrix and Its Relationship to the Operator Eigenspectrum [C]. The 13th International Conference on Algorithmic Learning Theory, Springer-Verlag, 2002: 23-40.
    [84] Li W X. The Research of Distributing Model of Blast Furnace Gas Flow Based on Data Mining [D]. Northeastern University, 2003.
    [85] Yang W and Zhang T. A New Method for the Detection of Moving Targets in Complex Scenes [J]. Computer Research & Development, 1998, 35(8): 724-728.
    [86]刘谦雷,杨绿溪,邹采荣.用于视频镜头突变切换检测的二次差分法和像素点匹配二次差分法[J].中国图象图形学报, 2003, 8(2): 161-168.
    [87]周艺华,曹元大,张龙飞,张洪欣.基于二次帧差与窗口最大值的镜头边界检测方法[J].北京理工大学学报, 2005, 25(11): 949-953.
    [88]边肇祺.模式识别[M].北京:清华大学出版社, 2003.
    [89] Zhang D, Gatica-Perez D, Bengio S, et al. Semi-Supervised Adapted Hmms for Unusual Event Detection [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005: 611-618.
    [90] Paragions N and Tziritas C. Detection and Location of Moving Objects Using Deterministic Relacation Algorithms [C]. International Conference on Pattern Recognition, 1996: 201-205.
    [91] Thakoor N and Gao J. Automatic Video Object Shape Extraction and Its Classification with Camera in Motion [C]. IEEE International Conference on Image Processing, 2005: 437-440.
    [92]李晓亮.基于小波变换和数学形态学的运动物体检测[J].南昌大学学报, 2006, 30(6): 624-626.
    [93] Rajagopalan R, Orchard M T and Brandt R D. Motion Field Modeling for Video Sequences [J]. IEEE Transactions on Image Processing, 1997, 6(11): 1503-1516.
    [94] Altunbasak Y, Mersereau R M and Patti A J. A Fast Parametric Motion Estimation Algorithm with Illumination and Lens Distortion Correction [J]. IEEE Transactions on Image Processing, 2003, 12(4): 395-408.
    [95] Criminisi A and Gross G. Bilayer Segmentation of Live Video [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2006: 53-60.
    [96] Haritaoglu I, Harwood D and Davis L. W4: Real-Time Surveillance of People and Their Activities [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8): 809-830.
    [97] Hayman E and Eklundh J-O. Statistical Background Subtraction for a Mobile Observer [C]. International Conference on Computer Vision, 2003: 67-74.
    [98] Liu Z and Sarkar S. Effect of Silhouette Quality on Hard Problems in Gait Recognition [J]. IEEE Transacions on Systems, Man and Cybernetics-Part B, 2005,35(2): 170-183.
    [99] Parag T, Elgammal A and Mittal A. A Framework for Feature Selection for Background Subtraction [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2006: 1916-1923.
    [100] Stauffer C and Grimson W E L. Learning Pattern of Activity Using Real-Time Tacking [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8): 246-252.
    [101] Smith K, Ba S O, Odobez J-M, et al. Tracking the Visual Focus of Attention for a Varying Number of Wandering People [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(7): 1212-1229.
    [102] Friedman N and Russell S. Image Segmentation in Video Sequences: A Probabilistic Approach [C]. The 13th Conference on Uncertainty in Artificial Intelligence, 1997.
    [103] Osher S and Sethian J A. Fronts Propagating with Curvature Dependent Speed: Algorithms Based on Hamilton-Jacobi Formulation [J]. Journal of Computer Physics, 1988, 79(1): 12-49.
    [104] Chang C J, Hsieh J W, Chen Y S, et al. Tracking Multiple Moving Objects Using a Level-Set Method [J]. International Journal of Pattern Recognition and Artificial Intelligence, 2004, 18(2): 101-125.
    [105]王长安,朱善.基于统计模型和活动轮廓的运动目标检测与跟踪[J].浙江大学学报(工学版), 2006, 40(2): 249-253.
    [106] Rogers S K, Colombi J M, Mariin C E, et al. Neural Networks for Automatic Target Recognition [J]. Neural Networks, 1995, 18(7): 1153-1184.
    [107]王哲,常发亮.一种基于立体视觉的运动目标检测算法[J].计算机应用, 2006, 26(11): 2724-2726.
    [108] Aggarwal J K and Nandhakumar N. On the Computation of Motion from Sequences of Images-a Review [J]. Proceedings of the IEEE, 1988, 76(8): 917-935.
    [109] Bascle B and Beriche R. Region Tracking through Image Sequences [C]. IEEE International Conference of Computer Vision, 1995: 302-307.
    [110] Birchfield S T and Rangarajan S. Spatiograms Versus Histograms for Region-Based Tracking [C]. IEEE Conference on Coputer Vision and Pattern Recognition, 2005: 1158-1163.
    [111] Stauffer C and Grimson W E L. Adaptive Background Mixture Models for Real-Time Tracking [C]. IEEE Conference of Computer Vision and Pattern Recognition, 1999: 246-252.
    [112] Isard M and Blake A. Condensation-Conditional Density Propagation for Visual Tracking [J]. International Journal of Computer Vision, 1996, 28(1): 5-28.
    [113] Kaneko T and Hori O. Feature Selection for Reliable Tracking Using Template Matching [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2003: 796-802.
    [114] MItra P, Murthy C and Pal S. Unsupervised Feature Selection Using Feature Similarity [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(3): 301-312.
    [115] Tissainayagam P and Surer D. Object Tracking in Image Sequences Using Point Feature [J]. Pattern Recognition, 2005, 38(1): 105-113.
    [116] Jain A K, Zhong Y and Lakshmanan. Object Matching Using Deformable Templates [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,1996, 18(3): 267-278.
    [117] Freedman D and Zhang T. Active Contours for Tracking Distributions [J]. IEEE Transactions on Image Processing, 2001, 10(10): 1467-1475.
    [118] Lee B H, Choi I and Jeon G J. Motion-Based Moving Object Tracking Using an Active Contour [C]. IEEE International Conference on Acoustics, Speech and Signal Processing, 2006: 649-652.
    [119] Luo H and Eleftheriadis A. Model-Based Segmentation and Tracking of Head-and-Shoulder Video Objects for Real Time Multimedia Services [J]. IEEE Transactions on Multimedia, 2003, 5(3): 379-389.
    [120] Balan A O and Black M J. An Adaptive Appearance Model Approach for Model-Based Articulated Object Tracking [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2006: 758-765.
    [121] Aggarwal J K and Cai Q. Human Motion Analysis: A Review [J]. Computer Vision and Image Understanding, 1999, 73(1): 82-98.
    [122] Gordon N J, Salmond D J and Smith A A M. A Novel Approach to Nonlinear/ Non- Gaussian Bayesian State Estimation [J]. IEE Proceedings on Radar and Signal Processing, 1993, 140(2): 107-113.
    [123] D.Crisan and Doucet A. A Survey of Convergence Results on Particle Filtering Methods for Practitioners [J]. IEEE Transactions on Speech and Audio Processing, 2002, 10(3): 173-185.
    [124] Liu J S and Chen R. Sequential Monte Carlo Methods of Dynamic System [J]. Journal of American Statistician, 1998, 83: 1032-1044.
    [125] Power P W and Schoonees J A. Understanding Background Mixture Models for Foreground Segmentation [C]. Proceedings of Image and Vision Computing, 2002: 267-271.
    [126]杨新.图像偏微分方程的原理与应用[M].上海交通大学出版社, 2003.
    [127] Sethian D A and Sethian J A. A Fast Level Set Method for Propagating Interfaces [J]. Journal of Computer Physics, 1995, 118: 269-277.
    [128] Sethian J A. A Fast Marching Level Set Method for Monotonically Advancing Fronts [J]. Proceedings of National Academy Sciences, 1996, 93: 1591-1608.
    [129] Paragios N and Derche R. Goedesic Active Contours and Level Set for the Detection and Tracking of Moving Objects [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(3): 266-280.
    [130] Bouttefroy P L M, Bouzerdoum A, Phung S L, et al. Vehicle Tracking Using Projective Particle Filter [C]. International Conference on Advanced Video and Signal Based Surveillance, 2009: 7-12.
    [131] Bouaynaya N and Schonfeld D. On the Optimality of Motion-Based Particle Filtering [J]. IEEE Transactions on Circuit s and Systems for Video Technology, 2009, 19(7): 1068-1072.
    [132] Wren C R. Pfinder: Real-Time Tracking of the Human Body [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(7): 780-785.
    [133] Francois A and Medioni G G. Adaptive Color Background Modeling for Real-Time Segmentation of Video Streams [C]. International Conference on Imaging Science, Systems, and Technology, 1999: 227-232.
    [134] Zhou Y and Tao H. A Background Layer Model for Object Tracking through Occlusion [C]. IEEE International Conference on Computer Vision, 2003: 1079-1085.
    [135]李斌,钟润添,王先基,庄镇泉.一种基于递增估计gmm的连续优化算法[J].计算机学报, 2007, 30(6): 979-985.
    [136] Yang S A and Hsu C T. Background Modeling from Gmm Likelihood Combined with Spatial and Color Coherency [C]. IEEE International Conference on Image Processing, 2006: 2801-2803.
    [137] Sanderson C, Gibbins D and Searle S. On Statistical Approaches to Target Silhouette Classification in Difficult Conditions [J]. Digital Signal Processing, 2008, 18: 375–390.
    [138] Wang L and Suter D. Visual Learning and Recognition of Sequential Data Manifolds with Applications to Human Movement Analysis [J]. Computer Vision and Image Understanding, 2008, 110: 153-172.
    [139] Maggio E and Cavallaro A. Learning Scene Context for Multiple Object Tracking [J]. IEEE Transactions on Image Processing, 2009, 18(8): 1873-1884.
    [140] Malladi R, Sethian J A and Vemuri B C. Shape Modeling with Front Propagation: A Level Set Approach [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995, 17(2): 158-175.
    [141] Zhang T and Freedman D. Improving Performance of Distribution Tracking through Background Mismatch [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(2): 282-287.
    [142] Shi Y and Karl W C. Real-Time Tracking Using Level Sets [C]. IEEE Conference on Coputer Vision and Pattern Recognition, 2005: 34-41.
    [143] Joshi N and Brady M. Non-Parametric Mixture Model Based Evolution of Level Sets [C]. International Conference on Computing: Theory and Applications, 2007.
    [144] Bernard O, Friboulet D, Thevenaz P, et al. Variational B-Spline Level-Set Method for Fast Image Segmentation [C]. IEEE International Symposium on Biomedical Imaging, 2008: 177-180.
    [145] Mumford D and Shah J. Optimal Approximation by Piecewise Smooth Functions and Associated Variational Problems [J]. Communication Pure Apply of Mathematics, 1989, 42(4): 577-685.
    [146] Chan T F and Vese L A. Active Contours without Edges [J]. IEEE Transactions on Image Processing, 2001, 10(2): 266-277.
    [147] Li C, Xu C Y, Gui C F, et al. Level Set Evolution without Re-Initialization: A New Variational Formulation [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005.
    [148] Silveira M and Marques J S. Level Set Segmentation of Dermos Copy Images [C]. IEEE International Symposium on Biomedical Image, 2008: 173-176.
    [149] Flenner A. Finding Edge Features Using the Fast Level Set Transform and the Helmholtz Principle [C]. Southwest Symposium on Image Analysis & Interpretation, 2008: 9-12.
    [150] Almeida J and Araujo R. Tracking Multiple Moving Objects in a Dynamic Environment for Autonomous Navigation [C]. IEEE International Workshop on Advanced Motion Control 2008: 21-26.
    [151] Rittscher J, Krahnstoever N and Galup L. Multi-Target Tracking Using Hybrid Particle Filtering [C]. the Seventh IEEE Workshop on Applications of Computer Vision, 2005: 1-8.
    [152] Xiang T and Gong S. Incremental and Adaptive Abnormal Behaviour Detection [J]. Computer Vision and Image Understanding, 2008, 111: 59–73.
    [153] Oliver N, Rosario B and Pentland A. A Bayesian Computer Vision System for Modeling Human Interactions [J]. IEEE Transactions on Pattern Analysis andMachine Intelligence, 2000, 22 (8): 831-843.
    [154] Gong S and Xiang T. Recognition of Group Activities Using Dynamic Probabilistic Networks [C]. IEEE International Conference on Computer Vision, 2003: 742-749.
    [155] Duong T, Bui H, Phung D, et al. Activity Recognition and Abnormality Detection with the Switching Hidden Semi-Markov Model [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005: 838-845.
    [156] Dee H and Hogg D. Detecting Inexplicable Behaviour [C]. British Machine Vision Conference, 2004: 477-486.
    [157] Shet V, Harwood D and Davis L. Multivalued Default Logic for Identity Maintenance in Visual Surveillance [C]. European Conference on Computer Vision, 2006: 119-132.
    [158] Zhong H, Shi J and M. Visontai. Detecting Unusual Activity in Video [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2004: 819-826.
    [159] Hamid R, Johnson A, Batta S, et al. Detection and Explanation of Anomalous Activities: Representing Activities as Bags of Event N-Grams [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005: 1031-1038.
    [160] Boiman O and Irani M. Detecting Irregularities in Images and in Video [C]. IEEE International Conference on Computer Vision, 2005: 462-469.
    [161] Xiang T and S. Gong. Video Behaviour Profiling and Abnormality Detection without Manual Labelling [C]. IEEE International Conference on Computer Vision, 2005: 1238-1245.
    [162] Wang X, Tieu K and Grimson E. Learning Semantic Scene Models by Trajectory Analysis [C]. European Conference on Computer Vision, 2006: 111–123.
    [163] Wang Y, Huang K and Tan T. Human Activity Recognition Based on R Transform [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2007: 1-8.
    [164] Zhao H and Liu Z. Shape-Based Human Activity Recognition Using Edit Distance [C]. The 2nd International Congress on Image and Signal Processing, 2009: 1-4.
    [165] Ferreira J P, Crisóstomo M M and Coimbra A P. Human Gait Acquisition and Characterization [J]. IEEE Transactions on Instrumentation and Measurement, 2009, 58(9): 2979-2988.
    [166] Wada T and Matsuyama T. Multiobject Behavior Recognition by Event Driven Selective Attention Method [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22( 8): 873-887.
    [167] Ke Y, Sukthankar R and Hebert M. Efficient Visual Event Detection Using Volumetric Features [C]. International Conference on Computer Vision, 2005.
    [168] Lee C-K, Ho M-F, Wen W-S, et al. Abnormal Event Detection in Video Using N-Cut Clustering [C]. International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2006.
    [169] Zhou H and Kimber D. Unusual Event Detection Via Multi-Camera Video Mining [C]. International Conference on Pattern Recognition, 2006.
    [170] Itti L and Baldi P. A Principled Approach to Detecting Surprising Events in Video [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005.
    [171] Cristani M, Bicego M and Murino V. Audio-Visual Event Recognition in Surveillance Video Sequences [J]. IEEE Transactions on multimedia, 2007, 9(2): 257-267.
    [172] Foresti G L, Marcenaro L and Regazzoni C S. Automatic Detection and Indexingof Video-Event Shots for Surveillance Applications [J]. IEEE Transactions on Multimedia, 2002459-471.
    [173] Snoek J, Hoey J, Stewart L, et al. Automated Detection of Unusual Events on Stairs [J]. Image and Vision Computing, 2008, doi: 10.1016/ j.imavis.2008.04.021.
    [174] Snoek J, Hoey J, Stewart L, et al. Automated Detection of Unusual Events on Stairs [C]. the 3rd Canadian Conference on Computer and Robot Vision, 2006.
    [175] Chan M T, Hoogs A, Schmiederer J, et al. Detecting Rare Events in Video Using Semantic Primitives with Hmm [C]. IEEE International Conference on Pattern Recognition, 2004.
    [176] Chan M T, Hoogs A, Sun Z, et al. Event Recognition with Fragmented Object Tracks [C]. International Conference on Pattern Recognition, 2006.
    [177] Chan M T, Hoogs A, Bhotika R, et al. Joint Recognition of Complex Events and Track Matching [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2006.
    [178] Hakeem A and Shah M. Learning, Detection and Representation of Multi-Agent Events in Videos [J]. Artificial Intelligence, 2007, 171: 586–605.
    [179] Andrade E L, Blunsden S and Fisher R B. Modeling Crowd Scenes for Event Detection [C]. International Conference on Pattern Recognition, 2006.
    [180] Piciarelli C and Foresti G L. On-Line Trajectory Clustering for Anomalous Events Detection [J]. Pattern Recognition Letters, 2006, 27: 1835-1842.
    [181] Zhong D and Chang S F. Real-Time View Recognition and Event Detection for Sports Video [J]. Journal of Visual Communication and Image Representation, 2004, 15: 330-347.
    [182] Xiang T and Gong S. Video Behaviour Profiling for Anomaly Detection [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, in press.
    [183] Xiang T and Gong S. Activity Based Surveillance Video Content Modeling [J]. Pattern Recognition, 2008, 41: 2309-2326.
    [184] Pawlak Z. Rough Sets [J]. International Journal of Computer and Information Sciences, 1982, 11: 341-356.
    [185]张文修,仇国芳.基于粗糙集的不确定决策[M].清华大学出版社, 2005.
    [186]白根柱,裴志利,王建,孔英,刘丽莎.基于粗糙集理论和信息熵的属性离散化方法[J].计算机应用研究, 2008, 25(6): 1701-1703.
    [187]沈永红,王发兴.基于信息嫡的粗糙集属性离散化方法及应用[J].计算机工程与应用, 2008, 44(5): 221-224.
    [188] Wu Q, Bell D A, Prasad G, et al. A Distribution-Index-Based Discretizer for Decision-Making with Symbolic Ai Approaches [J]. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(1): 17-28.
    [189] Wong S K M and Ziarko W. On Optimal Decision Rules in Decision Tables [J]. Bulletin of Polish Academy of Sciences, 1985, 33(11/12): 693-696.
    [190] Hu Q, Yu D and Xie Z. Information-Preserving Hybrid Data Reduction Based on Fuzzy-Rough Techniques [J]. Pattern Recognition Letters, 2006, 27: 414-423.
    [191] Hu Q, Yu D, Xie Z, et al. Fuzzy Probabilistic Approximation Spaces and Their Information Measures [J]. IEEE Transactions on Fuzzy Systems, 2006, 14(2): 191-201.
    [192] Shan Y, Yang F and Wang R. Color Space Selection for Moving Shadow Elimination [C]. IEEE International Conference on Image and Graphics, 2007: 496-501.
    [193] Illingworth J and Kittlor J. A Survey of the Hough Transform [C]. Conference on Computer Vision and Image Processing, 1988, 44: 87-116.
    [194] Cheng Y X and Qi F H. Randomized Hough Transform Using Gradient Direction Information [J]. Journal of Infrared and Millimeter Wave, 1998, 17(5): 375-379.
    [195] Wang L, Hu W and Tan T. Recent Developments of Human Motion Analysis [J]. Pattern Recognition, 2003, 36: 585-601.
    [196]许建华,张学工译, Vladimir N.Vapnik著.统计学习理论[M].北京:电子工业出版社, 2004, 6.
    [197] Vapnik V N. The Nature of Statistical Learning Theory [M]. New York: Springer-Verlag, 1995.
    [198] Lee K K and Xu Y. Modeling Human Actions from Learning [C]. IEEE International Conference on Intelligent Robot Systems, 2004.
    [199] Shawe-Taylor J and Cristianini N.模式分析的核方法[M].北京:机械工业出版社, 2006.
    [200] Cristianini N and Shawe-Taylor J.支持向量机导论[M].北京:电子工业出版社, 2004.
    [201]任双桥.支撑矢量机理论与应用研究[D].国防科技大学, 2006.
    [202] Wu X, Ou Y, Qian H, et al. A Detection System for Human Abnormal Behavior [C]. IEEE Conference on Intelligent Robots and Systems, 2005.
    [203] Ou Y, Qian H, Wu X, et al. Real-Time Surveillance Based on Human Behavior Analysis [J]. International Journal of Information Acquisition, 2005, 2(4): 353-365.
    [204] Wang X, Ma X and Grimson W E L. Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(3): 539-555.
    [205]Horn B K P and Schunk B G. Determining Optical Flow [J]. Artificial Intelligence, 1981, 17(10): l 85-203.
    [206]王润生.信息融合[M].北京:科学出版社, 2007.
    [207] Sadlier D A and Connor N E O. Event Detection in Field Sports Video Using Audio-Visual Features and a Support Vector Machine [J]. IEEE Transactions on Circuit s and Systems for Video Technolegy, 2005, 15(10): 1225-1233.
    [208] Yu X and Farin D. Current and Emerging Topics in Sports Video Processing [C]. IEEE International Conference on Multimedia and Expo, 2005.
    [209]童晓峰,刘青山,卢汉清.体育视频分析[J].计算机学报, 2008, 31(7): 1242-1251.
    [210] Assfalg J, Bertini M, Colombo C, et al. Semantic Annotation of Sports Videos [J]. IEEE Transactions on Multimedia, 2002, 19: 52-60.
    [211] Wan K W, Yan X and Xu C. Automatic Mobile Sports Highlights [C]. IEEE International Conference on Multimedia and Expo, 2005.
    [212] Gong Y, Sin L, Chuan C, et al. Automatic Parsing of Tv Soccer Programs [C]. International Conference on Multimedia Computing and Systems, 1995: 167-174.
    [213] Li J, Wang T, Hu W, et al. Two-Dependence Bayesian Network for Soccer Highlight Detection [C]. IEEE International Conference on Multimedia and Expo, 2006: 1625-1628.
    [214] Huang Q, Hu J, Hu W, et al. A Reliable Logo and Replay Detector for Sport S Video [C]. IEEE International Conference on Multimedia and Expo, 2007: 1695-1698.
    [215] Wang J, Chng E and Xu C. Soccer Replay Detection Using Scene Transition Structure Analysis [C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004: 433-437.
    [216] Tong X, Wang T, Li W, et al. A Three-Level Scheme for Real-Time Ball Tracking [C]. Workshop on Multimedia Content Analysis and Mining., 2007: 161-171.
    [217] Wang L and Zeng B. Automatic Extraction of Semantic Colors in Sports Video [C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004: 617-620.
    [218] Liu J and Tong X. Automatic Player Detection, Labeling and Tracking in Broadcast Soccer Video [C]. the British Machine Vision Conference, 2007: 70-80.
    [219] Wan K, Yan X, Yu X, et al. Real-Time Goal-Mouth Detection in Mpeg Soccer Video [C]. ACM Multimedia, 2003: 311-314.
    [220] Jiang S, Liu H, Zhao Z, et al. Generating Video Sequence from Photo Image for Mobile Screens by Content Analysis [C]. IEEE International Conference on Multimedia and Expo, 2007: 1475-1478.
    [221] Xie L, Chang S F, Divakaran A, et al. Structure Analysis of Soccer Video with Hidden Markov Models [C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002: 4096-4099.
    [222] Liu H, He T and Zhang H. Event Detection in Sports Video Based on Multiple Feature Fusion [C]. Fourth International Conference on Fuzzy Systems and Knowledge Discovery, 2007.
    [223] Chen S-C, Chen M, Zhang C, et al. Exciting Event Detection Using Multi-Level Multimodal Descriptors and Data Classification [C]. IEEE International Symposium on Multimedia, 2006: 193-200.
    [224] Ye Q, Huang Q, Gao W, et al. Exciting Event Detection in Broadcast Soccer Video with Mid-Level Description and Incremental Learning [C]. ACM Multimedia 2005: 455-458.
    [225] Ariki Y, Kubota S and Kumano M. Automatic Production System of Soccer Sports Video by Digital Camera Work Based on Situation Recognition [C]. IEEE International Symposium on Multimedia 2006.
    [226] Xu C, Wang J, Lu H, et al. A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video [J]. IEEE Transactions on Multimedia, 2008, 19(3): 421-436.
    [227] Chen C Y, Wang J C, Wang J F, et al. Event-Based Segmentation of Sports Video Using Motion Entropy [C]. International Symposium on Multimedia, 2007: 107-111.
    [228] Tjondronegoro D and Chen Y-P P. Using Decision-Tree to Automatically Construct Learned-Heuristics for Events Classification in Sports Video [C]. IEEE International Conference on Multimedia and Expo, 2006: 1465-1468.
    [229] Kolekar M H, Palaniappan K and Sengupta S. A Novel Framework for Semantic Annotation of Soccer Sports Video Sequences [C]. European Conference on Visual Media Production(CVMP2008), 2008: 1-9.
    [230] Zhang D, Raj R and .Chang S. General and Domain.Specific Techniques for Detecting and Recognizing Superimposed Text in Video [C]. IEEE International Conference on Image Processing, 2002: 593-596.
    [231] Yang X, Xue P and Tian Q. Repeated Video Clip Identification System [C]. ACM Multimedia, 2005: 227-228.
    [232] Tovinkere V. Detecting Semantic Events in Soccer Games: Towards a CompleteSolution [C]. IEEE International Conference on Multimedia and Expo, 2001: 1040-1043.
    [233] Benjamas N, Cooharojananone N and Jaruskulchai C. Flashlight and Player Detection in Fighting Sport for Video Summarization [C]. International Symposium on Communication and Information Technologies, 2005: 426-429.
    [234] Wang J, Xu C, Chng E, et al. Sports Highlight Detection from Keyword Sequences Using Hmm [C]. IEEE International Conference on Multimedia and Expo, 2004: 599-602.