用户名: 密码: 验证码:
视频对象分割技术及应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
在信息化的社会,视频内容因提供了最为丰富和全面的信息要素日益受到现代交通、网络媒体及计算机视觉等行业的青睐和重视。然而,原始视频往往含有的信息量非常大。其中部分甚至大部分信息对于行业具体应用而言意义不大。因此,如何提取有用信息,缩减视频的信息量就成为一个与实际应用紧密关联的重要问题。比如,在交通监控和安全监控视频中,使用视频对象分割技术将其中的主题信息(主要指运动对象信息)提取出来,才能更有效的对道路及生产环境安全进行监控。
     目前,视频对象分割技术就是最近几年发展起来的一种提取视频有效信息的重要基础性技术。该技术己广泛应用于交通流视频监控、工业自动化监控、安防、网络多媒体交互以及视频压缩编码等实际生产生活中。本文对视频对象分割技术及其应用进行了具体研究,并提出了一种基于高斯混合模型和小波理论的改进型的背景消减法。该方法有效削减了分割存在的噪声和空洞,提高了运动对象边缘的准确性。
     本文所做的主要工作如下:
     1.研究并实现了目前已有的多种分割算法。静态图像处理技术在视频分割中常用的工具及方法,如滤波方法、形态学算子及静态图像分割方法;视频对象分割技术,尤其是自动分割方法中的常用方法。
     2.研究并提出了一种改进型背景消减法。针对原有的基于高斯混合模型的背景消减方法存在噪声、空洞和边界分割不准确的不足。本文研究了一种由预分割与后处理组成的改进型背景消减法。其中,预分割基于高斯混合模型背景建模,并引入了图像的色彩模型和对比度模型,色彩模型主要用于提取每帧中大致的前景点,对比度模型主要用于确定前景和背景的边缘;后处理综合运用了中值滤波、小波理论中的金字塔分解和形态学开运算,以消除随机噪声和空洞。该方法有效地解决了原有背景消减法的不足,在削弱噪声和空洞的同时较准确地保留了对象边缘。
     3.研究了视频分割技术的实际应用。以现代交通中的道路监控为例,利用本文研究的算法,将视频中的车辆进行分割提取,然后使用轮廓逼近法,将分割出的对象加以标注,从而实现实时监控的辅助功能。最后,通过对本文算法与原有背景消减法及帧差法的的分割效果进行实验对比,验证了本文算法的优越性。
In the information society, by providing the most extensive and comprehensive information elements, video content is more and more popular in the industry of modern traffic, network media and computer vision. However, the original video often contain too large amount of information. Some of them even for most of the information are not meaningful to industry specific applications. So, How to extract useful information, to reduce the amount of information from the video will be an important problem seriously related with practical applications. For example, in traffic control and safety monitoring video, only when the main information of the video (Mainly referring to the information of moving object) is extracted the work on monitoring road safety and the production environment will be well done.
     At present, video object segmentation technology developed in recent years is an important foundational technology in the area of extracting effective video information. The technology has been widely used in traffic streaming video monitoring, industrial automation control, security, network and multimedia interactive video compression coding. In this paper, video segmentation technology and its application were researched. And an improved background extinction method based on Gaussian mixture model and wavelet theory was proposed. The method effectively reduced the noise and holes in the segmentation and improves the accuracy of moving object.
     In this paper, the major work done is as follows:
     1. A variety of segmentation algorithm existed were researched and realized. The common tools and methods used in the video segmentation from static image processing technology, such as filtering method, morphological operators and static image segmentation method; video object segmentation technology, especially the common methods in the automatic segmentation methods.
     2. An improved background extinction was researched and proposed. The background extinction segmentation method based on Gaussian mixture model had the disadvantages of holes and inaccuracy in the segmentation edge. An improved background extinction method which contained pre-segmentation and post-processing was proposed. Pre-segmentation was based on Gaussian mixture model first, and then color model and contrast model was used in this method. Post-processing used several methods, such as median filtering, pyramid decomposition of wavelet theory, morphological open computing, to eliminate random noise and holes. This method effectively solved the disadvantages of the original background extinction method. When it eliminated holes and noises the edge was well maintained.
     3. Practical application of video segmentation was researched. As an example of road monitoring in the modern traffic, the method proposed in this paper was used first, the cars was segmented, and then contour approximation was used to labeled the object, so real-time tracking and auxiliary monitoring was realized. In the end, the method proposed in this paper, original background extinction method and frame differencing method were compared to prove the effectiveness of the former.
引文
[1]A.Aydin,Leven Onural,Michael Wollborn,Roland Mech.Image Sequence Analysis for Emerging Interactive Multimedia Services[J].The European COST211 Framework,IEEE Trans.On CSVT.1998.8(7).pp:802-813.
    [2]Yin Li,Jian Sun,Heung-Yeung Shum,Video Object Cut and Paste[N],Microsoft Research Asia,SIGGRAPH 2005,pp:1-5.
    [3]Jian Sun,Weiwei Zhang,Xiaoou Tang,Heung-Yeung,Background Cut[N],Proc.ECCV 2006,pp:1-6
    [4]陈韩锋,基于时空联合分割框架的视频对象分割技术研究[D]。上海交通大学博士学位论文。2004,pp:5-20。
    [5]D.S.Zhang,G Lu.Segmentation of Moving Objects in Image Sequence:AReview[J].Circuits Systems and Signal Processing(Special Issue on Multimedia Communication Services).2001,20(2).pp:143-183.
    [6]JunKi Kim,Ho Suk Lee.Real-time Pre-processing and Video Object Segmentation for High Compression and Content-based MPEG-4 coding[C].Processing of the Data Compression Conference(DCC 2003).2003,3.pp:424-434
    [7]Petros Daras,Ioannis Kompatsiaris,Ilias Grinias,Georgios Akrivas.MPEG4Authoring Tool Using Moving Object Segmentation and Tracking in Video Shots[J].EURASIP Journal on Applied Signal Processing.2003,9.pp:1-18.
    [8]Ju Guo,C.Jay kuo,Semantic Video Object Segmentation for Content-Based Multimedia Applications[M],Kluwer Academic Publishers,2001,pp:115-126.
    [9]J.Serra,Image Analysis and Mathematical Morphology[M],New York,Academic Press,1982,pp:224-226.
    [10]Serra,ED.Image Analysis and Mathematical Morphology[M],Vol.2:Theoretical Advances,New York.Academic Press,1988,pp:122-134.
    [11]沈清,汤霖.模式识别导论[M].湖南:国防科技大学出版社,1991,pp:15-28
    [12]P.Salembier and F.Marques,Region-based Representations of Image and Video: Segmentation Tools foe Multimedia Services.IEEE Trans CSVT.1999,99(8),pp:1147-1169
    [13]A.Wang,E.H.Adelson,Representing moving images with layers[J],IEEE Trans.Image Processing,1994,3:625-638.
    [14]Georgi D.Borshukov,Gozde Bozdagi.Motion segmentation by multistage affine classification[J],IEEE Trans.On Image Processing,1997,6(11):1591-1594.
    [15]H.Li.B.J.Tye,E.P.Ong,W.S.Lin,C.C.Ko.Multiple Motion object segmentation based on homogeneous region merging[J],IEEE International Symposium on Circuits and Systems,2001.5:175-178.
    [16]Black,M.J.and Anandan.P.The robust estimation of multiple motions:Parametric and piecewise-smooth flow fields[J],Computer Vision and Image Understanding.CVIU,1996,63(1):75-104.
    [17]N.Diehl,Object-oriented motion estimation and segmentation in image sequences[J],Signal Processing:Image Communication,1991.3:23-56.
    [18]M.M.Chang,M.I.Sezan,and A.M.Yekalp,An algorithm for simultaneous motion estimation and scene segmentation[J],IEEE Int.Conf.Acust.,Speech.Signal Processing,ICASSP'94,Adelaide,Australia,Apr.1994.vol.V.221-224.
    [19]Srinivas Sista,Rangasami L.Kashyap,Unsupervised video segmentation and object tracking[J],Computers in Industry,2000,42:127-416.
    [20]Jean-mare Odobez,Patrick Bouthemy,Direct incremental model-based inage motion segmentation for video analysis[J],Signal Processing,1998,66:143-155.
    [21]Montoliu,R,Pla,F.Multiple parametric motion model estimation and segmentation,Proceedings of International Conference on Image Processing,2001,2:933-936.
    [22]李白杨,陈纯,钱英,视频分割技术的发展[J],计算机研究和发展,2001,38(1):36-42.
    [23]Thomas Meier,King N.Ngan,Automatic Segmentation of moving objects for video object plane generation,IEEE Trans,on circuits and systems for video technology,1998,8(5):525-538.
    [24]Thomas Meier,King N.N,video Segmentation for content-based coding[J], IEEE Trans onCircuits and Systems for Video Technology.1999.9(8):1190-1201.
    [25]韩军,熊璋,孔文彦等,自动分割及跟踪视频运动对象的一种实现方法[J],中国图像图形学报,2001,6A(8):732-737.
    [26]Y.Z.Hsu,H.H.Nagel,New Likelihood test methods for change detection in image sequences[J],Computer Vision,Graphics and Image Processing.1984,26:73-106.
    [27]R.Mech and M.Wollbom,A noise robust method for 2D shape estimation of moving objects in video sequences considering a moving camera[J],Signal Processing,1998,66:203-217.
    [28]Til Aach.Andre kaup.Bayesian algorithms for adaptive change detection in image sequences using markov random fileds[J],Signal Processing:Image Communication,1997,7:147-160.
    [29]黄波,杨勇,王桥,吴乐南,基于模糊聚类和时域跟踪的视频分割[J],通信学报vol.22,No.12,2001,22-28.
    [30]Hua Zhong,Liu Wenyin.Shipeng Li,Interactive tracking-A semiautomatic video object tracking and segmentation system[J],Proc.IEEE ICIP,2001,645-648.
    [31]Ju Guo:Jongwon Kim,C.C.J.,An interactive object segmentation system for MPEG video[J],Image Processing,ICIP 1999,vol.2:140-144.
    [32]Gatical-Perez,Chuang Gu,Ming-Ting Lee,Semantic Video Object Extraction Using Four-Band Watershed and Partition Lattice Operators[J],IEEE Trans.on Circuits and Systems for Video Technology.2001,1(5):603-618.
    [33]Roberto Castagno,Touradj Ebrahimi,Video Segmentation Based on Multiple Features for Interactive Multimedia Applications[J].IEEE Trans on Circuits and Systems for Video Technology.1998,8(5):562-567.
    [34]Moscheni.F.Bhattacharjee S.Kunt M,Spafio-temporal segmentation based on region merging[J],IEEE Trans.on Pattern Analysis and Machine Intekkugence.1998,20(9):897-915.
    [35]Lu,Z.;Pearlman,W.A,Semi-automatic semantic video object extraction by active contour Model[J],IEEE International Conference on Multimedia and Expo, 2000. vol. 2: 645-648.
    [36] Luo, H,; Eleftheriadis, A, Spatial temporal active contour interpolation for semi-automatic video object generation[J]. IEEE Intl. Conf. on Image Processing, 1999, vol, 2:944-948.
    [37] Zaletelj, J. Tasic, J. F, Video object segmentation based on edge tracking[J], Procceding of Image Processing. IEEE 2001, VII: 813-816.
    [38] Todd Schoepflin, vikram Chalana, video object tracking with a sequential hierarchy of template deformations[J]. IEEE Trans. on Circuits and Systems for Video Technology. 2001,11(11): 1171-1182.
    [39] Munchurl Kim, J. G. Jeon, Moving Object segmentation in vedio sequences by user interaction and automatic object tracking[J], Image and Vision Computing, 2001, 19:245-260.
    [40] Jungeun Lim; Jong Beom Ra, A semantic video object tracking algorithm using three-step boundary refinement[J], IEEE Intl. Conf. on Image Processing. 1999. vol.2: 159-163.
    [41] Songxiang Xu, Jenq-Neng Hwang: Jun Yu, An Accurate region based object tracking for video sequences[J], IEEE 3rd Workshop on Multimedia Signal Processing. 1999. 271-276.
    [42] Chuang Gu, Ming-Chieh Lee, Semantic video object segmentation and tracking using mathematical morphology and perspective motion model[J], Proceedings of IEEE International Conference on Image Processing. 1997.2: 514-517.
    [43] Jungeun Lim, Jong Bcom Ra, Semi-Automatic video segmentation for object tracking[J]. Proceedings of IEEE International Conference on Image Processing. 2001.2:81-84.
    [44] Valette. S. Magnin. I, Prost. R. Active mesh for video segmentation and objects tracking[J], International Conference on Image Processing. 2001. 2: 77-80.
    [45] Candemir Toklu, A. Murat Tekalp, A. Tanju Erdem, Semi-Automatic video object segmentation in the presence of occlusion[J], IEEE Trans. On Circuits and Systems for Video Technology, 2000, 10(4): 624-629.
    [46] Fuhui Long. Dagan Feng, etc. Extracting semantic video objects[J], IEEE Computer Graphics and Applications.2001,21(1):48-55.
    [47]Munchurl Kim,Jac Gark choi,A VOP generation tool:automatic segmentation of moving objects in image sequences based on spatio-temporal information[J].IEEE Trans on Circuits and Systems for Video Technology.1999.19(8):1216-1226.
    [48]Demin Wang,Unsupervised video segmentation based on watersheds and temporal tracking[J],IEEE Trans on Circuits and Systems for Video Technology.1998.8(5):539-546.
    [49]Yu Huang,Dietrich Paulus,Background-foreground segmentation based on dominant motion estimation and static segmentation,Proceedings of the 1~(st)Intl[J].Workshop on Image and Signal Processing and Analysis,2000:69-74.
    [50]黄波,杨勇,王桥,吴乐南,一种基于时空联合的视频分割算法[J],电子学报,2001,29(11):1419-1494.
    [51]J.G.Choi,S.W.Lee,and S.D.Kim,Spatio-temporal video segmentation using a joint similarity measure[J],IEEE Trans.Circuits and Systems.Video Technology,1997.7(4):279-286.
    [52]M.Kim,J.G.Jeon,et al.User's g-uide for a user-assisted video object segmentation tool[S].ISO/IEC JTCl/SC29/WG11 MPEG98/M3935,1998.
    [53]贾云得,机器视觉[M],北京,科学出版社,2000年4月,125-131
    [54]Adiv G.Determining three dementional motion and structure and from optical flow generated by several moving objects[J].IEEE Trans.PAMI,1985,7:384-401
    [55]Shao-Yi Chien,Shyh-Yih Ma,Liang-Gee Chen.Efficient Moving Object Segmentation Algorithm Using Background Regstration Technique[J].IEEE Transaction on Circuits and Systems for Video Technology,2002,12(7):577-586
    [56]Christof Ridder,Olaf Munkelt,Harald Kirchner.Adaptive background estimation and foreground detection using Kalman-filtering[J].ICRAM'95,1995,pp:193-199
    [57]Burt P.J.,Adelson E.H.,The laplacian pyramid as a compact image code[J].IEEE Transaction on Communication,1983,31(4):532-540
    [58] Chris Stauffer, W. Eric. L Grimson. Learning Patterns of Activity Using Real-Time Tracking[J]. IEEE Transaction On Pattern Analysis and Machine Intelligence, 2000, 22(8):747-757
    [59] Ioannis Pavlidis, Vassilios Morellas, Panagiotis Tsiamyrtzis, et al. Urban Surveillance Systems: From the Laboratory to Commercial World[J]. Proceedings of the IEEE, 2001, 89(10): 1478-1497

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700