视频流图像内容检索与运动目标检测研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
视频流图像内容检索与运动目标检测研究是在监控信息流中融入图像理解技术。图像理解是关于图像内容的描述或者是某种分类,其描述或分类是由计算机代替人的视觉完成,并由计算机自动给出对图像的评价。根据捕获到的图像对实际物体和场景做出有意义的判定,并对其行为进行理解与描述,实现“视觉智能”。
     本文针对图像颜色分布特征,提出对图像进行子块分割和合理的区域划分方法来对不同子块和区域分别进行颜色特征提取,并给出利用离散余弦变换和奇异值分解进行颜色特征提取和降维操作算法。提出自相似特征编码方法,将编码空间由灰度空间扩展到彩色空间,充分体现了图像的颜色自相似特征。采用中心扩散算法,能够在保证一定程度匹配误差的基础上有效缩短计算时间,增强算法的可行性,通过对自相似特征编码进行奇异值分解的方法提取特征向量来实现图像检索。
     文中给出利用提升小波变换和自适应二进制算术编码方法的图像压缩算法,提出动态压缩倍数方法,较好地解决了网络带宽问题。
     文中最后提出一种嵌入式系统工程化设计思想,并依此设计了一种网络摄像机硬件架构。讨论了嵌入式硬件实现中的关键问题-算法并行性、数据复用性、系统安全性问题。
The purpose of image understanding lies in intelligentizing image information processing, moreover it can replace parts of human work. Image understanding is a kind of description or certain sort concerning image contents, its description or sort is completed and evaluated by the computer instead of human vision. The object of computer vision research is to make meaningful judgment on practical objects and scenes according to image construction scene and acquired images, comprehend and describe its behavior and realize“vision intelligence”
     The research of this paper is based on the leading projects“Identity authentication system based on biologic feature recognition”and“Key science and technology development plan project of Jilin province”of the national 863 plan, and is applied in the image retrieval and motion object detection of network intelligent camera. By performing motion detection to the images in video stream and retrieval based on content on sensitive information, we can make judgment to the given images, such as, identify“human”activities in surveillant scene, detect“fire”alarm information in the forest fire monitoring situation etc. This paper focuses on the vision information of images and discusses the problems of retrieval based on content and object detection.
     Color feature is not neglectable in the process of image retrieval because it’s very straight forward. This paper focuses on the color distribution feature of image, and puts forward to extract color feature of different sub blocks and regions respectively by sub block partition to the image and reasonable region, and introduces an algorithm that makes use of DCT and SVD to extract color feature and reduce dimension. The experiment proves that the method has achieved a better complete check rate and accuracy check rate, having preferable retrieval effect. At the same time we analyze the limitation of color feature and discuss method that combines texture, shape and fractal, trying to achieve better retrieval effect. In order to fully express the self resemblance color feature of the images, this paper puts forward the self resemblance feature coding method, which expands the code space from the gradation dimension to chromatic dimension. At the same time, central diffusion algorithm which can effectively shorten computing time, enhance the feasibility of the algorithm while ensuring certain degree of matching error is adopted because of the great computation quantity, low speed problems of the traditional fractal encoding. Finally, image retrieval can be implemented by extracting feature vector from the self resemblance feature coding using singularity value decomposition. The experimental result shows that this algorithm has better real-time quality.
     The given image compression algorithm makes use of lifting wavelet transformation and self-adapting binary arithmetic coding method, and presents the dynamic variety diploid method which solves the network bandwidth problem preferably.
     At last this paper introduces an embedded system engineering design idea and designs the hardware architecture of network camera based on it. We discuss the key problems in the embedded hardware implementation-parallelism of algorithm, data reusing, and system security. We resolve the huge amount of information and limited bandwidth resource problems that as merely an input end, the traditional network camera will send the image information captured directly to the server without any processing 24 hours ceaselessly. At the same time, this paper presents a retrieval method that can detect accidents and security hidden trouble immediately from massive information. Security monitoring has experienced three generations of development, the first generation is simulated image monitoring, the second is monitoring based on PC technology, the third is network digital monitoring. The“network camera based on image retrieval”which has“vision intelligence”and based on the research achievement of this paper belongs to the new generation network monitoring machine.
     “Image content retrieval and motion object detection of video stream”is a problem that involves multiple subjects. The research of image retrieval and motion object detection have obtained some achievement, but compared to human perception ability, there is still a large gap between them. The successive research of this paper will try to apply semantic technique in the compile technology in the research of image understanding and plan to solve the difficulty of how to extract semantic feature from images and describe it. Searching and developing new technology can help to improve image target identification and understanding, it also can reduce the computing complexity effectively.
引文
[1] 谭铁牛.智能视频监控技术概述,第一届全国智能视觉监控学术会议.北京,2002.5
    [2] R Rickman, S John, Content Based Image Retrieval Using Color Tuple Histograms. Proc of SPIE: Storage and Retrieval for Image and Video Database . San Jose . CA .1996. 2670
    [3] G Pass, R Zabih, J Miller. Comparing Images Using Color Coherence Vectors. Proc of ACM Intern Conf Multimedia. Boston ,MA,1996
    [4] M J Swain,D H Ballard. Color Indexing, IJCV, 1991, 7(1):11-32
    [5] M J Swain,D H Ballard. Indexing via Color Histograms. Proc ICCV90:390-393
    [6] R Collins, A Lipton et al. A system for video surveillance and monitoring;VSAM final report.CMU-RI-TR-00-12, Robotic Institute Carnegie Mellon University,2000
    [7] Aki.Kobayashi, Toshiyuki Yoshida, Sakai. Image Retrieval by Estimating Parameters. of Distance Measure. SPIE. San Jose, California, 2000, 3972.432-441
    [8] J.Huang, S. Kumar, M.Mitra, W.J.Zhu, and R.Zabih. Image Indexing Using Color Correlogram. Proc. of IEEE Conf. On Computer Vision and Pattern Recognition, 1997
    [9] R M Haralick, Shangmugam, Dinstein. Textural Feature for Image Classification. IEEE Trans on Systems. Man , Cybernetics ,1973, SMC-3(6):610-621
    [10] R M Haralick.Statistical and Structural Approaches to Texture[J]. Proc.IEEE,1979,67:786-804
    [11] 刘静,钟伟才,刘芳,焦李成,“基于分数维的图像检索新方法”,计算机研究与发展,第 41 卷第 7 期,2004 年 7 月,pp1182-1187
    [12] Kato,T.,A sketch retrieval method for full color image database 11th Inter. Conf. Pattern Recog.,pp530-533
    [13] Towards An Advanced Multimedia Oriented Retrieval Engine, the sixth International World Wide Web conference which took place in California from 7-11th April 1997.
    [14] 唐立军, 段立娟, 高文. 基于内容的图像检索系统[J ] . 计算机应用, 2000 ,10 (7) :41-45.
    [15] JAIN A K,VAILAYA A. Image retrieval using color and shape. Pattern Recognition ,1997 ,29 (8) :1233-1244.
    [16] ORACHARD M T , BOUMAN C. Color quanti zation of image. IEEE Trans Signal Processing. 1991 ,39 (12) :2677-2690.
    [17] 章毓晋. 基于内容的视觉信息检索. 北京:科学出版社,2003
    [18] FinnR.QuerybyImageContent[M].IBM Research.1996
    [19] 金红, 周源华. 基于内容检索的视频处理技术[J ].中国图像图形报. 2000(4)。
    [20] 庄越挺, 潘云鹤.用语义联想支撑基于内容的检索[J ].计算机研究与发展. 1999
    [21] Hongjiang Ghang, "Content-based Video Analysis, Retrieval And Browsing", in Multimedia Information Retrieval and Management一Technological Fundamentals and Applications, D.Feng,WC. Siu, and H.J.Zhang. led.), Springer, 2002
    [22] N. Dimitova, H.J.Zhang, et al, "Applications of Video-Content Analysis and Retrieval", IEEE Multimedia. July, 2002
    [23] Remagnino P, Tan T, Baker K. Mufti-agent visual surveillance ofdynamic scenes[J]. Image and Vision Computing, 1998, 16 (8):529- 532
    [24] Collins, Lipton and Kanade, "A System for Video Surveillance and Monitoring," Proc. American Nuclear Society (ANS) Eighth International Topical Meeting on Robotics and Remote Systems, Pittsburgh, PA, April 25-29, 1999.
    [25]. Maggioni C, Kammerer B. Gesture Computer: History, Design, and Applications. Computer Vision for Human-Machine Interaction. Cambridge: Cambridge University Press, 1998
    [26]. Freeman W, Weissman C. Television control by hand gestures[C]. In: Proc International Conference on Automatic Face and Gesture Recognition, Zurich, Switzerland, 1995. 179-183
    [27] 李向阳,鲁东明,潘云鹤, 基于色彩的图像数据库检索方法的研究 ,计算机研究与发展, 1999年 第3期 No.3 1999
    [28] Haralick R M, Shanmngam K, Dinstein I. Texture feature for image classification[J]n. IEEE Transactions on Systems, Man, and Cybernetics, 1973,3(6):610-621.
    [29]Peitgen H O, Saupe D. The science of fractal image[M]. New York: Springer-Verlag, 1988. 20-21.
    [30] Fisher Y. Fractal image compression: theory and application[M]. New York: Springer-Verlag, 1995. 49-51.
    [31] Suchendra M.Bhandarkar and Xingzhi Luo.Fast and Robust Background Updating for Real-time Traffic Surveillance and Monitoring. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR’05),2005.
    [32] 李俊韬,张海,范跃祖.复杂交通场景中多运动目标分割算法.北京航天航空大学学报,2006,32(3):297~300
    [33] 肖梅,韩崇昭,张雷.基于时空背景差的运动目标检测算法.计算机辅助设计与图形学学报,2006,18(7):1044~1048
    [34] Abdol-Reza Mansouri, mar Mitiche, Christophe Langevin . JOINT SPACE-TIME MOTION-BASED SEGMENTATION OF IMAGE SEQUENCES WITH LEVEL SET PDES . Proceedings of the Workshop on Motion and Video Computing(MOTION’02)
    [35] Markus Latzel, Emilie Darcourt, John K.Tsotsos.People Tracking using Robust Motion Detection and Estimation . Proeeding of the SecondCanadian Conference on Computer and Robot Vision(CRV’05)
    [36] Jiang Gao, Alexander G.Hauptmann, Howard D.Wactlar.Combining Motion Segmentation with Tracking for Activity Analysis.Proceedings of the Sixth IEEE Intemational Conference on Automatic Face and Gesture Recogniton(FGR’04)
    [37] Alireza Bab-Hadiashar, Niloofar Gheissari, David Suter.Robust Model Based Motion Segmentation . Proceedings of the 16th International Conference on Pattern Recognition(ICPR’02)
    [38] Nikos Paragios and Rachid Deriche.Geodesic Active Contours and Level Sets for the Detection and Detection and Tracking of Moving Objects.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, Vol 22.No3,2000, Pages 266~280.
    [39] Daniel Majchrzak, Sudeep Sarkar, Barry Sheppard, and Robin Murphy.Motion Detection From Temporally Integrated Images.Porceedings of the International Conference on Pattern Recognition(ICPR’00)
    [40] Abhiji S . Ogale, Cornelia Femuller, Yiannis Aloimonos . Motion Segmentation Using Occlusions.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, VOL27, NO.6, pages 988~992
    [41]方帅 ,“视频监控系统中小运动目标分类算法”,信息与控制 vol34 No2,pp201-204,Apr 2005
    [42] 周西汉,刘勃,周荷琴.一种基于对称差分和背景消减的运动检测方法.计算机仿真,2005,22(4):117~120
    [43] Markus Latzel, Emilie Darcourt, John K.Tsotsos.People Tracking using Robust Motion Detection and Estimation.Proeeding of the Second Canadian Conference on Computer and Robot Vision(CRV’05)
    [44] Ashwin Thangali, Stan Sclaroff . Periodic Motion Detection and Estimation via Space-Time Sampling.Proceeding of the IEEE Workshop on Motion and Video Computing (WACV/ MOTIO N ’05)
    [45] 肖梅,韩崇昭,张雷.基于时空背景差的运动目标检测算法.计算机辅助设计与图形学学报,2006,18(7):1044~1048
    [46] 张继平,刘直芳.视频中运动目标的实时检测和跟踪.计算机测量与控制,2004,12(11):1036~1039
    [47]郑世友,费树屺,龙飞.基于小波提升框架的图像序列中运动目标检测算法.中国图象图形学报,2005,10 (5):596-602.
    [48].J.Villasenor,B.Belzer,J.Liao.Wavelet Filter Evaluation for Image Compression[J]. IEEE Transactions on Image Processing, Vol.2,August, 1995:1053-1060.
    [49].M.Antonini,M.Barlaud,P.Mathieu,I.Daubechies.Image coding using wavelet transform[J] .IEEE Transactions on Image Processing, Vol.5,Apr, 1992:205-220.
    [50].张雄伟,陈亮,徐光辉.DSP 集成开发与应用实例.电子工业出版社,2002.6.
    [51].邹彦 主编,唐冬,宁志刚 副主编.DSP 原理及应用.电子工业出版社,2005.1:4-7.
    [52] Cavan R A. Improved Tracking and Data Fusion Through Sensor Management and Control Proc. Data Fusion Symp. 1987: 66-65.
    [53] 杨枝灵等,数字图像获取、处理及实践应用,人民邮电出版社,2003.1
    [54] 胡硕,基于小波理论的目标检测与快速目标跟踪算法研究,中国科学院研究生院(长春光学精密机械与物理研究所)博士论文。
    [55] Paul L Rosin, Tim Ellis. Image difference threshold strategies and shadow detection.BMVC95, 1995, 347-356.
    [56]孙君顶, 武学东, 周利华. 基于颜色和形状的图像检索[J]. 计算机科学, 2004, 31(5):180-183.
    [57]刘洋. 基于内容的图像检索算法研究, 硕士学位论文, 吉林大学, 2004.
    [58]郑慧娆, 陈绍林, 莫忠息 等. 数值计算方法[M]. 武汉:武汉大学出版社, 2002.
    [59]刘小军, 王东峰, 张丽飞 等. 一种基于奇异值分解和隐马尔可夫模型的人脸识别方法[J].计算机学报, 2003, 26(3):1-5.
    [60]李明水, 欧珊蝴, 张珩. 分形图像压缩方法研究的新进展[J]. 工程图学学报, 2004, (2):143-152.
    [61]Mandelbrot B B. The fractal geometry of nature[M]. San Francisco, Freeman, 1982. 15-18.
    [62]齐东旭. 分形及其计算机生成[M]. 北京: 科学出版社, 1996. 3-5.
    [63]Peitgen H O, Saupe D. The science of fractal image[M]. New York: Springer-Verlag, 1988. 20-21.
    [64]John E Huchinson. Fractals and self similarity[J]. Indiana University Mathematics Journal, 1981, 35(5):713-747.
    [65] Fisher Y. Fractal image compression: theory and application[M]. New York: Springer-Verlag, 1995. 49-51.
    [66]Barnsley M F, Sloan A D. A better way to compress images[J]. Byte, 1988, 13(1): 215-223.
    [67]Fisher Y. Fractal encoding: Theory and application to digital image[M]. New York: Springer-Verlag, 1994. 12-14.
    [68]Jacquin A E. Fractal image coding: A review[J]. Proceeding of the IEEE, 1993, 81(10): 1451-1465.
    [69]Fisher Y. Fractal image compression[J]. Fractals,1994, 2(3): 347-361.
    [70] 许相莉、周春光、于哲舟、张利彪:分形域内彩色图象检索算法,吉林大学学报(理学版),Vol.44, Suppl, p.135-138,Aug.2006。
    [71] Shoichi Araki, Takashi Matsoukis, Naokaau Yokoya, et al. Real-time tracking of multiplemoving object contours in a moving camera image sequence . IEEE Trans Inf.&Syst, 2000,(7).
    [72] Jain R, Nagel H. On the analysis of accumulative difference of picture from image sequences of real world scenes [J]. IEEE Transactions PAMI, 1979. 206 -214.
    [73]陈朝阳,张桂林.基于图像对称差分运算的运动小目标检测方法[[J].华中理工大学学报,1998 ,26 (9).
    [74] Alan J Lipton, H Fujiyoshi. Raju S Patil. Moving target classification and tracking from real-time video [J]. IEEE Transactions on Workshop Application of Computer Vision, 1998: 8-14.
    [75] 程正兴,小波分析算法与应用,西安交通大学出版社 2003 年 3 月
    [76] Hong L, Wang W C, Logan M, et al. Multiplatform Multisensor Fusion with Adaptive-Rate Data Communication [J]. IEEE Trans. on Aerospace and Electronic Systems,33(1),1997,123-126
    [77]顾海军,赵晓晖,基于小波模板匹配的目标跟踪,吉林大学学报(信息科学版)Vol. 21No. 4 Nov, 2003
    [78] W. Seldens, The lifting scheme: A construction of second generation wavelets, Tech. Rep 1995:6,Industrial Mathematics Initiative, Department of Mathematics, University of SouthCarolina, 1995.
    [79] W. Sweldens and P. Schr oder, Building your own wavelets at home, Tech.Rep. 199_5:5.Industrial Mathematics Initiative, Department of Mathematics, University of South Carolina,1995.
    [80] Daubechies I, Sweldens W. Factoring Wavelet Transforms into Lifting Steps [R]. USA, Princeton University, 1996.
    [81]Swain M J, Ballard D H. Color Indexing[J]. International Journal of Computer Vision, 1991, 7 (1):11-32.
    [82]Hafner J, Sawhney H S, etal. Efficient color histogram indexing for quadratic form distance functions[J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1995, 17(7):729-736.
    [83]Stricker M, Orengo M. Similarity of color images[C]. In: Niblack W R, Rceds J eds. Proceedings of the SPIE2420, Storage and Retrieval for Images and Video DatabaseIII, San Jose, CA: SPIE, 1995:381-392.
    [84]Androustsos D A. Novel vector-based approach to color image retrieval using a vector angular-based distance measure[J]. Computer Vision and Image Understanding,1999, 75(1):46-58.
    [85]Pass G, Zabih R. Histogram refinement for content-based image retrieval[C]. In:IEEE Workshop on Applications of Computer Vision, 1996:96-102.
    [86]Lim S, Lu G. Spatial statistics for content based image retrieval[C]. In: IEEE International Conference on Information Technology: Computers and Communications, Monash Univ, Clayton, Vic, Australia, 2003:155-159.
    [87]Huang J, Kumar S R, Mitra M, etal. Image indexing using color correlograms[A]. In: IEEE Conference on Computer Vision and Pattern Recognition[C]. 1997:762-768.
    [88]Smith J R, Chang S F. Tools and techniques for color image retrieval[A]. In: Proc SPIE of the Storage & Retrieval for Image and Video Databases IV[C]. San Jose, CA:[s.n.], 1996, 2670:426-437.
    [89]Stricker M, Dimai A. Color indexing with weak spatial constraints[A].In: Proc SPIE Storage Retrieval Still Image Video DatabasesIV[C]. San Jose, CA:[s.n.], 1996, 2670:29-40.
    [90]顾红飞, 孙光灵, 屠菁. 一种基于颜色布局的压缩图像检索方法. 安庆师范学院学报(自然科学版), 2005, 11(2):6-10.
    [91]马棣麟. 摄影艺术构图. 上海: 复旦大学出版社[M], 1991.
    [92]Sloan A D. Retrieving database contents by image reconition: New fractal power[J]. Advanced Imaging, 1994, 9(5): 5-15.
    [93]Zhang A D, Cheng D, Raj Acharya. An approach to query-by-texture in image database systems[A]. In: Proceedings of the SPIE conference on Digital Image Storage and Archiving System, Bellingham. Society of Photo-Optical Instrumentation Engineers[C], WA, USA:1995, 338-349.
    [94]王志勇, 池哲儒, 余英林. 分形编码在图像检索中的应用[J]. 电子学报, 2000, 28(6):19-23.
    [95]Schouten B, Zeeuw P. Image databases, scale and fractal transforms. Proceeding of ICIP, Canada, Sept. 2000, 2:534-537.
    [96]徐艳, 王加俊. 使用拼贴误差直方图的分形图像检索. 电子与信息学报. 2006, 28(4):603-605.
    [97]张正炳, 朱耀庭, 朱光喜等. 基于近距自相似模型的分形图像编码方法[J]. 通信学报, 1997, 18(2):29-34.
    [98] Y Kameda, M.M. A Human Motion Estimation Method Using 3-Successive in International Conference on Irrtual System and Multimedia. 1996.
    [99] A. Nakazawa, H.K., S. Inokuchi. Human Using Distributed Video Systems, in International Conference on Pattern Recognition. 1998.
    [100] Reddy, R. and 1976, Speech recognition 妙 machine: A review. IEEE Proceedings. 6(4):pp. 502-531.
    [101] Guo G D, Yu S, Ma S D. Unsupervised segmentation of color images. In:Proceeding of 1998 IEEE International Conference on Image Processing .Chicago, IL, USA, 1998:pp299-302
    [102]林瑶,田捷.医学图像分割方法综述[J].模式识别与人工智能.2002, 15(2):pp192-201 [103」刘重庆,程华.分割彩色图像的一种有效聚类方法.模式识别与人工智能.1995, 8 (1) :133-138 页
    [104]王兴伟,沈兰荪,卫保国等.基于改进的 K 一均值聚类和数学形态学的彩 色眼科图像病灶分割.中国生物医学工程学报,2002, 21(5) :pp443-448
    [105] Lezoray 0, Cardot H. Cooperation of color pixel classification schemes and color watershed:a study for microscopic images .IEEE Transactions on Image Processing, 2002, 11(7): 783-789P
    [106] Shii A,Hamada N. Color image segmentation method using watershed algorithm and contour information. In:Proceeding of 1999 IEEE International Conference on Image Processing.Kobe,Japan, 1999: 305-309P
    [107] S. Mallat, S.Zhong. Characterization of signals from multiscale edges[J]. IEEE Trans. PAMI. 1992,14(7)
    [108]S.Mallat, W L. Hwang. Singularity detection and processing with wavelets[J]. IEEE Trans. IT. 1992, 38(2)
    [109]John Canny A Computational Approach to Edge Detection[J],IEEE Trans. On PAMI VOI.8,No.6,November 1986:679 一 698. [110〕马丽红,张宇,邓健平.基于形态开闭滤波二值标记和纹理特征合并的分水岭算法「J7.中国图象图形学报.2003, 8 (1) : 77-83 页 116
    [111] 乔予杰,胡俊,苏祥芳,孙洪,茹国宝.基于分级的 MRF 视频运动检测算法及其实现.武汉大学学报(理学版),2002,48(1):99~102
    [112] 刘红梅,黄金科,肖自美,梁凡.多分辨率运动估值的改进算法.中山大学学报(自然科学版),2001,40(2):34~39
    [113] 罗宁,方向忠,张文军.一种带运动检测的去隔行中值滤波算法.计算机工程与应用,2004,62~65
    [114] 陈前荣,陆启生。成礼智.运动模糊图像的运动模糊方向辨别.国防科技大学学包,2004,26(1):41~46
    [115] 胡俊,苏祥芳,刘立海,沈芸,管鲍,王延平.图像序列运动检测算法的研究及其应用.武汉大学学报(自然科学版),2000,613~617
    [116].陈传波,金先级.数字图像处理.机械工业出版社,2004.7.129. 152-153,148-149.
    [117] 刘循,智能交通中运动汽车检测及识别技术研究,四川大学博士论文
    [118] 许相莉, 周春光, 于哲舟, 张利彪, 孙昊翔. 基于自相似特征的彩色图像检索算法. 小型微型计算机系统,(已录用).
    [119] 张成林,提升小波算法的研究与 TDS642EVM 板上的实现,吉林大学硕士论文。
    [120]K.R.Castleman 著,朱志刚等译,数字图像处理,电子工业出版社

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700