基于内容的新闻视频检索

作者：党明辉
论文级别：硕士
学科专业名称：计算机技术
中文关键词：视频检索 ; 色彩空间 ; 关键帧 ; 镜头边界 ; 场景分割 ; 语义分析
英文关键词：Video Search ; Color model ; Keyframe ; Shot boundary ; video scene boundary ; Semantic Analysis
学位年度：2011
导师：耿国华
学科代码：081203
学位授予单位：西北大学

摘要

随着互联网和数字媒体技术的发展以及海量数字视频的出现,研究如何从大量数字视频中获取用户感兴趣的信息的基于内容的视频检索(CBVR)技术成为计算机科学研究的重点课题。本文以新闻视频为研究对象,根据新闻视频其自身特殊的结构、组织和检索方式,对已有的CBVR技术进行了改进,实现对新闻视频检索系统检索功能的改善。本文研究工作如下：
     (1)提出了一种改进的镜头边界检测算法。在完成对视频的结构化分析及关键技术的研究工作中,比较了像素比较镜头边界检测法、基于图像边缘轮廓特征镜头边界检测法、灰度差值法、直方图镜头检测法以及双直方图镜头检测法,权衡各种算法的优缺点,提出了基于像素比较法、直方图法、双直方图法互补性特点的一种改进的镜头边界检测方法。
     (2)提出了固定区域模板匹配法用于对新闻视频主持人镜头进行检测的方法。在新闻视频结构化特征提取部分,首先通过扩展人脸区域,即EFR方法选取标准主持人镜头作为通用模板,并由此确定出视频的检索区域,然后采用分块HSV颜色直方图作为模板参数,对初步选定的镜头的关键帧进行模板匹配实现主持人镜头的最终确认。该方法既能保证系统检测的灵活性,又能降低误检率。
     (3)构造了新闻视频语义分析提取原型系统。实现了对视频关键帧的语义分析、视频镜头的语义分析和视频场景的分割及语义提取。在完成新闻视频镜头分割、主持人镜头检测、字幕检测和广告视频场景分割的基础上,根据新闻视频中音频信息的特点,结合提取的音频语义构造了新闻视频场景分割及自动生成语义系统。该系统为用户查找视频进行视频检索提供了有力的技术支持,具有较高的时效性。
With the development of the Internet and digital media technology and the emergence of mass digital video, we need to facilitate access to a large number of video information in video information of interest, Thus, content-based video retrieval(CBVR) become an important issue in the field of computer science. This paper choose news videos as its research object and is based news videos'features of special structure,organization and retrieval way,which make it one of the most important topics。We have do research as follow.
     (1) This put forward a improved the shot boundary detection algorithm. Upon completion of the structure analysis of video and key technology research work, compared method of compares the pixel, edge profile characteristic method, gray level difference method, and dual histogram histogram shot detection method, We propose an improved method for shot boundary detection, based the complementary characteristics of pixel comparison, the histogram method and double histogram.
     (2) In news feature extraction part of this paper, a fixed area template matching shot detection for news video host is proposed. First of all standards host lens is detected by EFR Methods and thus determine the video search area, with block HSV color histogram as a template parameter, on behalf of all the possible lens frame template matching to achieve the ultimate host Lens confirmed. This approach can guarantee the detection of the flexibility of the system can reduce the false detection rate.
     (3) Based on the shot segmentation, shot detection of host, theme subtitle detection, advertising video scene segmentation, audio streams semantic extraction of the division, we determine the completion of the video scene. For video indexing, browsing to provide the organizational framework, the semantic information automatically generated based feature extraction, can quickly point users to find video clips with better efficiency. This comprehensive utilization of features of video and audio and its own model processing of news specific, makes the process steps to clear, the processing results to objective and accurate. With the current outstanding research achievements, this paper can realize content-based video retrieval better.

引文

[1]周明全,耿国华,韦娜.基于内容图像检索技术[M].北京：清华大学出版社,2007：1-144
    [2]沈兰荪,张菁,李晓光.图像检索与压缩域处理技术的研究[M].北京：人民邮电出版社2008：30-382
    [3]李玉峰.基于内容视频检索的镜头检测及场景检测研究[D].天津：天津大学博士学位论文,2009
    [4]史迎春.基于内客的视频检索语义提取若干问题研究[D].南京：南京理大学,2005
    [5]TimnanigLiu, Hongjiang Zhang, Feihu Qi. A novel video key-frame-extraction algorithm based on Perceived motion energy model. IEEE transactions on circuits and systems for video technology.2003,13(10):1006 - 1013
    [6]盛永华.基于内容的图像检索技术[J].信息科学,2010,3：57-58
    [7]E. Wold, T. blum, D. Keislar, J. Wheaten. Content-based lassification, seareh, and retrieval of audio. IEEE Multimedia.1996,3(3):27-36.
    [8]于海珠,司瑾.基于内容的图像检索技术[J]. 电脑知识与技术,2010,6(10)：2440-2441
    [9]梁刘红,富亮,薛向阳.电视节目自动分割算法[J].计算机研究与发展,2004,41(9)：1514-1519
    [10]杨强,尹德辉,马森.视频检索技术应用及发展趋势[J].电视技术,2006,31(2)：88-89
    [11]焦保军.基于新闻视频字幕的检测与提取分析[D].南京：南京理工大学硕士学位论文,2007
    [12]Y Rui, T S huang, S F Chang, Image retrieval:current techniques, Promising directions, and open issues. Journal of Visual communication and image representation,1999,10:39-62
    [13]黄祥林.基于压缩域图像检索技术的初步研究[D].北京：北京工业大学博士论文,2001
    [14]黄祥林,沈兰荪.基于内容的图像检索技术研究[J].电子学报,2002,30(7)：1065-1071
    [15]M J Swain, D H Ballard. Color indexing. Int. Journal of Computer Vision, 1991,7(1):11-32
    [16]M Stricker, M Orengo. Similarity of color images. SPIE Storage and Retrieval for image and video Datavbase Ⅲ,1995,2185(2):381-392
    [17]H Derek, S Rahul, S Henry, et al. Object-based image retrieval using the statistical structure of images IEEE Computer Society Conference on computer Vision and pattern Recognition, Washington DC, June27-July2,2004, 2(7):490-497
    [18]S K Chang. Iconic indexing by 2D string. IEEE trans pattern analysis and machine intelligence,1984,6(4):413-428
    [19]卢悦.基于内容的视频镜头检测与分类研究[D].济南：山东大学硕士论文,2010
    [20]曹莉华,胡晓峰,李国辉.基于内容检索中的视频处理技术研究[J].计算机工程与应用,1999,(6)：39-55
    [21]E. world, T. Blum, D. Keislar,J. Wheaten. Content-based classification, search, and retrieval of audio. IEEE Multimedia.1996,3(3):27-36
    [22]R.Lienhart, Comparison of Automatic Shot Boundary Detection Algorithms, SPIE,1999,3656:290-301
    [23]黄茜,张海泉,杨文亮.基于灰度和直方图的阈值自适应镜头边界检测[J].科学技术与工程,2007,4(7)：1671-1819
    [24]邓雪飞,夏定元.几种改进的镜头边界检测方法[J].数字电视与数字视频2005,7(2)：112-116
    [25]朱小俊,老松杨.一种有效的镜头探测方法[J].计算机工程与应用,2003,(8)：59-61
    [26]彭天强,李弼程.一种有效的抗闪光灯新闻视频镜头检测方法[J].信息工程大学学报,2007,8(4)483-486
    [27]Yeo B L, Liu B. Rapid scene analysis on compressed video[J]. IEEE Trans om Circuits and Systems for Video Technology,1995,5:533-544
    [28]滑勇.基于关键帧的视频内容检索问题的研究[D].大连：大连理工大学,2005
    [29]M. M. Yeung and B. Liu. Efficient matching and clustering of video shots[J]. Proceedings ofIEEE IXIP, pp.338-341,1995
    [30]Zhang Z, Wu J, Zhong D, et al.An Integrated System for Contentbased Video Retrieval and Browsing[J]. Pattern Recognition,1997,30(4):643.
    [31]H. Zhang, J Wu, D Zhong et al. An integrated system for content-based video retrieval and browsing[J]. Pattern pecognition,1997,30(4):643-658
    [32]W Wolf. key frame Selection by motion analysis[C]. in:Proc IEEE int Conf Acoust,Speech and Signal Proc,1996
    [33]刘佳兵.视频检索中的关键帧提取技术[C].福建电脑,2007,12：49-55
    [34]王旭军.视频分割方法研究[D].重庆：重庆大学硕士学位论文,2010,4
    [35]陆海斌,章毓晋,杨卫平.基于镜头和情节的视频非线性组织.计算机学报2000,23(5)：548-552
    [36]王策,何炎祥,王云.基于视音频特征和文本信息的新闻视频自动场景分割.计算机工程,2005,31(6)：171-199
    [37]W. A. C. Femando, C. N, Canagarajiah, D. R. Bull. Scene Change Detection Algorithms for Content-based Video Indexing and Retrieval. IEEE Electronics and Communications engineering Journal.2001,13(3):117-126
    [38]Seong Whan Lee, Young Min Kim, Sung Woo Choi. Fast Scene Chang Detection Using Direct Feature Extraction from Mpeg Compress Videos. IEEE Trans on Multimedia.2000,2(4):240-254
    [39]陈茜,赵进创.滚动字幕条件下的新闻视频检索研究[J].中国有线电视,2010(3)：296-298
    [40]柴旭请,崔红志,吕佳.基于内容的视频检索关键技术研究[J].科技信息,2010(17)：593-597
    [41]黄少年,赵跃龙等.一种基于镜头的视频场景检测方法[J].计算机工程与应用,2006,19：170-173
    [42]徐骏,李玲青,周洞汝.基于语义信息提取的新闻视频场景分割方法[J].计算机工程与应用,2003(4)：204-206
    [43]于俊清,汤旸,周向东.基于主色特征识别的新闻视频口播帧[J].计算机工程与科学,2004,26(8)：28-32
    [44]Hsu W, Kennedy L, Huang C W, et al. News Video Story Segmentation Using Fusion of Multi-level Multi-model Features in TRECVID 2003[C]//Proc. of IEEE ICASSP'04. Montreal, Canada:[s. n.],2004
    [45]杨娜,罗航哉,薛向阳.一种用于电视新闻节目的播音员镜头检测算法[J].软件学报,2002,13(8)：1559-1567
    [46]赵锞锞,彭天强,李弼程.新闻视频主持人镜头检测方法[J].计算机工程2008,34(19)：238-241
    [47]蓝昭华,良永忠.基于字幕的新闻视频检索算法[J].广播与电视技术,2010,(5)：64-67
    [48]A.K.Jain, and B. Yu, Automatic Text Location in Image and Video Frames, Pattern recognition, vol.31, no.12,1998,pp.2055-2076
    [49]K. Y. Jeong, K. Jung, E. Y. Kim, and H. J. Kim, Neural network based text location for news video indexing, in Proceedings of Int. Conf. on Image Processing, vol.3,1999, pp.319-323
    [50]C.Garcia and X. Apostolidis, Text detection and segmentation in complex colorimages, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol.4,2000, pp.2326-2329
    [51]R. Lienhart, F. Stuber, Automatic text recognition in digital videos, in Proceedings of SPIE Image and Video Processing Ⅳ 2666:180-188,1996.
    [52]H. K. Kim, Efficient Automatic Text Location Method and Content-Based Indexingand Structuring of Video Database, J. Visual Commun. Image Representation, vol.7, no.4,1996, pp.336-344
    [53]K. Sobottka, H. Bunke, and H. Kronenberg, Identification of Text on Colored Book and Journal Covers, in Proc. ICDAR'99,1999, pp.57-62
    [54]A. Wernicke and R. Lienhart, On the segmentation of text in videos, in Proc. IEEE Int.Conf. Multimedia Expo, vol.3,2000,pp.1511-1514
    [55]黄剑,赵黎,杨士强.视频文字检测与多尺度定位算法[J].清华大学学报(自然科学版),2004,44(1)：49-53
    [56]张引.复杂背景下文本提取方法研究与应用[D].杭州：浙江大学博士学位论文1999.7
    [57]Yan Xue-qiang, Ye Xiu-qing etc. Maximum entropy image thresholding algorithm based on the histogram defined on quantization image [J]. Pattern recognition and artificial intelligent,1998,11(3)pp.352-358
    [58]Nobuyuki OTSU. A Threshold Selection Method from Gray-level Histograms[J]. IEEE Transactions on Systems, Man, And Cybernetics,1979,9(1)
    [59]张洋.电视视频字幕文字[D].北京：中国科学技术大学硕士论文,2009.5
    [60]胡峰丽.基于内容的图像检索技术研究[D].北京：中国科学技术大学硕士论文,2007
    [61]谢光艺.视频字幕检测与提取的算法研究[D].西安：西安电子科技大学硕士学位论文,2005.1
    [62]文军,吴玲达,曾璞.新闻视频相似关键帧识别与故事单元关联分析研究.软件学报,2010,21(11)：2971-2975
    [63]徐新文.基于播音员识别的新闻视频故事分割方法[D].北京：中国科学技术大学硕士论文,2008
    [64]杨强.基于内容的新闻视频检索语义提取技术研究[D].成都：成都理工大学硕士论文,2007
    [65]李士进.新闻视频中广告片段精确定位方法研究[J].中国图象图形学报,2009,14(7)：1432-1439
    [66]徐骏,周晓峥.基于事件流的新闻视频场景分割方法[J].计算机辅助设计与图形学学报,2003,15(2)：228-232
    [67]张箭.基于内容的新闻视频检索系统研究[D].西安：西安电子科技大学硕士论文,2006.1
    [68]李默.新闻视频场景分割技术研究[D].郑州：解放据信息工程大学硕士论文,2005.6
    [69]梁宵.基于内容的新闻视频检索系统的研究与实现[D].沈阳：吉林大学硕士论文,2009
    [70]吴玲达,文军,陈丹雯.新闻视频故事单元关联分析技术研究综述[J].计算机科学,2010,37(6)：5-10
    [71]杨强.基于内容的新闻视频检索语义提取技术研究[D].成都：成都理工大学硕士学位论文,2006
    [72]屈洁.新闻视频内容结构分析研究[D].西安：西安电子科技大学学位论文,2010

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700