基于H.264的多视点立体视频关键技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
多视点立体视频的技术发展与图像获取、压缩编码、网络传输及立体显示等关键技术的发展密不可分。多视点立体视频本身具有合成困难、数据量大、难于存储和传输的特点,由这些特点引起的多视点立体视频预处理、压缩、传输等问题一直未得到妥善解决,这些问题是多视点立体视频产业化、实用化的主要瓶颈。本文通过对基于多视点的立体视频原理的深入研究,给出了一个基于H.264的多视点立体视频处理系统的整体解决方案并对其中的压缩编码、网络传输、立体显示等关键的技术进行了研究与实现。
     首先,本文提出了一套基于H.264的多视点立体视频的编码、传输、显示系统框架。在此框架上对所提出的的算法、协议进行了实现与验证。
     其次,基于JMVC测试模型,把2D视频压缩的搜索算法S-UMHexagon Search应用到了3D视频的压缩编码中,并对此算法进行了优化,从而大大提高了帧间预测时的运动搜索速率;此外,还在模式选择部分采用了快速模式选择,提前判断最优模式,降低了编码复杂度,大幅度提高了编码速率。通过以上的优化改进,压缩编码的时间降为原来的30%左右。
     然后,本文首次提出了基于H.264的多视点立体视频的传输协议:多视点数据管道协议(MVDC:Multi-View Data Conduit),并对协议进行了设计与实现。实际的网络传输表明,本文所提出的协议能够实现对多视点视频数据的实时、可靠传输,并能够较好的实现多视点视频数据的同步。
     最后,基于国家科技重大专项新一代宽带无线移动通信网―终端共性技术开放式研究(多视点自由立体移动终端显示)‖项目研制的立体显示终端,提出了小尺寸屏幕上的立体视频合成算法,设计了基于H.264的8视点立体视频的播放器,在Windows Mobile的无线环境和PC上设计并实现了多视点立体视频的播放。利用立体视频播放器,用户可以在具有立体显示屏幕的手机和PC上方便地观看到多视点立体视频。
The Multi-view stereo video technology has developed to a new level with the development of the stereo video capture technologies, efficient multi-view compression algorithms, increasing bandwidth of communication links and three-dimensional video display techniques. However,multi-view stereo video has some new characteristics, such as hard to synthesize, massive volumes of data, difficult to store and transmit. These new characteristics cause problems in pre-processing, compression and transmission of multi-view stereo video, and these problems have not yet been well-solved and blocked the industrialization of multi-view stereo video technology. Based on the deep research on the multi-view stereo video technology, this paper envisages a scenario of stereo video application,put forward an overall solution of multi-view stereo video processing system based on H.264 and implement some key technologies such as compression algorithms, network transmission, 3D display.
     Firstly, the system of multi-view stereo video transmission and display based on H.264 is proposed, and all the algorithms and protocols are verified on this system.
     Secondly, based on the JMVC test model, this paper apply 2D video compression algorithm - the S-UMHexagon Search in 3D video for compressing and encoding. This paper also optimizes the multi-view stereo video coding algorithm. By this way, the speed of motion search is increasedgreatly; the coding efficiency is improved significantly by decreasing the coding complexity and determining the optimal mode in advance. As the result, the encoding time is about 30 percent of the original.
     Thirdly, this paper put forward a transmission protocol for multi-view stereo video transmitting over IP network for the first time, so called Multi-View Data Conduit protocol (MVDC) which is an efficient network transmission technology for stereoscopic video. After the implementation of MVDC protocol and experimental results are analyzed and verified,this protocol can achieve real-time and reliable transmission of multi-view video data,and synchronization of multi-view video data at the same time.
     Finally, based on the National major sicence and technology project, 8-view stereo video player on small-size screen is achieved for the first time, and display of the multi-view stereo video based on Windows Mobile’s wireless platform has also been implemented. Through the multi-view stereo video player on Windows Mobile’s wireless platform, users can easily watch the stereo video on the three-dimensional screen.
引文
[1]V.S.Nalwa. A Guided Tour of ComputerVision: Addson-Wesley, 1993
    [2]S.Aljoscha,M.C.Chen, 3DAV exploration of video-based rendering technology in MPEG, IEEE Transaction on Circuit and Systems for Video Technology ,2004,14(3):348~356
    [3]T.Kanade, P.Rander, P.Narayanan, Virtualized reality: Constructing virtual worlds from real scenes, IEEE Multimedia, Immersive Telepresence, 1997, P34~47
    [4]S. Adedoyin, W.A.C. Fernando, A.Aggoun.A Joint Motion & Disparity Motion Estimation Technique for 3DIntegral Video Compression Using Evolutionary Strategy, IEEE Transactions on Consumer Electronics,2007,53(2)
    [5]C.T.E.R,Hewage, S.Worrall, S.Dogan,etc, STEREOSCOPIC TV OVER IP, 2007,24(6)
    [6]T.Okishi, Three-Dimensional imaging techniques, Academic Press.1976.
    [7] www.stereo3d.com
    [8]G.P.Li, Y.He, A novel multi-view video coding scheme based on H.264, ICICS-PCM, 2003, 493~497
    [9]W.Woontack, O.Antonio, Overlapped block disparity compensation with adaptive windows for stereo image coding, IEEE Trans.on Circuits and Systems for Video Technology, 2000,10(2):194~200
    [10]G.P.Michael, Data compression of stereopairs, IEEE Trans.on Communications, 1992, 40(4):684~696
    [11]ISO/IEC JTC1/SC29/WG11, Call for evidence on multi~view video coding[S], MPEG document N6720, Palma de Mallorca, 2004.
    [12]ISO/IEC JTC1/SC29/WG11,Survey of algorithms for multi-view video coding[S],MPEG Document N6909,Hong Kong,2005.
    [13] J. Konrad, Visual Communications of Tomorrow: Natural, Efficient and Flexible, IEEE CommunicationsMagazine, 2001, 39: 126~133
    [14]陈嘉健,李崇荣,基于下一代互联网的立体视频传输,大连理工大学学报,2005(1):219~224
    [15] Iain E.G.Richardson著,欧阳合,韩军译,H.264和MPEG-4视频压缩,长沙:国防科技大学出版社,2004,112~130
    [16]Baoliang Wang, Chunping Hou, Zhen Yuan, Transmission Protocol for Stereoscopic Video Based on H.264,Broadband Multimedia Systems and Broadcasting (BMSB), 2010 IEEE International Symposium on ,2010,1~5
    [17]Baoliang Wang,Chunping Hou, Yi Wei,etc, Implementation of simple UMHexagon search algorithm in stereoscopic video encoding, Broadband Multimedia Systems and Broadcasting (BMSB), 2010 IEEE International Symposium on,2010
    [18] ITU-T and ISO/IEC JTC1, Draft ITU-T recommendation and final draft standard of joint video specification (ITU-T Rec. H.264/ISO/IEC 14496-10 AVC) in Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, JVT G050, 2003.
    [19]T.Wiegard, G.J. Sullivan,G.B. Bjontedaara,etc,Overview of the H.264/AVC video coding standard,IEEE Trans. C.S.V.T,2003:560~570
    [20]毕厚杰,新一代视频压缩编码标准H.264/AVC,北京:人民邮电出版社,2005
    [21]Special Issue on the H.264/AVC Video Coding Standard, IEEE Trans. CircuitsSyst, Video Technol, 2003,
    [22]W.A. IJsselsteijn, P.J.H. Seuntens, L.M.J. Meesters, State-of-the-art in human factors and quality issues of stereoscopic broadcast television, DeliverableATTEST/WP5/01, Aug. 2002
    [23]马宇峰,魏维,视频通信中的错误隐藏技术,北京:国防工业出版社, 2007:81~95
    [24]霍俊彦,提高多视点视频编码效率的技术研究:[博士学位论文],西安:西安电子科技大学,2008
    [25]陈斌,李耀华,朱祥华,流媒体系统,中国数据通信,2002,89~91
    [26]M.E.Lukacs, Predictive coding of multi-viewpoint image sets, Proc.IEEE Int.Conf.Acoust., Speech, Signal Processing,1986,1:521~524
    [27]ISO/IEC 13 818-2, AMD 3, MPEG-2 multiview profile,ISO/IEC JTC1/SC29/WG11, document no.N1366,1996.
    [28]S.Sethuraman, Stereoscopic image sequence compression using multi-resolution andquadtree decomposition based disparity and motion-adaptive segmentation, Carnegie Mellon University, 1996
    [29]W.Woontack, Rate-distortion based dependent coding for stereo images and video: disparity estimation and dependent bit allocation, Dept.of Electrical Engineering, Faculty of the graduate school, University of Southern California, 1999
    [30]W.X.Yang, N.K.Ngan, MPEG-4 based stereoscopic video sequences encoder, IEEE International Conference on Acoustics, Speech and Signal Processing, 2004, 3: 741~744
    [31]T.Chiang, Y.Q. Zhang, A new rate control scheme using quadratic rate distortionmodel, IEEE Trans.Circuits Syst.Video Technol, 1997, 7:246-250
    [32]Y.Luo, Z Y.Zhang, P.An, Stereo video coding based on frame estimation andinterpolation, IEEE Transaction on Broadcasting, 2003, 49(1):14~21.
    [33] Christine Guillemot, Fernando Pereira, Luis Torres, et al.Distributed Monoview and Multiview Video Coding. IEEE Signal Processing Magazine. 2007.pp.67-76.
    [34]F.Ulrich, K.Andre, H.264/AVC compatible coding of dynamic light fields using transposed picture ordering, 2005 European Signal Processing Conference, Antalya, Turkey, 2005.
    [35]X.Guo,Q.M.Huang, Multiview video coding based on global motion model,Proceeding of pacific-rim conference on multimedia,2004,665 672
    [36]M.Waldowski, A new segmentation algorithm for videophone application based on stereo imagepairs,IEEE Transactions on Communicaitons,1991,39(12):1856~1868
    [37]M.Ziegler and S.Panis, An object-based stereoscopic coder, Intl.workshop on Stereoscopic and Three Dimensional Imaging, 1995, 40~45
    [38]李世平,基于H.264的立体视频编码方法,计算机工程与应用,2005
    [39]Jeong Hyu Yang, A Motion Vector Prediction Method For Multi-View Video Coding, IEEE computer society, 2008
    [40]Sang Heon Lee, Sang Hwa Lee, Nam Ik Cho, A Motion Vector Prediction Method for Multi-View Video Coding, IEEE Trans. Circuits and Systems for Video Technology.2008
    [41]Alexis Michael Tourapis, Hye-Yeon Cheong, Pankaj Topiwala. Fast ME in the JM reference software, Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6). Poznań, PL, 2005
    [42]Zhibo Chen, Peng Zhou, Yun He. Fast Motion Estimation for JVT,Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6). Pattaya II, Thailand, 2003
    [43]于飞,H.264运动估计算法分析,计算机技术与发展,2009, 19(4)
    [44]王彦杰,H.264运动估计算法优化研究,信息化研究,2009, 35(1)
    [45]何涌,刘桂华,H.264帧间模式选择优化算法,数字电视与数字视频,2007,31(12)
    [46]张淑芳,李华,侯玲等.基于H.264的快速帧间模式选择算法,计算机应用研究. 2008. 25(1)285~288.
    [47]Andy C. Yu, Efficient Block-Size Selection Algorithm For Inter-frame Coding In H.264/MPEG-4 AVC, IEEE Communications Magazine, 2004,9:126~133
    [48]沈方,支琤.H.264标准的帧间模式选择快速算法.微计算机信息.2006,22(6-1)
    [49]杨虹,基于H.264标准的立体视频压缩编码研究:[硕士学位论文],吉林,吉林大学,2007
    [50]王世刚,基于H.264标准的双目立体视频压缩编码与实现,中国体视学与图像分析,2008,13(1)
    [51]Xiaoquan Yi, Jun Zhang, Nam Ling,Improved and simplified fast motion estimation for JM. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6). Poznan, Poland. 2005.
    [52]白茂生,田裕鹏,田晓冬,基于UMHexagonS的快速帧间模式选择算法,计算机应用,2007, 27(9)
    [53]杨育红,快速运动估计UMHexagonS算法的探讨与改进,计算机工程与应用,2006
    [54] Byeungwoo Jeon,Fast mode decision for H.264,Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG,JVT-J033,2003
    [55]刘玉珍,H.264帧间预测编码块匹配模式的研究,信息技术,2007,10
    [56]陆璐,适用于H .264的快速模式选择算法,通信学报,2006, 27(7)
    [57]张惠,新一代视频编码标准H.264/AVC的关键技术研究,现代电子技术,2009
    [58]Dapeng Wu, Yiwei Thoms Hou, Ya-Qin Zhang, Transporting real-time video over the Internet: challenges and approaches,Proceedings of theIEEE, 2000, 88(12): 1855 ~1877
    [59]S. Wenger, H.264/AVC over IP, IEEE Trans. Circuits Syst. Video Technol, 2003, 13: 645–656
    [60]ShipingLi, Mei YU,Gangyi JIANG,Tae-Young CHOI,Yong-Deak KIM,―Approaches To H.264-Based Stereoscopic Video Coding[C]‖, Proceedings of the third International Conference on Image and Graphics(ICIG’04),2004 ,9
    [61]叶树华,高志红,网络编程实用教程,北京:人民邮电出版社,2007,134~179
    [62]邓全良,WinSock网络程序设计,北京:中国铁道出版社,2002,53~57
    [63]殷肖川,姬伟锋,陈靖等,网络编程与开发技术,西安:西安交通大学出版社,2009,110~111
    [64]Jon C.Snader著,刘江林译,高级TCP/IP编程,北京:中国电力出版社, 2001,50~58
    [65]胡鸣,Windows网络编程技术,北京:科学出版社,2008,154~173
    [66]夏超,一种基H.264的快速帧间预测算法,微计算机应用,2007, 28(10)
    [67]郝文化,Windows多线程编程技术与实例,北京:中国水利水电出版社,9~11
    [68]顾兵,XML实用技术教程,北京:清华大学出版社,2007.09,86~102
    [69]吴洁,XML应用教程,北京:清华大学出版社,2008.08,167~186
    [70]Sas Jacobs,XML基础教程入门、DOM、Ajax与Flash,北京:人民邮电出版社,2007.07,79~96
    [71]姜浩,立体视频显示及编码相关技术研究,西南交通大学硕士学位论文
    [72] http://wenku.baidu.com/view/c55c79fc700abb68a982fb5a.html
    [73] http://www.dzsc.com/news/html/2008-3-5/68632.html
    [74]田波民。电子显示(Electronic Display)[M]。清华大学出版社,2001
    [75]侯春萍,阿陆南,俞斯乐,立体成像系统数学模型和视差控制方法[J],天津大学学报,2005,38(5):1~6
    [76]Neil A. Dodgson,Autostereoscopic 3D Displays[J],Published by the IEEE Computer Society,2005,38(8):31~36
    [77]Cees van Berkela and John A Clarke , CHARACTERISATION AND OPTIMISATION OF 3D-LCD MODULE DESIGN[A],Proc SPIE,Philips Research Laboratories UK,1997,30(12):179~187
    [78]Jung-Young Son and Bahram Javidi,Three-Dimensional Imaging Methods Based on Multiview Images[J],JOURNAL OF DISPLAY TECHNOLOGY,2005(9),1(1):130~133
    [79]宋晓炜,多视点光栅立体图像合成及多视点立体视频处理研究:[博士学位论文],天津;天津大学,2007
    [80]王元庆,自由立体显示器的应用与现状[J],现代显示,2003(1):39~41
    [81]付永锋,孙建平,多视点自由立体显示器关键技术研究,第二届立体图象技术及其应用(国际)研讨会文集
    [82]Julio Sanchez , Maria P. Canton著.罗骏,等译.Windows图像编程[M],清华大学出版社,2000;
    [83] Microsoft Developer NetWork[M]. MicroSoft . July 2000
    [84]林晓森. 3D手机的未来在3GJ[.]通信世界2009.
    [85]TSAI Yuh-chou, LIN Chia-wen. H.264 error resilience coding based on multihypothesis motion compensated prediction [C]//ICME 2005. IEEE International Conference. Amsterdam: IEEE, 2005:952-955.
    [86]PENG Q, YANG T W, ZHU C Q. Block-based temporal error concealment for video packer using motion vector extrapolation [C]//IEEE Communications, Circuits and Systems and West Sino Expositions. [S.I.]: IEEE, 2002:10-14.
    [87]CHEN Yu, YU Ke-man, LI Jiang, et al. An error concealment algorithm for entire frame loss in video transmission [C]//Picture Coding Symposium. San Francisco:[S.N.], 2004.
    [88]BELFIORE S, GRANGETTO M, MAGLI E, et al. Concealment of whole-frame losses for wireless low bit-rate video based on multiframe optical flow estimation [J].IEEE Transactions on Multimedia, 2005, 2(7), 316-329.
    [89]HUANG Chun-ming, YANG Kai-chao, WANG Jia-shung. Error resilience supporting bi-directional frame recovery for video streaming [C]//ICIP 2004. Singapore:[s.n.],2004: 537-540.
    [90]GIROD B, FARBER N. Feedback-based error control for mobile video transmission [C]//Proceedings of the IEEE [S.I.]; IEEE, 1999:1707-1723.
    [91] ITU-T and ISO/IEC JTC1, Advanced Video Coding for Generic Audiovisual Services, ITU-T Recommendation H.264–ISO/IEC 14496-10 AVC, 2003.
    [92]Services,ITU-T Recommendation H.264–ISO/IEC 14496-10 AVC,2005.MPEG Video subgroup.Introduction to Multi-view Video Coding.73Th MPEG meeting, N7328,July 2005
    [93]Yun He,Jorn Ostermann,Masayuki Tanimoto,et al.Introduction to the Special Section on Multiview Video Coding.IEEE Trans.Circuits and Systems forVideo Technology.2007,17(11):1433-1435.
    [94]霍俊彦;常义林;李明;马彦卓,多视点视频编码的研究现状及其展望通信学报2010(5)
    [95]ITU-T and ISO/IEC JTC1, Advanced Video Coding for Generic Audiovisual Services, ITU-T Recommendation H.264–ISO/IEC 14496-10 AVC, 2005.
    [96]ITU-T and ISO/IEC JTC1, Advanced video coding for generic audio visual services Amendment 1: Support of additional colour spaces and removal of theHigh 4:4:4 Profile, ITU-T Recommendation H.264 (2005)–Amendment 1, 2005
    [97]Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, Joint DraftITU-T Rec.H.264|ISO/IEC 14496-10/Amd.3 Scalable video coding, JVT-X201, July, 2007
    [98]MPEG Video.Description of Exploration Experiments in 3DAV.66th MPEG meeting, W5959, October 2003.
    [99]MPEG Test, Video Subjective test results for the CfP on Multi-view Video Coding.75th MPEG meeting, W7779, January 2006.
    [100]MPEG Video.Description of Core Experiments in MVC.75th MPEG meeting, W7798, January 2006.
    [101]Anthony Vetro, Yeping Su, Hideaki Kimata et al.Joint Multiview Video Model (JMVM) 1.0.JVT-T208.July, 2006.
    [102]Gary J. Sullivan, Pankaj Topiwala, and Ajay Luthra (2007) The H.264/AVC Advanced Video Coding Standard: Overview and Introduction to the Fidelity Range Extensions. Retrieved 2009-10-08.
    [103]ITU-T and ISO/IEC JTC1,Advanced Video Coding for Generic AudiovisualServices,ITU-T Recommendation SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMSInfrastructure of audiovisual services– Coding of movingvideo,2010-03.
    [104]J.R.Ohm, Three dimensional subband coding with motion compensation, IEEETrans on Image Processing, 3(5) pp.559-571, Sep.1994.
    [105]Bernd Girod,Anne Margot Aaron,Shantanu Rane,David Rebollo-Monedero, Distributed Video Coding,Proceeding of the IEEE, Vol.93, No.1, pp. 71-83, January 2005
    [106]Ralf Schafer.Review and Future Directions for 3D-Video.25th PCS Proceedings: Picture Coding Symposium, 2006.
    [107]Yun He,Jorn Ostermann,Masayuki Tanimoto,et al.Introduction to the SpecialSection on Multiview Video Coding.IEEE Trans.Circuits and Systems forVideo Technology.2007,17(11):1433-1435.
    [108]MPEG Video Subgroup.Introduction to Multiview Video Coding.83th MPEGmeeting, W9580, January 2008.
    [109]ITU-R Recommendation BT.500-11.Methodology for the subjective assessment of the quality of television pictures, 2002.
    [110]Jens-Rainer Ohm.Submissions received in CfP 75th MPEG meeting, M12969, January 2006.
    [111]P.Merkle, A.Smolic, K.Muller, and T.Wiegand.Efficient Prediction Structuresfor Multiview Video Coding.IEEE Trans.on Circuits and Systems for Video Technology, 2007, 17 (11):pp.1461-1473.
    [112]G.Bjontegaard Calculation of average PSNR differences between RD-Curves, VCEG-M33, 2001, http://ftp3.itu.ch/av-arch/video-site/0104_Aus/.
    [113]JOHANSON M. Stereoscopic video transmission over the Internet: Internet applications[C]//Proc of the 2nd IEEE Workshop on WIAPP. 2001:12-19
    [114]Joint Video Team of ITU-T VCEG and ISO/IEC MPEG,―On SVC&MVC Random Access and Layer/View Switching‖Doc. JVT-V041, Jan. 2007.
    [115]Joint Video Team of ITU-T VCEG and ISO/IEC MPEG,―Comparative Study of MVC Prediction tructures‖Doc. JVT-V132- Q, Jan, 2007.
    [116]Philipp Merkle, Aljoscha smolic, Karsten Mueller, et al, Statistical Evaluation of Spatialtemporal Prediction for Multi-view Video Coding,Proc.ICOB 2005, Berlin, Germany, October 2005.
    [117]Ulrich Fecker and Andre Kaup, Statistical Analysis of Multi-Reference Block Matching for Dynamic Light Field Coding, VMN2005.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700