可伸缩视频编码及传输理论与应用研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
可伸缩视频编码技术的发展为基于网络传输的视频应用带来了新的契机,随着网络规模的不断扩大,及网络上所承载的业务的多样化,互联网络固有的异构性及性能的动态变化性表现的愈发突出,与此同时加入网络中的终端也越来越多样化,这些都给基于网络传输的视频应用带来了挑战,传统的视频编码方案显然无法适应这种挑战,而可伸缩视频编码方案因其生成的码流具有多个截断点,能够根据具体的应用选取合适的码流进行传输,提供对异构网络和多样终端的适应性,得到了国内外众多学者的关注。本文首先分析现有视频传输系统所面临的问题,并对现有的可伸缩视频编码技术、立体视频编码技术、码率控制技术、网络带宽测量技术进行研究,进而提出所面临问题的解决方案,本文主要的创新点如下。
     1.提出一种完全可伸缩视频编码的实现方案,该方案实现了视频在时间维、质量维、空间维的分级编码,并根据平均运动矢量来判断视频序列的运动程度,进而采用相应的图像组结构,具有较强的实用价值。
     2.提出一种基于H.264/MVC的立体视频可伸缩编码方案,该方案充分考虑双目抑制效应,在现有立体视频编码标准H.264/MVC的基础上,对立体视频的左视点数据进行高质量的单层编码,而对视频的右视点数据进行分级编码,在尽可能保证视频立体效果的前提下,实现了对立体视频的可伸缩编码。
     3.提出一种新颖的适宜于H.264/SVC的码流选取方案,该方案根据码流片段所处的时间维层次及各时间维层次之间的依赖关系计算其时间维重要性系数,并根据码流片段所包含的不同频率系数来计算其质量维重要性系数,进而给出该该码流片段的重要性系数,最终根据目标码率和重要性系数排序结果选取特定的码流组合,实验结果表明,本文所提方案计算量较小,相较于H.264/SVC基本提取过程获得了性能的提高,尤其是在高码率的情况下。
     4.针对现有可用带宽测量机制的不足之处,提出一种快速准确的可用带宽测量方案,该方案通过测量一段媒体流由发送端到接收端所用的时间来获取前一个时间间隔的网络带宽,并采用基于高斯分布的预测模型预测下一个时刻的网络带宽,实验结果表明该方案能够在不影响当前媒体传输过程的情况下,快速准确地预测下一个时刻的网络带宽。
     5.提出一种适宜于家庭M2M网络的多媒体共享方案。该方案基于可伸缩视频编码,以适应网络和终端的差异性,采用家庭网关作为整个家庭网络的管理中心,并采用本文所提的带宽测量和预测机制来预测当前网络的可用带宽,实验结果表明本文所提方案解决了由网络异构性和终端多样性所带来的问题,实现了家庭M2M网络内部各个设备之间任意媒体信息的共享。
The development of scalable video coding technology has brought new opportunities forvideo applications which are based on network transmission. With the continuous expansionof network scale and diversity of service provided in the network, the inherent heterogeneityof network becomes more prominent, at the same time more and more devices withdifferent hardware and software configuration join in the network, all of these bring greatchallenges to the traditional video transmission system, in this situation, the scalable videocoding scheme(SVC),in which the generated bit stream can be divided into base layer andenhancement layers, get the attention of many scholars at home and abroad.With SVC scheme,the appropriate multimedia data can be selected according to the network condition anddevice configuration, hence SVC can provide the ability to adapt to different machines andnetworks. In this thesis, firstly the problems of current video transmission system areanalyzed, then we study the SVC technology, stereo video coding technology, rate controltechnology and bandwidth measurement technology, at last the solution of the faced problemare given. The main innovations of this paper are summarized as follows.
     1. A full-scalable video coding system is realized, in the system, video can behierarchically coded in temporal, spatial and fidelity domain, at the same time, weuse average motion vectors to determine the movement of a video sequence, andthen adopt the suitable GOP structure to improve the coding efficiency.
     2. Scalable stereo video coding scheme based on H.264/MVC is proposed, the schemetakes full consideration of the binocular suppression theory, encodes the left view ofa stereoscopic video in single-layer, and the right view in multi-layers, generatesscalable stereoscopic bit stream, meanwhile maintains stereoscopic perception asgood as possible.
     3. Novel bit stream extraction strategy which is suitable for H.264/SVC is presented,the strategy calculates a weighting coefficient for each bit stream slice according toits temporal level and the contained DCT coefficients, then selects a particular combination of bit stream according to the weighting coefficient and target bit rate,Experimental results show that the proposed scheme can achieve qualityimprovement compared with the existing JSVM basic extractor,especially in highbit-rate situation.
     4. A fast and accurate available bandwidth measurement and prediction scheme isproposed, the scheme gets the available network bandwidth by measuring the timethat a piece of media data transmitted from server to client, then predicts theavailable bandwidth in the next moment according to the normal distributionprediction model, Experimental results show that the proposed scheme predictsavailable bandwidth quickly and accurately, and meanwhile has no influence on thetransmission of streaming media.
     5. Video sharing solution in home M2M networks is proposed. The solution is based onSVC, adopts home gateway as the home management center, and the availablebandwidth prediction scheme proposed in chapter5is employed to get the currentavailable bandwidth, the presented multimedia sharing system can effectively solvethe problems bring by various terminals and heterogeneous networks, and realize thesmooth sharing of multimedia in home M2M networks.
引文
[1] S.-J. Choi and J. W. Woods, Motion-compensated3-d subband coding of video,IEEETransactions on Image Processing, vol.8, no.2, pp.155-167, Feb.1999.
    [2] E. Pesquet-Popescu and V. Bottreau, Three-dimensional lifting schemes formotion-compensated video compression, Proceedings of ICASSP’01, pp.1793-1796,Salt Lake City, UT, USA, May2001.
    [3] L. Luo, J. Li, S. Li, Z. Zhuang, and Y.-Q. Zhang, Motion compensated lifting wavelet andits application in video coding, Proceedings of ICME’01, pp.365-368, Tokyo, Japan,Aug.2001.
    [4] A. Secker and D. Taubman, Motion-compensated highly scalable video compressionusing an adaptive3d wavelet transform based on lifting, Proceedings of ICIP’01, vol.2,pp.1029-1032, Thessaloniki, Greece, Oct.2001.
    [5] H. Schwarz, T. Hinz, H. Kirchhoffer, D. Marpe, and T. Wiegand, Technical description ofthe HHI proposal for SVC CE1, ISO/IEC JTC1/SC29/WG11, doc. M11244, Palma deMallorca, Spain, Oct.2004.
    [6] J. Reichel, M. Wien, and H. Schwarz, eds., Scalable Video Model3.0, ISO/IEC JTC1/SC29/WG11, doc. N6716, Palma de Mallorca, Spain,Oct.2004.
    [7] ITU-T,Video Codec for Audiovisual Services at p×64Kbit/s Version1,ITU-TRecommendation H.261,1988
    [8] ITU-T,Video coding for low bit rate communication Version1,ITU-T RecommendationH.263Version1,1995
    [9] ITU-T and ISO/IEC JTC1,Advanced Video Coding for Generic AudiovisualServices,ITU-T Recommendation H.264–ISO/IEC14496-10AVC,2003
    [10]T.Wiegand,G.J.Sullivan,G.Bjntegaard,and A.Luthra, Overview of the H.264/AVC videocoding standard, IEEE Transactions on Circuits and Systems for Video Technology,vol.13,no.7,pp.560–576,July2003
    [11]ISO/IEC JTC1,Coding of moving pictures and associated audio for digital storage mediaat up to about1.5Mbps—Part2:Video,ISO/IEC11172-2(MPEG-1),1991
    [12]ITU-T and ISO/IEC JTC1,Generic Coding of Moving Pictures and Associated AudioInformation-Part2:Video,ITU-T Recommendation H.262—ISO/IEC13818-2(MPEG-2),1994
    [13]ISO/IEC JTC1,Information technology–coding of audio/visual objects,Part2:Visual,ISO/IEC14496-2(MPEG-4visual version2),2001
    [14]W.Gao,Cliff Reader,Feng Wu,Yun He,et al, AVS–The Chinese Next-Generation VideoCoding Standard,NAB2004,Las Vegas,April2004
    [15]M. E. Lukacs, B Predictive coding of multi-viewpoint image sets,in Proc. IEEE Int.Conf.Acoust. Speech Signal Process., Tokyo,Japan,1986, vol.1, pp.521–524.
    [16]ITU-T and ISO/IEC JTC1, Generic coding of moving pictures and associated audioinformation VPart2: Video, ITU-T Recommendation H.262and ISO/IEC13818-2(MPEG-2Video),1994.
    [17]Vetro T, Wiegand GJ, Sullivan X (2011) Overview of the stereo and multiview videocoding extensions of the H.264/MPEG-4AVC standard. Proc IEEE99(4):626–642
    [18]Manni E and Katsaggelos A K. Optimized bit extraction using distortion modeling in thescalable extension of H.264/AVC [J]. IEEE Transactions on Image Processing,2009,18(9):2022-2029.
    [19]Li Chun-hua, Yuan Chun, and Zhong Yu-zhuo. A novel substream extraction for scalablevideo coding over P2P networks. Proc. of the11th International Conference on AdvancedCommunication Technology, Gangwon-Do, South Korea, Feb.15-18,2009,3:1611-1615.
    [20]Sun Jun, Gao Wen, and Zhao De-bin, et al.. On rate-distortion modeling and extraction ofH.264/SVC fine-granular scalable video. IEEE Transactions on Circuits and Systems forVideo Technology,2009,19(3):323-336.
    [21] Xiao Song, Wu Cheng-ke, and Li Yun-Song, et al.. Priority ordering algorithm forscalable video coding transmission over heterogeneous. Proc. of the22th InternationalConference on Advanced Information Networking and Applications, Ginowan, Japan,Mar.25-28,2008:896-903.
    [22]Amonou I, Cammas N, and Kervadec S, et al.. Optimized rate-distortion extraction withquality layers in the scalable extension of H.264/AVC. IEEE Transactions on Circuitsand Systems for Video Technology,2007,17(9):1186-1193.
    [23]J. Strauss, D. Katabi, F. Kaashoek: A measurement study of available bandwidthestimation tool, Proceedings of ACM SIGCOMM Internet Measurement Conference.Karlsruhe, Germany,2003.p39-44.
    [24]N. Hu, P. Steenkiste: Evaluation and characterization of available bandwidth probingtechniques, IEEE Journal on Selected Areas in Communications,2003.21(6):879-894.
    [25]V. Ribeiro, M. Coates, R. Riedi: Multifractal cross-traffic estimation, Proc. of ITCspecialist seminar on IP traffic Measurement, Monterey, CA,2000.1-10.
    [26]M. Jain, C. Dovrolis: PathLoad: a measurement tool for end-to-end available bandwidth,Proc of Passive and Active Measurements (PAM) Workshop,2002:14-25.
    [27]V. Ribeiro, R. Riedi: PathChirp: Efficient available bandwidth estimation for networkpaths, La Jolla, California: Workshop on Passive and Active Measurement (PAM),2003.
    [28]B. Melander, M. Bjorkman, P. Gunningberg: A New End-to-End Probing and AnalysisMethod for Estimating Bandwidth Bottlenecks, IEEE Press,2000:100-105.
    [29]B. Melander, M. Bjorkman, P. Gunningberg: A New End-to-End Probing and AnalysisMethod for Estimating Bandwidth Bottlenecks, IEEE Press,2000:100-105.
    [30]Jizheng Xu,Ruiqin Xiong,Bo Feng,etal.3D subband video coding using Barbelllifting.ISO/IEC JTC1/SC29/WG11MPEG68th meeting, M10569/s05, Munich, March2004
    [31]J.-R.Ohm.Three-dimensional subhand coding with motion compensation[J],IEEE Trans.Image Processing,1994,3(5),559-571
    [32]A.Secker and D.Taubman,Motion-compensated highly scalable video compression usingan adaptive3D wavelet transform based on lifting[A],International Conf.on ImageProc.,2001[C],Thessaloniki,GR,2001,2(10):1029-1032
    [33]Abhijeet Golwelkar,John W.Woods.Motion-Compensated Temporal Filtering and MotionVector Coding Using Biorthogonal Filters[J].IEEE Transactions on Circuits and Systemsfor Video Technology,APRIL2007,17(4):417-428
    [34]H. Schwarz, D. Marpe, and T. Wiegand, Hierarchical B pictures, Joint Video Team,doc. JVT-P014, Poznan, Poland, July2005.
    [35]H. Schwarz, D. Marpe, and T. Wiegand, Analysis of hierarchical B pictures andMCTF,Proceedings of ICME’06, Toronto, Canada, July2006.
    [36]王相海.图像及视频可分级编码[M].科学出版社,2009
    [37]A. Segall, S. Sun, and G. J. Sullivan,―Spatial scalability,‖IEEE Transactions on Circuitsand Systems for Video Technology, this issue.
    [38]H. Schwarz, D. Marpe, and T. Wiegand, Constrained inter-layer pre-diction forsingle-loop decoding in spatial scalability, Proc. Of ICIP’05, Genoa, Italy, Sep.2005.
    [39]H. Schwarz, D. Marpe, and T. Wiegand, Further results on constrained inter-layerprediction, Joint Video Team, doc. JVT-O074, Busan, Korea, April2005.
    [40]H. Schwarz, D. Marpe, and T. Wiegand, Independent parsing of spatial and CGS layers,Joint Video Team, doc. JVT-S069, Geneva, Switzer-land, March2006.
    [41]Shapiro J M. Embedded Image Coding Using Zerotrees of Wavelet Coefficients[J]. IEEETransactions on Signal Processing,1993,41(12):3445-3462.
    [42]A.Said,W.A.Pearlman.A New Fast and Efficient Image Codec Based on Set Partitioningin Hierarchical Trees[J].IEEE Trans.Circuits Syst.Video Technology.1996,6:243~250
    [43]Taubman D. High Performance Scalable Image Compression with EBCOT [J]. IEEETransactions on Image Processing,2000,9(7):1158-1170.
    [44]Andra K, Chakrabarti C, Acharya T. A High-Performance JPEG2000Architecture [J].IEEE Transactions on Circuits and System for Video Technology,2003,13(3):209-218.
    [45]Joint Video Team (JVT)(2007) ISO-IEC MPEG&ITU-T VCEG, JVT-W090,CE1:simplified FGS, April2007
    [46]Radha H M,van der Schaar M,Chen Yingwei.The MPEG-4fine-grained scalablevideocoding method for multimedia streaming over IP[J].IEEE Transactions onMultimedia,2001,3(1):53-68.
    [47]Wang Q,Wu F,Li S,et al.Fine-granularity spatially scalable video coding[C].IEEEInternational Conference on Acoustics,Speech and Signal Processing(ICASSP),Salt LakeCity,2001,3:1801-1804
    [48]Sun X,Wu F,Li S,et al.Macroblock-based progressive fine granularity scalable videocoding[C]. IEEE International Conference on Multimedia and Expo(ICME),Tokyo,August,2001
    [49]孙晓艳,高文,吴枫,等.基于宏块的渐进、精细可伸缩的视频编码[J].软件学报,2002,13(11):2134-2141
    [50]孙晓艳,高文,吴枫,等.基于宏块的具有时域和SNR精细可伸缩的视频编码[J].计算机学报,2003,26(3):345-352
    [51]MURAT TEKALP A.崔之枯等译.数字视频处理[M].北京:清华大学出版社,1998.
    [52]Park G H, Park M W, J eong S. Adaptive GOP Structure for Joint Scalable VideoCoding[J],IEICE TRANS.COMMUN.,2007,E90–B(2):431-434.
    [53]常铮、卓力、沈兰荪.一种高效的运动补偿三维小波视频编码方案[J].电子与信息学报,2006,2(28):237-241.
    [54]Wang Y G,Liang F.Improved Adaptive Group of Pictures Structure[J].IEE ElectronicsLetters,2006,42(21):1210-1211.
    [55]Zongze Wu, Shengli Xie, Xie Zhang, Kexin Zhang, Resolution scalable image codingthrough recombining DCT coefficients, International Conference on InformationComputing and Automation. Chengdu, Dec,2007
    [56]Mu ller K., Merkle P., Tech G., Wiegand T.,3D video formats and coding methods,,201017th IEEE International Conference on Image Processing (ICIP),2010, Page(s):2389-2392
    [57]Vetro A., Tourapis A.M., Muller K., Tao Chen,3D-TV Content Storage and Transmission,IEEE Transactions on Broadcasting,57(2):384–394
    [58]Mu ller K., Merkle P., Wiegand T.,3D video representation using depth maps,Proceedings of the IEEE,2011.
    [59]C. Fehn, Depth-Image-Based Rendering (DIBR), Compression and Transmission for aNew Approach on3D-TV, Proc. SPIE Conference on Stereoscopic Displays and VirtualReality Systems XI, pp.93-104,San Jose, CA, USA, Jan.2004.
    [60]A. Vetro,S. Yea, and A. Smolic, Towards a3D video format for auto-stereoscopic displays,Proc. SPIE Conference on Applications of Digital Image Processing XXXI, San Diego,CA, Aug.2008.
    [61]L. Stelmach, W.J. Tam, D. Meegan: Stereo image quality: Effects of mixedspatio-temporal resolution, IEEE Transactions on Circuits and Systems for VideoTechnology, vol.10, no.2, pp.188-193, March2000.
    [62]H. Brust, A. Smolic, K. Mueller: Mixed resolution coding of stereoscopic video formobile devices, The3rd3DTV-Conference: The True Vision-Capture, Transmission andDisplay of3D Video,3DTV-CON2009. pp.1–4.
    [63]C. Fehn, P. Kauff, S. Cho: Asymmetric coding of stereoscopic video for transmission overT-DMB, The1st International Conference on3DTV,3DTV-CON2007. pp.1-4.
    [64]Y. Chen, Y. K. Wang, M. Gabbouj: Regionally adaptive filtering for asymmetricstereoscopic video coding, IEEE International Symposium on Circuits and Systems,ISCAS2009. Taipei,2009: vol.2, pp.585-588.
    [65]S. N. Park, D. G. Sim: View-dependency video coding for asymmetric resolutionstereoscopic views, Optical Engineering,2009, vol.48, no.7, pp.077009-077009-8.
    [66]ITU-R BT.500-11. Methodology for the subjective assessment of the quality of televisionpictures,2000.
    [67]Thomas Wiegand.ITU-T Study Group XV. Recommendation H.261,Video codec foraudio visual services at p×64Kbit/s, Helsinki, March1-12,1993
    [68]J.R. Corbera and S. Lei, Rate Control in DCT Video Coding for Low-DelayCommunications, IEEE Trans. On Circuits and Systems for Video Technolog. VOL.9,NO.l, PP.172-185, Feb.1999.
    [69]MPEG-2Test Model5,Doc.ISO/IEC JTCI/SC29/WG11/93-400,Apr.1993
    [70]尚书林,杜清秀,TM5码率控制算法的分析和改进,中国图像图形学报,2005(07).
    [71]H.J. Lee,T. Chiang,and Y.Q. Zhang, Seable Rate Control for MPEG-4Video, IEEE Trans.On Circuits and Systems for Video Technolog. VOL.10, PP.878-894, Sep.2000.
    [72]Do-Kyoung Kwon,Yongjin Cho and C.C.J.Kuo. A simplified rate control scheme fornon-conversational H.264video. IEEE workshop on Multimedia SignalProcessing,2007:284~287
    [73] Siwei Ma,Wen Gao and Yan Lu.Rate-Distortion analysis for H.264/AVC video codingand its application to rate control. IEEE Trans. on Circuits and Systems for VideoTechnology.2005,15(12):1533~1544.
    [74]N.Kamaci,Y.Altunbasak and R.M.Mersereau. Frame bit allocation for the H.264/AVCvideo coder via Cauchy-density-based rate and distortion models.IEEE Trans.on Circuitsand Systems for Video Technology.2005,15(8):994~1006
    [75]Z.Li,W.Gao,F.Pan et al.JVT-G012-r1, Adaptive basic unit layer rate control forJVT[S].Joint Video Team of ISO/IEC MPEG and ITU-T VCEG,2003.
    [76]赵波,吴成柯.一种新的低时延视频编码码速率控制算法.计算机学报.2005,28(1):53~59
    [77]T.M. Cover, J.A. Thomas. Elements of Information Theory. New York,Wiley,1991.
    [78]Wei Ding; Bede Liu; Rate control of MPEG video coding and recording byrate-quantization modeling, IEEE Transactions on Circuits and Systems for VideoTechnology, Feb.1996,6(1):12-20
    [79]Z.He, S.K.Mitra. A unified rate-distortion analysis framework for transform coding. IEEETrans. on Circuits and Systems for Video Technology.2001,11(12):1221~1236
    [80]ITU-T. Recommendation Y.1540.Internet protocol data communication sevrice-IP packettransfer and avail-ability Performance Parameters.
    [81]ITU-T. Recommendation Y.1541. Network performance objectives for IP-based sevrice.
    [82]IETF. RFC2680. A one-way packet loss mertic for IPPM.
    [83]IETF. RFC2679. A one-way delay metric for IPPM.
    [84]IETF. RFC2681.A round-trip delay mertic for IPPM.
    [85] IETF. RFC3393. IP paeket delay variation mertic for IPPM.
    [86]IETF. RFC2678. IPPM mertics for measuring connectivity.
    [87]Vern Pxason, Andrew Admas and Mat Mathis.Experiences with NINII, In Proceedings ofPassive and Active Measurement,2000.
    [88]R.Caceers, N.G. Dufield. Measurement and analysis of IP network usage and behavior,IEEE Communications Magazine,May2000.
    [89]Jain M, Dovrolis C. End-to-end available bandwidth: measurement methodology,dynamics, and relation with TCP throughput[J].IEEE/ACM Transactions on Networking,2003,11(4):537-549.
    [90]D. Kiwior, J. Kingston. Pathmon: a methodology for determining available bandwidthover an unknown net-work[C],IEEE/Sarnoff Symposium on200427-30.
    [91]Melander B, Bjorkman M, Gunningberg P., Regression-based available bandwidthmeasurements [C]. San Diego, USA: Int'l Symp on Performance Evaluation of Computerand Telecommunication Systems,2002.
    [92]黄清龙,阮宏顺.概率论与数理统计[M].北京:北京大学出版社.2005.
    [93]S. Whitehead, Adopting Wireless Machine-to-Machine Technology, IEE Computing andControl Eng., vol.15, no.5,2004, pp.40-46.
    [94]C. Inhyok, Trust in M2M Communication, IEEE Vehic. Tech. Mag.,vol.4, no.3, Sept.2009, pp.69-75.
    [95]Y. Zhang, R. Yu, S. L. Xie―Home M2M networks: Architectures,standards, and QoSimprovement,‖IEEE Communication Manazine,Volume:49, Issue:4, pp.44-52
    [96]http://www.zigbee.org/
    [97]The Bluetooth Special Interest Group, http://www.bluetooth.com/
    [98]D. Porcine, P. Research, W. Hirt, Ultra-wideband radio technology: Potential andchallenges ahead, IEEE Commun. Mag., vol.41, no.7, pp.66-74, Jul.2003.
    [99]Wi-Fi Alliance, http://www.wi-fi.org/
    [100] H. Schwarz, D. Marpe, T. Wiegand, Overview of the scalable video coding extensionof the H.264/AVC standard, IEEE Transactions on Circuits and Systems for VideoTechnology, vol.17, no.9, pp.1103-1120, Sep.2007.
    [101]家庭网络系列标准第六部分:多媒体与数据网络通信协议(报批稿)
    [102] http://zh.wikipedia.org/wiki/SSDP
    [103] http://zh.wikipedia.org/wiki/UPnP
    [104] http://zh.wikipedia.org/wiki/SOAP
    [105] DLNA, DLNA Overview and Vision Whitepaper,2006,http://www.dlna.org/en/industry/about/dlna_white_paper_2006.pdf.
    [106] IGRS, Information Device Intelligent Grouping and Resource Sharing QoSSpecification for Wireless UWB networks (draft),2008.
    [107] Universal Plug and Play, Understanding Universal Plug and Play: A White Paper,June2000, http://upnp.org/resources/whitepapers.asp.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700