精细可分级视频编码技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着视频编码技术的发展和一系列视频编码标准的相继出台,视频编码技术在很多领域得到了应用,然而传统的面向存储领域的视频编码技术主要解决的是如何提高压缩编码效率。随着近年来互联网和多媒体技术的不断发展,人们对基于网络的视频传输需求越来越广泛,如何产生适应网络传输的码流就成为了视频编码中的关键问题。因此,视频编码的目的已经由以前单纯的提高压缩效率转向提高压缩效率与适应网络传输并重。精细可分级视频编码(FGS),因为其能有效地解决网络视频中存在的大数据量、带宽变化和丢包等问题,被视为网络环境下的一种很有前途的视频编码方案。
     本文的工作主要是针对FGS编码效率较低的缺点,在保证其精细可分级特性的前提下,研究提高其编码效率的方法。论文首先介绍了视频图像压缩的基本思想和关键技术,并简述了视频编码标准的发展历程和各种标准的特点,特别是H.264中的一些关键技术。之后总结了各种可分级编码的原理、方法和改进策略,尤其是精细可分级编码方法和针对它的各种改进方案。在此基础上本文将H.264编码与MPEG-4 FGS编码相结合,提出了一种新的FGS编码方法——基于H.264的FGS编码,它以H.264编码标准作为方案的基本层,MPEG-4 FGS的增强层作为方案的增强层,使它既具有了H.264的高压缩率,又具有了FGS的精细可分级特性。实验证明,与MPEG-4 FGS相比,这种方法明显的提高了编码效率,改善了视频图像的主客观质量。之后,本文又提出了两种新的FGS增强算法——水环扫描算法和肤色选择增强算法,分别对水环中心区域和肤色区域进行增强。实验表明,水环扫描算法能在不增加编码复杂度和不降低整幅图像的质量的前提下,明显的提高水环中心区域的质量,而肤色选项增强算法则能在适当降低整帧图像质量的条件下,明显提升肤色区域的视觉质量。
While developing of video coding technology and a series of video coding standards coming on one after the other, video coding technology has been widely used in many fields. However, the major task of the traditional video coding technologies for storage-based applications is how to improve the coding efficiency. With the rapid development of Internet and multimedia technologies, the demand of video communication based on network has been more and more aboard, and how to produce a video streaming which can adapt to communication has been a key question for video coding technology, so the aim of video coding has changed from improving coding efficiency to paying equal attention to coding efficiency and communication. Fine granular scalability coding (FGS) can effective solve the problems of video streaming on network, such as great data quantity, change of band width and lose of packages, and hence is regarded as a promising video coding scheme in the Internet scenario.
     Because of the low coding efficiency of FGS, this paper’s work is looking for a method to improve coding efficiency while keeping scalable capability. This paper firstly introduced the principle of video compressing, key technology of video coding, development of coding standards and their essential characters. The introduction of essential characters in H.264 was in particular. This paper also introduced each scalable coding method, summarized theirs principles, methods and improvements, and especially introduced fine granular scalability coding and its improvements. In this foundation, this paper combined H.264 and MPEG-4 FGS, and proposed a new method of FGS --- FGS video coding base on H.264. This method use H.264 as the base layer and the enhancement layer of MPEG-4 FGS as the enhancement layer, so it have more high coding efficiency while keeping scalable capability. Experimental results show that, compare with MPEG-4 FGS, this method can obviously improve the efficiency of coding and improve the subjective and objective picture quality. Afterwards, this paper introduced two enhanced methods for FGS---water ring scanning arithmetic and skin color selective enhancement arithmetic, respective enhanced to the centre area of water ring and the area of skin color. The experiments proves that, the methods of water ring arithmetic can significantly improve the picture quality in the centre area of water ring while keeping the efficiency of the whole picture and coding complexity, and the method of skin color selective enhancement arithmetic can significantly improve the skin color area’s picture quality while propriety reducing the whole picture’s coding efficiency.
引文
[1].Dapeng Wu, Yiwei Thomas, Wenwu Zhu et al. Streaming Video over the Internet: Approaches and Directions[J]. IEEE Transactions on Circuits and Systems for Video Technology, Vol.11, 2001, 11 (3):282-300.
    [2].Gregory J. Conklin, Gary S. Greenbaum et al. Video coding for streaming media delivery on the Internet[J]. IEEE Transactions on Circuits and Systems for Video Technology, Vol.11, 2001, 11 (3):269-281.
    [3].姜恩华,姜文彬,基于组播的视频信息传输技术研究[J]淮北煤炭师范学院学报, Vol.25, No.1, Mar.2004:50-53.
    [4].郭庆琳,樊孝忠,网络广播的实现及其瓶颈问题的解决[J]北京广播学院学报(自然科学版), 2003.6:11-16.
    [5].王原丽,张杰基于码流切换的SP/SI帧技术研究[J],武汉理工大学学报(信息与管理工程版), Vol. 28, No. 7, Jul. 2006:79-82.
    [6].Vivek K Goyal, Multiple Description Coding: Compression Meets the Network[J], IEEE Signal Processing Magazine, 2001, 9:74-93.
    [7].Ishfaq Ahmad, Xiaohui Wei, Yu Sun et al. Video Transcoding: An Overview of Various Techniques and Research Issues[J]. IEEE Transactions on Multimedia, Vol.7, 2005, 7 (5):793-804.
    [8].Dapeng Wu, Y. Thomas Hou, Ya-Qin Zhang, Scalable Video Coding and Transport over Broadband Wireless Networks[Z], http://research. microsoft.com/research/pubs/view.aspx?pubid =874.
    [9].Weiping Li, Fellow, IEEE, Overview of Fine Granularity Scalability in MPEG-4 Video Standard[J], IEEE Transactions on circuits and systems for video technology, Vol.11, 2001, 11 (3):301-317.
    [10].Hayder M. Radha, Mihaela van der Schaar, Yingwei Chen, The MPEG-4 Fine-Grained Scalable Video Coding Method for Multimedia Over IP[J], IEEE Transactions on Multimedia, Vol.3, 2001, 3 (1):53-68.
    [11]. W.P. Li, F. Ling, X.M. Chen, Fine granularity scalability in MPEG-4 for streaming video[C], IEEE International Symposium on Circuits and System, May 28-31, 2000. Geneva, Switzerland.
    [12].R. Kalluri, M. Schaar, Single-Loop Motion-Compensated based Fine-Granular Scalability (MC-FGS) with cross-checked results[C], ISO/IECJTC1/SC29/WG11, M6831, Pisa, Italy, Jan, 2001.
    [13]. M. van der Schaar, H. Radha, Motion-compensation Fine-granular-scalability(MC-FGS) for Wireless Multimedia[J], IEEE Multimedia Signal Processing, Oct. 3, 2001:454-458.
    [14]. Mihaela S. Adaptive Motion Compensation Fine Granular Scalability (AMC-FGS) for Wireless Video[J], IEEE Trans. Circuits Syst. Video Technol, Vol.12, 2002,12(6) :360-371.
    [15].Feng Wu, Shipeng Li, Ya-Qin Zhang, DCT-Prediction based progressive fine granularity scalable coding[C], International Conference on Image Processing (ICIP), Vancouver,2000,3.
    [16].Wu, Feng, Li, Shi-peng, Zhang, Ya-qin. A framework for efficient progressive fine granularity scalable video coding[J]. IEEE Transactions on Circuit and Systems for Video Technology, Vol.11, 2001, 11(1):332-344.
    [17].吴枫,李世鹏,张亚勤,渐进、精细的可伸缩性视频编码[J]计算机学报Vol. 23 No. 12 Dec. 2000:1276-1282.
    [18].孙晓艳,高文,吴枫,李世鹏,张亚勤基于宏块的渐进、精细可伸缩的视频编码[J],软件学报Vol.13, No.11 2002:2134-2141.
    [19].Xiaoyan Sun, Feng Wu, Shipeng Li et al. Macroblock-based progressive fine granularity scalable coding[J], International Conference on Multimedia and Expo, Tokyo, Japan, Aug. 2001.
    [20].Peng W S, Chen Y K. Mode-adaptive fine granularity scalability[C], ICIP 2001, Greece, Oct. 2001: 993-996.
    [21]. Mihaela van der Schaar, Hayder Radha, A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video[J] IEEE Transactions on circuits and systems for video technology, Vol. 11, No. 3, March 2001:318-331.
    [22]. M. Vander Schaar, H. Radha, and Y. Chen, An all FGS solution for hybrid temporal-SNR scalability[C]. ISO/IEC JTC1/SC29/WG11, MPEG99/m5552, Maui meeting, 1999.
    [23].X. Sun, F. Wu, S. Li, W. Gao, Y-Q. Zhang, Macroblock-based temporal-SNR progressive fine granularity scalable video coding[J], IEEE International Conference on Image Processing (ICIP),1025-1028, Thessaloniki, Greece, October, 2001:1025-1028.
    [24]. Van der Schaar M, Radha H. A novel MPEG-4 based hybrid temporal-SNR scalability for Internet video[C]. In : Proceedings of ICIP2000 , Vancouver , British Columbia , Canada ,2000: 548-551.
    [25].孙晓艳,高文,吴枫,李世鹏,基于宏块的具有时域和SNR精细可伸缩性的视频编码[J].计算机学报, Vol. 26 No. 3, Mar. 2003:346-352.
    [26]. Qi Wang, Feng Wu, Shipeng Li, Yuzhuo Zhong, Ya-Qin Zhang, Fine-granularity spatially scalable video coding[C], Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 3, May 2001:1801-1804.
    [27]. Shih-Ta Hsiang, Highly scalable subband/wavelet image and video coding[D], Rensselaer Polytechnic Institute, Troy, New York, January 2002.
    [28]. Marcellin M W, Gormish M, Bilgin A, et al. An overview of JPEG2000 [C]. Data Compression Conference, Snowbird, USA, 2000.
    [29].MPEG-4 Video Group. MPEG-4 video verification model version 18.0[S]. ISO/IEC/JTC/SC29/WG11, N3908, Pisa, January 2001.
    [30].沈兰荪,卓力编著,小波编码与网络视频传输[M],科学出版社, 2005.4.
    [31].姚庆栋,毕厚杰等编著,图像编码基础(第3版)[M],清华大学出版社, 2006.
    [32].张春田,苏育挺等编著,数字图像压缩编码[M],清华大学出版社, 2004.
    [33].沈兰荪等著,视频编码与低速率传输[M],电子工业出版社, 2001.
    [34].刘峰编著,视频图象编码技术及国际标准[M],北京邮电大学出版社, 2005.
    [35].Iain E.G.Richardson著,欧阳合,韩军译,视频编解码器设计:开发图像与视频压缩系统[M],中国环境科学出版社, 2005.
    [36].ITU-T Recommendation H.261, video Coding for Audio visual Service at P×64Kbit/s[S], 1993.
    [37]. ISO/IEC IS 11172: Information technology一coding of moving pictures and associated audio for digital storage media at up to about LS Mbps(MPEG-1) [S], 1993.
    [38].杨品钟玉琢等译, MPEG运动图象压缩编码标准(ISO/IEC 11172)[S],机械工业出版社, 1995.
    [39].ISO/IEC 13818, ITU-T Draft Recommendation H.262 information Technology Generic Coding of Moving Pictures and Associated Audio[S], Jul 1995.
    [40].钟玉琢乔秉贵等译,运动图象及其伴音通用编码国际标准—MPEG-2[S],清华大学出版社, 1997.
    [41].ITU-T Recommendation H.263, Video Coding for Low Bitrate communication[S], 1996.
    [42].ITU-T Recommendation H.263 Version 2, Video Coding for Low Bitrate communication[S], 1998.
    [43].ITU-T Recommendation H.263 Version 3, Video Coding for Low Bitrate communication[S], 2000.
    [44].ISO/IEC 1S 14496-2: Information technology一coding of audio-visual objects一part2: Visual(MPEG-4 Video) [S], 1999.
    [45].钟玉琢,王琪,贺玉文编著,基于对象的多媒体数据压缩编码国际标准—MPEG-4及其校验模型[M],科学出版社, 2000.
    [46].Thomas Wiegand, Gary J. Sullivan, Gisle Bjontegaard, and Ajay Luthra, Overview of the H.264 / AVC Video Coding Standard[J], IEEE Transactions on circuits and systems for video technology, Vol.11, July, 2003:1-19.
    [47].Atul Puria, Xuemin Chenb, Ajay Luthra, Video coding using the H.264/MPEG-4 AVC compression standard[C], Signal Processing: Image Communication 19 (2004):793-849.
    [48].H.264 / MPEG-4 Part 10 White Paper[Z], http://www.vcodex.com..
    [49].ISO/IEC International Standard 14496-10:2003, Information Technology–Coding of Audiovisual Objects– Part 10: Advanced Video Coding[S], 2002.
    [50].毕厚杰主编,新一代视频压缩编码标准—H.264/AVC[M],人民邮电出版社, 2005.5.
    [51].余兆明等编著,图像编码标准H.264技术[M],人民邮电出版社, 2005.3.
    [52].ITU-T Study Group 16, ITU-T Study Group 16: Study Period 2005-2008[Z]. http://www.itu.int/ITU-T/studygroups/com16/sg16-q6.html.
    [53].Peter Amon, Klaus Illgner, Jürgen Pandel, SNR Scalable Layered Video Coding[Z], http://amp.ece.cmu.edu/packetvideo2002/papers/59-ethpsnsons.pdf.
    [54]. F. Ling, W. P Li and H. Q. Sun, Bitplane coding of DCT coefficients for image and video compression[C], Proceeding of SPIE VCIP'99, San Jose, Jan.25-27,1999: 25-27.
    [55]. Kim, S.H. and Ho, Y.S, HVS-Based Frequency Weighting for Fine Granular Scalability[C].Proc. Information and Communication Technologies (2003), 127–131.
    [56]. Seung-Hwan Kim and Yo-Sung Ho, Frequency Weighting and Selective Enhancement for MPEG-4 Scalable Video Coding[C], Advances in multimedia Information Processing PCM 2004:159-166.
    [57]. Zheng Ruo-bin. Scalable multiple description coding and distributed video streaming over 3g mobile networks [C]. Ontario, Canada, Waterloo, 2003.
    [58].赵海涛,王养利,闫凤霞,杨艳梅.多描述可分级编码的研究进展[J],计算机工程与设计Vol. 26, No. 8, 2005.8:2118-2120.
    [59]. Schwarz H, Marpe D, Schierl T, Wiegand T, Combined scalability support for the scalable extension of H.264/AVC[C] Multimedia and Expo, 2005. ICME 2005, IEEE International Conference.
    [60]. ITU-T and ISO/IEC JTC1, Scalable Video Coding– Working Draft 1[S], JVT-N020, Jan, 2005.
    [61]. ITU-T and ISO/IEC JTC1, Joint Draft 10 of SVC Amendment[S], JVT-W201, Apri, 2007
    [62].Gwang Hoon Park and Kyuheon Kim, Adaptive Scanning Method for Fine Granularity Scalable Video Coding[Z], http://etrij.etri.re.kr/Cyber/servlet/GetFile?fileid=SPF-1092278631360
    [63].?ukasz B?aszak, Marek Domański, Spiral Scan in Video Compression[Z], http://www.arehna.di.uoa.gr/Eusipco2005/defevent/papers/cr1698.pdf.
    [64].Joint Video Team, Water Ring Scan method for H.26L based FGS[C]. ISO/IEC JTC1/SC29/WG11 and ITU-TSG16 Q.6, 2002.
    [65].M Schaar, Y T Lin, Content-based selective enhancement for streaming video[J]. IEEE Transactions on Multimedia, Vol.11, March 2001:977-980.
    [66].陈锻生刘政凯肤色检测技术综述[J]计算机学报VOl.29 NO.2, 2006.2:194-207.
    [67].Douglas Chai, King N Ngan, Automatic Face Location for Videophone Images[C], IEEE TENCON-Digital Signal Processing Application, 1996.
    [68]. Xie, L.T.Chia, B.S.Lee, Optimal bit allocation for FGS coding[C], IEE Electronics Letter, vol. 40: no. 16, 2004.
    [69].史翠竹,余松煜,王嘉, FGS视频流的码率分配算法研究[J],计算机工程与应用, 2004.14:49-52.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700