基于率失真函数的保证质量的编码方法
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着互联网和数字多媒体技术的迅猛发展,数字视频的应用越来越广泛。经过多年的发展,视频编码技术已经较为成熟,最新的视频编码国际标准H.264/AVC以及我国的AVS在编码效率上都比以往的标准提高了一倍以上。然而,随着技术的持续进步,从VCD到DVD,从标清到高清,人们对视频质量的要求也随之提高。因此,对视频编码中的关键技术进行深入研究,不断进行优化,是非常具有理论和现实意义的。此外,由于视频编码标准不断的推陈出新,在新旧标准之间的转码成为研究热点。在我国,目前被广泛应用的MPEG-2标准与新一代的AVS标准之间的转换,具有极大的应用价值。本文研究视频编码中率失真函数与码率控制等若干关键问题,提出了基于H.264标准的率失真模型以及Two-pass VBR码率分配算法;研究MPEG-2/AVS转码中的码率控制问题,提出了平滑质量的码率控制算法。本文研究的主要内容包括:
     1.基于H.264/AVC标准的率失真函数分析
     本文首先利用信息论和量化理论的知识,着重研究如何应用码率失真理论分析实际编码的码率失真曲线(也称码率失真函数、率失真函数),得到基于H.264/AVC编码标准的率失真模型。具体使用广义高斯分布GGD来描述编码中变换系数的分布,并在H.264/AVC特定编码量化模式下进行推导,最终得到其率失真模型,为平滑的视频质量控制打下基础。
     2.基于率失真模型的码率分配算法
     在得到率失真模型之后,本文进一步研究如何利用该模型,实现保证质量的码率控制方法。为此,本文基于H.264/AVC编码标准,开发了一种全新的Two-pass VBR编码算法。该方法首先通过第一次编码确定视频序列中每一帧图像的率失真函数模型,并利用该模型计算目标码率,再根据目标码率调整量化系数,从而在第二次编码中得到质量恒定的编码结果。实验表明,和原有编码算法相比,这种新方法能够在保证目标码率的前提下提供更稳定的视频质量。
     3. MPEG-2/AVS视频转码中的码率控制算法
     本文还对MPEG-2/AVS视频转码中的码率控制进行了一定研究,得到了一种保证质量平滑的转码方法。首先,利用一种复杂度率失真模型对码流进行分析,根据目标码率计算初始量化系数。接下来,在转码过程中根据原始MPEG-2序列,使用一定的控制算法动态调整量化系数,以达到合理的目标码率。实验表明,本算法能够在保证目标码率的前提下得到较为平滑的视频转码质量,提供更好的主观质量感受。
     这些视频编码与转码的问题既有一定的理论深度和学术价值,同时又对应用有直接的指导意义。
Over the past few decades, digital video compression technique was extensively used in video communication, TV broadcast, video on demand, and digital storage. Several international industry standards have been established, including MPEG-1/2/4, H.263 and H.264. The newest H.264/AVC and AVS video coding standards doubled the coding efficiency relative to the performance of prior standards such as MPEG-2.
     For all video encoders, rate control plays an important role. It directly affects the bit rate and quality of the output bit streams. So it is very important for us to have an in-depth study on the key techniques used in rate control and improve the coding efficiency of the new coding schemes. In this paper, we present a two-pass variable bit rate (VBR) coding algorithm for H.264/AVC, using a novel R-D model. The model is developed based on the statistical analysis of the integer transform coefficients with generalized Gaussian distributions (GGD). Also, we develop a new VBR bit allocation method for MPEG-2/AVS video transcoding. The main contributions in this paper are listed as follows:
     1. Rate-distortion analysis and modeling of the H.264/AVC standard
     In this paper, we first present a new rate-distortion model for H.264. This model is developed from the statistical analysis of integer transform coefficients using GGD. The analysis and modeling are based on information theory and the quantization scheme of H.264.
     2. A novel Two-pass VBR bit allocation algorithm for H.264/AVC standard
     In this paper an effective VBR algorithm for H.264/AVC is developed, which adopts two-pass encoding to achieve a constant video quality. Characteristics of the entire video sequence are collected and analyzed in the first pass, using our new R-D model, and then the sequence could be optimally re-encoded in the second pass. Experimental results show that the proposed algorithm is effective and efficient to reduce the PSNR variation and provide consistent visual quality.
     3. A new rate control method for MPEG-2/AVS transcoding
     In this paper, a simple and effective rate control method for MPEG-2/AVS transcoding is presented. First, the initial quantization parameter of the transcoding is achieved by a new R-D model in transcoding. Then, during the AVS encoding, quantization parameter is adjusted according to the MPEG-2 bit rate using our new rate control algorithm. Experimental results show that the proposed algorithm can effectively reduce the PSNR variation and provide consistent visual quality.
引文
[1] T. Sikora, "Trends and perspectives in image and video coding," Proceedings of the IEEE, vol.93, no.1, pp. 6-17, Jan.2005
    [2] N. Ahmed, T. Natrajan, and K. R. Rao, "Discrete cosine transform," IEEE Trans. Comput., vol. C-23, no. 1, pp. 90–93, Dec. 1984.
    [3] D. Legall, "The MPEG video compression algorithm," Image Commun., vol. 4, pp. 129–140, Apr. 1992.
    [4] B. G. Haskell, A. Puri, and A. N. Netravali, Digital Video: An Introduction to MPEG-2. London, U.K.: Chapman & Hall, 1997, Digital Multimedia Standards Series.
    [5] F. Pereira and T. Ebrahimi, The MPEG-4 Book. Englewood Cliffs, NJ: Prentice-Hall, 2002.
    [6] T. Wiegand, G. J. Sullivan, G. Bjntegaard and A. Luthra, "Overview of the H.264/AVC video coding standard," IEEE Trans. Circuits Syst. Video Technol., vol.13, no.7, pp. 560-576, July.2003
    [7] A. Netravali and B. Haskel, Digital Pictures: Representations, Compression, and Standards (Applications of Communications Theory). New York: Plenum, 1995.
    [8] P. Kauff, B. Makai, S. Rauthenberg, U.G?lz, J. L. P. DeLameillieure, and T. Sikora, "Functional coding of video using a shape-adaptive DCT algorithm and object-based motion prediction toolbox," IEEE Trans. Circuits Syst. Video Technol., vol. 7, no. 1, pp. 181–196, Feb.1997.
    [9] A. Smolic, T. Sikora, and J.-R. Ohm, "Long-term global motion estimation and its application for sprite coding, content description, and segmentation," IEEE Trans. Circuits Syst. Video Technol., vol. 9, no.8, pp. 1227–1242, Dec. 1999.
    [10] M. Karczewicz, J. Niewglowski, and P. Haavisto, "Video coding using motion compensation with polynomial motion vector fields," Signal Process. Image Commun., vol. 10, no. 1–3, pp. 63–91, 1997.
    [11] A. Smolic, B. Makai, and T. Sikora, "Real-time estimation of long-term 3-D motion parameters for SNHC face animation and model-based coding applications," IEEE Trans. Circuits Syst. Video Technol., vol. 9, no. 2, pp. 255–263, Mar. 1999.
    [12] Iain E. G. Richardso, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia, John Wiley & Sons Ltd, 2003
    [13] K. R. Rao and P. Yip, Discrete Cosine Transform, Academic Press, 1990.
    [14] D. Marpe, H. Schwarz, and T. Wiegand, “Context-adaptive binary arithmetic coding in the H.264/AVC video compression standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, pp. 620–636, July 2003.
    [15] G. J. Sullivan and T. Wiegand, "Rate-distortion optimization for video compression," IEEE Signal Processing Magazine, vol.15, no.6, pp. 74 - 90, Nov.1998
    [16] T. Wiegand, H. Schwarz, A. Joch, F. Kossentini and G. J. Sullivan, "Rate-constrained coder control and comparison of video coding standards," IEEE Trans. Circuits and Systems for Video Technology, vol.13, no.7, pp. 688-703, Jul.2003
    [17] C. E. Shannon, "A mathematical theory of communication," The Bell System Technical Journal, vol.27, pp. 379-423, 623-656, Jul, Oct.1948
    [18] C. E. Shannon, "Coding theorems for a discrete source with a fidelity criterion," Claude Elwood Shannon: Collected Papers, N.J.A. Sloane and A.D. Wyner, Eds. Piscataway, IEEE Press, pp. 325-350, 1993
    [19] Z. He and S. K. Mitra, "A unified rate-distortion analysis framework for transform coding," IEEE Trans. Circuits and Systems for Video Technology, vol.11, no.12, pp. 1221-1236, Dec.2001
    [20] J. Xin, C.-W. Lin, and M.-T. Sun, "Digital video transcoding," Proceedings of the IEEE, vol. 93, no. 1, pp. 84-97, Jan.2005.
    [21] I. Ahmad, X. Wei, Y. Sun, and Y.-Q. Zhang, "Video transcoding: an overview of various techniques and research issues", IEEE TRANSACTIONS ON MULTIMEDIA, vol. 7, no. 5, pp. 793 804, Oct. 2005.
    [22] H.-M. Hang and J.-J. Chen, "Source model for transform video coder and its application. I. Fundamental theory," IEEE Trans. Circuits and Systems for Video Technology, vol.7, no.2, pp. 287-298, Apr.1997
    [23] E. Y. Lam and J. W. Goodman, "A mathematical analysis of the DCT coefficient distributions for images," IEEE Trans. Image Processing, vol.9, no.10, pp. 1661-1666, Oct.2000
    [24] R. Reininger and J. Gibson, "Distributions of the Two-Dimensional DCT Coefficients for Images," IEEE Trans. Communications, vol.31, no.6, pp. 835-839, Jun.1983
    [25] N. Farvardin and J. Modestino, "Optimum quantizer performance for a class of non-Gaussian memoryless sources," IEEE Trans. Information Theory, vol.30, no.3, pp. 485-497, May.1984
    [26] K. Sharifi and A. Leon-Garcia, "Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video," IEEE Trans. Circuits and Systems for Video Technology, vol.5, no.1, pp. 52-56, Feb.1995
    [27] F. Muller, "Distribution shape of two-dimensional DCT coefficients of natural images," Electronics Letters, vol.29, no.22, pp. 1935-1936, Oct.1993
    [28] 孙俊, "流媒体编码和传输中若干关键技术的研究," 博士论文,中国科学院计算技术研究所, 2006
    [29] T. Berger, Rate Distortion Theory. Englewood Cliffs, Prentice- Hall, 1971.
    [30] T. Berger and J. D. Gibson, "Lossy source coding," IEEE Trans. Information Theory, vol.44, no.6, pp. 2693-2723, Oct.1998
    [31] R. M. Gray and D. L. Neuhoff, "Quantization," IEEE Trans. Information Theory, vol.44, no.6, pp. 2325-2383, Oct.1998
    [32] H. Gish and J. Pierce, "Asymptotically efficient quantizing," IEEE Trans. Information Theory, vol.14, no.5, pp. 676-683, Sep.1968
    [33] L.-J. Lin and A. Ortega, "Bit-rate control using piecewise approximated rate-distortion characteristics," IEEE Trans. Circuits and Systems for Video Technology, vol.8, no.4, pp. 446-459, Aug.1998
    [34] X. M. Zhang, A. Vetro, Y. Q. Shi and H. Sun, "Constant quality constrained rate allocation for FGS-coded video," IEEE Trans. Circuits and Systems for Video Technology, vol.13, no.2, pp. 121-130, Feb.2003
    [35] T. Chiang and Y.-Q. Zhang, "A new rate control scheme using quadratic rate distortion model," IEEE Trans. Circuits and Systems for Video Technology, vol.7, no.1, pp. 246-250, Feb.1997
    [36] J. R. Corbera and S. Lei, “Rate control in DCT video coding for low-delay communications,” IEEE Trans. Circuits Syst. Video Technol., vol. 9, no. 1, pp. 172–185, Feb. 1999.
    [37] F. Moscheni, F. Dufaux, and H. Nicolas, “Entropy criterion for optimal bit allocation between motion and prediction error information,” in Proc. SPIE Visual Communications and Image Processing, Cambridge, Nov. 1993.
    [38] J. Sun, W. Gao, D. Zhao and Q. Huang, "Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos," IEEE Trans. Circuits and Systems for Video Technology, vol.16, no.4, pp. 535-539, Apr.2006
    [39] N. Kamaci, Y. Altunbasak and R. M. Mersereau, "Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models," IEEE Trans. Circuits and Systems for Video Technology, vol.15, no.8, pp. 994-1006, Aug.2005
    [40] Ning Wang, Yun He, “A New Bit Rate Control Strategy for H.264”, PCM2003, Singapore, Dec. 2003
    [41] Y. Yu, J. Zhou, Y. Wang and C. W. Chen, "A novel two-pass VBR coding algorithm for fixed-size storage application," IEEE Trans. Circuits and Systems for Video Technology, vol.11, no.3, pp. 345-356, Mar.2001
    [42] P. H. Westerink, R. Rajagopalan and C. A. Gonzales, "Two-pass MPEG-2 variable-bit-rate encoding," IBM J.RES.DEVELOP, vol.43, no.4, Jul.1999
    [43] C. Que, G. Chen, and J. Liu, “An Efficient Two-Pass VBR Encoding Algorithm for H.264”, 2006 International Conference on Communications, Circuits and Systems Proceedings, Vol.1, pp. 118-122, Jun. 2006
    [44] ISO/IEC JTC1/SC29/WG11, MPEG-2 Test Model 5 (TM5) [S]. 1993.
    [45] H.B. Yin, X.Z. Fang, L. Chen, and J. Hou, “A practical consistent-quality two-pass VBR video coding algorithm for digital storage application”, IEEE Trans. Consumer Electronics, Vol. 50, No. 4, pp. 1142-1150, Nov. 2004
    [46] Shanableh T, Ghanbari M. “Transcoding of video into different encoding formats”. IEEE Int Conf on Acoustics Speech and Signal Processing, 2000, 4: 1927-1930.
    [47] Seo Kwabg-deok, Kim Jae-kyoon. “Fast motion vector refinement for MPEG-1 to MPEG-4 transcoding with spatial down-sampling in DCT domain”. IEEE Int Conf on Image Processing, 2001, 1: 469 - 472.
    [48] Y. Su, M.-T. Sun, and V. Hsu, “Global motion estimation from coarsely sampled motion vector field and the applications,” in Proc. IEEE Int. Symp. Circuits and Systems, vol. 2, 2003, pp. 628–631.
    [49] G. Fernandez-Escribano, P. Cuenca, L. Orozco-Barbosa and A. Garrido, "A fast Intra-Frame prediction algorithm for MPEG-2/H.264 video transcoders". Proceeding of ICIP 2005, Genoa, Italy, Sep. 2005. pp. 684-687.
    [50] G. Fernandez-Escribano, H. Kalva, P. Cuenca, and L. Orozco-Barbosa, “Speeding-up the Macroblock Partition Mode Decision in MPEG-2/H.264 Transcoding,” IEEE International Conference on Image Processing (ICIP) 2006, October 9-12, 2006, Atlanta, USA.
    [51] Y.-P. Tan and H. Sun, "Fast motion re-estimation for arbitrary downsizing video transcoding using H.264/AVC standard," IEEE Transactions on Consumer Electronics, Vol. 50, No. 3, Aug. 2004
    [52] T. Shanableh and M. Ghanbari, "Heterogeneous video transcoding to lower spatio-temporal resolutions and different encoding formats," IEEE Trans. Multimedia, vol. 2, pp. 101-110, June 2000.
    [53] Z. Zhou, S. Sun, S. Lei, and M.T. Sun, "Motion information and coding mode reuse for MPEG-2 to H.264 transcoding," Proc. IEEE International Symposium on Circuits and Systems, vol.2, no.1, pp.1230–1233, May 2005
    [54] Y. Sun, X. Wei, and I. Ahmad, "Low delay rate-control in video transcoding," in Proc. IEEE Int. Symp. Circuits and Systems (ISCAS) 2003, vol. 2, Bangkok, Thailand, May 2003, pp. II-660–II-663.
    [55] Z. Lei and N.D. Georganas, "Rate adaptation transcoding for precoded video streams," Proc. ACM Multimedia02, pp.127–136, Juan-les-Pins, France, Dec. 2002.
    [56] G. Chen, S.X. Lin, and Y.D. Zhang, "Rate Control Algorithm for MPEG-2 to H.264/AVC Transcoding", IbPRIA07 (II: 72-79), 2007.
    [57] J. A. Domínguez-Molina, G. González-Farías and R. M. Rodríguez-Dagnino, “A practical procedure to estimate the shape parameter in the generalized Gaussian distribution,” Technique Report I-01-18, Sep. 2001, available: http://www.cimat.mx/reportes/enlinea/I-01-18_eng.pdf
    [58] M. K. Varanasi and B. Aazhang, “Parametric generalized Gaussian density estimation,” J. Acoust. Soc. Am. vol. 86, no. 4, pp. 1404-1415, Oct. 1989.
    [59] H.S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low-complexity transform and quantization in H.264/AVC”, IEEE Trans. Circuits Syst. Video Technology, Vol. 13, No. 7, pp. 598-603, Jul. 2003。

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700