基于小波变换的视频压缩算法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
视频压缩及传输由于具有广泛的应用前景,成为国际研究的热点。它是HDTV、DVD、视频会议、Internet视频及其它相关应用的核心技术。目前,由于现有标准的局限性,国际上提出了嵌入式小波变换编码方案,与传统的DCT变换相比,小波变换具有时频域局部化特性和多分辨性质,且小波是对整个图像进行变换,避免了DCT变换的方块效应,更适合可伸缩的编码。随着用于图像压缩的EZW、SPIHT和EBCOT模型及用于视频压缩的3D-EZW、3D-SPIHT和3D-ESCOT模型的出现,基于小波变换的嵌入编码已成为图像视频压缩研究的主流。嵌入编码具有率伸缩特性,很适合在带宽随时延变化的互联网上使用。
    目前,嵌入编码都是基于两种类型:一种是分层的嵌入编码,主要是利用小波不同子带间系数的偏序性和自相似性,将不同子带间的系数形成一棵棵树结构,通过将整棵树量化为零来减少压缩比。另一种是基于上下文模型,主要是利用小波系数同子带内相邻系数的相关性,组织高效的上下文,来显著地提高压缩效果。
    受EZW、SPIHT、EBCOT模型及3D-EZW、3D-SPIHT、3D-ESCOT模型的启发,本文综合利用小波系数不同子带间的自相似性、偏序性和同子带内相邻系数的相关性,提出了一种新的三维小波嵌入编码上下文模型3D-PEM。实验结果表明:利用这种模型获得的PSNR高于3D-ESCOT模型。同时也具有3D-ESCOT模型的优良性质。在此基础上,本文又提出了基于自适应运动补偿的三维小波模型MC-3DPEM,利用运动估计和补偿消除视频间的运动,对补偿后的亮度、色度分量先进行空间域的小波变换,再利用亮度系数、色度系数之间的相关性,将三个分量的系数形成一个变换系数矩阵,对变换系数矩阵进行时间域的小波变换,用先前的模型对变换后的三维小波系数编码。实验结果表明:相同码率情况下,基于运动补偿的MC-3DPEM模型获得的PSNR高于3D-PEM模型,同时视频三个分量的混合编码比每个分量单独编码时具有更好的PSNR。
Video compression has long been one of the main topics of compression coding. Since it is critical to HDTV, DVD, Video conference, etc. In order to avoid the faults of international standards, a number of embedded wavelet coefficients coding models have been proposed. Due to its multiresolution, time/frequency localization and lower time complexity, wavelet is suitable for scalable video coding, which has been applied to coding algorithms such as EZW, SPIHT and EBCOT. Embedded coding schemes are SNR scalable, and, even farther, which are essential in transmission of image and video on Internet.
    There have been two type of embedded coding. The first is based on partial order and self-similarity between subbands of wavelet coefficients, which organizes the wavelet coefficients across subbands into a zerotree, while the second is based on context models, using the correlation of neighboring wavelet coefficients.
    Inspired by typical coding schemes such as EZW, SPIHT, EBCOT, 3D-IEZW, 3D-SPIHT and 3D-ESCOT, a new context model of wavelet coefficients is proposed, which make use of partial order, self-similarity between subbands of wavelet coefficients and the correlation of neighboring wavelet coefficients. Experimental results show that the model can achieve higher PSNR than 3D-ESCOT .while at the same time keeping all the advantages of 3D-ESCOT. In order to further improve the compression performance, the model is modified with the help of motion compensation technique. Experimental results also show that at various conditions, MC-3DPEM is superior to 3D-PEM without motion compensation
引文
[1] Wallace, G. K. The JPEG Still Picture Compression Standard, Comm. ACM, vol. 34, no. 4, April 1991, pp. 30-44.
    [2] ISO/IEC, Information Technology — Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to 1.5 Mbps. ISO/IEC 11172-2, 1992.
    [3] ISO/IEC, Information Technology — Generic Coding of Moving Pictures and Associated Audio Informations: Video. ISO/IEC 13818-2, 1995.
    [4] ITU Telecommunication Standardization Sector of ITU, Video Codec for Audiovisual Services at p × 64 kbits/sec. ITU-T Recommendation H.261,1994.
    [5] ITU Telecommunication Standardization Sector of ITU, Video Coding for Low Bitrate Communication. ITU-T Recommendation H.263, Mar. 1996.
    [6] ITU Telecommunication Standardization Sector of ITU, Video Coding for Low Bitrate Communication. ITU-T Recommendation H.263 Version 2, Jan.1998.
    [7] ISO/IEC, Information Technology — Coding of Audio-Visual Objects —Part2: Visual. ISO/IEC 14496-2, Dec. 1999.
    [8] ISO/IEC FCD15444-1. JPEG 2000 Part I Final Committee Draft Version 1.0[S].
    [9] B.Girod, U.Horn, B.Belzer, Scalable video coding with multiscale motion compensation and unequal error protection, Proc.symp.on Multimedia Communications and Video Coding, new York: Plenum press, pp. 475-482, Oct.1995.
    [10] J M.Shapiro Embedded image coding using zerotree of wavelet coefficients. IEEE Trans. Signal Processing, vol. 41, no. 12,:pp.3445-3462, Dec. 1993 .
    [11] A.Said and W.A.Pearlman. A new,fast,and efficient image codec based on set partitioning in hierarchical trees, IEEE Transactions on Circuits and Systems for Video Technology,vol.6, no. 3,pp.243-250,June 1996.
    [12] S. A. Martucci, I. Sodagar, T. Chiang, and Y.-Q. Zhang, A zerotree wavelet video coder, IEEE Transactions on Circuits and Systems for Video Technology,vol. 7, no. 1, pp. 109-118, February 1997.
    [13] David Taubman,High Performance Scalable Image Compression with EBCOT,IEEE Trans. On Image Processing,vol. 9,No. 7, July 2000
    
    
    [14] W. K. Carey, L. A. Von Pischke, and S. S. Hemami, Rate-distortion based scalable progressive image coding, in SPIE Conf. on the Mathematics of Data and Image Coding, Compression, and Encription, (San Diego, CA),July 1998.
    [15] J. Li and S. Lei, An embedded still image coder with rate-distortion optimization, IEEE Trans. Image Processing, vol. 8, pp. 913–924, July 1999.
    [16] P. J. Sementilli, A. Bilgin, J. H. Kasner, and M. W. Marcellin, Wavelet TCQ: submission to JPEG-2000, in SPIE Conf. Applications of Digital Image Processing XXI, vol. SPIE 3460, (San Diego, CA), pp. 2–12, July 1998.
    [17] S. McCanne, V. Jacobson, and M. Vetterli, Receiver-driven layered multicast, in Proc. SIGCOMM'96, ACM, Stanford, CA, Aug. 1996,pp. 117-130.
    [18] 崔锦泰,程正兴,小波分析导论,第1版,中国西安,西安交通大学出版社,1995:1~28.
    [19] 李建平,唐远炎. 小波分析方法的应用,第1版,中国重庆,重庆大学出版社,1999,10
    [20] I.Daubechies, Orthonormal bases of compactly supported wavelets, Communications on Pure and Applied mathematics, 1988, Vol. 41, No. 11, pp.909-996.
    [21] S.Mallat, A theory for multiresolution signal decomposition:the wavelet representation, IEEE Trans. Pattern Analysis and machine Intelligence, 1989, Vol. 11, pp. 674-693.
    [22] W.Sweldens, The lifting scheme:A custom-design construction of bi-orthogonal wavelets, Appl. Comput., Harmon. Anal, 1996, Vol. 3, No. 2, pp.186-200.
    [23] W.Sweldens , The lifting scheme: A construction of second generation wavelets, SIAM J. Math. Anal., 1997.
    [24] I.Daubechies, And W Sweldens, Factoring wavelet transforms into lifting steps, J. Fourier Anal. Appl., 1998, Vol. 4, No. 3, pp.247-269.
    [25] M. Antonini, M. Barlaud, P. Mathieu, and I. Daubechies, Image coding using wavelet transform, IEEE Transactions on Image Processing, vol. 1, no. 2,pp. 205-220, April 1992.
    [26] R. A. DeVore, B. Jawerth, and B. J. Lucier, Image compression through wavelet transform coding," IEEE Transactions on Information Theory, vol. 38,pp. 719-746, March 1992.
    [27] I. H. Witten, R. Neal, and J. G. Cleary, Arithmetic coding for data compression, Communications of the ACM, vol. 30, no. 6, pp. 520-540, June 1987.
    
    
    [28] D. Taubman and A. Zakhor, Multirate 3-D subband coding of video, IEEE Transactions on Image Processing, vol. 3, no. 5, pp. 572-588, September 1994.
    [29] B.-J. Kim andW. A. Pearlman, An embedded wavelet video coder using three dimensional set partitioning in hierarchical trees (SPITH), Proceedings of the IEEE Data Compression Conference, pp. 251-260, Snowbird, Utah, March 25-27, 1997.
    [30] B.-J. Kim and W. A. Pearlman, Low-delay embedded 3-D wavelet color video coding with SPIHT, Proceedings of SPIE - Visual Communications and Image Processing, pp. 955-964, San Jose, California, January 28-30, 1998.
    [31] I. Sodagar, H.-J. Lee, P. Hatrack, and Y.-Q. Zhang, Scalable wavelet coding for synthetic/ natural hybrid images, IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 2, pp. 244-254, March 1999.
    [32] Jizheng Xu, Zixiang Xiong, Shipeng Li and Ya-Qin Zhang,Three-Dimensional Embedded Subband Coding with Optimized Truncation (3-D ESCOT), Applied and Computational Harmonic Analysis, Volume 10, Issue 3, May 2001, Pages 290-315
    [33] 张正阳,高性能图象编码研究[博士论文].西安:西安电子科技大学,1999
    [34] 王琪,钟玉琢,一种结合量化的零树小波图象编码器,清华大学学报(自然科学版),2000年第40卷第7期
    [35] Y.Chen and W.A.Pearlman, Three-dimensional subband coding of video using the zero-tree method. Visual Communication and Image Processing,96,proc SPIE2727 pp:1302-1309.
    [36] S.-J. Choi and J. W. Woods, Motion-compensated 3-D subband coding of video, IEEE Transactions on Image Processing, vol. 8, no. 2, pp. 155-167,February 1999.
    [37] K. Shen and E. J. Delp, A control scheme for a data rate scalable video codec,Proceedings of the IEEE International Conference on Image Processing, vol. II,pp. 69-72, Lausanne, Switzerland, September 16-19, 1996.
    [38] M. L. Comer, K. Shen, and E. J. Delp, Rate-scalable video coding using a zerotree wavelet approach, Proceedings of the Ninth Image and Multidimensional Digital Signal Processing Workshop, pp. 162-163, Belize City, Belize,March 3-6, 1996.
    [39] K. Shen, A Study of Real Time and Rate Scalable Image and Video
    
    Compression. Ph.D. thesis, School of Electrical and Computer Engineering, Purdue University, December 1997.
    [40] K. Shen and E. J. Delp, Wavelet based rate scalable video compression, IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 1,pp. 109-122, February 1999.
    [41] A. N. Netravali and C. B. Rubinstein, Luminance adaptive coding of chrominance signals, IEEE Transactions on Communications, vol. COM-27, no. 4,pp. 703-710, April 1979.
    [42] J. O. Limb and C. B. Rubinstein, Plateau coding of the chrominance component of color picture signals, IEEE Transactions on Communications,vol. COM-22, no. 3, pp. 812-820, June 1974.
    [43] K. Shen and E. J. Delp, Color image compression using an embedded rate scalable approach, Proceedings of the IEEE International Conference on Image Processing, vol. III, pp. 34-37, Santa Barbara, California, October 26-29, 1997.
    [44] Q. Wang and M. Ghanbari, Scalable coding of very high resolution video using the virtual zerotree, IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, no. 5, pp. 719-727, October 1997.
    [45] Subhasis Saha Image Compression : From DCT to Wavelets - A Review. University of California, Davis , March 2000.
    [46] A. S. Lewis and G. Knowles, Image compression using the 2-D wavelet transform, IEEE Trans. Image Processing, vol. 1, pp. 244-250, Apr.1992.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700