视频编码中运动估计算法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着近年来多媒体技术的飞速发展,许多应用领域对视频图像的实时压缩提出了更高的要求,快速、高效的压缩算法是解决这一问题的关键。H.264是ITU-T VCEG组织和ISO/IEC MPEG组织共同制定的新一代视频编码标准。虽然H.264采用了传统的混合编码框架,但通过使用包括帧内预测、整数变换、可变预测块大小、多参考帧、1/4像素精度预测、自适应算术编码、去块效应滤波以及基于率失真优化的模式选择等新技术,编码效率获得了很大提高。AVS(Audio Video Coding Standard)是目前我国正在制定的具有自主知识产权的音视频编码技术标准,它的编码效率与H.264相当,其优势为:性能高、复杂度低、实现成本低、专利授权费用低。
     运动估计是视频编码中关键技术之一。本文对H.264和AVS中的运动估计技术进行了深入分析,并提出了相应的优化算法。UMHexagonS整像素快速运动估计算法性能优秀,先后被JVT和AVS接受为标准算法。本文在对UMHexagonS算法进行了详细分析后,提出了优化算法。同原算法比较,能够在性能仅下降0.03dB的情况下,节约30%~90%的运动估计时间。AVS采用了可变大小块以及1/4像素精度的运动估计技术以提高编码效率。本文基于AVS参考软件提出了一种增强亚像素自适应范围搜索算法。该算法主要使用了小菱形窗的搜索策略和阈值判决方法。与亚像素全搜索算法比较,在平均性能仅下降0.0093dB的情况下,减少了30.25%的搜索点数。
     随着数字音视频产业的发展,嵌入式视频压缩技术应用越来越广泛。本文阐述了基于TMS32DM642 DSP平台的通用视频编码器优化方法,包括系统算法优化、存储器的分配、使用EDMA传输数据和调用TI库函数等。同时阐述了对基于MPEG-4平台的PMVFAST快速运动估计算法的优化方法。基于这些方法,在DM642平台上采用MPEG-4编码器对CIF格式图象进行编码,帧率可达40fps,完全满足实时编码的要求。
With the fast develop of multimedia techniques, there are many areas need more efficient real-time compression techniques for digital image processing. Fast compression algorithm is the key for this problem. H.264 is the current project of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Motion Picture Experts Group (MPEG). Although H.264 follows the hybrid coding scheme in common with earlier standards, improvements are achieved through the inclusion of a number of new features that distinguish itself from all existing ones, including Intra Prediction, Integer transform, different predicted block, muti-conference frames, CAVLC,CABAC, quarter-pel precision prediction, Deblocking filter, R-D mode strategy, etc. The being finished Chinese audio and video coding standards AVS has approximate coding efficiency as H.264, AVS’s advantages includes high performance, low complexity , low implementation price and sample licensing.
     Motion Estimation is a key technique for video coding. This thesis is concerned with the Motion estimation techniques of H.264&AVS and proposes improved methods in several fields. UMHexagonS for Fast Integer Motion Estimation is adopted in previous JVT and AVS meetings because it achieves a very good trade-off between ME acceleration and R-D performance. This thesis present some characters within UMHexagonS process and makes corresponding improvements. Compared with the original technique, 30~90% ME time is reduced while the limited performance lose about 0.03dB. AVS applied the variable block sizes and quarter-pel motion compensation to improve coding efficiency. In this paper, an adaptive range fractional pel search algorithm is proposed to further speed up the encoding process and reduce the computation complexity. Small diamond search and threshold judgment are used to this algorithm. Compared to the half-pel full search algorithm, experiments show that the proposed algorithm can reduce the sub-pel search point by 30.25% on average with the limited performance lose about 0.0093dB.
     With the developonent of the digital audio and video industry, embedded video compression technology is applied more widely. This thesis describes the optimization techniques for a general video encoder based on TMS32DM642 DSP. The algorithm optimization, memory distribution, EDMA and TI’s function libraries usage are discussed based on DM642. And the optimization technique for the PMVFAST Fast Integer Motion Estimation base on MPEG-4 are provided. An MPEG-4 encoder based on DM642 using these methods which encodes images of CIF can carry out real-time encoding with 40fps.
引文
[1]毕厚杰.新一代视频压缩标准--H.264[M].北京:人民邮电出版社,2005:1~148.
    [2] Iain E.G.Richardson.H.264 and MPEG-4 Video Compression-Video Coding for Next-generation Multimedia[M].John Wiley & Sons Ltd, 2005:1~60.
    [3] K.R.Rao, Z.S. Bojkovic, and D.A. Milovanovic. Multimedia Communication Systems: techniques, Standards, and Networks[M]. Prentice Hall, 2002:88~89.
    [4]蔡安妮,孙景鳌等编著,多媒体通信技术基础[M].北京:电子工业出版社,2000:41~96.
    [5] Yazbeck, S. The US road to 3G: an overview of telecom regulations, carrier strategies, and the consumer market[A]. 10th international Conference on Telecommunications[C]. 2003,1:25~32.
    [6] Miah, A and Tan, K. An overview of 3G mobile network infrastructure[A]. 2002. Student Conference on research and development[C]. 2002(16):228~232.
    [7] S.J.Wee and J.Q Apostolopoulos, Eficient processing of compressed video[A], Conference Record of the thirty-second Asilomar Conference on Signals, Systems&Computers[C]. 1998(1):853-857.
    [8] A.M.Tekalp, Digital Video Processing[M]. Englewood Cliffs: Printice Hall, 1995:1~50.
    [9]樊昌信、詹道庸、徐炳祥、吴成柯.通信原理[M],北京:国防工业出版社,1995: 20~50.
    [10] Gonzalez Rafael C., Woods Richard E.数字图象处理[M],电子出版社,2003.
    [11] N.Ahmed, T.Natarajan and K.R.Rao. Discrete Transform[J]. IEEE Transaction on Computation, 1974,23:90~93.
    [12] I.Daubechies. Orthogonal bases of compactly supported wavelets[J], Communications on Pure and Applied Mathematics, 1988, 41(11):909~996.
    [13]沈兰荪,图像编码与异步传输[M].北京,人民邮电出版社,1998.
    [14] Z. Chen, P. Zhou, and Y. He, Fast Integer and Fractional Pel Motion Estimation for JVT[R], JVT-F017r.doc, Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG. 6th Meeting, Awaji, Island, Japan, 2002, 12.
    [15] ISO/IEC 11172-2: Information technology—Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s—Part 2: Video[S],1993.
    [16] ISO/IEC DIS 13818-2 Information technology—Generic coding of moving pictures and associated audio information: Video[S].
    [17] Editor: Rob Koenen. Overview of the MPEG-4 Version 1 Standard [J]. http://wwwam.hhi.de/mpeg-video/standards/mpeg-4.htm, 2004.
    [18] ISO/IEC 14496-2(committee draft). Information technology-generic coding of audio-visual objects:visual[S], Tokyo, 1998.
    [19] Moving Picture Experts Group (MPEG) MPEG Video Webpage[S]. http://wwwam.hhi.de/mpeg-video/#MPEG1-Ref, 2004.
    [20] ITU-T H.261. Video codec for audiovisual services at P*64kbit/s[S]. Helsinki, 1993:1~30.
    [21] ITU-T H.263. Video codec for low bit rate communication[S]. Helsinki, 1996: 1~40.
    [22] ITU-T H.263+(Draft). Video coding for low bit rate communication[S]. Helsinki, 1998:24~35.
    [23] ITU-T Draft for H.263++. annexes U,V and W to recommendation H.263[S]. Helsinki, 2000.
    [24] ITU-T Recommendation H.264 / ISO/IEC 11496-10. Advanced Video Coding. Final Committee Draft Document, JVT-G050[C], 2003.
    [25] MPEG video group, Anticipatory Joint Mode (JM) of Enhanced Compression Video Coding. ISO/IEC JCC1/SC29/WG11, N-4355[C], 2001.
    [26]黄铁军,高文. AVS标准制定背景与知识产权状况[J].电视技术. 2005(7):4~7.
    [27] Liang Fan, Siwei Ma, Feng Wu. Overview of AVS Video Standard[C]. Proc. 2004 IEEE Intl. Conf. Multimedia & Expo, 2004: 423~426.
    [28]虞露,音视频编码技术标准AVS-视频技术概述[J].中国多媒体视讯,2004(3):34~35.
    [29]信息技术先进音视频编码第二部分:视频(国家标准GB/T 20090.2)[S]. 2006.
    [30]朱光喜,文强,田晓华等.一种环路滤波方法和环路滤波器[P].中国:专利申请号: 200410012658.0.
    [31] Z. Chen, P. Zhou, and Y. He, Fast Integer and Fractional Pel Motion Estimation for JVT[R], JVT-F017r.doc, Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG. 6th Meeting, Awaji, Island, Japan, 2002,12.
    [32] E.GRicbardson, JR.Martyn, Digital Video Communications[M]. Artech House, 1997.
    [33] Chen M J, Chen L G, Chiueh T.D. A new block-matching criterion for motion estimation and its implementation[J]. IEEE Trans Circuits System Video Technology, 1995,1(5):231~236.
    [34] T.Koga, K.linuma, A.Hirano etal. Motionc ompensated interframe coding for video conferencing[C]. National Telecommunications Conf, New Orleans, LA,1981.
    [35] E.GRicbardson, JR.Martyn. Digital Video Communications[M]. Copyright, Artech House, 1997.
    [36] R.Li and M.L.Liou. A new three-step search algorithm for block motion estimation[J]. IEEE Trans on Circuits System Video Technology, 1994,4(4):438~442.
    [37] L.M.Po and W.C.Ma. A novel four step search algorithm for fast block motion estimation[J]. IEEE Trans on Circuits System Video Technology, 1996,6 (3): 313 ~317.
    [38] S.Zhu and K.-K. Ma. A new diamond search algorithm for fast block-matching motione stimation[J]. IEEE Trans. Image Processing, 2000,9(2):287~290.
    [39] C.Zhu, X.Lin and L.P.Chau. Hexagon-based search patern for fast block motion estimation[J]. IEEE Trans on Circuits System Video Technology, 2002,12(5):349~355.
    [40] P.I.Hosur and K.K.Ma, Motion vector field adaptive fast motion estimation[C]. Proc. 2nd Int. Conf. Information, Communications and Signal Processing (ICICS’99), Singapore, 1999:7~10.
    [41] A.M.Tourapis, O.C.Au, and M.L.Liou. Highly Efficient Predictive Zonal Algorithms for Fast Block-Matching Motion Estimation[J]. IEEE Transactions on circuits and systems for video technology. 2002,12(10):934~947.
    [42] ISO/IEC JTC1/SC29/WG11 N2932. MPEG-4 Video Verification Model version14.0 [S], 1999:301~03.
    [43]杨育红,徐烜,季晓勇.快速运动估计UMHexagonS算法的探讨与改进[J].计算机工程与应用, 2006,11:52~54.
    [44] Hye- Yeon Cheong Tourapis, Alexis Michael Tourapis, Pankaj Topiwala. Fast Motion Estimation within the JVT codec[C]. In: JVT- E023,5th Meeting: Geneva, Switzerland, 2002.10.
    [45] G.Bjontegaard, Calculation of average PSNR differences between RD-curves[R], ITU-T Q.6/SG16 (VCEG) Document VCEG-M33, 2001, 3.
    [46] Bernd Girod. Motion Compensation Prediction with Fractional-PelAccuracy[J]. IEEE Transactions on Communication, 1993, 41(4):604~612.
    [47] Du Cheng, He Yun, Zheng Junli. PPHPS: a parabolic prediction-based, fast half-p ixel search algorithm for very low bitrate moving picture coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2003, 13(6):514~518.
    [48] Dong H, Zhang Q, A fast half-pixel motion vector search algorithm[J]. Jourmal of Image and Graphics, 2005, 7.(11):884~887.
    [49] Zhibo Chen, Peng Zhou, Yun He,“Fast Integer Pel and Fractional Pel Motion estimation in for JVT”, JVT-F017r1.doc[C]. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, 6th meeting, Awaji, Island, JP, 5-13 December, 2002.
    [50] C Du, Y He. A Comparative Study of Motion Estimation for Low Bit Rate Video Coding[C]. In:SPIE Proc VCIP2000,2000;4067(3):1239~1249.
    [51] X Li, C Gonzales. A Locally Quadratic Model of the Motion Estimation Error Criterion Function and Its Application to Subpixel Interpolation[J]. IEEE Trans Circuts Syst Video Technol, 1996;6:118~122.
    [52]黄雪华,李锦涛,刘金刚. H.264中1/4像素运动估计快速算法的比较[J].计算机工程与应用,2004,35:94~95.
    [53] P.Yin, H.Y Cheong, A.M. Tourapis, and J. Boyce, Fast mode decision and motion estimation for JVT/H.264[J], Proceedings of the 2003 IEEE International Conference on Image Processing , 2003 9.(3): 853~856.
    [54]王念旭,赵远等.DSP应用系统设计.北京:电子工业出版社,2004.
    [55] Texas Instruments Incorporated.TMS320C6000 Peripherals Reference guide. USA:Texas Instrument,2001.
    [56]李方慧,王飞,何佩琨.TMS320C6000系列DSPs原理与应用(第2版).北京:电子工业出版社,2003.
    [57] Texas Instruments Incorporated . TMS320C6000 CPU and Instruction Set Reference Guide.USA:Texas Instrument,2000.
    [58]尹勇,欧光军,关荣峰.DSP集成开发环境CCS使用指南.北京:北京航空航天大学出版社,2003.
    [59] Alexandros Eleftheriadis. Recent Advances in Audio-Visual Information Representation:MPEG-4 and Beyond[M]. Department of Electrical Engineering Columbia University,1997.
    [60] TI. TMS320C6000 C/C++ Optimizing Compiler User.s Guide[M]. 2002,10.
    [61] TI. Image Processing Examples Using the TMS320C64x Image / Video Processing Library(IMGLIB)[M]. 2003,5.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700