H.264与AVS-M视频转码器设计与实现
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着无线网络技术和音视频技术的发展,移动多媒体编解码应用在人们的生活中扮演越来越重要的角色。为了支持不同编码标准之间的设备能够相互通信,需要对已经编码的码流数据进行相互的转换,这一转换过程就叫做视频转码。AVS-M是我国专门针对移动多媒体应用自行制定、拥有自主知识产权的数字音视频编解码标准,为AVS系列编解码标准的第七部分;H.264-Baseline为H.264/AVC(Advanced Video Coding,先进视频编码)标准的基本档次,为国际移动多媒体应用的主流标准。本文的研究工作围绕两种标准相互间的转码展开。
     本文的主要工作如下:
     (1)对H.264-Baseline和AVS-M两种视频编解码标准的基本技术要点进行了简要介绍,并通过对比两者之间的不同,提出了实现转码器所需解决的主要问题。
     (2)通过对H.264-Baseline和AVS-M之间整数变换方面的比较,本文提出了一种基于变换域的变换系数转码快速算法,该算法通过简单的蝶形计算将两种码流的变换系数矩阵互转,结合量化过程实现残差数据的快速转码。
     (3)通过对帧内预测编码部分的比较分析,本文提出了一种帧内预测模式的快速映射算法,该算法结合16×16亮度宏块切割模式映射等方法,重用了码流中包含的帧内模式预测结果,极大程度的提高了帧内最佳预测模式的判断速度。
     (4)通过对帧间预测编码部分的比较分析,本文提出了一种树形结构的帧间预测转码算法,该算法针对不同的宏块选择性进行运动矢量预测的修正和插值数据残差的重计算,在保证图像质量的前提下提高帧间转码的效率。
     (5)将本文提出的算法应用于转码器的构建程中,利用一系列测试完成对转码器的转码质量和效率测试。结果表明,该转码器能够正确实现H.264-Baseline和AVS-M之间的转码,并能在保证图像质量的同时大大提高转码效率。
With the development of wireless network and audio and video technology, the application of mobile multimedia codec plays an important role more and more in social life. For supporting the communication between different sets with different code standard, it is necessary for transforming the coded bit-stream into another style. This process is called video transcode. As a digital audio and video codec standard established and owned by China, AVS, Audio and Video Standard, includes a video standard called AVS-M as its seventh part; H.264-Baseline, the base level of H.264/AVC(Advanced Video Coding), is the mainstream standard of international mobile multimedia communication. The essay's work is the transformation between them:
     (1) Introducing the principle of H.264-Baseline and AVS-M, comparing with each other and taking the keys about realizing the transcoder.
     (2) After comparing the part of integer transform between H.264-Baseline and AVS-M, the essay presents a transcoding fast algorithm about transform coefficient in the transform field, which transforming the transform coefficient matrix between them by using butterfly-type algorithms, realizes the remnant data 's fast transcoding.
     (3) After analyzing and finding difference between the part of intra predication, the essay presents a fast algorithm for mapping the intra prediction modes. Using the way of cutting the 16×16 luminance macro block and mapping its prediction mode, it is nice for reusing the results about intra mode prediction, advancing the speed of getting the best prediction mode.
     (4) After comparing the part of inter predication, the essay proposes a tree-type inter prediction transcoding algorithm, which using the motion vector correction and the regain of insert-value, advances the efficiency of inter prediction transcoding and keeps the quality of image.
     (5) At last, the essay builds a transcoder by using such three algorithms, tests the quality and efficiency of transcoding. This architecture can realize the transcoding between H.264-Baseline and AVS-M, and advance the efficiency of transcoding that kept the quality of image.
引文
[1] AVS工作组. 信息技术—先进音视频编码第7部分:移动视频(报批稿)[Z].2006.4.
    
    [2] N. Feamster, C. Christopoulos. Transcoder architectures for video Coding[J]. IEEE Trans. Consumer Electronics, 1998,44:88-98.
    [3] Y.H.Moon, G.Y.Kim, J.H.Kim. An improved early detection algorithm for all-zero blocks in H.264 video encoding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2005,15:1053-1057.
    [4] A. Ahmad, N. Khan, S. Masud, M.A.Maud. Selection of variable block sizes in H.264[C]. Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004. Quebec, Canada, 2004:173-176.
    [5] M. Horowitz, A. Joch, F. Kossentini, A. Hallapuro. H.264/AVC baseline profile decoder complexity analysis[J]. IEEE Transactions on Circuits and System for Video Technology, 2005, 5:704-716.
    [6] C.C.Cheng, T.S.Chang. Fast three step intra prediction algorithm for 4×4 blocks in H.264[C]. Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS), Kobe, Japan, 2005:1509-1512.
    [7] Z. Wei, B.Jiang, X.Zhang, Y.Chen. A new Full-pixel and Sub-pixel Motion Vector Search Algorithm for Fast Block-matching Motion Estimation in H.264[C]. Proceedings of the Third International Conference on Image and Graphics(ICIG). Hong Kong, China, 2004:345-348.
    [8] Lei, N. D. Georganas. H.263 Video Transcoding for spatial Resolution Down scaling Proe[C]. IEEE International Conference on Information Technology: Coding and Computing, Las Vegas, Nevada. 2002:125-128.
    [9] S.F.Lin, M. T. Lu, H. Chen. Fast Multi-Frame Motion Estimation for H. 264 and Its Applications to Complexity-Aware Streaming[C]. Proceedings of IEEE International Conference on Circuits and Systems. 2005:1505-1508.
    [10] R. Lee. A new algorithm to compute the discrete cosine transforms[J]. IEEE Transactions on Signal Processing. 1984, 32:1243-1245.
    
    [11] G.Y.Kim, Y. H. Moon, J.H.Kim. An early detection of all-zero DCT blocks in H.264[C]. Proceedings of the IEEE International Conference on Image Processing(ICIP). Singapore, 2004:453-457.
    
    [12] Y.Shen, C.Huang, L. Yu, et al. Fast Multi-frame Motion Estimation Algorithm in H.264[C]. Proceedings of IEEE International Conference on Signal Processing. CA, US.2004:1187-1190.
    [13]Joint Video Team of ITU-T and ISO/IEC JTC1.Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification(ITU-T Rec.H.264|ISO/IEC 14496-10 AVC)[Z].Joint Video Team(JVT)of ISO/IEC MPEG and ITU-T VCEG,JVTG050r1,2003.
    [14]Y.D.Zhang,F.Dai,S.X.Lin.Fast 4x4 intra-prediction mode selection for H.264[C].Proveedings of IEEE international Conference on Multimedia and Expo(ICME),2004:537-540.
    [15]Z.Chen,Y.He.Prediction based directional refinement algorithm for fractional pixel motion search strategy.Document Joint Video Team-D069.July,2002.
    [16]S.M.Park,M.Lee,S.Kim,et al.VLSI Implementation of H.264 Video Decoder for Mobile Multimedia Application.ETRI Journal,2006,28:525-528.
    [17]A.Vetro,T.Hata,N.Kuwahara.Complexity-quality analysis of transcoding architectures for reduced spatial resolution[J].IEEE Trans.Consumer Electronics,Aug.2002,48:515-521.
    [18]毕厚杰.新一代视频压缩编码标准:H.264/AVC[M].北京:人民邮电出版社,2005:4-6.
    [19]Minyue Wang,Guangmin Sun,Qiang Wu.Converting AVS coefficients to H.264/VAC transform coefficients[C],IEEE International Conference on Industrial Informatics (INDIN08).Daejeon,Korea,2008,7.
    [20]Joo-Kyong Lee,Ki-Dong Chung.Quantization/DCT Conversion Scheme for DCT-Domain MPEG-2 to H.264/AVC Transcoding[J].IEICE Trans.Communication.2005,E88-8(7):2856-2863.
    [21]YongTaek Hong,KyungHo Leet,Jingsang Kim,Won-Kyung Chot.High Speed Architecture for MPEG-2/H.264 Video transcoding.2006 Communications and Information Technologies,International Symposium on Oct.18 2006-Sept.20.2006:674-678.
    [22]B.Meng,O.C.Au.Fast intra-prediction mode selection for 4x4 blocks in H.264[C].Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing(ICASSP' 03).Hong Kong,China,2003:389-392.
    [23]B.Meng,O.C.Au,C.W.Wong,H.K.Lam.Efficient Intra-Prediction Algorithm in H.264[C].Proceedings of IEEE International Conference on Multimedia and Expo (ICME2003).Maryland,USA,2003:837-840.
    [24]G Keesman,R Hellinghuizen,F Hoeksema.Transcoding of MPEG bitstreams[J].Signal Processing,Image Communications.1996,8:481-500.
    [25]Jingyu Yang,Qionghai Dai,Wenli Xu.A Rate Control Algorithm for MPEG-2 to H.264Real-time Transcoding[C].Proc.of SPIE.2005,Vol.5960:1995-2003.
    [26]O.Werner.Requantization for transcoding of MPEG-2 intraframes[J].IEEE Trans.Image Process.1999,8:179-191.
    [27]V.Lappanlainen.Performance of an advanced video codec on a general-purpose processor with media ISA extensions[J].IEEE Transactions on Consumer Electronics.2000,46:706-716.
    [28]A.Chang,O.C.Au,Y.M.Yeung.A novel approach to fast multi-frame selection for H.264 video coding[C].Proceedings of International Symposium on Circuits and Systems.2003,2:704-707.
    [29]X.S.Zhou,E.Q.Li,Y.K.Chen.Implementation of H.264 decoder on general-purpose processor with media instructions[C].Proceedings of SPIE Conference on Image and Video Communications and Processing.Vol.5022.
    [30]E.Feig,S.Winograd.Fast algorithms for the discrete cosine transform[J].IEEE Transactions on Signal Processing.1992,40:2174-2193.
    [31]M.E.Al-Mualla,C.N.Changarajah,D.R.Bull.Video coding for mobile communications efficiency,complexity and resilience[M].Elsevier Science,2002.
    [32]Yao Wang,Jorn Ostermann,Ya-Oin Zhang.视频处理与通信[M].北京:电子工业出版社,2003.6.
    [33]陈际红.典型国际视频标准专利许可协议的分析研究[J].电子信息产业标准化动态,2006,6:42-47.
    [34]楮晶辉,俞斯乐,鲁照华.视频转换编码及其实现技术的研究[J].电子学报,2004,10(32):1678-1683
    [35]郭映,李桂苓,徐岩.视频码流转换编码器结构及其分析[J]。中国图像图形学报,2004,10(9):1153-1159.
    [36]Iain E.G.Richardson.H.264和MPEG-4视频压缩—新一代多媒体视频编码技术[M].长沙:国防科技大学出版社,2004,11.
    [37]王丽莉.MPEG-2到AVS视频转码器的研究与实现[D].哈尔滨:哈尔滨工程大学,2007.
    [38]王敏越.AVS向H.264转码的关键技术研究与实现[D].北京:北京工业大学,2008.
    [39]沈兰荪,卓力.小波编码与网络视频传输[M].北京:科学出版社,2005,12.
    [40]欧建平,娄生强.网络与多媒体[M].北京:人民邮电出版社,2002,2.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700