视频编码和编码转换中的运动矢量估计

作者：董武
论文级别：硕士
学科专业名称：电路与系统
中文关键词：视频编码 ; 编码转换 ; 运动估计 ; 块匹配 ; 自适应 ; 降低帧率 ; 合成运动矢量
英文关键词：Video coding ; Transcoding ; Motion estimation ; Block matching ; Adaptive ; Reduced frame rate ; Resultant motion vector
学位年度：2004
导师：李晓辉
学科代码：080902
学位授予单位：安徽大学
论文提交日期：2004-05-11

摘要

随着无线通信的广泛应用，视频图像的编码和传输技术面临巨大的挑战。由于视频的数据量巨大，为了满足视频在频带受限的无线信道上传输的实时性要求，必须使用数据量压缩比大而且复杂度小的快速编码算法，尽量用最少的数据传输最大的信息量。运动估计是运动图像压缩中的关键技术之一，视频信号在时间上有很强的相关性，利用块匹配估计和运动补偿技术，可以有效地去除图像帧间冗余度，实现高压缩比。通常，在编码器运行中，运动估计算法需要消耗70％左右的执行时间，因此为了提高编码器的速度必须首先提高运动估计算法的效率。
     此外，由于Internt和移动通信的高速发展，出现了各种具有不同性能的客户机，如蜂窝手机、PDA、手提电脑和膝上电脑等等，这些客户机迫切要求能够无线接入Internet，浏览Internet上的内容。由于Internet和无线网络具有不同的带宽，因而也就对应着不同的传输码率。如果将己压缩的视频信号流由互联网直接通过无线网络传送给客户机，将会出现视频编码流与传输信道失配的情况。此时，就需要在Internet和客户机之间设置代理服务器，对已压缩编码的视频信号流进行码率转换，将已压缩的高速视频码流转换成低速率的视频码流，以保证视频信号流在移动无线网络中的正确传输，为移动用户提供不同服务质量的视频服务。
     本文在以下几个方面进行了研究：
     (1)在传统的运动估计算法的基础上，根据视频中物体的运动情况提出了一种基于图像运动特征的快速运动估计算法。仿真实验结果表明，本文算法适用于各种运动情况的图像序列，其性能接近于全搜索算法，同时极大地降低了计算复杂度。
     (2)介绍了视频编码转换中的各种转换模型。视频编码转换既可以在像素域中进行，也可以在变换域—DCT域中进行。具体的转换方式有三种：码率转换、分辨率转换、编码制式的转换。码率转换一般是降低视频的码率，提高不同网络的兼容性；分辨率转换一般是降低视频的空间分辨率和时间分辨率；编码制式的转换是对已用一种标准编码后的视频流用另外一种标准来编码。
     (3)在基于降低时间分辨率的转换中，本文分析了合成运动矢量的线性内插法、FDVS方法，并在此基础上提出了一种运动矢量合成的新方法。仿真实验结果表明，该算法同已有的合成算法相比，不仅提高了视频的转换质量，而且提高了转换速度。
With great applications of wireless communication, there are great challenges in the coding and transmission technology of video. Because data quantity of video is very big, fast coding algorithm used must have high data compression ratio and low complexity, and least data quantiy used can transmit most information quantity in order to satisfy video's real time request, when video is transmitted in narrow wireless channel. Motion Estimation is one of key technology in the video image compression coding. Video signals have very high motion correlation in temporal direction, motion estimation and motion compensation technology based on block matching can eliminate redundancy of inter-frame's effectively to achieve high compression ratio. Commonly, motion estimation algorithm consumes about 70% computing time in coder, so to improve the coder's speed, the efficiency of motion estimation must be improved firstly.
    Furthermore, with Internet and mobile communication's high development, various client devices having different performance appear, such as: cellular phone, PDA, hand-held computer, laptop computer, etc. These client devices hope to gain access to Internet in wireless channel, and browse Internet's content. As Internet and wireless network have different bandwidth, they have different code rate accordingly. If video signal stream compressed is transmitted in wireless channel from Internet to client devices, coded video signal stream will not match the wireless channel. So, proxy need to be set between generic WWW servers and client devices, transcoding video signal stream which has been coded and compressed. Proxy can transform high speed video stream into low speed video stream to make video signal stream transmitted accurately in mobile wireless network, and provides client devices different quality of service.
    The works we have researched include several aspects as follows:
    (1) Traditional motion estimation algorithms are analysed, and a fast motion estimation algorithm based on video objects' motion character is proposed. The simulation experiment result shows this algorithm is adapted to image sequences with different motion characteristic, and its performance is close to full search algorithm while greatly reducing computation complexity.
    (2 ) Various transcoding models in video transcoding are analysed. Video transcoding can be carried out not only in pixel domain but also in transform domain, namely DCT domain. Video




    transcoding has three means: code rate conversion, resolution conversion, conversion between coding standard. Code rate conversion commonly is reducing video's code rate. Resolution conversion commonly is reducing video's temporal resolution and spatial resolution. Conversion coding standard is recoding video with another coding standard which has been coded with a standard.
    (3) In reducing temporal resolution transcoding two motion vector composed methods are analysed: linear interpolation method and FDVS method. A new motion vector composed method is proposed. Experiment results reveal the proposed algorithm is computationally efficient while keeping better image quality compared with the previous presented algorithms.

引文

[1] ITU-T Ree, H.261, Video Codec for Audiovisual Services at P64k bit/s, Rev.2,[S] 1993.
    [2] ITU-T Rec, H.262 ISO/IEC 13818-12, Information Technology-Generic Coding of Moving Pictures and Associated Audio Information, Part 2: Video[S], 1995.
    [3] ITU-T Rec. H.263, Video Coding for Low Bit Rate Communication[S], 1995.
    [4] ISO/IEC 11172, Information Technology-Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5Mb/s, Part 1:system, Part2:video, part 3:Audio[S], 1993.
    [5] ISO-MPEG-2 ISO 13818-2, Coding of Moving Picture Associated Audio[S], 1994, 12.
    [6] ISO-IEC, 144961-2, Information Technology-Coding of Audio-Visual Objects, Part 2: Visual[S], 1998.
    [7] A. Jamalipour, S. Tekinay. Fourth Generation Wireless Networks and Interconnecting Standards[J]. IEEE Personal Communications, 2001, Vol. 8, No. 5, page: 8～9.
    [8] W. Takayuki et al. Video Transcoding Proxy for 3Gwireless Mobile Internet Access[J]. IEEE Communication Magazine, October 2000, page: 66～71.
    [9] S. Ota et al. Architecture of Multimedia Data Transcoding System for Mobile Computing[C]. 2000 IEICE General Conf., B-5-174, Mar 2000, page:559.
    [10] Ghanbari M. The Cross-Search Algorithm for Motion Estimation[J]. IEEE Transactions on Communications, 1990, Vol. 38, No. 7, page: 950～953.
    [11] Po L M, Ma W C. A Novel Four-Step Algorithm for Fast Block Motion Estimation[J]. IEEE Transactions on Circuits and Systems on Video Technology, 1996, Vol.6, No. 3, page: 313～317.
    [12] Jain J R, Jain A K. Displacement Measurement and Its Application in Interframe Image Coding[J]. IEEE Transactions on Communications, 1981, Vol.29, No. 12, page: 1799～1808.
    [13] Zhu S, Ma K K. A New Diamond Search Algorithm for Fast Block Matching Motion Estimation[C]. Int'1 Conf. on Information, Commu. and Signal Proc.(ICICS'97), Singapore, 1997, page: 292～296.
    [14] P. Assuncao, M. Ghanbari. Optimal Transcoding of Compressed Video[C]. 1997 International Conference on Image Processing (ICIP'97), Vol.3, page: 739～742.
    [15] G. Keesman, etal. Transcoding of MPEG bitstreams[J]. Signal Processing Image Comm, 1996, vol.8, page: 481～500.


    [16] N. Bjork, C. Christopoulos. Transcoder Architectures for Video Coding[J]. IEEE Trans. On CE, 1998, Vol.44, No.1, page: 88～98.
    [17] Peng Yin, Min Wu and Bede Liu. Video Transcoding By Reducing Spatial Resolution[C]. the Proceedings of the IEEE Int'1 Conf. on Image Processing(ICIP), Vancouver, Canada, Sept. 2000. Vol.1, page: 972～975.
    [18] A. Vetro, T. Hata, N. Kuwahara, H. Kalva, and S. Sekiguchi. Complexity-quality analysis of transcoding architectures for reduced spatial resolution[J]. IEEE Transactions on Consumer Electronics, August 2002, Vol.48, No.3, page: 515～521.
    [19] Hwang J H, Wu T D, and Lin C W. Dynamic frame-skipping in video transcoding[J]. In Proceedings of the IEEE Second Workshop on Multimedia Signal Processing, December 1998, page: 616～621.
    [20] Mei-Juan Chen, Ming-Chung Chu, and Chih-Wei Pan. Efficient Motion-Estimation Algorithm for Reduced Frame-Rate Video Transcoder[J]. IEEE Transactions on Circuits and System for Video Technology, 2002, Vol. 12, No. 4, page: 269-275.
    [21] K.T.Fung, Y. L. Chan, and W. C. Siu. New architecture for dynamic frame-skipping transcoding[J]. IEEE Trans.Image Processing, Aug, 2002, vol.11, page. 886～900.
    [22] Vetro A, Yin P, Liu B, and Sun H. Reduced spatio-temporal transcoding using an intra-refresh technique[C]. IEEE International Symposium on Circuits and Systems, 2002, page: 723～726.
    [23] M. Ghanbari, T. Shanableh. Transcoding of video into different encoding formats[J]. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2000), 2000, page: 1927～1930.
    [24] Jun Xin, Ming-Ting Sun, Byung-Sun Choi, Kang-Wook Chun. An HDTV-to-SDTV Spatial Transcoder[J]. IEEE Transactions on Circuits and System for Video Technology, November 2002, Vol. 12, No.11. page: 998～1008.
    [25] K. Seo, J. Kim. Fast motion vector refinement for MPEG-1 to MPEG-4 transcoding with spatial downsampling in DCT domain[C]. Proceedings of IEEE ICIP2001,2001, page: 469～472.
    [26] Soam Acharya, Brian Smith. Compressed Domain Transcoding of MPEG[C]. IEEE International Conference on Multimedia Computing and Systems. 1998, page: 295～304.
    [27] Junichi Nakajima, Hiroyuki Tsuji, Yoshiyuki Yashima, Naoki Kobayashi. Motion Vector Re-estimation for Fast Video Transcoding from MPEG-2 to MPEG-4[C]. Proc. Of the 2nd Workshop and Exhibition on MPEG-4, June 2001, page: 87～90.


    [28] Nick Feamster, Susie Wee. An MPEG-2 to H.263 Transcoder[C]. SPIE International Symposium on Voice, Video, and Data Communications, Boston, MA, September, 1999.
    [29] S.Dogan, A.H.Sadka, A. M. Kondoz. Efficient MPEG-4/H.263 video transcoder for interoperability of heterogeneous multimedia networks[J]. IEE Electronics Letters, May 1999, vol. 35, page: 863～864.
    [30] 郭晓强，门爱东，全子一。H.263与MPEG-4视频码流相互转换的实现[J]。电视技术，2003，11，page：31～34。
    [31] T. Shanableh, M.Ghanbari. Transcoding architectures for DCT-domain heterogeneous video transcoding[C]. Proceedings. 2001 International Conference on Image Processing, 2001, Vol. 1, page: 433～436.
    [32] Chia-Wen Lin, Yuh-Reuy Lee. Fast algorithms for DCT-domain video transcoding[C]. Proceedings. 2001 International Conference on Image Processing, 2001, Vol.1, page: 421-424.
    [33] Shizhong Liu, Alan C. Bovik. Local Bandwidth Constrained Fast Inverse Motion Compensation for DCT-Domain Video Transeoding[J]. IEEE Transactions on Circuit and System for Video Technology, May 2002, Vol.12, No.5. page: 309～319.
    [34] S. J. Wee, J. G. Apostolopoulos, N. Feamster. Field-to-frame transcoding with spatial and temporal downsampling[C]. Proc. IEEE International Conference on Image Processing (ICIP), October 1999, page: 24～28.
    [35] Susie J. Wee and John G. Apostolopoulos. Efficient Processing of Compressed Video[C]. Conference Record of the Thirty-Second Asilomar Conference on Signals, Systems&Computers, 1998, page: 855～859.
    [36] C. E. Shannon. A mathematical theory of communication[J]. Bell System Technical Journal, 1948, Vol. 27.
    [37] H. G. Musman et al. Advances in picture coding[C]. Proc. IEEE, 1985, Vol.73: 523～548.
    [38] 纪中伟，许林峰，朱维乐。基于预测性菱形搜索法的快速去隔行技术[J]。系统工程与电子技术，2002，Vol．24，No．11，page：112～116。
    [39] 肖自美。图像信息理论与压缩编码技术。中山出版社，2000。
    [40] Po L M, Ma W C. A Novel Four-Step Algorithm for Fast Block Motion Estimation[J]. IEEE Transactions on Circuits and Systems on Video Technology, 1996, Vol.6, No. 3, page: 313～317.
    [41] A. Puri, H. M. Hang, and D. L. Schilling. An efficient block matching algorithm for motion-compensated coding[C], in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal

    Processing, 1987, page: 1063～1066.
    [42] Yun-Hee Choi, Tae-Sun Choi. Fast motion estimation techniques with adaptive variable search range[J]. IEICE Trans. on Fundamental Electronics, Communications and Computer, 1999, Vol. E82-A, page: 905～910.
    [43] C. T. Chen. Video compression: standards and application[J]. Journal of Visual Communication and Image Representation, Vol.4, No.2. page: 103～111.
    [44] P. A. A. Assuncao, M. Ghanbari. A frequency-domain video transcoder for dynamic bit rate reduction of MPEG-2 bit streams[J]. IEEE Trans. Circuits and Systems for Video Technology, 1998, Vol.8, page: 953～967.
    [45] P. A. A. Assuncao, M. Ghanbari. Buffer analysis and control in CBR video transcoding[J]. IEEE Transactions on Circuits and Systems for Video Technology. 2000, Vol.10, No 1, page: 83～92.
    [46] P. A. A. Assuncao, M. Ghanbari. Bit Rate Control of Internetworking Video Transcoders[C]. Fifth IEEE International Conference on Electronics, Circuits and Systems, ICECS'98, vol.2, page: 247～250.
    [47] P. A. A. Assuncao, M. Ghanbari. Fast computation of MC-DCT for video transcoding[J]. Electronics Letters, 1997, Vol.33, No.4, page: 284～286.
    [48] P. A. A. Assuncao, M. Ghanbari. Transcoding of MPEG-2 video in the frequency domain[C]. ICASSP-97, 1997, Vol.4, page: 2633～2636.
    [49] Kou-Sou Kan, Kuo-Chin Fan, Yin-Hwa Huang. Low-complexity and low-delay video transcoding for compressed MEPG-2 bitstream[C]. IEEE International Symposium on Circuits and System, 1997, page: 99～102.
    [50] Hiroyuki Kasai, Tsuyoshi Hanamura, etal. Rate control scheme for low-delay MPEG-2 vidio transcoder[C]. Proceedings of International Conference on Image Processing, 2000, page: 473～484.
    [51] E. Feig, S. Winograd. Fast algorithms for discrete cosine transform[J]. IEEE Trans.Signal Processing, 1992, 40, page: 2174～2193.
    [52] 李晓辉。基于降低分辨率模型视频代码转换的研究[J]。应用科学学报，200l，Vol．19，No．2，page：127～130。
    [53] Peng Yin, Anthony Vetro, Bede Liu, Huifang Sun. Drift compensation for reduced spatial resolution transcoding[J]. IEEE Transactions on circuits and system for video technology,

    2002, Vol.12, No.11, page: 1009～1019.
    [54] H. Sun, W. Kwok, J. Zdepski. Architectures for MPEG compressed bitstream scaling[J]. IEEE Transaction on Circuits and System for. Video Technology, 1996, Vol.6, page: 191～199.
    [55] Pedro A. A. Assuncao and Mohammed Ghanbari. Post-processing of MPEG-2 coded video for transmission at lower bit rates[J]. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, May 1996, page: 1998～2001.
    [56] Jun Xin, Ming-Ting Sun, Kangwook Chun, Byung Sun Choi. Motion re-estimation for HDTV to SDTV transcoding[C]. 2002 IEEE International Symposium on Circuits and Systems, 2002, Vol. 4, page: 715～718.
    [57] Rakesh Dugad, Narendra Ahuia. A Fast Scheme for Downsampling and Upsampling in the DCT Domain[C]. Proceedings of the 1999 International Conference on Image Processing, 1999, Vol.Ⅱ, page: 909～913.
    [58] S. F. Chang, D.G. Messerschmitt. Manipulation and composition of MC-DCT compressed video[J]. IEEE Journal on Selected Areas in Communication, 1995, Vol.13, page: 1～11.
    [59] Youn J, Sun M T, and Lin C W. Motion vector refinement for high performance transcoding[J]. IEEE Transactions on Multimedia, 1999, Vol.1, No.1, page: 30～40.
    [60] Vetro A, Yin P, Liu B, and Sun H. Reduced spatio-temporal transcoding using an intra-refresh technique[C]. IEEE International Symposium on Circuits and Systems, 2002, page: 723～726.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700