H.264到HEVC视频转码技术研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

H.264到HEVC视频转码技术研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Research on Video Transcoding Techniques from H.264to HEVC
作者：蒋炜
论文级别：博士
学科专业名称：电子信息技术及仪器
中文关键词：视频转码 ; H.264 ; HEVC ; 统计分析 ; 区域特征分析 ; 视觉显著性
英文关键词：video transcoding ; H.264 ; HEVC ; statistical analysis ; region feature
英文关键词：analysis ; visual saliency
学位年度：2013
导师：陈耀武
学科代码：0810
学位授予单位：浙江大学
论文提交日期：2013-10-01

摘要

视频编码技术发展的趋势之一是追求更高的编码效率。H.264视频编码标准在提高编码效率和灵活性方面取得了巨大成功,它使得数字视频有效地应用于各种各样的网络类型和工程领域。然而,多样化的服务、高清视频的普及、以及超高清格式(4K×2K或8K×4K分辨率)的出现对于比H.264编码效率更高的下一代视频编码标准提出了强烈的需求。在这样的背景下,MPEG和VCEG组织于2010年成立了视频编码联合协作小组(JCT-VC),经过多年的努力,研发出了H.264标准的继承者,新一代视频编码标准HEVC。与H.264相比,HEVC虽然可以在相似的视频感知质量下节省高达约50%的比特率,但由于H.264广泛而深入的应用,在相当长一段时间内,这两个技术需要共存。因此H.264到HEVC的转码在网络传输和存储方面具有重要的现实意义。然而,HEVC为了提高编码效率,引入了一系列相当耗时的编码算法,给实时视频转码应用带来了新的挑战。针对HEVC编码算法特性,在转码过程中充分利用H.264码流信息来加速转码中HEVC重编码过程是提高转码器性能的关键之一。此外,由于转码的目标是为了在同样的视频质量下获取更高的压缩效率,而视觉显著性分析已成为计算机视觉和图像处理领域一个重要的研究课题,因此如何从人眼视觉感知的角度,在H.264码流压缩域提取视觉显著性进而指导H.264到HEVC的转码过程也成为提高转码效率的关键之一。正是在这样的背景下,本文展开了对H.264到HEVC视频转码方法的研究。
     第一章首先阐述了选题的意义,接着对视频编解码技术、H.264和HEVC编码技术、视频转码技术以及视觉显著性及其应用进行了简单综述,最后介绍了本文的主要研究内容和论文结构。
     第二章从统计分析的角度对H.264到]HEVC快速视频转码方法进行研究。针对帧间转码,首先通过大量统计分析找出HEVC码流中Skip模式与H.264码流中各种模式的映射关系,并利用其对Skip模式进行提前判决,然后通过对编码比特数进行数理统计分析快速选择预测单元的对称与非对称分割模式,最后依据运动矢量的相似性优化了HEVC运动估计过程中预测单元的搜索起点和搜索范围,进一步减少了转码过程的计算量。针对帧内转码,首先利用H.264和HEVC帧内编码模式之间的关系自适应地选择编码树单元的搜索深度范围,然后通过计算编码单元区域的梯度大小和方向,找出其与帧内方向预测模式之间的关系,减少HEVC帧内预测的候选模式集中的模式个数,进而加速帧内转码过程。
     第三章提出了一种基于区域特征分析的H.264到HEVC快速视频转码方法。该方法首先根据图像复杂度和编码比特数之间的关系将每帧图像以编码树单元为单位划分为三种复杂度区域,其次按照不同区域类型决定每个编码树单元的搜索深度范围。然后对运动矢量进行去噪滤波和聚类以分析每个编码单元的区域特征,依据其分析结果择优选择编码单元的最小搜索深度和预测单元的最可能分割模式。最后,将聚类中心的运动矢量加入预测运动矢量作为候选,从而减小运动搜索窗的大小,以达到在保证几乎相同的率失真视频质量下,大幅减少转码过程的计算复杂度。
     第四章针对H.264到HEVC视频转码效率进行研究,通过分析视觉显著性及其在视频转码领域中的应用,根据人眼视觉的特点,提出一种基于视觉显著性分析的H.264到HEVC视频转码算法。该方法首先利用H.264码流中的运动矢量场进行全局运动估计和局部运动分割得到运动显著性图,然后结合编码比特数的分布特点加以修正生成最终的视觉显著性图,最后在HEVC重编码过程中,利用视觉显著性图对非显著性区域进行自适应频率系数压制,以在保持视频主观质量的前提下,进一步提高视频转码的效率。
     第五章总结归纳了本文的创新点和研究成果并提出了进一步的研究方向和任务。
One of the development trend in video coding technology is the pursuit of higher coding efficiency. The H.264standard has been a great success in terms of both coding efficiency enhancement and flexibility for effective use over a broad variety of network and application domains. However, an increasing diversity of services, the growing popularity of HD video, and the emergence of beyond-HD formats (e.g.4k×2k or8k×4k resolution) are creating even stronger needs for coding efficiency superior to H.264's capabilities. Under this circumstance, MPEG and VCEG have established a Joint Collaborative Team on Video Coding (JCT-VC) to develop a successor to H.264. This new international standard is referred to as High Efficiency Video Coding (HEVC). Although HEVC can achieve up to approximately50%bit rate savings for similar perceived video quality compared to H.264, the wide and deep penetration of H.264creates a need for the co-existence of these technologies in a fairly long period of time. Therefore, transcoding from H.264to HEVC has important practical significance in the network transmission and storage. However, on one hand, HEVC introduces a series of time-consuming coding algorithm in order to improve the coding efficiency, thus brings new challenges to real-time video transcoding applications. Considering the characteristics of HEVC coding algorithm, making full use of the H.264bitstream information to speed up the HEVC re-encoding process of transcoding is one of the key technologies to improve the performance of transcoder. On the other hand, since the goal of transcoding is to obtain higher compression efficiency at the same video quality and the analysis of video saliency in computer vision and image processing has become an important research field, how to extract the video saliency from the compress domain of H.264bitstream from a human visual perception to guide the optimization of the transcoding from H.264to HEVC is also one of the key technologies to improve the performance of transcoder. In this context, the research of video transcoding from H.264to HEVC is carried out.
     In chapter1, the importance of the research work is firstly presented. Secondly, the video codec technology, H.264and HEVC codec technology, video transcoding technology and video saliency and its applications are biefly summarized. Finally, the main research contents and the structure of the thesis are illustrated.
     In chapter2, low-complexity video transcoding algorithm from H.264to HEVC is studied based on statistical analysis. For inter frame transcoding, by exploiting the mapping correlations of Skip mode in HEVC and all modes in H.264/AVC, an early decision of Skip mode is firstly introduced. Secondly, through the statistical analysis of coding bits, a fast prediction unit (PU) partition selection for both symmetric and asymmetric partitions is described. Finally, the motion estimation process is optimized according to the motion similarity between H.264and HEVC bitstreams. For intra frame transcoding, the searching depth range of coding tree unit (CTU) is firstly decided based on the the intra coding modes in H.264. Secondly, gradient directions are statistically calculated and a gradient-mode histogram is generated for each coding unit. Finally, based on the distribution of the histogram, only a small part of the candidate modes are chosen for the intra coding processes.
     In chapter3, a fast H.264to HEVC transcoding algorithm based on region feature analysis is proposed. First, each frame is segmented into three regions in units of CTU based on the correlation between image coding complexities and coding bits of the H.264source stream. Then the searching depth range of each CTU is adaptively decided according to the region type. After that, motion vectors are de-noise filtered and clustered in order to analyze the region features of coding unit (CU). Based on the analysis results, the minimum searching depth of CU and partitions of PU are optimally selected, and the motion vector predictor and search window size of motion estimation are also optimally decided for further reduction of the computational complexity.
     In chapter4, based on the characteristics of human vision system, a video saliency based video transcoding algorithm from H.264to HEVC is proposed. Firstly, the motion vector field included in the H.264bitstream is utilized to do the global motion estimation and motion segmentation. Secondly, the distribution of coding bits is combined to produce the visual saliency map which indicates the salient regions in the videos. Finally, during the re-encoding process, a frequency coefficient suppression technique in the transform domain is used in the non salient region for further bitrate reduction while keeping the subjective quality of salient region.
     In the final chapter, the novel achievements of the research in this thesis and the prospect of the future research are concluded.

引文

[1]Weigand T, Sullivan G J, Bj(?)ntegaard G, et al.. Overview of the H.264/AVC video coding standard [J]. IEEE Transactions on Circuits and Systems for Video Technology,2003,13(7):560-576.
    [2]Sullivan G J, Ohm J R, Han W J, et al. Overview of the high efficiency video coding (HEVC) standard [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12):1649-1668.
    [3]Xin J, Lin C W, Sun M T. Digital video transcoding [J]. Proceedings of the IEEE,2005,93(1):84-97.
    [4]Vetro A, Christopoulos C, Sun H. Video transcoding architectures and techniques:an overview [J]. IEEE Signal Processing Magazine,2003,20(2): 18-29.
    [5]Yin P, Vetro A, Xia M H, et al.. Rate-Distortion Models for Video Transcoding [J]. Image and Video Communications and Processing (SPIE), 2003,5022:479-488.
    [6]Feamster N, Wee S. MPEG-2 to H.263 transcoder [J]. Multimedia Systems and Applications (SPIE),1999,3845:164-175.
    [7]Jang S H, Jayant N. An adaptive non-Linear motion vector resampling algorithm for down-scaling video transcoding [C]. IEEE International Conference on Multimedia and Expo,2003:229-232.
    [8]Yeh J, Cheung G. Complexity scalable mode-based H.263 video transcoding [C], IEEE International Conference on Image Processing (ICIP), 2003:169-172.
    [9]Acharya S, Smith B. Compressed Domain Transcoding of MPEG [C], IEEE International Conference on Multimedia Computing and Systems,1998: 295-304.
    [10]Liu S, Lu L G, Kuo CCJ. Efficient MPEG-2 to MPEG-4 video transcoding [C], International Conference on Image and Video Communications and Processing (SPIE),2003:186-195.
    [11]Youn J, Sun M T. An HDTV-to-SDTV spatial transcoder [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,12(11): 998-1008.
    [12]Pantoja M, Ling N. A two-level rate control approach for video transcoding [C]. IEEE International Conference on Image Processing (ICIP), 2009:3657-3660.
    [13]von dem Knesebeck M, Nasiopoulos P. A Fast Mode Decision Algorithm for Downscaled Transcoding of H.264 Preencoded Video [C]. IEEE International Conference on Consumer Electronics (ICCE),2010.
    [14]Cavallaro A, Steiger O, Ebrahimi T. Semantic segmentation and description for video transcoding [C]. IEEE International Conference on Multimedia and Expo (ICME),2003:597-600.
    [15]Corrales-Garcia A, Martinez J L, Fernandez-Escribano G, et al.. Wyner-Ziv to Baseline H.264 Video Transcoder [J]. EURASIP Journal on Advances in Signal Processing,2012,(135):1-19.
    [16]Wang J, Yang E H, Yu X. An efficient motion estimation method for H.264-based video transcoding with spatial resolution conversion [C]. IEEE International Conference on Multimedia and Expo (ICME),2007:444-447.
    [17]Lei Z J, Nicolas D Georganas. Accurate bit allocation and rate control for DCT domain video transcoding [C]. IEEE Canadian Conference on Electrical and Computer Engineering (CCECE),2002:968-973.
    [18]Lefol D, Bull D, Canagarajah, N, et al.. An efficient complexity-scalable video transcoder with mode refinement [J]. Signal Processing-Image Communication,2007,22(4):421-433.
    [19]Kasai H, Hanamura Tsuyoshi, Kamayama W, et al.. Rate control scheme for low-delay MPEG-2 video transcoder [C]. IEEE International Conference on Image Processing (ICIP),2000:964-967.
    [20]Shen GB, He YW, Cao WY, et al.. MPEG-2 to WMV transcoder with adaptive error compensation and dynamic switches [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,16(12):1460-1476.
    [21]Xin J, Vetro A, Sekiguchi S I, et al.. Motion and Mode Mapping for MPEG-2 to H.264/AVC Transcoding [C]. IEEE International Conference on Multimedia and Expo (ICME),2006:313-316.
    [22]Shen B, Alto P. Motion drift modeling and correction for downscale video transcoding [C]. IEEE International Conference on Image Processing (ICIP), 2005:680-683.
    [23]Bjork N, Christopoulos C. Transcoder architectures for video coding [J]. IEEE Transactions on Consumer Electronics,2002,44(1):88-98.
    [24]de los Reyes G, Reibman A R, Chuang J C, et al.. Video Transcoding for Resilience in Wireless Channels [C]. IEEE International Conference on Image Processing (ICIP),1998:338-342.
    [25]Liang Y F, Chebil F, Islam A. Compressed Domain Transcoding Solutions for MPEG-4 Visual Simple Profile and H.263 Baseline Videos in 3GPP Services and Applications [J]. IEEE Transactions on Consumer Electronics,2006, 52(2):507-514.
    [26]Lu L G, Xiao S, Kouloheris J L, et al.. Efficient and low cost video transcoding [J]. Visual Communications and Image Processing,2002,4671: 154-163.
    [27]Takahashi K, Satoh K, Suzuki T, et al.. Motion vector synthesis algorithm for MPEG-2-to-MPEG-4 transcoder [J]. Visual Communications and Image Processing,2001,4310:872-882.
    [28]Song B C, Kim T H, Chun K W. Efficient video transcoding with scan format conversion [C]. IEEE International Conference on Image Processing (ICIP), 2002:709-712.
    [29]Ozawa K, Ito H, Watanabe K, et al.. Low Complexity Real-Time Video Transcoders for Video Upload and Retrieval Applications [C]. IEEE Workshop on Signal Processing Systems,2007:368-372.
    [30]Ahmad I, Wei X H, Sun Y, et al.. Video Transcoding:An Overview of Various Techniques and Research Issues [J]. IEEE Transactions on Multimedia,2005,7(5):793-804.
    [31]ITU. Encoding Parameters of Digital Television for Studios. ITU-R Recommendation BT.601-1.
    [32]Richardson I E G, H.264 and MPEG-4 Video Compression [M], John Wiley & Sons, England,2003.
    [33]毕厚杰,王健.新一代视频压缩编码标准-H.264/AVC(第2版)[M].人民邮电出版社,2009.
    [34]Ahmed N, Natarajan T, Rao K R. Discrete cosine transform [J]. IEEE Transactions on Computers,1974, C-23(1):90-93.
    [35]Chen W, Smith C H, Fralick S C. Fast computational algorithm for the discrete cosine transform [J]. IEEE Transactions on Communications,1977, COM-25(9):1004-1009.
    [36]CCITT. Video codec for audiovisual services at p×64 kbit/s (Recommendation H.261) [S]. Consultative Committee of International Telegraph and Telephone, version 1,1990, version 2,1993.
    [37]ITU-T. Video coding for low bit rate communication (Recommendation H.263) [S].Telecommunication standardization sector of ITU, version 1,1995, version 2,1998; version 3,2000.
    [38]ITU-T. Advanced video coding for generic audiovisual services (Recommendation H.264) [S].Telecommunication standardization sector of ITU,2007.
    [39]Schwarz H, Marpe D, Wiegand T. Overview of the scalable video coding standard [J]. IEEE Transactions on Circuits and Systems for Video Technology,2007,17(9):1103-1120.
    [40]Vetro A, Wiegand T, Sullivan G J. Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard [J]. Proceedings of the IEEE,2011,99(4):626-642.
    [41]ISO/IEC/JTC1. Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s-Part 2:Video (ISO/IEC 11172-2 MPEG-1) [S]. ISO/IEC,1993.
    [42]ISO/IEC/JTC1. Generic coding of moving pictures and associated audio information-Part 2:Video (ISO/IEC 13818-2 MPEG-2) [S]. ISO/IEC,1994.
    [43]ISO/IEC/JTC 1. Coding of audio-visual objects-Part 2:Visual (ISO/IEC 14496-2 MPEG-4) [S]. ISO/IEC,1999-2003.
    [44]Kwon S K, Tamhankar A, Rao K R. Overview of H.264/MPEG-4 part 10 [J]. Journal of Visual Communication and Image Representation,2006,17(2): 186-216.
    [45]Bross B, Han W J, Ohm J R, et al.. High Efficiency Video Coding (HEVC) text specification draft 10 (for FDIS & Last Call) [C].2013, JCT-VC Meeting Document:JCTVC-L1003.
    [46]Bossen F, Flynn D, Suehring, K. HEVC HM 10 Reference Software. [C]. 2013, JCT-VC Meeting Document:JCTVC-L1010.
    [47]Ugur K, Andersson K, Fuldseth A, et al.. High performance, low complexity video coding and the emerging HEVC standard [J]. IEEE Transactions on Circuits and Systems for Video Technology,2010,20(12):1688-1697.
    [48]Ostermann J, Bormans J, List P, et al.. Video coding with H.264/AVC:Tools, performance, and complexity [J]. IEEE Circuits and Systems Magazine,2004, 4(1):7-28.
    [49]Koumaras H, Kourtis M A, Martakos D, et al.. Benchmarking the encoding efficiency of H.265-HEVC and H.264/AVC [C]. Future Network & Mobile Summit,2012:1-7.
    [50]Pourazad M T, Doutre C, Azimi M, et al.. HEVC:The New Gold Standard for Video Compression:How Does HEVC Compare with H.264/AVC? [J]. IEEE Consumer Electronics Magazine,2012,1(3):36-46.
    [51]Bossen F. Common HM test conditions and software reference configurations [C].2013, JCT-VC Meeting Document:JCTVC-L1100.
    [52]McCann K, Bross B, Han W J, et al.. High Efficiency Video Coding (HEVC) Test Model 10 (HM 10) Encoder Description [C].2013, JCT-VC Meeting Document:JCTVC-L1002.
    [53]Kim K, Min J, Lee T, et al.. Block Partitioning Structure in the HEVC Standard [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12):1697-1706.
    [54]Chen J W, Kao C Y, Lin Y L. Introduction to H.264 advanced video coding [C]. Asia and South Pacific Conference on Design Automation,2006: 736-740.
    [55]Lainema J, Bossen F, Han W J, et al.. Intra Coding of the HEVC Standard [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012, 22(12):1792-1801.
    [56]Lainema J, Ugur K. Intra mode coding in HEVC standard [C]. IEEE International Conference on Visual Communications and Image Processing (VCIP),2012:1-6.
    [57]Van Wallendael G, Van Leuven S, De Cock J, et al.. Improved intra mode signaling for HEVC [J]. IEEE International Conference on Multimedia and Expo (ICME),2011:1-6.
    [58]Gabriellini A, Flynn D, Mrak M, et al.. Combined Intra-Prediction for High-Efficiency Video Coding [J]. IEEE Journal of Selected Topics in Signal Processing,2011,22(12):1282-1289.
    [59]Lainema J, Ugur K. Angular intra prediction in High Efficiency Video Coding [C]. IEEE International Workshop on Multimedia Signal Processing (MMSP),2011:1-5.
    [60]Lin J L, Chen Y W, Tsai Y P, et al.. Motion vector coding techniques for HEVC [C]. IEEE International Workshop on Multimedia Signal Processing (MMSP),2011:1-6.
    [61]Lin J L, Chen Y W, Tsai Y P, et al.. Motion Vector Coding in the HEVC Standard [J]. IEEE Journal of Selected Topics in Signal Processing,2013, PP(99):1-13.
    [62]Zhao L, Guo X, Lei S, et al.. Simplified AMVP for High Efficiency Video Coding [C]. IEEE International Conference on Visual Communications and Image Processing (VCIP),2012:1-4.
    [63]Budagavi M, Fuldseth A, Bjontegaard, et al.. Core Transform Design for the High Efficiency Video Coding (HEVC) Standard [J]. IEEE Journal of Selected Topics in Signal Processing,2013, PP(99):1.
    [64]Nguyen T, Helle P, Winken M, et all.. Transform Coding Techniques in HEVC [J]. IEEE Journal of Selected Topics in Signal Processing,2013, PP(99):1.
    [65]Winken M, Helle P, Marpe D, et all.. Transform codinginthe HEVC Test Model [C]. IEEE International Conference on Image Processing (ICIP),2011: 3693-3696.
    [66]Sang Y P, Meher P K. Flexible integer DCT architectures for HEVC [C]. IEEE International Symposium on Circuits and Systems (ISCAS),2013: 1376-1379.
    [67]Joshi R, Reznik Y A, Karczewicz M. Efficient Large Size Transforms for High-Performance Video Coding [C]. SPIE Proceedings of Applications of Digital Image Processing,2010,7798:77980W.
    [68]Sze V, Budagavi M. High Throughput CABAC Entropy Coding in HEVC [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012, 22(12):1778-1791.
    [69]Sole J, Joshi R, Nguyen T, et al.. Transform coefficient coding in HEVC [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012, 22(12):1765-1777.
    [70]Norkin A, Bjontegaard G, Fuldseth A, et al.. HEVC Deblocking Filter [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012, 22(12):1746-1754.
    [71]Fu C M, Chen C Y, Huang Y W, et al.. Sample adaptive offset for HEVC [C]. IEEE International Workshop on Multimedia Signal Processing (MMSP), 2011:1-5.
    [72]Chi C C, Alvarez-Mesa M, Juurlink B, et al.. Parallel Scalability and Efficiency of HEVC Parallelization Approaches [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12):1827-1838.
    [73]Vanne J, Viitanen M, Hamalainen T D, et al.. Comprarative rate-distortion-complexity analysis of HEVC and AVC video codecs [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12): 1885-1898.
    [74]Bossen F, Bross B, Suhring K, et al.. HEVC Complexity and Implementation Analysis [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12):1685-1696.
    [75]Correa G, Assuncao P, Agostini L, et al.. Performance and Computational Complexity Assessment of High-Efficiency Video Encoders [J]. IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12): 1899-1909.
    [76]Girod B, Aaron A M, Rane S, et al.. Distributed video coding [J]. Proeeedings of the IEEE,2005,93(1):71-83.
    [77]Pirsch P, Demassieux N, Gehrke W. VLSI architecture for video compression-a survey [J]. Proeeedings of the IEEE,1995,83(2):220-246.
    [78]Chang S F, Messerschmitt D G. Manipulation and compsiting of MC-DCT compressed video [J]. IEEE Journal on Selected Areas in Communications, 1995,13(1):188-198.
    [79]Merhav N, Bhaskaran V. Fast algorithms for DCT-domain image downsampling and for inversemotion compensation [J]. IEEE Transactions on Circuits and Systems for Video Technology,1997,7(3):468-476.
    [80]Assuncao P A A, Ghanbari M. A frequence-domain video transcoder for dynamic bit-rate reduction of MPEG-2 bit streams [J]. IEEE Transactions on Circuits and Systems for Video Technology,1998,8(8):953-967.
    [81]Fung K T, Chan Y L, Siu W C. New Architecture for dynamic fram-skipping transcoder [J]. IEEE Transactions on Image Processing,2002,11(8):886-900.
    [82]Yin P, Vetro A, Liu B, et al.. Drift compensation for reduced spatial resolution transcoding [J]. IEEE Transactions on Circuits and Systems for Video Technology,2002,12(11):1009-1020.
    [83]郑艳,MPEG-2到H.264/AVC视频转码及相关技术研究,博士论文,2008.
    [84]Nakajima Y, Hori H, Kanoh T. Rate conversion of MPEG coded video by re-quantization process [C]. International Conference on Image Processing (ICIP),1995:408-411.
    [85]Sun M T, Wu T D, Hwang J N. Dynamic bit allocation in video combining for multipoint conferencing [J]. IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing,1998,45(5):644-648.
    [86]Werner O. Requantization for transcoding of MPEG-2 intraframes [J]. IEEE Transactions on Image Processing,1999,8(2):179-191.
    [87]Assuncao P A A, Ghanbari M. Transcoding of single-layer MPEG video into lower rates [J]. IEEE Proceedings-Version, Image and Signal Processing, 1997,144(6):377-383.
    [88]Shanableh T, Ghanbari M. Heterogeneous video transcoding to lower spatio-temporal resolutions and different encoding formats [J]. IEEE Transactions on Circuits and Systems Ⅱ:Analog and Digital Signal Processing,2000,2(2):101-110.
    [89]Sun H, Kwok W, Zdepski J W. Architectures for MPEG compressed bitstream scaling [J]. IEEE Transactions on Circuits and Systems Ⅱ:Analog and Digital Signal Processing,1996,6(2):191-199.
    [90]Hwang J N, Wu T D. Motion vector re-estimation and dynamic frame-skipping for video transcoding [C]. IEEE International Conference on Signals, Systems and Computers,1998:1606-1610.
    [91]Youn J, Sun M T, Lin C W. Motion vector refinement for high-performance transcoding [J]. IEEE Transactions on Multimedia,1999,1(1):30-40.
    [92]Chen M J, Chu M C, Pan C W. Efficient motion-estimation algorithm for reduced frame-rate video transcoder [J]. IEEE Transactions on Circuits and Systems Ⅱ:Analog and Digital Signal Processing,2002,12(4):269-275.
    [93]Bjork N, Christopoulos C. Transcoder architectures for video coding [J]. IEEE Transactions on Consumer Electronics,2005,44(1):88-98.
    [94]Yin P, Wu M, Liu B. Video transcoding by reducing spatial resolution [C]. IEEE International Conference on Image Processing (ICIP),2000:972-975.
    [95]Xin J, Sun M T, Chun K, et al.. Motion re-estimation for HDTV to SDTV transcoding [C]. IEEE International Symposium on Circuits and Systems (ISCAS),2002:715-718.
    [96]Zhu W, Yang K, Beacken M. CIF-to-QCIF video bitstream down-conversion in the DCT domain [J]. Bell Labs Technical Journal,1998,3(3):21-29.
    [97]Shanableh T, Ghanbari M. Transcoding architectures for DCT-domain heterogeneous video transcoding [C]. IEEE International Conference on Image Processing (ICIP),2001:433-436.
    [98]Hur J H, Kwon H K, Lee Y L. H.264/AVC baseline profile to MPEG-4 visual simple profile transcoding to reduce the spatial resolution [J]. International Journal of Imaging Systems and Technology,2006,16(1):24-33.
    [99]Fernandez-Escribano G, Kalva H, Cuenca, P, et al.. A fast MB mode decision algorithm for MPEG-2 to H.264 P-frame transcoding [J]. IEEE Transactions on Circuits and Systems for Video Technology,2008,18(2):172-184.
    [100]Seo K D, Heo S C, Kwon S K, et al.. Dynamic bit-rate reduction based on requantization and frame-skipping for MPEG-1 to MPEG-4 transcoder [J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences,2004, E87A(4):903-911.
    [101]Tsai T H, Lin H Y, Lee Y X, et al.. Complexity Reduction of H.263 to H.264 Transcoder with Fast Mode Decision [C]. IEEE International Syposium on Circuits and Systems,2007:1999-2002.
    [102]Elliott K. A historical look at research into the human visual system and its current application toward 3D video distribution [C]. Conference on Stereoscopic Displays and Applications,2010.
    [103]Wang Z, Sheikh H R, Bovik A C. Objective video quality assessment [M]. The Handbook of Video Database:Design and Applications, CRC Press, Boca Raton, Florida,2003, pp.1041-1078.
    [104]寿天德.视觉信息处理的脑机制[M].上海：上海科技教育出版社,1997.
    [105]Henderson J M, Hollingworth A. High-level scene perception [J]. Annual Review of Psychology,1999,50:243-271.
    [106]Niebur E, Koch C. Computational architectures for attention [M]. The Attentive Brain, MIT Press, Cambridge, Massachusetts,1998.
    [107]Itti L, Baldi P. A principled approach to detecting surprising events in video [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2005,1:631-637.
    [108]Walther D, Koch C. Modeling attention to salient proto-objects [J]. Neural Networks,2006,19:1359-1407.
    [109]James W. The principles of psychology [M]. New York:Holt,1890.
    [110]Treisman A M, Gelade G. A feature-integration theory of attention [J]. Cognitive Psychology,1980,12(1):97-136.
    [111]Liu L, Fan G. A new JPEG2000 region-of-interest image coding method: Partial significant bit Planes shift [J]. IEEE Signal Proeessing Letters,2003, 10(2):35-39.
    [112]Nystrom M, Gibsont J D, Anderson J B. Multiple deseription image coding using regions of interest [C]. Asilomar Conferences on Signals, Systems and Computers,2007:925-928.
    [113]Itti L. Automatie foveation for video compression using a neurobiological model of visual attention [J]. IEEE Transactions on Image proeessing,2004, 13(10):1304-1318.
    [114]Han S, Vasconcelos N. Object-based regions of interest for image compression [J]. Proeeedings of the Data Compression Conference,2008: 132-141.
    [115]刘伟,张宏,童勤业.视觉注意计算模型及其在自然图像压缩中的应用,浙江大学学报(工学版),2007,41(4)：650-654.
    [116]Ouerhani N, Aiehip N, Hugli H, et al..Visual attention guided seed seleetion for color image segmentation [J]. Lecture Notes in Computer Science,2001, (2124):630-637.
    [117]罗彤,陈裕泉.基于视觉注意引导和区域竞争控制的医学图像分割.浙江大学学报(工学版),2007,41(11)：1797-1500.
    [118]Lee S H, Moon J, Lee M. A region of interest based image segmentation method using a biologically motivated seleetive attention model [C]. International Joint Conference on Neural Networks,2006:1413-1420.
    [119]Fu Y, Cheng J, Li Z L, et al.. Saliency Cuts:Anautomatic approach to object segmentation [C]. International Conference on Pattem Reeognition,2008:1-4.
    [120]Ko B C, Nam J Y. Objeet-of-interest image segmentation based on human attention and semantic region clustering [J]. Journal of Optical Soeiety,2006, 23(10):2462-2470.
    [121]Mendi E, Milanova M. Image segmentation with active contour based on seleetive visual attention [C]. Proeeedings of the 3rd WSEAS international symposium on Wavelets theory and applied mathematies, signal Proeessing & modern science,2009:79-84.
    [122]Ban S W, Lee M, Yang H S. A face detection using biologically motivated bottom-up saliency map model and top-down perception model [J]. Neurocomputing,2004,56(1):475-480.
    [123]Walter D, Itti L, Riesenhuber M, et al.. Attentional seleetion for object recognition-a gentle way [J]. Lecture Notes in Computer Science,2002, (2525): 472-479.
    [124]吴田富.视觉注意机制计算模型及其在物体识别中的应用.合肥工业大学硕士学位论文,2005.
    [125]Oliva A T, Orralba A, Castelhano M S, et al.. Top-down control of visual attention in object deteetion [C]. Intemational Conference on Image Process, 2003:14-17.
    [126]Yu Y L, Mann G, Gosine R G. Task-driven moving objeet deteetion for robots using visual attention [C]. International Conference on Humanoid Robots, 2007:428-433
    [127]Vu K, Hua K A, Tavanapong W. Image retrieval based on regions of interest [J]. IEEE Transactions on Knowledge and Data Engineering,2003, 15(4):1045-1049.
    [128]陈媛媛.图像显著区域提取及其在图像检索中的应用.上海交通大学硕士学位论文,2006.
    [129]Muneesawang P, Guan L. Using knowledge of the region of interest (ROI) in automatic image retrieval learning [C]. Proeeedings of International Joint Confereneeon Neural Networks,2005:1854-1859.
    [130]Rajashekhara, Chaudhuri S. Segmentation and region of interest based image retrieval in low depth of field observations [J]. Image and Vision Computing, 2007,25(11):1709-1724.
    [131]黎曦.基于感兴趣区域的图像分类技术研究.国防科技大学硕士学位论文,2006.
    [132]Dong L, Izquierdo E. A Biologically Inspired System for Classification of Natural Images [J]. IEEE Transactions on Circuits and Systems for Video Technology,2007,17(5):590-604.
    [133]宋雁斓,张瑞,支睁,杨小康,陈尔康.一种基于视觉注意模型的图像分类方法.中国图像图形学报,2008,13(10)：1886-1889.
    [134]Lin C W, Chen Y C, Sun M T. Dynamic region of interest transcoding for multipoint video conferencing [J]. IEEE Transactions on Circuits and Systems for Video Technology,2003,13(10):982-992.
    [135]Khan J I, Guo Z. Fast perceptual region tracking with coding-depth sensitive access for stream transcoding [J]. Journal of Visual Communication and Image Representation,2008,19(6):355-371.
    [136]Lievens J, Lambert Peter, Van de Walle D, et al.. Compressed-domain motion detection for efficient and error-resilient MPEG-2 to H.264 transcoding [C]. International Conference on Applications of Digital Image Processing XXX, (SPIE),2007:6696.
    [137]Yeh, C H, Chen S M, Chern S J. Content-aware video transcoding via visual attention model analysis [C]. International Conference on Intelligent Information Hiding and Multimedia Processing,2008:429-432.
    [138]Liu S, Bovik, A C. Foveation embedded DCT domain video transcoding [J]. Journal of Visual Communication and Image Representation,2005,16(6): 643-667.
    [139]Sinha A, Agarwal G, Anbu A. Region-of-interest based compressed domain video transcoding scheme [C]. IEEE International Conference on Accoustics, Speech, and Signal Processing (ICASSP),2004:161-164.
    [140]Huang S F, Chen M J, Tai K H, et al.. Region-of-interest determination and bit-rate conversion for H.264 video transcoding [J]. EURASIP Journal on Advances in Signal Processing,2013:112.
    [141]Xie R, Yu S Y. Region-of-interest-based video transcoding from MPEG-2 to H.264 in the compressed domain [J]. Optical Engineering,2008,47(9): 097001.
    [142]Su C J, Lin Y. Zero-block inter/intra mode decision for MPEG-2 to H.264/AVC inter P-frame transcoding [J]. IET Image Process,2010,52(6): 494-504.
    [143]Moriron S, Faria S, Navarro A, et al. Video transcoding from H.264/AVC to MPEG-2 with reduced computational complexity [J]. Signal Processing: Image Communication,2009,24:637-650.
    [144]Martinez J L, Fernandez-escribano G, Kalva H, et al. Wyner-Ziv to H.264 Transcoder for Low Cost Video Encoding [J]. IEEE Transactions on Consumer Electronics,2009,55(3):1453-1461.
    [145]Kalva H, Kunzelmann P. Dynamic motion estimation for transcoding P frames in H.264 to MPEG-2 transcoders [J]. IEEE Transactions on Consumer Electronics,2008,54(2):657-662.
    [146]Bialkowski J, Barkowsky M, Kaup A. Fast video transcoding from h.263 to H.264/MPEG-4 AVC [J]. Multimedia Tools and Applications,2007,35(2): 127-146.
    [147]Fernandez-escribano G, Kalva H, Martinez J L, et al. An MPEG-2 to H.264 video transcoder in the Baseline profile [J]. IEEE Transactions on Circuits and Systems for Video Technology,2010,20(5):763-768.
    [148]Fernandez-escribano G, Bialkowski J, Gamez J A, et al. Low-Complexity Heterogeneous Video Transcoding Using Data Mining [J]. IEEE Transactions on Multimedia,2008,10(2):286-299.
    [149]Liu X G, Yoo K Y, Kim S W. Low Complexity Intra Prediction Algorithm for MPEG-2 to H.264/AVC Transcoder [J]. IEEE Transactions on Consumer Electronics,2010,56(2):987-994.
    [150]Su Y P, Xin J, Vetro A, et al.. Efficient MPEG-2 to H.264/AVC intra transcoding in transform-domain [C]. IEEE International Symposium on Circuits and Systems (ISCAS),2005:1234-1237.
    [151]Wang M Y, Sun J M, Wu Q. Efficient intra mode decision for AVS to H.264/AVC transcoding [C]. IEEE International Symposium on Signal Processing and Information Technology,2007:948-951.
    [152]Wang Z H, Gao W, Zhao D B, et al.. A fast intra mode decision algorithm for AVS to H.264 transcoding [C]. IEEE International Conference on Multimedia and Expo (ICME),2006:61-64.
    [153]Pasqualini S, Pierleoni P, Fioretti F, et al.. Adaptive threshold for intra frame prediction in H.263 to H.264 smart-transcoder [C]. IEEE International Conference on Advanced Communication Technology (ICACT), 2008:1439-1444.
    [154]Jing X, Siu W C, Chau L P, et al.. Fast intra mode decision algorithm for H.263 to H.264/AVC transcoding [J]. IEEE International Conference on Signal and Image Processing,2008:666-670.
    [155]Zhao L, Zhang L, Ma S W, et al.. Fast Mode Decision Algorithm for Intra Prediction in HEVC [C]. IEEE International Conference on Visual Communications and Image Processing (VCIP),2011:1-4.
    [156]Da Silva T L, Agostini L V, da Silva Cruz L A. Fast HEVC intra prediction mode decision based on EDGE direction information [C]. Proceedings of the 20th European Signal Processing Conference (EUSIPCO),2012:1214-1218.
    [157]Sun H M, Zhou D J, Goto S. A Low-Complexity HEVC Intra Prediction Algorithm Based on Level and Mode Filtering [C]. IEEE International Conference on Multimedia and Expo (ICME),2012:1085-1090.
    [158]Kim J, Yang J, Lee H, et al.. Fast intra mode decision of HEVC based on hierarchical structure [C]. IEEE International Conference on Information, Communications and Signal Processing (ICICS),2011:1-4.
    [159]Kim Y, Jun D S, Jung S, et al.. A fast intra prediction method using hadamard transform in high efficiency video coding [C]. SPIE Proceedings of Visual Information Processing and Communication,2012,8305:83050A.
    [160]Bjontegaard G. Calculation of Average PSNR Differences between R D curves [C]. Document VCEG-M33, April 2001.
    [161]Shen T, Lu Y, Wen ZY, et al. Ultra fast H.264/AVC to HEVC transcoder [C]. Data Compression Conference (DCC),241-250 (2013).
    [162]Shanableh T, Peixoto E, Izquierdo E. MPEG-2 to HEVC video transcoding with content-based modeling [J]. IEEE Transactions on Circuits and Systems for Video Technology,23(7),1191-1196 (2013).
    [163]Peixoto E, Izquierdo E. A complexity-scalable transcoder from H.264/AVC to the new HEVC codec [C]. IEEE International Conference on Image Processing (ICIP),737-740 (2012).
    [164]Achanta R, Shaji A, Smith K, et al.. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence.2012,34(11):2274-2281.
    [165]Wiegand T, Schwarz H, Joch A, et al.. Rate-constrained coder control and comparison of video coding standards [J]. IEEE Transactions on Circuits and Systems for Video Technology,2003,13(7):688-703.
    [166]Courty N, Marchand E. Visual perception based on salient features[C]. International Conference on Intelligent Robots and Systems,2003:1024-1029.
    [167]Chen Q, Yang X K, Song L, et al.. Robust Video Region-of-Interest Coding Based on Leaky Prediction [J]. IEEE Transactions on Circuits and Systems for Video Technology,2009,19(9):1389-1394.
    [168]Achanta R, Estrada E, Wils P, et al.. Salient region detection and segmentation [C]. International Conference in Computer Vision Systems,2008: 66-75.
    [169]Lu T, Yuan Z, Huang Y, et al.. Video retargeting with nonlinear spatial-temporal saliency fusion [C]. International Conference on Image Processing,2010:1801-1804.
    [170]Xu L F, Li H L, Zeng L Y, et al.. Saliency detection using joint spatial-color constraint and multi-scale segmentation [J]. Journal of Visual Communication and Image Representation,2013,24(4):465-476.
    [171]Yi Y, Ding J, Lai J L. A novel video salient object extraction method based on visual attention [J]. Signal Processing-Image Communication,2013,28(1): 45-54.
    [172]Imamoglu, N, Lin W S, Fang Y M. A Saliency Detection Model Using Low-Level Features Based on Wavelet Transform [J]. IEEE Transactions on Multimedia,2013,15(1):96-105.
    [173]Fang Y M, Chen Z Z, Lin W S, et al.. Saliency Detection in the Compressed Domain for Adaptive Image Retargeting [J]. IEEE Transactions on Image Processing,2012,21(9):3888-3901.
    [174]Chen Y M, Bajic, I V. A joint approach to global motion estimation and motion segmentation from a coarsely sampled motion vector field [J]. IEEE Transactions on Circuits and Systems for Video Technology,2011,21(9): 1316-1328.
    [175]Xie R, Yu S Y. Region-of-interest-based video transcoding from MPEG-2 to H.264 in the compressed domain [J]. Optical Engineering,2008,47(9): 097001.
    [176]Khan J I, Guo Z. Fast perceptual region tracking with coding-depth sensitive access for stream transcoding [J]. Journal of Visual Communication and Image Representation,2008,19(6):355-371.
    [177]Zhang D, Li B, Xu J Z, et al.. Fast Transcoding from H.264/AVC to High Efficiency Video Coding [C]. International Conference on Multimedia and Expo (ICME),2012:651-656.
    [178]Su Y P, Sun M T, Hsu V. Global motion estimation from coarsely sampled motion vector field and the applications [J]. IEEE Transactions on Circuits and Systems for Video Technology,2005,15(2):232-242.
    [179]Chen Y M, Bajic I V. Motion vector outlier rejection cascade for global motion estimation [J]. IEEE Signal Processing Letter,2010,17(2):197-200.
    [180]Schuur B, Wedi T, Wittmann S, et al.. Frequency selective update for video coding [C]. IEEE International Conference on Image Processing (ICIP),2006: 1709-1712.
    [181]Zheng Y Y, Zhou F, Tian X, et al.. Lightweight Content-Adaptive Coding in Joint Analyzing-Encoding Framework [J]. IEEE Transactions on Consumer Electronics,2008,54(2):614-622.
    [182]Song H J, Kuo C-C J. A region-based H.263+ codec and its rate control for low VBR video [J]. IEEE Transactions on Multimedia,2004,6(3):489-500.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700