多视点视频编码关键技术研究

英文题名：Research on Key Techniques for Multi-view Video Coding
作者：王凤随
论文级别：博士
学科专业名称：电路与系统
中文关键词：多视点视频编码 ; 算法优化 ; 低复杂度 ; 模式选择 ; 帧间预测 ; 视间预测 ; 运动估计 ; 视差估计
英文关键词：Multi-view video coding ; algotithm optimization ; low complexity ; mode
英文关键词：decision ; inter prediction ; inter-view prediction ; motion estimation ; disparity
英文关键词：estimation
学位年度：2013
导师：都思丹
学科代码：080902
学位授予单位：南京大学
论文提交日期：2013-11-01

摘要

随着计算机图形学和计算机视觉技术的发展,多视点视频(MVV)越来越多地引起人们的普遍关注。同传统的单视点视频相比,多视点视频拥有丰富的三维深度信息,能够为用户提供无法比拟的立体感和交互性。然而,由于多视点视频是由位置固定的多个摄像机同时从不同角度拍摄同一场景而获得的一组视频信号,其数据量会随着摄像机数目的增多而成倍增加。巨大的视频数据量对存储和传输提出了更高的要求,多视点视频编码(MVC)就是对多视点视频数据的有效压缩。随着新一代显示技术的发展及网络传输能力的快速提高,多视点视频编码越来越受到国内外学者及研究机构的青睐。多视点视频编码沿用传统的混合视频编码框架结构,并对该框架进行了拓展和创新,复杂的预测结构带来了计算复杂度的急剧增加,巨大的计算量严重影响了MVC的实际应用和推广。因此,研究MVC低复杂度快速算法至关重要。在多视点视频中,除了具有同一视点的时间和空间相关性以外,还具有同一时刻不同视点间的视点相关性。因此,如何有效地利用这些视点内及视点间的相关性信息来去除冗余是提高多视点视频编码速度的关键。论文在对多视点视频编码相应关键技术深入分析的基础上,对其中的耗时模块进行了一系列的优化。
     首先,在对多视点视频编码各个模式分析的基础上,针对MVC模式分析计算量大的缺点,提出了一种有效的Direct模式提前终止模式选择快速算法。基于Direct模式最有可能成为最优模式这一观察,算法首先计算当前宏块的Direct模式的率失真代价(RD cost)值,并与自适应阈值进行比较,以提供一个提前终止的机会。如果当前宏块的RD cost值小于自适应阈值,那么Direct模式将直接被选为最优模式,其余的模式选择过程不必检查；否则,将进行穷尽模式搜索来选择最优模式。自适应阈值的设计是算法实现的关键,该算法综合利用了当前宏块与其相邻宏块的空间、时间及视点间的相关性来共同确定。实验结果表明,同MVC参考软件的穷尽模式选择算法相比,提出快速算法降低了约72.38%的计算复杂度,总比特率平均减少了1.06%,而PSNR仅下降0.05dB。
     其次,通过对MVC帧间预测可变尺寸块中各个尺寸块分布特点的分析,提出了基于模式复杂度的多视点视频编码帧间预测快速算法。在提出的算法中,根据所定义的模式复杂度将宏块分成3个不同的模式类型,每种类型仅检查相对应的模式分块,其余不必要的模式分块就可以提前终止,从而使得计算量大大减少。实验结果表明,同全模式选择算法相比,提出算法在保持编码效率基本不变的同时,计算复杂度减少62.75%。
     再次,针对多视点视频编码视间预测效率低的问题,提出了视差估计提前终止的视间预测快速算法。提出的方法是基于帧间各分块模式之间的预测方向的相关性而提出来的,采用帧间16×16模式在视点方向的预测结果来确定其他模式是否进行视差估计。实验结果表明,提出算法能够有效地跳过不必要的视差估计过程,从而有效地降低多视点视频编码视间预测的计算复杂度。
     最后,基于上述算法,本文提出一种融合算法。该算法融合了Direct模式提前终止算法、可变尺寸块帧间预测算法和视差估计提前跳过算法。实验结果表明,该融合算法能够最大限度地降低多视点视频编码的计算复杂度,平均可降低78.79%,同时比特率可以降低0.07%,而PSNR值仅仅降低了0.04dB。
     综上所述,本文分析了多视点视频编码的各关键技术,并对相应的模块进行了优化研究。所提出的快速优化算法能够很好地降低多视点视频编码的计算复杂度,对多视点视频编码的应用具有重要的参考价值。
With the development of compute graphics and computer vision technology, multi-view video attracts more and more attention. Compared with the traditional single-view video, multi-view video comprises rich three-dimensional depth information, which can provide people with the highly-welcome experience of3D stereoscopic and interactive. However, multi-view video is captured by a set of video cameras from various viewpoints but at the same time. With the increasing number of cameras, the amount of video data is linearly increased. Huge amount of video data highly requires for efficient storage and transmission. Multi-view video coding is efficient compression for multi-view video data. With the advances in the new display and network transmission techniques, multi-view video coding attracts more and more attention.MVC follows the classic block-based hybrid video coding framework, and the development and innovation of the framework. Intricate prediction structure brings out rapid increase in computational complexity, which obstructs MVC from practical application and promotion. Therefore, it is very essential for MVC to study low complexity fast algorithms. In MVV, it is also with inter-view correlation between different views but at the same time instant, besides spatial correlation and temporal correlation within a single view. Hence, the key of speeding up encoding for MVC is how to effectively utilize these correlations within a sigle view and between views to remove the redundancy. This research paper dedicates much effort to series of optimizations for those time-consuming modules of MVC, based on the analysis for the key techniques of MVC.
     First, an efficient early Direct mode decision for MVC is proposed in order to overcome heavy computation of mode analysis, on the basis of analyzing each mode of MVC. Based on the observation that the Direct mode is highly possible to be the optimal mode, the proposed method first computes the rate distortion cost of the Direct mode of the current macroblock and compares this RD cost value with an adaptive threshold for providing an early termination chance as follows. If this RD cost value is smaller than the adaptive threshold, the Direct mode will be selectd as the optimal mode and the checking process of remaining modes will be skipped; otherwise, exhaustive mode decision is used to check all the modes to select the optimal mode. The key of the proposed algorithm is the design of the adaptive threshold, which is determined by using the spatial, temporal and inter-view correlations between the current macroblock and its neighboring macroblocks, respectively. Experimental results have shown that the proposed method is able to reduce the computational load by72.38%and the total bit rate by1.06%, while only incurring a negligible loss of PSNR (about0.05dB on average), compared with exhaustive mode decision in the reference software of MVC.
     Second, a fast inter prediction algorithm based on mode complexity for multi-view video coding is proposed, after analyzing the characteristic of each variable block size in inter prediction of the MVC. In the proposed algorithm, macroblocks are divided into three different mode classes on the basis of the mode complexity defined. Each class only checks the specified mode size(s), and the other unnecessary mode sizes can be early terminated. Thus, computational load can be greatly reduced. Experimental results have demonstrated that the proposed method is able to reduce62.75%with negligible loss of coding efficiency, compared with the full mode decision in the reference software of MVC.
     Third, a fast inter-view prediction algorithm based on an early disparity estimation skipping is presented aim at impoving the prediction efficiency between inter-views for MVC. This method is proposed via using prediction direction correlation between inter mode sizes. The prediction result of mode16×16selecting inter-view prediction as its optimal prediction can be used to decide whether disparity estimation of the other mode sizes is selected or not. Experimental results have shown that the proposed method can omit the unnecessary disparity estimation process, and effectively reduce the computational complexity in inter-view prediction for MVC.
     Finally, a fusion algorithm is proposed based on the above-mentioned algorithms. This algorithm combines the Direct mode early termination, variable size inter prediction and early disparity estimation skipping. Experimental results have shown that the fusion algorithm is able to significantly reduce the computational complexity of MVC by78.79%on average and the total bit rate by0.07%on average, while only incurring a negligible loss of PSNR (about0.04dB on average), compared with exhaustive mode decision in the reference software of MVC.
     In summary, the key techniques in multi-view video coding are analyzed in this paper, and the optimizations of the corresponding modules are studied. The proposed fast optimization algorithms can significantly reduce the computational complexity of MVC,which has an important reference value to the practical applications of MVC

引文

[1]Dis ISO/IEC.11172-2 (MPEG-1), Coding of moving pictures and associated audio for digital storage media up to 1.5Mbits/s [S].1993.
    [2]ISO/IEC ITU-T And.13818-2 (MPEG-2), Generic coding of moving pictures and associated audio information [S].1994.
    [3]Dis ISO/IEC.14496 (MPEG-4), Coding of audio-visual objects [S].1999.
    [4]ITU-T. ITU-T Rec. H.261, Video codec for audio visual services at p×64 kbits/s [S].1993.
    [5]H.263 ITU Recommendation. Video coding for low bit rate communication [S]. March 1995.
    [6]H.264 ITU Draft Recommendation. JVT-G050rl, ISO/IEC draft international standard 14496-10 [S]. May 2003.
    [7]ITU-T. ITU-T Rec. H.63, Video coding for low bit rate communication [S].2000.
    [8]Joch Anthony, Kossentini Faouzi, Schwarz Heiko, Wiegand Thomas, et al. Performance comparison of video coding standards using lagrangian coder control [C]. Proceedings of International Conference on Image Processing.2002: Ⅱ-501-Ⅱ-504 vol.502.
    [9]Schafer Ralf, Wiegand Thomas,Schwarz Heiko. The emerging H.264/AVC standard [J]. EBU technical review,2003,293.
    [10]Pandit P Vetro A, Chen Y. Joint multiview video model (JMVM) 8.0 software, jvt-aa208 [R]. Geneva:JVT,2008.
    [11]Lipton L. Stereographies(?) developers' handbook:Background on creating images for crystaleyes(?) and simuleyes(?) [J]. StereoGraphics Corporation,1997: 15-26.
    [12]Pastoor Siegmund. Human factors of 3d displays in advanced image communications [J]. Displays,1993,14(3):150-157.
    [13]Matusik Wojciech,Pfister Hanspeter.3d tv:A scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes [J]. ACM Transactions on Graphics (TOG),2004,23(3):814-824.
    [14]Marpe Detlev, Schwarz Heiko,Wiegand Thomas. Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2003,13(7):620-636.
    [15]Smolic A,Kimata H. Description of exploration experiments in 3dav [J]. ISO/IEC JTC1/SC29/WG11, WG,2003,11.
    [16]Smolic A,Yamashita R. Report on status of 3dav exploration [J]. ISO/IEC JTC,1.
    [17]Subgroup Mpeg Video. Draft call for evidence on multiple views video coding [C]. W6374,68th MPEG Meeting, Munich, Germany,2004.
    [18]Subgroup Mpeg Video. Draft call for proposals on multi-view video coding [C]. W6910,71st MPEG Meeting, Hong Kong, China,2005.
    [19]Vetro Anthony, Su Yeping, Kimata Hideaki,Smolic Aljoscha. Joint multiview video model JMVM 2.0 [J]. ITU-T and ISO/IEC Joint Video Team, Document JVT-U207,2006.
    [20]Vetro A, Pandit P.Kimata H. Text of ISO/IEC 14496-10:200x/fdam 1 multiview video coding [J]. W9978,85th MPEG Meoting,2008.
    [21]ISO/IEC 14496-10:2008/FDAM 1:2008(E), Information technology-coding of audio-visual objects-part 10:Advanced video coding, amendment 1:Multiview video coding [S].
    [22]Merkle Philipp, Smolic Aljoscha, Muller Karsten,Wiegand Thomas. Efficient prediction structures for multiview video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2007,17(11):1461-1473.
    [23]Koo Han-Suh, Jeon Yong-Joon,Jeon Byeong-Moon. Mvc motion skip mode, document jvt-w081 [R]. San Jose:JVT,2007.
    [24]Yang Haitao, Chang Yilin,Huo Junyan. Fine-granular motion matching for inter-view motion skip mode in multiview video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2009,19(6):887-892.
    [25]Hur Jae-Ho, Cho Sukhee,Lee Yung-Lyul. Adaptive local illumination change compensation method for H.264/AVC-based multiview video coding [J].Circuits and Systems for Video Technology, IEEE Transactions on,2007,17(11): 1496-1505.
    [26]Seshadrinathan Kalpana,Bovik Alan Conrad. Motion tuned spatio-temporal quality assessment of natural videos [J]. Image Processing, IEEE Transactions on,2010,19(2):335-350.
    [27]Zhao Yin, Yu Lu, Chen Zhenzhong,Zhu Ce. Video quality assessment based on measuring perceptual noise from spatial and temporal perspectives [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2011,21(12): 1890-1902.
    [28]Tanimoto Masayuki, Tehrani Mehrdad Panahpour, Fujii Toshiaki,Yendo Tomohiro. Free-viewpoint tv [J]. Signal Processing Magazine, IEEE,2011,28(1): 67-76.
    [29]Muller Karsten, Merkle Philipp,Wiegand Thomas.3-D video representation using depth maps [J]. Proceedings of the Ieee,2011,99(4):643-656.
    [30]Kim Jae Hoon, Lai Polin, Lopez Joaquin, Ortega Antonio, et al. New coding tools for illumination and focus mismatch compensation in multiview video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on, 2007,17(11):1519-1535.
    [31]Yea S.,Vetro A. View synthesis prediction for multiview video coding [J]. Signal Processing-Image Communication,2009,24(1-2):89-100.
    [32]San Xing, Cai Hua, Lou Jian-Guang,Li Jiang. Multiview image coding based on geometric prediction [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2007,17(11):1536-1548.
    [33]Tsung Pei-Kuei, Ding Li-Fu, Chen Wei-Yin, Chuang Tzu-Der, et al. Video encoder design for high-definition 3d video communication systems [J]. Communications Magazine, IEEE,2010,48(4):76-86.
    [34]Lai P. L.,Ortega A. Predictive fast motion/disparity search for multiview video coding-art. No.607709 [J]. Visual Communications and Image Processing 2006, Pts 1 and 2,2006,6077:7709-7709.
    [35]Lu Jiangbo, Cai Hua, Lou Jian-Guang,Li Jiang. An epipolar geometry-based fast disparity estimation algorithm for multiview image and video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2007,17(6):737-750.
    [36]Ding Li-Fu, Tsung Pei-Kuei, Chien Shao-Yi, Chen Wei-Yin, et al. Content-aware prediction algorithm with inter-view mode decision for multiview video coding [J]. Multimedia, IEEE Transactions on,2008,10(8):1553-1564.
    [37]Ding L. F., Tsung P. K., Chien S. Y., Chen W. Y., et al. Computation-free motion estimation with inter-view mode decision for multiview video coding [J].2007 3dtv Conference,2007:217-220.
    [38]Yu Mei, Peng Zongju, Liu Weiyue, Shao Feng, et al. Fast macroblock selection algorithm for multiview video coding based on inter-view global disparity [C]. Image and Signal Processing,2008. CISP'08. Congress on,2008:575-578.
    [39]Shen Liquan, Liu Zhi, Yan Tao, Zhang Zhaoyang, et al. Early skip mode decision for MVC using inter-view correlation [J]. Signal Processing:Image Communication,2010,25(2):88-93.
    [40]Li Xiaoming, Zhao Debin, Ma Siwei,Gao Wen. Fast disparity and motion estimation based on correlations for multiview video coding [J]. Consumer Electronics, IEEE Transactions on,2008,54(4):2037-2044.
    [41]Ding Li-Fu, Chien Shao-Yi,Chen Liang-Gee. Joint prediction algorithm and architecture for stereo video hybrid coding systems [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2006,16(11):1324-1337.
    [42]Li Xiaoming, Zhao Debin, Ji Xiangyang, Wang Qiang, et al. A fast inter frame prediction algorithm for multi-view video coding [C]. Image Processing,2007. ICIP 2007. IEEE International Conference on,2007:Ⅲ-417-Ⅲ-420.
    [43]Lee Seo-Young, Shin Kwang-Mu,Chung Ki-Dong. An object-based mode decision algorithm for multi-view video coding [C]. Multimedia,2008. ISM 2008. Tenth IEEE International Symposium on,2008:74-81.
    [44]Cernigliaro Gianluca, Jaureguizar Fernando, Ortega Antonio, Cabrera Julian, et al. Fast mode decision for multiview video coding based on depth maps [C]. IS&T/SPIE Electronic Imaging,2009:72570N-72570N-72510.
    [45]Peng Zongju, Jiang Gangyi,Yu Mei. A fast multiview video coding algorithm based dynamic multi-threshold [C]. Multimedia and Expo,2009. ICME 2009. IEEE International Conference on,2009:113-116.
    [46]Huo Junyan, Chang Yilin, Li Ming,Ma Yanzhuo. Scalable prediction structure for multiview video coding [C]. Circuits and Systems,2009. ISCAS 2009. IEEE International Symposium on,2009:2593-2596.
    [47]Shen Liquan, Liu Zhi, Liu Suxing, Zhang Zhaoyang, et al. Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding [J]. Broadcasting, IEEE Transactions on,2009,55(4): 761-766.
    [48]Shen Liquan. Yan Tao, Liu Zhi, Zhang Zhaoyang, et al. Fast mode decision for multiview video coding [C]. Image Processing (ICIP),2009 16th IEEE International Conference on,2009:2953-2956.
    [49]Zhu Wei, Tian Xiang, Zhou Fan,Chen Yaowu. Fast inter mode decision based on textural segmentation and correlations for multiview video coding [J]. Consumer Electronics, IEEE Transactions on,2010,56(3):1696-1704.
    [50]Zeng Huanqiang, Ma Kai-Kuang,Cai Canhui. Mode-correlation-based early termination mode decision for multi-view video coding [C]. Image Processing (ICIP),2010 17th IEEE International Conference on,2010:3405-3408.
    [51]Yang Shih-Hsuan,Liou Yu-Shiuan. Fast reference frame and mode selection for multiview video coding based on coded block patterns [C]. Multimedia and Expo (ICME),2010 IEEE International Conference on,2010:1102-1107.
    [52]Zhang Yun, Kwong Sam, Jiang Gangyi,Wang Hanli. Efficient multi-reference frame selection algorithm for hierarchical B pictures in multiview video coding [J]. Broadcasting, IEEE Transactions on,2011,57(1):15-23.
    [53]Shen Liquan, Liu Zhi, An Ping, Ma Ran, et al. Low-complexity mode decision for MVC [J]. Circuits and Systems for Video Technology, IEEE Transactions on, 2011,21(6):837-843.
    [54]Zeng Huanqiang, Ma Kai-Kuang,Cai Canhui. Fast mode decision for multiview video coding using mode correlation [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2011,21(11):1659-1666.
    [55]Deng Zhi-Pin, Chan Yui-Lam, Jia Ke-Bin, Fu Chang-Hong, et al. Fast motion and disparity estimation with adaptive search range adjustment in stereoscopic video coding [J]. Broadcasting, IEEE Transactions on,2012,58(1):24-33.
    [56]Zhang Q. An efficient inter mode decision method for MVC [J].2012 International Workshop on Information and Electronics Engineering,2012,29: 38-42.
    [57]Khattak. S., Hamzaoui R., Ahmad S.,Frossard P. Low-complexity multiview video coding [J].2012 Picture Coding Symposium (Pcs),2012:97-100.
    [58]Shafique M, Zatt B.,Henkel J. A complexity reduction scheme with adaptive search direction and mode elimination for multiview video coding [J].2012 Picture Coding Symposium (Pcs),2012:105-108.
    [59]Seo Jungdong,Sohn Kwanhoon. Early disparity estimation skipping for multi-view video coding [J]. Eurasip Journal on Wireless Communications and Networking,2012.
    [60]Chan Chia-Chi,Tang Chih-Wei. Coding statistics based fast mode decision for multi-view video coding [J]. Journal of Visual Communication and Image Representation,2013,24(6):686-699.
    [61]Khattak S., Hamzaoui R., Ahmad S.,Frossard P. Fast encoding techniques for multiview video coding [J]. Signal Processing-Image Communication,2013, 28(6):569-580.
    [62]Zhang Yun, Kwong Sam, Xu Long,Jiang Gangyi. Direct mode early decision optimization based on rate distortion cost property and inter-view correlation [J]. 2013.
    [63]Wang Fengsui, Zeng Huanqiang, Shen Qinghong,Du Sidan. Efficient early direct mode decision for multi-view video coding [J]. Signal Processing-Image Communication,2013,28(7):736-744.
    [64]Liu Yebin, Dai Qionghai.Xu Wenli. A point-cloud-based multiview stereo algorithm for free-viewpoint video [J]. Visualization and Computer Graphics, IEEE Transactions on,2010,16(3):407-418.
    [65]Liu Yebin, Dai Qionghai, You Zhixiang,Xu Wenli. Rate-prediction structure complexity analysis for multi-view video coding using hybrid genetic algorithms [C]. Electronic Imaging 2007,2007:650804-650804-650808.
    [66]Li Dong, Zhang Yongbing, Liu Qiong, Ji Xiangyang, et al. Enhanced block prediction in stereoscopic video coding [C].3DTV Conference:The True Vision-Capture, Transmission and Display of 3D Video (3DTV-CON),2011, 2011:1-4.
    [67]Wang Qifei, Ji Xiangyang, Dai Qionghai,Zhang Naiyao. Region based rate-distortion analysis for 3d video coding [C]. Data Compression Conference (DCC),2010,2010:555-555.
    [68]Cheng Xiaoyu, Sun Lifeng.Yang Shiqiang. A multi-view video coding approach using layered depth image [C]. Multimedia Signal Processing,2007. MMSP 2007. IEEE 9th Workshop on,2007:143-146.
    [69]Li Yanjie,Sun Lifeng. A novel upsampling scheme for depth map compression in 3DTV system [C]. Picture Coding Symposium (PCS),2010,2010:186-189.
    [70]Liu Yanwei, Huang Qingming, Ma Siwei, Zhao Debin, et al. RD-optimized interactive streaming of multiview video with multiple encodings [J]. Journal of Visual Communication and Image Representation,2010,21(5):523-532.
    [71]Xiang Xinguang, Zhao Debin, Ma Siwei,Gao Wen. Auto-regressive model based error concealment scheme for stereoscopic video coding [C]. Acoustics, Speech and Signal Processing (1CASSP),2011 IEEE International Conference on,2011: 849-852.
    [72]Zhang Nan, Ma Siwei.Gao Wen. Shape-based depth map coding [C]. Intelligent Information Hiding and Multimedia Signal Processing,2009. IIH-MSP'09. Fifth International Conference on,2009:316-319.
    [73]Zhang Nan,Ma Siwei. H.264/AVC-based depth map sequence coding using improved loop-filter [C]. Intelligent Information Hiding and Multimedia Signal Processing,2009. IIH-MSP'09. Fifth International Conference on,2009: 312-315.
    [74]Zhu Wei, Tian Xiang, Zhou Fan,Chen Yaowu. Fast disparity estimation using spatio-temporal correlation of disparity field for multiview video coding [J]. Consumer Electronics, IEEE Transactions on,2010,56(2):957-964.
    [75]Fu Deliang, Zhao Yin,Yu Lu. Temporal consistency enhancement on depth sequences [C]. Picture Coding Symposium (PCS),2010,2010:342-345.
    [76]Zhao Yin, Zhu Ce, Chen Zhenzhong,Yu Lu. Depth no-synthesis-error model for view synthesis in 3-D video [J]. Image Processing, IEEE Transactions on,2011, 20(8):2221-2228.
    [77]Chen Jianle, Liu Jilin, Wang Xingguo.Chen Guobin. Modified edge-oriented spatial interpolation for consecutive blocks error concealment [C]. Image Processing,2005. ICIP 2005. IEEE International Conference on,2005: Ⅲ-904-907.
    [78]Zhang Guofeng, Hua Wei, Qin Xueying, Wong Tien-Tsin, et al. Stereoscopic video synthesis from a monocular video [J]. Visualization and Computer Graphics, IEEE Transactions on,2007,13(4):686-696.
    [79]Huo Jun-Yan, Chang Yi-Lin, Yang Hai-Tao,Wan Shuai. Color compensation for multi-view video coding based on diversity of cameras [J]. Journal of Zhejiang University SCIENCE A,2008,9(12):1631-1637.
    [80]Yuan Hui, Chang Yilin, Li Ming,Yang Fuzheng. Model based bit allocation between texture images and depth maps [C]. Computer and Communication Technologies in Agriculture Engineering (CCTAE),2010 International Conference On,2010:380-383.
    [81]Yuan Hui, Chang Yilin, Huo Junyan, Yang Fuzheng, et al. Model-based joint bit allocation between texture videos and depth maps for 3-D video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2011,21(4): 485-497.
    [82]Liu Xiaoxian, Chang Yilin, Yang Haitao,Yuan Hui. Adaptive non-uniform quantization of depth data in free-viewpoint television system [J]. Journal of Xi'an Jiaotong University,2010,2:016.
    [83]Zongju Peng, Gangyi Jiang, Mei Yu.Qionghai Dai. Fast macroblock mode selection algorithm for multiview video coding [J]. Eurasip Journal on Image and Video Processing,2009,2008.
    [84]Zhang Yun, Jiang Gangyi, Yu Mei, Yang You, et al. Depth perceptual region-of-interest based multiview video coding [J]. Journal of Visual Communication and Image Representation,2010,21 (5):498-512.
    [85]Peng Zongju, Yu Mei, Jiang Gangyi, Shao Feng, et al. Fast macroblock mode selection algorithm for multiview depth video coding [J]. Chinese Optics Letters, 2010,8(2):151-154.
    [86]Peng Zongju, Yu Mei, Jiang Gangyi, Si Yuehou, et al. Virtual view synthesis oriented fast depth video encoding algorithm [C]. Industrial and Information Systems (ⅡS),2010 2nd International Conference on,2010:204-207.
    [87]Shao Feng, Jiang Gangyi, Yu Mei, Chen Ken, et al. Asymmetric coding of multi-view video plus depth based 3-D video for view rendering [J]. Multimedia, IEEE Transactions on,2012,14(1):157-167.
    [88]Zhu Bo, Jiang Gangyi, Zhang Yun, Peng Zongju, et al. View synthesis oriented depth map coding algorithm [C]. Information Processing,2009. APCIP 2009. Asia-Pacific Conference on,2009:104-107.
    [89]Lei Yang, Xiaowei Song, Chunping Hou, Jichang Guo, et al. A general multiview lcd stereo image composition method based on optical plate technology [C]. 3DTV Conference:The True Vision-Capture, Transmission and Display of 3D Video,2009,2009:1-4.
    [90]Hou Chunping, Yang Jiachen,Zhang Zhuoyun. Stereo image displaying based on both physiological and psychological stereoscopy from single image [J]. International Journal of Imaging Systems and Technology,2008,18(2-3): 146-149.
    [91]Yang Jiachen, Hou Chunping, Zhou Yuan, Zhang Zhuoyun, et al. Objective quality assessment method of stereo images [C].3DTV Conference:The True Vision-Capture, Transmission and Display of 3D Video,2009,2009:1-4.
    [92]Li Sumei, Xiang Wei, Cheng Fuwei, Zhao Ruichao, et al. HVS-based quality assessment metrics for 3-D images [C]. Intelligent Systems (GCIS),2010 Second WRI Global Congress on,2010:86-89.
    [93]Yang Shuai, Zhao Yan, Wang Shigang,Chen Hexin. Error concealment for stereoscopic video using illumination compensation [J]. Consumer Electronics, IEEE Transactions on,2011,57(4):1907-1914.
    [94]Wei Jian, Wang Shigang,Chen Liwei. Multi-view video coding with adaptive selection of prediction mode based on hierarchical B picture [C]. Wireless Mobile and Computing (CCWMC 2009), IET International Communication Conference on,2009:339-343.
    [95]Shen Liquan, Liu Zhi, Yan Tao, Zhang Zhaoyang, et al. View-adaptive motion estimation and disparity estimation for low complexity multiview video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2010, 20(6):925-930.
    [96]Shen Liquan, Liu Zhi, An Ping, Ma Ran, et al. Fast mode decision for scalable video coding utilizing spatial and interlayer correlation [J]. Journal of Electronic Imaging,2010,19(3):033010-033010-033018.
    [97]Yan T, An P, Shen Lq,Zhang Zy. Bit allocation and rate control algorithm for M VC [J]. Imaging Science Journal, The,2011,59(4):202-210.
    [98]Shen Liquan, Feng Guorui, Liu Zhi, Zhang Zhaoyang, et al. Macroblock-level adaptive search range algorithm for motion estimation in multiview video coding [J]. Journal of Electronic Imaging,2009,18(3):033003-033003-033008.
    [99]Shen Liquan, Liu Zhi, Zhang Zhaoyang,Shi Xuli. Selective vs-mrf-me and intra coding in h.264 based on spatiotemporal continuity of motion field [J]. Signal Processing:Image Communication,2009,24(5):405-414.
    [100]Shen Liquan, Sun Yiwen, Liu Zhi,Zhang Zhaoyang. Efficient skip mode detection for coarse grain quality scalable video coding [J]. Signal Processing Letters, IEEE,2010,17(10):887-890.
    [101]Liu Suxing, An Ping, Zhang Zhaoyang, Zhang Qian, et al. Multi-view video coding based on vector estimation and weighted disparity interpolation [J]. Circuits, Systems and Signal Processing,2009,28(6):913-923.
    [102]Zhang Qiuwen, An Ping, Zhang Yan, Shen Liquan, et al. Low complexity multiview video plus depth coding [J]. Consumer Electronics, IEEE Transactions on,2011,57(4):1857-1865.
    [103]Liu Sx, An P, Zhang Zy, Zhang Q, et al. On the relationship between multi-view data capturing and quality of rendered virtual view [J]. Imaging Science Journal, The,2009,57(5):250-259.
    [104]Zhang Qiuwen, An Ping, Zhang Yan. Shen Liquan, et al. Improved multi-view depth estimation for view synthesis in 3D video coding [C].3DTV Conference: The True Vision-Capture, Transmission and Display of 3D Video (3DTV-CON), 2011,2011:1-4.
    [105]Kim Woo-Shik, Ortega Antonio, Lai Polin, Tian Dong, et al. Depth map coding with distortion estimation of rendered view [J]. Visual Communication and Information Processing,2010,7543.
    [106]Kim Woo-Shik, Ortega Antonio, Lee Jaejoon,Wey Hocheon.3-D video quality improvement using depth transition data [C]. Multimedia and Expo (ICME), 2011 IEEE International Conference on,2011:1-6.
    [107]Shen Godwin, Kim Woo-Shik, Ortega Antonio, Lee Jaejoon, et al. Edge-aware intra prediction for depth-map coding [C]. Image Processing (ICIP),2010 17th IEEE International Conference on,2010:3393-3396.
    [108]Shen Godwin, Kim W-S, Narang Sunil K, Ortega Antonio, et al. Edge-adaptive transforms for efficient depth map coding [C]. Picture Coding Symposium (PCS),2010,2010:566-569.
    [109]Muller K, Merkle Philipp, Tech Gerhard,Wiegand Thomas.3D video formats and coding methods [C]. Image Processing (ICIP),2010 17th IEEE International Conference on,2010:2389-2392.
    [110]Merkle P, Wang Y, Muller K, Smolic A, et al. Video plus depth compression for mobile 3D services [C].3DTV Conference:The True Vision-Capture, Transmission and Display of 3D Video,2009,2009:1-4.
    [111]Muller K, Smolic Aljoscha, Dix Kristina, Merkle Philipp, et al. Coding and intermediate view synthesis of multiview video plus depth [C]. Image Processing (ICIP),2009 16th IEEE International Conference on,2009: 741-744.
    [112]Wildeboer Meindert Onno,Yendo Tomohiro, Tehrani Mehrdad Panahpour, Fujii Toshiaki, et al. Color based depth up-sampling for depth compression [C]. Picture Coding Symposium (PCS),2010,2010:170-173.
    [113]Wildeboer Mo, Yendo T, Panahpour Tehrani M, Fujii T, et al. Depth up-sampling for depth coding using view information [C].3DTV Conference: The True Vision-Capture, Transmission and Display of 3D Video (3DTV-CON), 2011,2011:1-4.
    [114]Yang Lu, Wildeboer Meindert Onno, Yendo Tomohiro, Tehrani Mehrdad Panahpour, et al. Reducing bitrates of compressed video with enhanced view synthesis for FTV [C]. Picture Coding Symposium (PCS),2010,2010:5-8.
    [115]Lee Jin Young, Wey Hochen,Park Du-Sik. A novel approach for efficient multi-view depth map coding [C]. Picture Coding Symposium (PCS),2010, 2010:302-305.
    [116]Oh Kwan-Jung, Vetro Anthony,Ho Yo-Sung. Depth coding using a boundary reconstruction filter for 3-D video systems [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2011,21(3):350-359.
    [117]Oh Byung Tae, Lee Jaejoon,Park Du-Sik. Depth map coding based on synthesized view distortion function [J]. Selected Topics in Signal Processing, IEEE Journal of,2011,5(7):1344-1352.
    [118]Vetro Anthony, Wiegand Thomas,Sullivan Gary J. Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard [J]. Proceedings of the Ieee,2011,99(4):626-642.
    [119]Wu Dajun, Pan Feng, Lim Keng Pang, Wu Si, et al. Fast intermode decision in H.264/AVC video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2005,15(7):953-958.
    [120]You Jongmin, Kim Wonkyun,Jeong Jechang.16x16 macroblock partition size prediction for H.264 p slices [J]. Consumer Electronics, IEEE Transactions on, 2006,52(4):1377-1383.
    [121]Kuo Tien-Ying,Chan Chen-Hung. Fast variable block size motion estimation for H.264 using likelihood and correlation of motion field [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2006,16(10):1185-1195.
    [122]Kim Byung-Gyu. Novel inter-mode decision algorithm based on macroblock (mb) tracking for the p-slice in H.264/AVC video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2008,18(2):273-279.
    [123]Kim Jong-Ho,Kim Byung-Gyu. Fast block mode decision algorithm in H.264/AVC video coding [J]. Journal of Visual Communication and Image Representation,2008,19(3):175-183.
    [124]Yu Andy C-W, Martin Graham R,Park Heechan. Fast inter-mode selection in the H.264/AVC standard using a hierarchical decision process [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2008,18(2):186-195.
    [125]Shen Liquan, Liu Zhi, Zhang Zhaoyang.Shi Xuli. Fast inter mode decision using spatial property of motion field [J]. Multimedia, IEEE Transactions on, 2008,10(6):1208-1214.
    [126]Liu Zhi, Shen Liquan,Zhang Zhaoyang. An efficient intermode decision algorithm based on motion homogeneity for H.264/AVC [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2009,19(1):128-132.
    [127]Zeng Huanqiang, Cai Canhui,Ma Kai-Kuang. Fast mode decision for H.264/AVC based on macroblock motion activity [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2009,19(4):491-499.
    [128]Pan Feng, Lin Xiao, Rahardja Susanto, Lim Keng Pang, et al. Fast mode decision algorithm for intraprediction in H.264/AVC video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2005,15(7): 813-822.
    [129]Wang Jia-Ching, Wang Jhing-Fa, Yang Jar-Ferr,Chen Jang-Ting. A fast mode decision algorithm and its vlsi design for H.264/AVC intra-prediction [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2007,17(10): 1414-1422.
    [130]Zeng Huanqiang, Ma Kai-Kuang,Cai Canhui. Hierarchical intra mode decision for H.264/AVC [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2010,20(6):907-912.
    [131]Martinez-Enriquez Eduardo,Jimenez-Moreno Amaya, Angel-Pellon Miguel,Diaz-De-Maria Fernando. A two-level classification-based approach to inter mode decision in H.264/AVC [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2011,21(11):1719-1732.
    [132]Jung Seung-Won, Baek Seung-Jin, Park Chun-Su,Ko Sung-Jea. Fast mode decision using all-zero block detection for fidelity and spatial scalable video coding [J]. Circuits and Systems for Video Technology,IEEE Transactions on, 2010,20(2):201-206.
    [133]Lin Hung-Chih, Peng Wen-Hsiao,Hang Hsueh-Ming. Fast context-adaptive mode decision algorithm for scalable video coding with combined coarse-grain quality scalability (CGS) and temporal scalability [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2010,20(5):732-748.
    [134]Kuo Tien-Ying, Lai Yun-Yang,Lo Yi-Chung. Fast mode decision for non-anchor picture in multiview video coding [C]. Broadband Multimedia Systems and Broadcasting (BMSB),2010 IEEE International Symposium on,2010:1-5.
    [135]Ding Li-Fu, Tsung Pei-Kuei, Chien Shao-Yi, Chen Wei-Yin, et al. Computation-free motion estimation with inter-view mode decision for multiview video coding [C].3DTV Conference,2007,2007:1-4.
    [136]Tourapis Alexis Michael, Wu Feng,Li Shipeng. Direct mode coding for bi-predictive pictures in the JVT standard [C]. Circuits and Systems,2003. ISCAS'03. Proceedings of the 2003 International Symposium on,2003: 11-700-11-703 vol.702.
    [137]Guo Xun, Lu Yan, Wu Feng,Gao Wen. Inter-view direct mode for multiview video coding [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2006,16(12):1527-1532.
    [138]Sullivan Gary J,Wiegand Thomas. Rate-distortion optimization for video compression [J]. Signal Processing Magazine, IEEE,1998,15(6):74-90.
    [139]Wiegand Thomas, Schwarz Heiko, Joch Anthony, Kossentini Faouzi, et al. Rate-constrained coder control and comparison of video coding standards [J]. Circuits and Systems for Video Technology,IEEE Transactions on,2003,13(7): 688-703.
    [140]Su Yeping, Vetro Anthony,Smolic Aljoscha. Common test conditions for multiview video coding [J]. JVT-T207, Klagenfurt, Austria,2006.
    [141]Koo Han-Suh, Jeon Yong-Joon,Jeon Byeong-Moon. MVC motion skip mode [J]. ISO/IECJ TC1/SC29/WG11 and ITU,2007.
    [142]Bjontegard Gisle. Calculation of average PSNR differences between RD-curves [J]. ITU-T VCEG-M33,2001.
    [143]Li Reoxiang, Zeng Bing,Liou Ming L. A new three-step search algorithm for block motion estimation [J]. Circuits and Systems for Video Technology, IEEE Transactions on,1994,4(4):438-442.
    [144]Po Lai-Man,Ma Wing-Chung. A novel four-step search algorithm for fast block motion estimation [J]. Circuits and Systems for Video Technology, IEEE Transactions on,1996,6(3):313-317.
    [145]Zhu Shan,Ma Kai-Kuang. A new diamond search algorithm for fast block-matching motion estimation [J]. Image Processing, IEEE Transactions on, 2000,9(2):287-290.
    [146]Zhu Ce, Lin Xiao,Chau Lap-Pui. Hexagon-based search pattern for fast block motion estimation [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2002,12(5):349-355.
    [147]Zhu Ce, Lin Xiao, Chau Lappui,Po Lai-Man. Enhanced hexagonal search for fast block motion estimation [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2004,14(10):1210-1214.
    [148]Cheung Chun-Ho,Po Lai-Man. Novel cross-diamond-hexagonal search algorithms for fast block motion estimation [J]. Multimedia, IEEE Transactions on,2005,7(1):16-22.
    [149]Chen Zhibo, Xu Jianfeng, He Yun,Zheng Junli. Fast integer-pel and fractional-pel motion estimation for H.264/AVC [J]. Journal of Visual Communication and Image Representation,2006,17(2):264-290.
    [150]Tourapis Alexis M, Au Oscar C,Liou Ming L. Predictive motion vector field adaptive search technique (pmvfast)-enhancing block based motion estimation [C]. Proceedings of SPIE,2001:883-892.
    [151]Tourapis Alexis M, Au Oscar C,Liou Ming L. Highly efficient predictive zonal algorithms for fast block-matching motion estimation [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2002,12(10):934-947.
    [152]Tourapis Alexis M. Enhanced predictive zonal search for single and multiple frame motion estimation [C]. Electronic Imaging 2002,2002:1069-1079.
    [153]Li Wenhua,Salari Ezzatollah. Successive elimination algorithm for motion estimation [J]. Image Processing, IEEE Transactions on,1995,4(1):105-107.
    [154]Gao Xq, Duanmu Cj,Zou Cr. A multilevel successive elimination algorithm for block matching motion estimation [J]. Image Processing, IEEE Transactions on, 2000,9(3):501-504.
    [155]Ahn Tae Gyoung, Moon Yong Ho,Kim Jae Ho. Fast full-search motion estimation based on multilevel successive elimination algorithm [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2004,14(11): 1265-1269.
    [156]Chen M-J, Li G-L, Chiang Yi-Yen,Hsu Ching-Ting. Fast multiframe motion estimation algorithms by motion vector composition for the MPEG-4/AVC/H. 264 standard [J]. Multimedia, IEEE Transactions on,2006,8(3):478-487.
    [157]Su Yeping,Sun Ming-Ting. Fast multiple reference frame motion estimation for H.264/AVC [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2006,16(3):447-452.
    [158]Huang Yu-Wen, Hsieh Bing-Yu, Chien Shao-Yi, Ma Shyh-Yih, et al. Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC [J]. Circuits and Systems for Video Technology, IEEE Transactions on,2006,16(4):507-522.
    [159]Aydinoglu Haluk,Hayes Monson H. Stereo image coding:A projection approach [J]. Image Processing, IEEE Transactions on,1998,7(4):506-516.
    [160]Kim Yongtae, Kim Jiyoung,Sohn Kwanghoon. Fast disparity and motion estimation for multi-view video coding [J]. Consumer Electronics, IEEE Transactions on,2007,53(2):712-719.
    [161]Xu Xiaozhong.He Yun.Fast disparity motion estimation in MVC based on range prediction [C]. Image Processing,2008. ICIP 2008.15th IEEE International Conference on,2008:2000-2003.
    [162]Ding Li-Fu, Tsung Pei-Kuei, Chen Wei-Yin, Chien Shao-Yi, et al. Fast motion estimation with inter-view motion vector prediction for stereo and multiview video coding [C]. Acoustics, Speech and Signal Processing,2008. ICASSP 2008. IEEE International Conference on,2008:1373-1376.
    [163]Fecker Ulrich,Kaup Andre. Statistical analysis of temporal and spatial block matching for multi-view sequences [C]. ISO/IEC JTC1/SC29/WG11, MPEG2005/M11546,2005.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700