多视点视频编码方法的研究

英文题名：Research on Multi-view Video Coding
作者：陈建乐
论文级别：博士
学科专业名称：通信与信息系统
中文关键词：多视点视频编码 ; 视差预测 ; 全局视差估计 ; 颜色差异补偿 ; 码率控制
英文关键词：multi-view video coding ; disparity estimation ; global disparity estimation ; color-variation compensation ; rate control
学位年度：2006
导师：刘济林
学科代码：081001
学位授予单位：浙江大学
论文提交日期：2006-05-01

摘要

为满足视频场景自然和真实再现需求，具有3D视觉功能的多视点视频技术正越来越受到学术界和工业界的重视，并成为近年来视频研究的热点之一。多视点视频蕴涵了景物的深度信息，在自然场景的表征上更具真实感，在3D电视、自由视点电视、具有临场感的可视会议及虚拟现实等领域展现了广阔的应用前景。然而多视点视频具有巨大的数据量，存贮和传输十分困难，必须对其进行高效的压缩。在多视点视频中，除了各个视频流内具有很强的空间和时间相关性，各视点之间也具有一定的交叉相关性，如何有效地利用这些相关性是提高多视点视频编码效率的关键。为提高多视点视频的压缩效率，本文在多视点视频编码的预测框架、运动与视差矢量的预测、基于颜色差异补偿的视差预测编码以及码率控制等方面进行了研究。
     本文首先分析了多视点视频中视差预测特性和各种相关性的相对大小，在此基础上，提出了基于H.264的多视点视频编码方案，使用H.264的帧内方向预测和帧间多模式预测解除多视点视频的空间相关性、时间相关性和交叉相关性。在视差预测中引入全局视差预测编码模式，并将其集成到H.264的多模式预测编码中，提高了压缩效率：为减少编码视差矢量和运动矢量所需的比特数，提出了改进的视差矢量和运动矢量预测方法，该方法除了利用视差矢量和运动矢量的空间相关性，还利用了它们在相邻视点或相邻时刻的对应关系。另外，本文还提出了一种基于分层结构的视差预测框架，使多视点视频码流具有“视点可分级”、“视点随机访问”和“部分视点解码”的功能。
     在多视点视频中，由于各摄像机所处方位不同的影响，接收到的光线强度存在差异，同时各摄像机的增益、电平等也不能保证完全一致，导致实际获得的多视点视频图像之间存在着颜色(包括亮度和色差信号)差别，从而严重影响多视点视频的压缩性能。为进一步提高多视点视频的压缩效率，本文深入研究了基于颜色差异补偿的视差预测编码。在分析了不同视点图像之间的颜色差异基础上，对其进行建模，然后根据对模型的简化程度，提出和实现了多种基于颜色差异补偿的视差预测编码方法：全局线性颜色差异补偿、全局非线性颜色差异补偿、局部颜色差异补偿、全局与局部自适应的颜色差异补偿。实验结果验证了本文的颜色差异补偿方法明显改善了视差预测，提高了多视点视频的压缩效率。
     码率控制是视频编码中非常重要的技术之一，它是任何视频编码都不能回避的问题。对于单视点视频编码，由于传统的多通道VBR码率控制算法计算复杂度高，不适用于实时视频应用，本文提出了一种低复杂度的单通道VBR码率控制算法。该算法根据图像复杂度来分配当前图像的目标比特数，算法中的模型参数随输入视频自适应更新，适应了视频的场景变化。
    实验结果表明，该算法在满足码率条件下可获得相对稳定的视频质量。目前，对多视点视频编码码率控制的研究尚未深入，本文提出了一种适合本文多视点视频编码方案的码率控制算法。该算法根据每个视点图像的编码复杂度来分配图像的目标编码比特数；另外该算法为每个视点的图像建立了独立的二次信源模型，能够精确控制每个视点的实际编码比特数。实验结果表明，该算法能够有效地控制多视点视频的码率，同时获得了较高的编码效率。
Multi-view video, which can provide viewers with the benefits of added realism, selective viewing, and improved scene understanding, will be widely used in the fields of 3D TV, free-viewpoint TV, immersive videoconferencing, virtual reality etc. However a large amount of data is one major obstacle for using multi-view video is the large amount of data. A multi-fold increase in bandwidth over the existing single-view makes it extremely tough to transmit and store multi-view video data. This thesis mainly concerns the problems of highly efficient multi-view video coding (MVC). To achieve high compression efficiency, co-relation between the different views must be exploited in multi-view video coding scheme. Based on this scenario, we propose various efficient encoding schemes.
    Firstly, we proposed an H.264-based multi-view video coding scheme. It uses the advanced predictive method of H.264 to eliminate the spatial, temporal and inter-view co-relation in multi-view video. According to the characteristic of multi-view video, global disparity coding method is employed. An eight-parameter global disparity model is used and two global disparity coding modes are proposed. To decrease the coding bit number of motion vector (MV) and disparity vector (DV), an optimized MV&DV predicted method is proposed. It utilizes not only the MV&DV of casual neighboring blocks, but also that of corresponding blocks in the adjacent images. To support view-scalability, a layered disparity prediction structure is proposed. Using the layered disparity prediction structure, our MVC stream can be random accessed and decoded partly.
    Due to the distinct performance of cameras, there exists serious color-variation (including brightness variation) between the images of different views. These variations well impact the compression performance of multi-view video coding. To improve the coding efficiency of MVC, we investigate the method of color-variation compensation disparity prediction. Firstly we analyze and model color-variation based on the image formation theory. Then, various color-variation compensation methods are proposed according to the simplified models. They are global linear color-variation compensation, global non-linear color-variation compensation, local color-variation compensation and global-local-adaptive color-variation compensation. Experiment results shows these methods can greatly improve the performance of disparity prediction and the coding efficiency of MVC.
    Rate-distortion (R-D) analysis and rate control play an important role in video coding and communication systems, which are to prevent buffer malfunction and provide the highest possible
    video quality under the constraints of rate and delay. In this thesis, rate control algorithms of both traditional single-view video coding and MVC are provided. Due to the high computational complexity of multi-pass variable bitrate (VBR) rate control algorithm, we proposed a low-complexity single-pass VBR rate control algorithm for real-time video coding application. It allocates the target bit number for each picture according to its complexity, and can achieve constant video quality. Although the rate control of video coding is deeply investigated, rate control of MVC is seldom concerned. In this thesis, we proposed a rate control algorithm for H.264-based MVC scheme. The algorithm employs an independent quarter rate-distortion model for each viewpoint video. Simulation results showed that it could accurately control the used bit number of each frame.

引文

[Ahmed, 1974] N. Ahmed, T. Natarajan and K. R. Rao, "Discrete transform", IEEE Transaction on Computation, vol.23, pp: 90-93, 1974.
    [Aljoscha, 2004] S. Aljoscha, M.C. Chen, "3DAV exploration of video-based rendering technology in MPEG [J]", IEEE Transaction on Circuit and Systems for Video Technology, vol.14(3), pp:348-356, 2004.
    [Anthony, 2004] V. Anthony, M. Wojciech, "Coding approaches for end-to-end 3D TV systems", Mitsubish Lab technique report, 2004.
    [Borner, 2000] R. Borner, B. Duckstein, O. Machui, and T. Sikora, "A family of single-user autostereoseopie displays with head-tracking capabilities", IEEE Transactions Circuits and Systems for Video Technology, vol.10(2), pp: 234 - 243, 2000.
    [Boyce, 2004] J. Boyce, "Weighted prediction in the H.264/MPEG4 AVC video coding standard", ISCAS, pp: 789-792, 2004.
    [Buehler, 2001] C. Buehler, M. Bosse, L. Mcmillan, S. Gortler, and M. Cohen, "Unstructured lumigraph rendering", Proceedings ofSIGGRAPH, pp: 425-432, 2001.
    [Chang, 1996] Y.C. Chang, J. F Reid, "RGB calibration for color image analysis in machine vision [J]", IEEE Transaction on Image Processing, vol.5(10), pp: 1414-1422, 1996.
    [Chen, 1995] S.E. Chen, "Quick Time VR-an image based approach to virtual environment navigation", Proceedings of SIGGRAPH, pp: 29-38, 1995.
    [Cheung, 2004] H.K. Cheung, W.C. Siu, D.G. Feng, "Novel illumination compensation scheme for sprite coding [C]", International Conference on Signal Processing, vol.2, pp: 1223-1226, 2004.
    [Chiang, 1997] T. Chiang and Y.Q. Zhang, "A new rate control scheme using quadratic rate distortion model," IEEE Trans. Circuits Syst. Video Technol., vol.7, pp: 246-250, 1997.
    [Clarke, 1984] R.J.Clarke, "Transform coding of image", New York." Academic Press, 1984.
    [Cox, 1995] I.J. Cox, "Dynamic histogram warping of images pairs for constant image brightness", In Proc.of IEEE Int. Conf. on Image Processing, pp: 2366-2369, 1995.
    [Cutting, 1995] J.E. Cutting, P.M. Vishton, "Perceiving layout and knowing Distances: the interaction, relative potency, and contextual use of different information about depth", Handbook, 2001 of Perception and Cognition:Perception of Space and Motion, Academic Press, vol. 5, pp: 69-117．1995．
    [陈，2004] 陈韩锋，戚飞虎，全局运动估计的阈值可变双迭代法，上海交通大学学报，vol．38(1)，PP：1-4，2004．
    [Daubechies, 1988] I. Daubechies, "Orthogonal bases of compactly supported wavelets", Communications on Pure and Applied Mathematics, vol.41, pp: 909-996, 1988.
    [Dinstein, 1989] I. Dinstein, et al., "Compression of stereoscopic images and the evaluation of its effects on 3D perception", SPIE Conf. Applications of Digital Image Processing Ⅻ, vol. 1153, pp: 522-530, 1989.
    [Dufaux, 2000] F. Dufaux and J. K. Efficient, "Robust and fast global motion estimation for video coding", IEEE Trans. on Image Processing, vol.9, no.3, pp: 497-501, 2000.
    [Everett, 1963] H. Everett Ⅲ, "Generalized lagrange multiplier method for solving problems of optimum allocation of resources", Operations Research, vol. 11, pp: 399-417, 1963.
    [Fehn, 2002] C. Fehn, P. Kauff, et al., "An evolutionary and optimized approach on 3D-TV", Proceedings of International Broadcast Conference, pp: 357-365, 2002.
    [Francesco, 2004] I. Francesco, T. Emanuele, S. Oliver, "Three-dimensional image processing in the future of immersive media [J]", IEEE Transaction on Circuit and Systems for Video Technology, vol.14(3), pp: 288-303, 2004.
    [Gibson, 1997] J.D. Gibson, T. Berger, T. Looabaugh, et al., "Digital compression for multimedia principles & standards", Morgan Kaufmann Publishers Inc., 1997.
    [Gilge, 1990] M. Gilge, "Motion estimation by scene adaptive block matching (SABM) and illumination correction", SPIE Syrup. Image Processing Algorithms and Tech., vol.1244, pp:355-366, 1990.
    [Guo, 2004] X. Guo, Q.M. Huang, "Multiview video coding based on global motion model [C]", Proceeding of pacific-rim conference on multimedia, pp: 665-672, 2004.
    [Hamagishi, 1995] G. Hamagishi, M. Sakata, A. Yamashita, et al., "New stereoscopic LC displays without special g lasses", Asia Display, pp: 791-794, 1995.
    [Hang, 1997] H. Hang and J. Chen, "Source model for transform video coder and its application. Ⅰ.fundamental theory," IEEE Trans. Circuits Syst. Video Technol., vol. 7(2), pp: 287-298, 1997.
    [Healey, 1994] G. Healey and R. Kondepudy, "Radiometric CCD camera calibration and noise estimation", IEEE Trans. Pattern Anal. Machine Intell, vol.16, no.3, pp.267-276, 1994.
    [He, 2001] Z. He, "ρ-domain rate-distortion analysis and rate control for visual coding and communications", Dissertation of University of California Santa Barbara, 2001.
    [Hirose, 1997] M. Hirose, "Image-based virtual world generation", IEEE Multimedia, vol.4, no.l,pp.27-33, 1997.
    [Hsu, 1997] C.-Y. Hsu, A. Ortega, and M. Khansari, "Rate control for robust video transmission over wireless channels", In Proc. Visual Communications and Image Processing (VCIP '97), SanJose, CA, 1997.
    [韩，2003] 韩军功，卢朝阳，立体图像序列的压缩方法，通信学报，vol．24(6)，PP：113-123，2004．
    [赫，1983] 赫葆源，张厚聚，陈舒永，《实验心理学》，北京大学出版社，pp．468-534，1983．
    [贺，2001] 贺玉文，钟玉琢，杨士强，一种快速全局运动补偿编码方法，电子学报，vol．29(2)，PP：1-3．2001．
    [ISO, 1991] ISO/IEC CD 11172, "Coding of moving pictures and associated audio for digital storage media at up to 1.5Mbits/sec—Part 2: Coding of moving pictures information", 1991
    [ISO, 1993a] ISO-IEC/JTC1/SC29/WG11, N0400, "Test Model 5", Draft,1993
    [ISO, 1993b] ISO/IEC JTCI IS 11172-2 (MPEG-1), "Information technology —Coding of moving pictures and associated audio for digital storage media up to about 1.5 Mbit/s", 1993.
    [ISO, 1994] ISO/IEC JTC1 IS 13818-2 (MPEG-2), "Information technology — Generic coding of moving pictures and associated audio", 1994.
    [ISO, 1995] ISO/IEC 13818-2, "Information technology—Generic coding of moving pictures and associated audio Part 2: Video", 1995.
    [ISO, 1996] ISO/IEC 13 818-2, AMD 3, "MPEG-2 multiview profile", ISO/IEC JTC1/SC29/WG11, document no. N1366, 1996.
    [ISO, 1998] ISO/IEC FD1S 14496-2, "Information technology —Generic coding of audio-visual objects Part 2: Visual", 1998.
    [ISO, 2001] ISO/IEC JTC 1/SC29/WG11, "MPEG-4 video verification model version 18.0", 2001.
    [ISO, 2004] ISO/IEC JTC1/SC29/WG11, "Call for evidence on multi-view video coding", MPEG document N6720, Palma de Mallorca, 2004.
    [ISO, 2005a] ISO/IEC JTC1/SC29/WG11, "Survey of algorithms for multi-view video coding", MPEG Document N6909, Hong Kong, 2005.
    [ISO, 2005b] ISO/IEC JTC1/SC29/WG11, "Requirements on multi-view video coding", MPEG Document N7282, Poznan, Poland, July 2005.
    [ITU, 1993] ITU-T Recommendation H.261, "Video codec for audiovisual services at 64 kbit/s", Proc. COM 15R 16-E, 1993.
    [ITU, 1996] ITU-T Recommendation H.263, "Video coding for low bitrate communication", 1996.
    [ITU, 1997] ITU-T/SG55, "Video codec test model, near-term, TMNS", ITU Study Group 16, Video Coding Experts Group, Document Q15-A-59, Portland, USA, 1997.
    [ITU, 2003] ITU-T Rec. H.264/AVC /ISO/IEC 11496-10, "Advanced Video Coding", Final Committee Draft, Document JVT-G050, 2003.
    [Izquierdo, 1997] E. Izquierdo, "Stereo matching for enhanced telepresence in three-dimensional video communications", IEEE Transactions on CSVT, vol. 7, no. 4, pp: 629-643, 1997.
    [Izquierdo, 1998] M.E. Izquierdo, "Stereo image analysis for multi-viewpoint telepresence applications", Signal Processing: Image Communication, vol.11, pp: 231-254, 1998.
    [Izquierdo, 1999] E. Izquierdo, X. H. Feng, "Modeling arbitrary objects based on geometric surface conformity", IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no.2, pp:336-352, 1999.
    [Jagmohan, 2003] A. Jagmohan, K. Ratakonda, "MPEG-4 One-Pass VBR Rate Control for Digital Storage", IEEE Trans. On Circuits and Systems for Video Technology, vol. 13, no.5, pp: 447-452,2003.
    [Jens-Rainer, 1999] O. Jens-Rainer. "Stereo/Multiview video encoding using the MPEG family of standards", SPIE 3639. San Jose California, pp: 242-253, 1999.
    [Jia, 2003] H.Z. Jia, W. Gao, Y. Lu, "Stereo video coding based on global displacement compensated prediction", Proceeding of pacific-rim conference on multimedia, pp: 61-65, 2003.
    [Julesz, 1971] B. Julesz, "Foundations of Cyclopean Perception", Chicago: The University of Chicago Press, 1971.
    [Kamikura, 1998] K. Kamikura, H. Watanabe, H. Jozawa, H. Kotera, and S. Ichinose, "Global brightness-variation compensation for video coding", IEEE Trans. Circuits and Systems for Video Technology, vol.8(8), 1998.
    [Kanade, 1997] T. Kanade, P. Rander, P. Narayanan, "Virtualized reality: Constructing virtual worlds from real scenes", IEEE Multimedia, Immersive Telepresence 4, pp: 34-47, 1997.
    [Kang, 1998] S. B. Kang, "Geometrically valid pixel reprojection methods for novel view synthesis", ISPRS Journal of Photogrammetry & Remote Sensing, vol.53, pp: 342-353, 1998.
    [Keller, 2003] Y. Keller and A. Averbuch, "Fast gradient methods based on global motion estimation for video compression", IEEE Transactions on Circuits and Systems for Video Technology, vol.13, no. 4, pp: 300 - 309, 2003.
    [Kim, 2003] S.H. Kim and R.H. Park, "Fast local motion compensation algorithm for video sequences with brightness variations", IEEE Trans. Circuits and Systems for Video Technology, vol. 13(4), 2003.
    [Kimata, 2004a] H. Kimata, M. Kitahara, "Preliminary results on multiple view video coding," ISO/IEC JTC1/SC29/WG11 Doc M10976, 2004.
    [Kimata, 2004b] H. Kimata, M. Kitahara, K. Kamikura, and Y. Yashima, "Multi-view video coding using reference picture selection for free-viewpoint video communication", Picture Coding Symposium 2004, San Francisco, California, USA, pp: 15-17, 2004.
    [Konrad, 2001] J. Konrad, "Visual communication of tomorrow: natural, efficient and flexible", IEEE Communication Magazine, vol. 39, no. 1, pp: 126-133, 2001.
    [Kost,1991] B. Kost and S. Pastoor, "Visibility thresholds for disparity quantization errors in stereoscopic displays", Proc. SID, vol. 32, no. 2, pp. 165-170, 1991.
    [Lakshman, 1998] T.V. Lakshman, A. Ortega and A.R. Reibman, "VBR video: tradeoffs and potentials", IEEE Proc., vol. 86(5), pp: 952-973, 1998.
    [Lambert, 2006] P. Lambert, D.N. Wesley, D.N. Philippe, M. Ingfid, D. Pier, "Rate-distortion performance of H.264/AVC compared to slate-of-the-art video codecs", IEEE Trans. on Circuits and Systems for Video Technology, vol. 16, no. 1, 2006.
    [Lee, 2000] H.J. Lee, T.H. Chiang, Y.Q. Zhang, "Scalable rate control for MPEG-4 video", IEEE Trans. Circuit Syst. Video Technology, vol. 10, pp: 878-894, 2000.
    [Li, 2003] G.P. Li, Y. He, "A novel multi-view video coding scheme based on H.264 [C]", ICICS-PCM, pp: 493-497, 2003.
    [Li, 2006] Z.G. Li, W. Gao, F. Pan, S.W. Ma, K.P. Lira, G.N. Feng, X. Lin, S. Rahardja, H.Q. Lub and Y. Lu. "Adaptive rate control for H.264", Journal of Visual Communication and Image Representation, vol.17, no.2, pp: 376-406, 2006.
    [Lim, 2003] J. Lira, J. Kim, K.N. Ngan, and K. Sohn, "Advanced rate control technologies for 3D-HDTV", IEEE Transactions on Consumer Electronics, vol.49, no.4, pp: 1498-1507, 2003.
    [Lim, 2004] J. Lim, K. Ngan, W. Yang and K. Sohn, "Muitiview sequence CODEC with view scalability", Signal Processing: Image Communication, vol. 19, no. 3, pp: 239-256, 2004.
    [Liu, 1993] J. Liu and R. Skerjanc, "Stereo and motion correspondence in a sequence of stereo images", Signal Processing: Image Commun., vol.5, pp: 305-318, 1993.
    [Lopez, 2004] J. Lopez, G. Chen, J.H. Kim and A. Ortega, "Illumination compensation for multi-view video compression", ISO/IEC JTC1/SC29/WG11 Doc M11132, 2004.
    [Lukacs 1986] M.E. Lukacs, "Predictive coding of multi-viewpoint image sets", Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, pp. 521-524, 1986.
    [Luo, 2003] Y. Luo, Z Y. Zhang, P. An, "Stereo video coding based on frame estimation and interpolation [J]", IEEE Transaction on Broadcasting, vol.49(1), pp: 14-21, 2003.
    [Ma, 2003] S. Ma, Z.G. Li, F. Wu, "Proposed draft of adaptive rate control", doc. JVT-H017, JVT 8th Meeting, Geneva, 2003.
    [Ma, 2005] S. Ma, W. Gao, Y. Lu, "Rate-distortion analysis for H.264/AVC video coding and its application to rate control", IEEE Trans. On Circuits and Systems for Video Technology, vol. 15, no.12, pp: 1533-1544, 2005.
    [Malassiotis, 1994] S. Malassiotis and M.G. Strintzis, "Joint motion/disparity estimation for stereoscopic image sequences", SPIE Conf Visual Commun. Image Processing, vol. 2308, pp: 614-625, 1994.
    [Matusik, 2004] W. Matusik, H. Pfister, "3D TV: A sealable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes", ACM Transaction on Graphics, vol. 23, no. 3, pp: 811-821, 2004.
    [Michael, 1992] G.P. Michael, "Data compression of stereopairs", IEEE Trans. on Communications, vol.40, no. 4, pp: 684-696, 1992.
    [Motoki, 1995] T. Motoki, H. Isono, and I. Yuyama, "Present status of three-dimensional television research", Proceedings of the IEEE, vol. 83, no. 7, pp. 1009-1021, 1995.
    [Naito, 1999] S. Naito and S. Matsumoto, "34/45Mbps 3D-HDTV digital coding scheme using a modified motion compensation with disparity vectors", VCIP, SPIE, vol.3653, pp: 1082-1089,1999.
    [Negahdaripoui, 1993] S. Negahdaripoui, C.H. Yu, "A generalized brightness change model for computing optical flow", In Proc. 4th Int. Conference on Computer Vision, Berlin, pp: 2-11, 1993.
    [Ngan, 2000] K.N. Ngan, M. Strintzis, M. Tanimoto and Y. Wang, "Special issue on 3-D video technology", IEEE Transactions on Circuits and Systems on Video Technology, vol. 10, no. 2, pp: 185-187, 2000.
    [Okoshi, 1976] T. Okishi, "Three-Dimensional imaging techniques", Academic Press. 1976.
    [Ortega, 1998] A.Ortega and K.Ramchandran, "Rate-distortion methods for image and video compression", IEEE Signal Processing Magazine, vol. 15(6), pp: 23-50, 1998.
    [Pastoor, 1989] S. Pastoor and K. Schenke, "Subjective assessments of the resolution of viewing directions in a multi-viewpoint 3D TV system", Proc. SID, vol. 30, no. 3, pp: 217-223, 1989.
    [Pastoor, 1997] S. Pastoor and M. Wopking, 3-D displays, "A review of current technologies",visplays 17, pp: 100-10, 1997.
    [Peng, 2005] Y. Peng, J. Boyce, A.M Tourapis, "Localized weighted prediction for video coding", IEEE International Symposium on Circuits and Systems (ISCAS), Vol. 5, pp: 4365-4368, 2005
    [Pollard, 2000] S. Pollard, M. Pilo, S. Hayes, A. Lorusso, "View synthesis by trinocular edge matching and transfer", Image and Vision Computing, vol. 18, pp: 749-757, 2000.
    [Puri, 1997] A. Puri, R.V. Kollarits, and B.B. Haskell, "Basics of stereoscopic video, new compression results with MPEG-2 and a proposal for MPEG-4", Signal Processing: Image Communication, vol. 10, pp: 201-234, 1997.
    [Rath, 1999] G.B. Rath, A. Makur, "Iterative least squares and compression based estimations for a four-parameter linear global motion model and global motion compensation", IEEE Transactions on Circuits and Systems for Video Technology, vol.9(7), pp: 1075-1099, 1999.
    [Rhee, 2000] L. Rhee, G.R. Martin, S. Muthukrishnan, R.A. Packwood, "Quadtree-structured variable-size block-matching motion estimation with minimal error", IEEE Transactions on Circuits and Systems for VideoTechnology, vol. 10 (1), pp: 42-50, 2000.
    [Ribas, 1999] J. Ribas-Corbera and S. Lei, "Rate control in DCT Video coding for low-delay communications", IEEE Trans. Circuit Syst. Video Technol., vol. 9(1), pp: 172 -185, 1999.
    [Riley, 1997] M.J. Riley, I.E.G. Richardson, "Digital Video Communications", Artech House Inc., 1997.
    [Rodrigues, 2001] N.M.M. Rodrigues, V.M.M. Silva and S.M.M. Faria, "Hierarchical motion compensation with spatial and luminance transformations", ICIP, pp: 518-521, 2001.
    [Scharstein, 2002] D. Seharstein and R. Szeliski, "A taxonomy and evaluation of dense two-frame stereo correspondence algorithms", International Journal of computer vision, vol.47(1/2/3), pp: 7-42, 2002.
    [Schertz, 1989] A. Schertz, "Source coding of stereoscopic television pictures", Third intl. conf. on image proc. and its applications, IEE Conf. Pub., no.307, pp: 462-464, 1989.
    [Schreer, 2005] O. Schreer, P. Kauff and T. Sikora, "3D video communication—algorithms, concepts and real-time systems in human centred communication", John Wiley & Sons Ltd., NewYork, 2005.
    [Sethuraman, 1994] S. Sethuraman, M.W. Siegel, and A.G. Jordan, "A multi-resolution framework for stereoscopic image sequence compression", International Conference on Image Processing, pp: 361-365, 1994.
    [Sethuraman, 1996] S. Sethuraman, "Stereoscopic image sequence compression using multi-resolution and quadtree decomposition based disparity and motion-adaptive segmentation",[Ph.D Thesis], Carnegie Mellon University, 1996.
    [Shen, 2004] Y.F. Shen, D.M. Zhang, C. Huang, J.T. Li, "Adaptive weighted prediction in video coding", IEEE International Conference on Multimedia and Expo (ICME) 2004, pp: 427-430, 2004
    [Shiwa, 1994] S. Shiwa, N. Tetsutani, K. Akiyama, S. Ichinose, T. Komatsu, "Development of direct-view 3Ddisplay for videophones using 15 inch LCD and lenticular sheet", IEICE Transactions Information andSystems E, pp: 940 - 948, 1994.
    [Shoham, 1988] Y. Shoham and A. Gersho, "Eficient bit allocation for an arbitrary set of quantizers", IEEE Trans. Acoust., Speech, Signal Processing, vol.36, pp: 1445-1453, 1988.
    [Shres, 1993] D. R. Shres, F.F. Holly, and P.G. Hamder, "High ratio bandwidth reduction of video imaging for teleoperation", SPIE lmage and Video Processing, vol. 1903, pp: 236-245, 1993.
    [Skerjane, 1991] R. Skerjanc and J.Liu "A three camera approach for calculating disparity and synthesizing intermediate pictures", Signal Proeessing: Image Commun., vol.4, no.1, pp: 55-64,1991.
    [Smolic, 2003] A. Smolic, H. Kimata, "Report on 3DAV exploration", ISO/IEC dTC1/SC29/WG11, Doc. N5878, 2003.
    [Smolic, 2004a] A. Smolicand and D. McCutchen, "3DAV exploration of video-based rendering technology in MPEG", IEEE Transaction on Circuits and Systems for Video Technology, voi. 14(3), pp: 348-356,2004.
    [Smolic, 2004b] A. Smolic, H. Kimata, "Requirements on Multi-view Video Coding", ISO/IEC JTC1/SC29/WG11, Doe. N6834, 2004.
    [Smolic, 2005] A. Smolic and P. Kauff, "Interactive 3D video representation and coding technologies", Proceedings of the IEEE, vol.93(1), pp: 98-110, 2005.
    [Song, 1996] Y.J. Song, "Improved disparity estimation algorithm with MPEG-2's scalability for stereoscopic sequences [J]", IEEE Trans. on Consumer Electronics, vol.42(3), pp: 306-311, 1996.
    [Strintzis, 1999] M.G. Strintzis and S.Malassions, "Object-based coding of stereoscopic and 3D image sequences", IEEE Signalprocessing, pp: 14-28, 1999.
    [Su, 2005] Y.P. Su, "Global motion estimation from coarsely sampled motion vector field and the applications student member", IEEE, Ming-Ting Sun, Fellow, IEEE, and Vincent Hsu, IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no.2, pp: 232-242, 2005.
    [Sullivan, 1991] G.J. Sullivan and R.L. Baker, "Rate-distortion optimized motion compensation for video compression using fixed or variable size blocks", GLOBECOM'91, pp: 85-90, 1991.
    [沈，1998) 沈兰荪，图像编码与异步传输，北京：人民邮电出版社，1998。
    [Tamtaoui, 1991] A. Tamtaoui and C. Labit, "Constrained disparity and motion estimators for 3DTV image sequence coding", Signal Processing: Image Commun., vol.4, pp: 45-54, 1991.
    [Tanimoto, 2004] M. Tanimoto, T. Fuji, "Comparison of temporal and spatial predictions for dynamic ray-space coding", ISO/IEC JTC1/SC29/WG11 Doc M10668, 2004.
    [Tappan, 1987] J.H. Tappan, M.E. Wright, and F.E. Sistler, "Error sources in a digital image analysis system", Comput. Electron. Agriculure, vol.2, pp: 109-118, 1987.
    [Tekalp, 1995] A.M.Tekalp, "Digital Video Processing", Englewood Cliffs: Printice Hall, 1995.
    [Tzovaras, 1994] D. Tzovaras, M.G. Strintzis, and H. Sahinoglou, "Evaluation of multiresolution block matching techniques for motion and disparity estimation", Signal Processing." Image Commun, vol.6, no.l, pp: 59-67, 1994.
    [Tzovaras, 1995] D. Tzovaras, N. Grammalidis, and M.G. Strintzis, "Object-based coding of stereoscopic image sequences using joint 3D motion/disparity segmentation", SPIE Conf. Visual Commun. Image Processing, vol.2501, pp: 1678-1689, 1995.
    [Tzovaras, 1996] D. Tzovaras, N. Grammalidis, M.G. Strintzis, "Joint three-dimensional motion/disparity segmentation for object-based stereo image sequence coding", Optical Engineering, vol.35, no.l, pp: 137-144, 1996.
    [Tzovaras, 1997] D. Tzovaras, N. Grammalidis and M.G. Stfintzis, "Object-Based coding of stereo image sequences using joint 3-D motion/disparity compensation", IEEE Trans. on Circuits and Systems for Video Technology, vol.7, no.2, pp: 312-327, 1997.
    [Tzovaras, 1999] D.Tzovaras, L. Kompatsiaris, M.G. Strintzis, "3D object articulation and motion estimation in model-based stereoscopic videoconference image sequence analysis and coding[J]", Signal Processing: Image Communication, vol. 14, pp:817-840, 1999.
    [Ulrich 2005a] F. Ulrich, K.Andre, "H.264/AVC compatible coding of dynamic light fields using transposed picture ordering", 2005 European Signal Processing Conference, Antalya, Turkey, Sep. 4-8, 2005.
    [Ulrich, 2005b] F. Ulrich, ISO/IEC JTC1/SC29/WG11, "Luminance and chrominance compensation for multi-view sequences using histogram matching", Nice, France, 2005.
    [Valyus, 1966] N. A. Valyus, "Stereos copy, London", The Focal Press, 1966.
    [Vetro 2004] A. Vetro, W. Matusik, H. Pfister, J. Xin, "Coding Approaches for end-to-end 3D TV Systems", Mitsubishi Electric Research Laboratories, Technical report 2004-137,, 2004,
    [Waldowski, 1991] M. Waldowski, "A new segmentation algorithm for videophone application based on stereo image pairs", IEEE Transactions on Communicaitons, vol.39, no. 12, pp: 1856-1868, 1991.
    [Wang, 1997] L. Wang, "Rate control for MPEG video coding", Signal Proc.: Image Communication, vol. 15, pp: 493-511, 2000.
    [Wang, 2000] R.S. Wang et.al, "Multi-view video sequence analysis, compression and virtual viewpoint synthesis", IEEE Trans. on CSVT, vol. 10(3), pp:397-410, 2000.
    [Wang, 2004] H. Wang, J. Lopez, G. Chen, N.-M. Cheung and A. Ortega, "Using inter-view prediction for multi-view video compression", ISO/IEC JTC1/SC29/WG11 Doc M10512, 2004.
    [Watson, 1993] A.B. Watson, "DCT quantization matrices visually optimized for individual images", Proceedings of the SPIE conference on human vision, visual processing and digital display Ⅳ, vol. 1913, pp: 202-216, 1993
    [Westerink, 1999] P.H. Westerink, R. Rajagopalan and C.A. Gonzales, "Two-pass MPEG-2 Variable-bit-rate Encoding", IBM J. RESDEVELOP, vol.43(4), pp: 471-488, 1999.
    [Wiegand, 2003a] T. Wiegand, H. Sehwarz, A. Joch, F. Kossentini, G.J. Sullivan, "Rate-constrained coder control and comparison of video coding standards [J]", IEEE Transaction on Circuit and Systems for Video Technology, vol.13(7), pp: 688-703, 2003.
    [Wiegand, 2003b] T.Wiegand, G.J. Sullivan, G. Bjontegarrd and A.Luthra. "Overview of the H.264/AVC Video Coding Standard", IEEE Transaction on circuits and systems, vol.13, no.7, pp: 560-576, 2003.
    [Wilkens, 1990] C.D. Wilkens, "Three-dimensional stereoscopic display implementation: Guidelines derived from human visual capabilities", Proc. Stereoscopic Displays and Applications, SPIE, vol.1256, pp: 2-11,1990.
    [Woontack, 1999] W. Woontack, "Rate-distortion based dependent coding for stereo images and video: disparity estimation and dependent bit allocation(D)", PH.D Thesis, Dept. of Electrical Engineering, Faculty of the graduate school, University of Southern California, 1999.
    [Woontack, 2000] W. Woontack and O. Antonio, "Overlapped block disparity compensation with adaptive windows for stereo image coding [J]", IEEE Trans. on Circuits and Systems for Video Technology, vol.10 (2), pp: 194-200, 2000.
    [Xu, 1999] L.Q. Xu, A. Loftier, P.J. Sheppard, D. Machin, "True-view videoconferencing system through 3-Dimpression of telepresence", BT Technology Journal vol. 17 (1), pp: 59-68, 1999.
    [Xu, 2004] J.F. Xu, Y. He, "A novel rate control for H.264", 1EEE Symposium on Circuit and System, pp: 809-812, 2004.
    [Yamaguchi,1989] H. Yamaguchi, Y. Tatehira, K. Akiyama and Y. Kobayashi, "Stereoscopic images disparity for predictive coding", Proc. IEEE Int. Conf. ,4coust., Speech, Signal Processing, pp: 1976-1979, 1989.
    [Yang, 1991] X.D Yang, Q.Xiao, and H. Raafat, "Direct mapping between histogram: An improved interactive image enhancement method", IEEE In. Conf. on Systems, Man and Cybernetics, pp: 243-247, 1991.
    [Yang, 2004] W.X. Yang, N.K. Ngan, "MPEG-4 based stereoscopic video sequences encoder [C]", IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.3, pp: 741-744,2004.
    [Yu, 2001] Y. Yu, J. Zhou, Y. Wang, and C.W. Chen, "A novel two-pass vbr coding algorithm for fixed-size storage application", IEEE Trans. Circuits Syst. Video Technol., vol. 11, pp: 345-356,2001.
    [院，1998] 院亮，立体电视的发展方向[J]，电视技术，1998 (7)：2．
    [Zhang, 1999] J. Zhang, M.O. Ahmad, "Quadtree structured region-wise motion compensation for video compression", IEEE Transactions on Circuits and Systems for Video Technology, vol.9(5), pp: 808-821, 1999.
    [Ziegler, 1995] M. Ziegler and S. Panis, "An object-based stereoscopic coder", Intl. workshop on Stereoscopic and Three Dimensional Imaging, pp: 40-45, 1995.
    [章，2000] 章毓晋，图象理解与计算机视觉(M)，北京：清华大学出版社，2000．
    [赵，2005] 赵波，吴成柯，一种新的低延时视频编码码速率控制算法，计算机学报，vol．28(1)，PP：53-59，2005。
    [周，1993] 周荫清，信息理论基础，北京：北京航空航天大学出版社，1993．

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700