IP可视电话的视频编码及码率控制的研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
可视电话使人们在通话时能够看到对方影像,是一种发展前景看好的多媒体通信设备,它不仅适用于家庭生活,而且还可以广泛应用于各项商务活动、远程教学、安防监控、医院护理、医疗诊断、科学考察等不同行业的多种领域。随着科学技术的进步和人民生活水平的提高,打破了原先的技术条件和经济状况的限制,大规模发展可视电话成为可能。尤其是面对突如其来的“非典”疫情,各大通讯公司以最快的速度为省城各大医院开通了可视电话,协助全社会战胜“非典”病魔起着很大的作用。
     为此,我们进行了基于H.323和Trimedia TM1300处理器芯片的IP可视电话系统的设计和实现。H.323是ITU—T推出的用于IP分组网络的多媒体通信终端协议,Trimedia TM1300处理器芯片是Philips公司推出的一种基于多媒体应用的具有VLIW指令,含有128个通用寄存器,32位的高性能处理器,它能够通过编程实现通信协议,完成高质量的音频、视频处理和网络接口。我们设计的IP可视电话系统主要集中在音视频编解码的实时实现、H.323协议栈的实现、一些必要的网络通信协议如PPPoE、SMTP、POP3、IMAP、DHCP等实现,在保证压缩音视频的QoS质量的同时,尽可能地发展各种增值业务服务如邮件系统、浏览网页和组成家庭智能系统等。
     本文主要针对作者在实现IP可视电话系统中所从事的研究工作而展开的,将重点放在IP可视电话系统中TCP/IP协议栈在pSOS+上的移植、网络通信协议如PPPoE、PPP、SMTP、IMAP、POP3、DNS、DHCP等协议的实现、H.263视频编码及其码率控制算法研究、视频编码中基于程序结构和7M1300 DSP专用指令的优化方法,同时初步介绍H.323、SIP、H.264及嵌入式系统中的实时操作系统(RTOS)pSOS+。在第一章中先简介IP可视电话系统的发展历程、前景和软硬件框架结构,然后简要说明作者的研究工作。在第二章中阐述了ITU-T的H.323、IETF的SIP多媒体框架协议以及它们之间的比较。在第三章中阐述了基于IP的网络通信协议的实现,主要是OpenBSD中的TCP/IP协议栈在pSOS+上的移植和PPPoE、PPP、SMTP、IMAP、POP3、DNS、DHCP协议、嵌入式Web服务器的实现,然后介绍如何利用RTP/RTCP来检测网络有效带宽,其中的网络有效带宽就是在第六章提出的码率控制算法中确定目标帧率的依据之一。在第四章中主要阐述了H.261、H.263、H.263+标准的内容及其比较、H.264标准及其与H.263编码性能的比较。在第五章中阐述了H.263视频编码中基于程序结构和TM1300 DSP专用指令的优化方法。在第六章中详细阐述了视频编码中的关键技术之一的码率控制算法,其中提出了一种简单、有效、适合网络带宽抖动的自适应可变帧率的码率控制算法。
IP video phone provides face to face communication service and is a hopeful product. It is not only fit for the family life, but also widely applied in various business affairs, long-distance teaching, secret monitor, hospital nurse, medical treatment , science research and other fields. Now with the developing of modern science and technology and the improvement of living level, the technology and the finance status limits are overcomed and the time of developing video phone is coming. It was well known that various communication companys presented lots of hospitals video phones in time when sars arised suddenly, which played an important role in contributing to the overcoming of the disaster.
    So we design a video phone system based on H.323 and Trimedia TM1300 DSP. H.323 is the standard about multimedia communication released by ITU-T. TM1300 including a very powerful, general-purpose VLIW processor core (the DSPCPU) that coordinates all on-chip activities is a media processor for high-performance multimedia applications that deal with high-quality video and audio. The DSPCPU implements a 32-bit linear address space and 128, fully general-purpose 32-bit registers. The video phone system designed by us is mainly the real-time implement of A-V codec the implement of H.323 protocol stack and the implement of the network communication protocol for example PPPoE SMTP and POP3 etc. So we not only ensure that the QoS of the A-V codec's datum, but also develop the increment services for example E-mail system and browser etc.
    The paper will emphasis on the graft and realization of the internet protocols H.263 video codec rate control algorithms for video encoder and optimization methods of the codec based on TM1300. H.323 SIP, H.264 and pSOS+ are also simply introduced. In the first chapter, the developping history and foreground of IP video phone and the system framework of the video phone including hardware and software framework are introduced. Then the research work is also introduced in this chapter. In the next chapter, H.323 SIP the difference of H.323 and SIP are introduced. In the third chapter, the graft of TCP/IP in OpenBSD on pSOS+ and the realization of the internet protocols including PPPoE PPP SMTP POP3 IMAP DNS DHCP, embedded WEB server and how to estimate network state used by RTP/RTCP are introduced. In the fourth chapter, firstly H.261 , H.263 H.263+ standards and their differences are introduced. Secondly the improved H.264 standard and the performance comparison of the H.263 and H.264 are introduced. In the fifth chapter, some optimization methods about program structure and optimization methods based on TM1300 are introduced. In the sixth chapter, the research of rate control methods which is the most difficult in my study is introduced.
引文
[1] 浙江大学硕士学位论文 《基于IP的可视电话中H.263+编码器的实现及其优化》 杨俊智 2003年2月
    [2] 浙江大学硕士学位论文 《基于IP的可视电话设计及实现》 吴旭东 2003年2月
    [3] 浙江大学硕士学位论文 《基于媒体处理器的视频解码器设计》 刘峰 2003年2月
    [4] IP网络电话技术 糜正琨编著 中国通信学会主编 人们邮电出版社 2000北京
    [5] ITU-T Recommendation H.323 (1996), Terminal for low bit rate multimedia communication.
    [6] SIP协议的特点与应用 魏春城 电信科学 2002.9
    [7] RFC 2516-A Method for Transmitting PPP Over Ethernet (PPPoE) 1999
    [8] RFC 1661-The Point-to-Point Protocol (PPP) 1994
    [9] RFC 1172-Point-m-Point Protocol (PPP) initial configuration options 1990
    [10] RFC 1547-Requirements for an Intemet Standard Point-to-Point 1993
    [11] 《TCP/IP详解 卷1:协议》 W. Richard Stevens著 范建华等译 机械工业出版社 2001年
    [12] 由W3100A构成嵌入式网关的家庭智能系统 单片机与嵌入式系统应用 3003年第4期 51-53
    [13] RFC1889—RTP: A Transport Protocol for real-time applications (ftp://ftp.enst.fr/pub/rfc/all/1889.gz).
    [14] RFC1890---RTP Profile for Audio and Videoconferences with Minimal Control (ftp://ftp. enst. fr/pub/rfc/all/1890, gz).
    [15] 可靠的实时传输协议 梁丰 肖若贵 电视技术 2002年第5期 42-44
    [16] 《计算机系统结构》John L.Hennessy,David A.Patterson著 郑纬民等译 清华大学出版社 2002年
    [17] Philips Trimedia TM1300 Data Book 2000.3
    [18] Philips Trimedia SDE, Cook Book 2000.3
    [19] Philips Trimedia SDE, Software Tools Book 2000.3
    [20] Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, JVT-G050, March 2003.
    [21] T. Wiegand, G. J. Sullivan, G. Bjφntegaard, and A. Luthra: "Overview of the H.264/AVC Video Coding Standard", in IEEE Transactions on Circuits and Systems for Video Technology, this issue.
    [22] G. Cote, B. Erol, M. Gallant, and F. Kossentini. "H.263+: Video Coding at Low Bit Rates", IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, pp. 849-866, Nov. 1998.
    [23] ISO/IEC JTC1/SC29/WG11, "Test Model 5," Doc. MPEG-NO400, Apr. 1993.
    [24] ITU-T/SG 16/Q 15-G-16 rev 3, "Video Codec Test Model, Near Term, Version 11 (TMN-11)", Download via anonymous ftp to:standard.pictel.com/video-site/9804 Tam/Q 15-G-16 rev 3.doe, February, 1999.
    
    
    [25] D. Marpe, H. Schwarz, and T. Wiegand, "Context-Adaptive Binary Arithmetic Coding for H.264/AVC," in IEEE Transactions on Circuits and Systems for Video Technology, this issue.
    [26] G.J. Sullivan and R. L. Baker. "Rate-Distortion Optimized Motion Compensation for Video Compression Using Fixed or Variable Size Blocks," in Proc. GLOBECOM'91, pp. 85-90, Phoenix, AZ, USA, Dec. 1991.
    [27] T. Wiegand and B. Girod, "Lagrangian Multiplier Selection in Hybrid Video Coder Control," in Proc. ICIP 2001, Thessaloniki, Greece, Oct. 2001.
    [28] ITU-T/SG 16/VCEG (formerly Q. 15 now Q.6), "H.26L Test Model Long Term Number 8 (TML-8) draft 0," Doc. VCEG-N10, Jul. 2001.
    [29] T.Chiang andd Y.-Q. Zhang, "A new rate control scheme using quadratic rate distortion model",IEEE Trans. Circuit Syst. Video Technol., vol. 7, NO. 1, pp. 246-250, Feb. 1997.
    [30] G.J. Sullivan and T. Wiegand, "Rate-Distortion Optimization for Video Compression", IEEE Signal Processing Magazine, vol. 15,no. 6, pp. 74-90, Nov. 1998.
    [31] T. Wiegand and B. Girod, "Multi-frame Motion-Compensated Prediction for Video Transmission," Kluwer Academic Publishers, Sept. 2001.
    [32] "Low-Complexity Transform and Quanfization in H.264/AVC," in IEEE Transactions on Circuits and Systems for Video Technology, this issue.
    [33] ITU-T, Video codec test model, near-term, version 8 (tmn8), Technical Report, H.263 AdHoc Group, Portland; June 1997.
    [34] T. Wiegand, M. Lightstone, D. Mukherjee, T.G. Campbell, S.K. Mitra, Rate-distortion optimized mode for very low bit rate video coding and emerging H.263 standard, IEEE Trans. Circuits Systems Video Technol. 6 (2) (April 1996) 182-190.
    [35] Hwangjun Song and Kyoung Mu Lee, "Adaptive rate control algorithms for low-bit-rate video under the networks supporting bandwidth renegotiation," Signal Processing: Image Communication (Elsevier Science), Vol. 17, No. 10, pp. 759-779, Nov. 2002.
    [36] H. Song, C.C.J. Kuo, Rate control for low bit rate video via variable frame rates and hybrid DCT/wavelet I-frame coding,IEEE Trans. Circuits Systems Video Technol. Submitted for publication (In minor revision), 1998
    [37] Hwangjun Song, Jongwon Kim, C.-C.Jay Kuo, "Real-time encoding frame rate control for H.263+video over the Internet" Signal Processing: Image Communication 15 (1999) 127-148
    [38] 《信息论及其应用》 仇佩亮 编著 浙江大学出版社 1999年
    [39] Thomas Wiegand, Heiko Schwarz, Anthony Joch, Faouzi Kossentini, and Gary J. Sullivan: Rate-Constrained Coder Control and Comparison of Video Coding Standards, IEEE Transactions on Circuits and Systems for Video Technology, July 2003.
    [40] 基于率失真理论的视频编码码率控制 熊成玉 何芸 通信学报 1997年第5期
    [41] Heiko Schwarz and Thomas Wiegand:The Emerging JVT/H.26L Video Coding
    
    Standard,Proc. International Broadcasting Conference (IBC 2002), Amsterdam, NL, September 2002, invited paper.
    [42] Thomas Stockhammer and Thomas Wiegand: Optimized Transmission of H. 26L Video over Packet-Lossy Networks, Proc. IEEE International Conference on Image Processing (ICIP 2002), Rochester, NY, USA, September 2002.
    [43] Thomas Wiegand, Niko Frber, Klaus Stuhlmller, and Bernd Girod: Long-Term Memory Motion-Compensated Prediction for Robust Video Transmission, in Proc. of IEEE International Conference on Image Processing, Vancouver, Canada, Sep. 2000.
    [44] J. Kim, Y. Kim, Hwangjun Song, T. Kuo, Y. Chung and C.-C. Jay Kuo, TCP-friendly Internet video streaming employing variable frame rate encoding and interpolation, IEEE Trans. on Circuits and Systems for Video Technology, Vol. 10, No. 7, pp. 1164-1177, Oct. 2000.
    [45] J. Pyun, Y. Kim, S. Ko and Hwangjun Song, "Scene adaptive frame-layer rate control for low bit rate video," accepted for publication in IEICE Trans. Fundamentals Of Electronics, Communications and Computer Sciences, Oct. 2003.
    [46] H.264标准的特点及其改进研究 楼剑 陆亮 虞露 电视技术 2003年第6期 13-15
    [47] H.264编码环中的去块效应滤波系统 陆亮 楼剑 虞露 电视技术 2003年第7期 12-14
    [48] H.264编码系统的特点及其应用前景 李宾 高平 电视技术 2003年第6期 19-21
    [49] Siwei Ma, Wen Gao, Peng Gao, and Yah Lu," Rate control for advance video coding(AVC) standard" 2003 IEEE
    [50] Thomas Wiegand, Niko Frber, Klaus Stuhlmfiller, and Bernd Girod: Error-Resilient Video Transmission Using Long-Term Memory Motion-Compensated Prediction, in IEEE Journal on Selected Areas in Communications, vol. 18, no. 6, pp. 1050-1062, June 2000