语音压缩编码G.723.1标准的研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着卫星通信、数字移动通信和个人通信网的迅猛发展,日益增加的客户需求量与现有通信信道容量之间的矛盾日益突出。如何在现有的信道资源条件下,通过压缩信源以提高传输效率已成为当前急待解决的问题之一。相继出现的语音信号压缩标准为语音信号的高效传输提供了一种有效方法,其实质就是在相当的语音质量指标下,降低数字化语音的数码率。
     G.723算法是ITU-T建议的应用于低速率多媒体服务中语音或其它音频信号的压缩算法,例如:H.323,H.324系统。这种声码器具备两种比特率:5.3kbps,6.3kps。在帧边界处可以在两种速率之间进行切换。本算法提供对无声语音帧的检测以及在无声时进行舒适噪声填充的功能。如果优化系统,有限地提高其复杂度,将会得到更高的语音质量。G723.1算法同样适用于音乐或其它声音信号,但是处理效果不如语音。
     G723.1是一种参数编码,是目前制定的一系列采用混合编码技术的语音编码标准中编码效率最高的,它以其卓越的性能被广泛应用到各种领域,与G.711等波形编码标准相比,其复杂度提升了很多,相应地大大增加了实现的难度。G.723.1的高质量和低码率是以其高复杂度的编码算法、较高的延迟以及较大的存储空间换得的,这也大大增加了它的实时实现难度。将算法转化为应用的第一步是准确的理解算法,并编写成代码,转化为可运行的程序,这是先决条件。但是,算法的正确实现并不保证它的性能能满足用户的要求,这就要求根据实际的需求对算法进行改进。同时,在实现过程中,程序的编写还要受到硬件资源的限制,例如内存的大小,CPU的速度等。
     本论文首先对语音编码的基础技术进行了详细的描述,同时在深入研究G.723.1语音编码标准的基础上,给出了C语言的实现,并从码本搜索的角度,对ITU-T制定的标准提出了优化策略;同时,在介绍TMS320C541 DSP硬件环境的基础上,给出G.723.1实时实现的系统设计、硬件设计及软件设计的流程。
With the rapid development of satellite communications, digital mobile communications and personal communications, the conflict between user's demand and capability of communication channels is more and more outstanding. How to improve the usage efficiency of resources available is the focus to which people have put much attention. Now, the standards of speech compression coding provide a way of transporting speech signals efficiently. In fact, all of them are to reduce the baud rate of data under definite speech quality.
    G723 aerithemetic is a compressing arithemetic that proposed by ITU-T and applied in speech and other audio frequency signals of low velocity multimedia services,such as:H.323,H.324 system.This arithemetic provides inspection to silence speech frames and fills in comfortable noise when it is silence.If optimize system and increase the complexity limitedly,we can get higher quality of speech.G723.1 is also available in music or other voice signals,but the managing effect is not as good as speech's.
    Among the standards which are established using mixed coding technology, G723.1 has the highest efficiency, and is widely used in many fields. But, to gain the high quality and low baud rate, G723.1 pays out the costs, such as the high complex of coding arithmetic, high delay and large store space. Each of them brings up the difficulties of its real-time implementation. The first step of use the arithmetic is to understand it accurately and code,then change it into program can be run, this is the precondition, but we can't assure that the capability can meet the demand of customers, so we shoud improve the arithmetic according to the demand of practice. At the same time, in practice, the program also limited by the hardware such as:size of memory,speed of CPU,etc.
    Detailed the basic technology of speech coding and based on the in-depth study of G723.1, the paper implements it using C language and provides some optimizing policies with respect to code-book research. At the same time, based on the analysis of TMS320C5410 EVM environment, the paper gives system design, hardware design and the flow of software design.
引文
[1] 陈显治等,现代通信技术,电子工业出版社,2001
    [2] 谢希仁等,计算机网络,电子工业出版社,1994
    [3] 糜正琨,IP网络电话技术,人民邮电出版社,2000
    [4] 舒华英等,IP电话技术及其应用,人民邮电出版社,2001
    [5] 陈俊良,IP电话与IP电话技术基础,机械工业出版社,2001
    [6] Walter J.Goralski Matthew C.Kolon, IP TELEPHONY, McGraw-Hill Companies, Inc. 1999
    [7] ITU-T Rec. G.723.1, DUAL RATE SPEECH CODER FOR. MULTIMEDIA COMMUNICATIONS TRANSMITTING AT 5.3 AND 6.3 kbit/s, 1996
    [8] 吴资玉等,数字通信原理,中国物资出版社,1999
    [9] 程时端,综合业务数字网,人民邮电出版社,1999
    [10] 马立军,IP电话操作指南,科学出版社,2000
    [11] 章云等,DSP控制器及其应用,机械工业出版社,2001
    [12] 精英工作室,IP电话完全手册,中国电力出版社,2000
    [13] 杨行峻等,语音信号处理,电子工业出版社,1995
    [14] 陈永彬等,语音信号处理,中国科学技术出版社,1990
    [15] Texas Instruments, TMS320C3X USER'S GUIDE
    [16] Texas Instruments, TMS320C3X Float_Point Assembling Language
    [17] Texas Instruments, TMS320C3X C Compiler Reference Guide
    [18] 周志鹏等,IP电话网关中G.723.1双速率语音编码器的算法及实现研究,《电子计算机与外部设备》,2000.5 P123~124
    [19] 杨树堂等,G.723.1语音编码器算法的聚类优化及其应用,《通信学报》,2001.2 P113~117
    [20] 李学春等,IP电话技术的演讲,《通讯世界》,2000.4
    [21] 林峥等,基于TMS320C6201的G.723.1多通道语音编解码的实现,《电子技术应用》, 2000 12
    [22] UYLESS BLACK, Emerging Communications Technologies 2nd Edition,清华大学出版社,1998
    [23] J.N.Holmes. Influence of Glottal Waveform on the Naturelness of Speech From a Parallel Format Sythesizer. IEEE Trans. 1973,Vol.AE-21:298~305
    [24] Network functions for the application of multi-rote speech coding in digital cellular systems Yasuda, S.; Ummoto, K: Murase, A.; Vehicular Technology Conference, 1994 IEEE 44th, 8-10 June 1994 Page(s): 306-310 vol. 1
    
    
    [25] Phase modelling for low bit-rate speech coding Cheetham, B.M.G; Sun, X.Q.; Barrett, P.A.; Speech Coding for Algorithms for Radio Channels (Ref. No. 2000/012), IEE Seminar, 17 April 2000 Page(s): 5/1-5/5
    [26] IEE Colloquium on Low Bit-Rate Speech Coding for Future Applications'(Digest No. 1993/234)Low Bit-Rate Speech Coding for Future Applications, IEE Colloquium on, 6 Dec 1993
    [27] Alan V.McCree,Thomas P. Barnwell Ⅲ. A New Mixed Excitation LPC Vocoder.ICASSP 1991,593~596
    [28] Alan V.McCree,Thomas E Barnwell Ⅲ. A New Mixed Excitation LPC Vocoder Model for Low Bit Rate Speech Coding. IEEE Trans. Speech and Audio Processing, 1995,3(4)
    [29] Real-time implementation of 16 kb/s low-delay CELP speech coding algorithm on a TMS320C30 Hui Guanghui; Tian Wenshun; Ni Weizhen; Wang Dejun; TENCON '93. Proceedings. Computer, Communication, Control and Power Engineering.1993 IEEE Region 10 Conference on Issue: 0,19-21 Oct. 1993 Page(s):283-286 vol.3
    [30] A high quality CELP for AUDETEL NICAM and Teletext applications Evans, B.G.; Suddle, M.R.; Kondoz, A.M.; Information Access for People with Disability, IEE Colloquium on , 5 May 1993 Page(s): 8/1-8/7
    [31] An overview of speech coding techniques Xydeas, C.; Speech Coding-Techniques and Applications, IEE Colloquium on, 14 Apr 1992 Page(s): 111-125
    [32] Standards for low bit rate speech coding applications Freeman, D.; Speech Coding, IEE Colloquium on, 9 Oct 1989 Page(s): 211-214