AVS音频编解码算法研究及其在DSP平台上的实现
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
AVS音频标准是我国具有自主知识产权的音视频编码标准AVS的第三部分,为高质量音频压缩领域提供了很好的方案,主要用于高分辨率数字广播、高密度激光数字存储媒体、无线宽带多媒体通讯和互联网宽带流媒体等业务。AVS音频编码标准中引入了许多先进的音频编码技术,研究AVS音频编码技术及其实时实现对AVS音频标准的推广和应用具有重大的现实意义。
     本文首先介绍了音频编码技术的发展情况和基本原理,对AVS音频编解码的框架和各个模块的算法进行了研究和分析。针对AVS音频编码算法中暂态信号处理的复杂度较高的缺陷,提出了两种新的暂态信号处理方法,并在AVS音频编解码器中进行了验证,实验证明,这两种方法的编码质量和原来编码器的质量相当,但计算复杂度却远远低于原来的编码器。
     然后,本文对AVS音频编解码器的各个模块进行了优化,使得AVS音频编解码器的计算复杂度大大降低。接下来,将AVS音频编码器移植到DM642 EVM平台上并进行优化,在该平台上实现音频的实时采集,实时编码以及编码比特流的实时发送;将AVS音频解码器移植到DSK5416平台上并进行优化,实现编码比特流的实时接收,实时解码和音频数据的实时播放。编解码器的优化结果是,在44.1kHz采样率下,实时编码需要150MHz时钟,实时解码需要25MHz时钟。
     最后,本文介绍了音频一致性测试的目的和必要性,并针对AVS音频标准,提出了AVS音频解码器需要测试的参数,以及一致性测试码流的生成方法。
AVS audio standard is the third part of AVS (Audio and Video coding Standard), which is of our own independent intellectual property, it provides a good way to the field of high quality audio compression. And it applies to the high-resolution digital broadcast, high-density laser digital storage medium, wireless wide-band multimedia communication, broad band internet streaming media application and so on. There are many advanced technology on audio coding in AVS audio standard. It has great significance on the spread and applications of AVS audio standard to do the research on the algorithms and realization in real time of AVS audio coding.
     The thesis introduces the development and the fundamental principle of audio coding technology. The standard modules of AVS audio and the algorithm of each module are studied and analyzed. It could be found that the computational complexity of AVS audio coding is very high when transient signals are processed. Thus, two new methods are presented in place of the original method. It is proved by experimentation based on AVS that the audio quality of new methods is the same as the quality of the original method. But the computational complexity of AVS audio encoder and decoder are greatly decreased in the new methods.
     After the analysis of the algorithms of AVS audio, the thesis optimizes all modules of AVS audio so as to decrease the computational complexity of encoder and decoder greatly. After that, the porting and optimization of AVS audio encoder is presented based on DM642 EVM, including audio capturing, encoding, and bit stream transmitting in real time. And the porting and optimization of AVS audio decoder is presented based on DSK5416, including bit stream receiving, decoding and playing in real time. The encoder can encode the audio samples in real time with the sampling rate of 44.1kHz at 150MHz cycles while the decoder at 25MHz cycles.
     Finally, the thesis introduces the purpose and necessity of audio conformance testing and sums up the parameters that are necessary to be tested. And the method to generating bit streams which are used to test decoder is given at last.
引文
[1]高文,多媒体数据压缩技术,电子工业出版社,1994:P1-30
    [2]房建,左涛,陈婷,数字音频压缩编码技术及其应用,信息技术,2004,28(2):P9-11
    [3]汪波,黄佩伟,钟幼平等,数字音频编码及其应用,信息技术,2006年第9期
    [4]杨俊,蔡宣平,颜飞翔,数字音频技术及其应用与发展(二),电声技术,2001年第6期
    [5]ISO/IEC 11172-3, Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s-Part3: Audio. Draft.November 1991
    [6]ISO/IEC 13818-3, Information Technology - Generic Coding of Moving Pictures and Associated Audio Information: Audio. November 1994
    [7]ISO/IEC 14496-3, Information Technology– Coding of Audio-Visual Objects. Part 3: Audio. 2001
    [8]Advanced Television System Committee [R].Digital Audio Compression Standard (AC-3).USA.20 Dec 95.
    [9]钱征海,吕宗伟,数字音频技术与杜比AC-3,电子技术应用,2005年,第9期:49-51
    [10]ISO/IEC 13818-7, Information Technology– Generic Coding of Moving Pictures and Associated Audio Information-Part 7: Advanced Audio Coding. 1997
    [11]高文,黄铁军,“信源编码标准AVS及其在数字电视中的应用”,电视技术,2003年第11期,P23-26
    [12]黄铁军,“AVS标准的背景、进展与产业应用展望”,信息技术与标准化,2003年第九期,P4-8
    [13]信息技术——先进音视频编码第三部分音频
    [14]OMAP-DM270,http://focus.ti.com/general/docs/wtbu/wtbuproductcontent.tsp?templateId=6123&navigationId=11998&contentId=4681, 2006-10-13
    [15]Texas Instrument,TMS320C64x Technical Overview,Jan., 2001
    [16]DM64x Digital Media DSPs,http://focus.ti.com/paramsearch/docs/parametricsearch.tsp?family=dsp§ionId=2&tabId=1857&familyId=749¶mCriteria=no,2006-10-13
    [17]TMS320DM644x DSPs,http://focus.ti.com/dsp/docs/dspplatformscontentnp.tsp?sectionId=2&familyId=749&tabId=1398,2006-10-13
    [18]TMS320DM643x DSPs,http://focus.ti.com/paramsearch/docs/parametricsearch.tsp?family=dsp§ionId=2&tabId=1863&familyId=1301¶mCriteria=no,2006-11-16
    [19]朱丽,郭从良,心理声学模型在数字音频中的应用,电声技术,2002年第8期
    [20]&&Ted Painter,“Perceptual Coding of Digital Audio”, Proceedings of the IEEE, 2000, 88(4), P451~504
    [21]何兵,徐盛,陈健,一种低码率音频压缩编码的窗切换方法,2001年第11期
    [22]潘兴德,朱晓明,EAC音频编解码技术,电声技术,2003年第2期
    [23]姜晔,吴镇扬,感知音频编码中预回声的产生机理与抑制方法,电声技术,2000年第11期
    [24]Peter Paint,“Perceptual Coding of Digital Audio”, PROCEEDINGS OF THE IEEE, VOL. 88, NO. 4, P451-513, APRIL 2000
    [25]Noll,P.,”MPEG digital audio coding”, Signal Processing Magazine,IEEE Volume 14,Issue 5,Sept.1997
    [26]RuiMin Hu,Yong Zhang,Haojun Ai,”Digital audio compression technology and AVS audio standard research”, Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings
    [27]Texas Instruments, TMS320DM642 Datasheet,2006-07
    [28]Texas Instrument, TMS320DM642 Digital Media Processor Product Preview2002-7
    [29]Texas Instruments, C6000 DSP McASP Reference Guide, 2006-04
    [30]Texas Instruments, C6000 DSP Cache User's Guide, 2003-05
    [31]Texas Instruments, C64x Two-Level Internal Memory Reference Guide,2006-02
    [32]Texas Instruments, TLV320AIC23B, 2004-02
    [33]Texas Instruments, TMS320VC5416 Fixed-Point Digital Signal Processor, 1999-03
    [34]Texas Instruments, PCM3002, 2000-01
    [35]丁贵广,郭宝龙,精细可伸缩视频编码中的增强层编码方法研究,计算机工程与应用,2003,21~26
    [36]Texas Instruments, C6000 CSL API Reference Guide, 2004-08
    [37]C6000 Optimizing Compiler User's Guide, 2004-05
    [38]Texas Instruments, CCS v3.0 Getting Started Guide, 2004-05
    [39]刘康,基于DM642的网络多媒体开发平台的设计与实现:[硕士学位论文],天津:天津大学,2005
    [40]付兴,基于DM642的多功能多媒体处理平台的研究与开发:[硕士学位论文],天津:天津大学,2007
    [41]W Li,“Fine granularity scalability in MPEG-4 for streaming video[C]”, InISCS 2000,Geneva,Switzerland, June 2005
    [42]侯兆荣,肖仲喆,窦维蓓,基于16bit内核DSP实现准双精度音频解码,电声技术,2001年第5期
    [43]马士超,王贞松,基于DSP的三角函数快速计算,计算机工程,2005,31(22):P12-14
    [44]陈洪光,林嘉宇,唐朝京,MPEG音频层III解码算法的仿真及定点化,国防科技大学学报,2001,23(2):P95-98
    [45]汪春梅,张崇明,李蕊等,TMS320DM642多媒体处理系统中高性能音频功能的实现,电声技术,2006-06:P53-55
    [46]张卫宁,赵子婴,定点DSP的定标及其运算方法,计算机工程,2002年3月,28卷第3期:223-225
    [47]周日贵,胡景春,叶水生,定点DSP的开发实践,南昌航空工业学院学报(自然科学版),2003年3月,第17卷第1期:59-62
    [48]数字音视频编解码标准工作组,数字音视频编解码技术标准AVS主观测试:音频,2006年7月

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700