音频编码频带扩展技术的研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
高质量低码率的压缩技术一直语音和音频编码孜孜以求的目标。语音编码技术可以达到非常低的编码速率,但只能处理窄带语音信号。而音频编码技术虽然可以达到很好的音质效果,但是很难进一步提高压缩比。因此,语音和音频编码技术都在寻找新的技术来实现高质量低码率。
     频带扩展技术就是一种很好的解决方案。本文回顾了频带扩展技术的主要实现算法,包括非盲式和盲式频带扩展,并把最著名的一种频带扩展实现技术——频带复制技术作为一个独立研究单元,进行深入研究。在本文的结构安排上,首先对实现频带扩展的一些关键技术进行了介绍,并且从EAAC+编解码器中分离出独立的频带复制算法,作为进一步研究的平台。然后,通过对频带复制算法进行测试和评估,发现了频带复制算法的一些不足之处。接着,以频带复制算法为基础,针对其算法存在的缺点,设计出相应的改善策略和解决方法,进而形成一套新颖且完整的频带扩展改进方案,并完成相关测试评估。
     通过评估,这套新颖的改进方案体现出自身的优点,并在频带扩展的某些方面有明显的改善。虽然改进方案尚不能解决所有的问题,但其改进思想的潜在价值却不容忽视,对其进行进一步发掘和研究相信会很有意义。
High quality and low bit-rate has been the targets of both speech coding and audio coding. Speech coding offers very low bit-rate, but it focuses on narrow band speech signals. On the other hand, audio coding provides excellent quality, but it is hard to improve compression ratio. Therefore, both of them are searching for novel technology to help them realize the targets.
     Spectral band extension (SBE) technology is a good solution for the targets mentioned above. This paper reviews some algorithms of spectral band extension technology, including both non-blinding and blinding extensions, and spectral band replication (SBR). One of the most popular methods of spectral band extension is thoroughly studded. Firstly in the thesis, some key technologies related to SBE are described, a set of algorithms about SBR in EAAC+ software are studded. Secondly, based on tests and evaluations of the algorithms, some of the deficiencies of the SBR processing are addressed. Then some new strategies, methods and algorithms of the SBE technology, which will overcome the drawbacks, are designed and tested.
     The performances of the innovation schemes have shown some improvements and advantages for spectral band extensions. Even they are not always good in any cases, but still have some potential feathers, so further study and investigations about these issues are expected.
引文
[1] Painter T, Spanias A,“Perceptual coding of digital audio”, Proc IEEE, 2000, 88(4):451515.
    [2] Ted Painter,Andreas Spanias,“A review of algorithms for perceptual coding of digital audio signals”, Proceedings of International Conference on Digital Signal Processing(DSP),1997.179-205.
    [3] Lam Y.H,Stewart R.W,“Perception-based residual analysis-synthesis system”,Proc ICASSP 1999,USA:IEEE,1999.989-992.
    [4] Purnhagen H,“Advances in parametric audio coding”,1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,Published:1999, Page(s):31-34.
    [5] Nomura,T. Iwadare,M.Serizawa,“A bitrate and bandwidth scalable CELP coder”, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, 1998, Published:1998,Volume:1,Page(s):341-344.
    [6] ISO/IEC JTC1/SC29/WG11,“Coding of moving pictures and audio - MPEG-2 Advanced Audio Coding AAC”, ISO/IEC 13818-7 International Standard,1997.
    [7] 陈景铭, “利用高频重建之 MPEG 音讯编解码器设计及其系统实现”, 台湾国立云林科技大学,电子与咨讯工程系硕士班,硕士论文,2004
    [8] 潘政彦,陈文俭,“节省参数并提升音质之改良式高频重建方法”,大叶大学咨讯工程学系
    [9] AMR Wideband Speech Codec;General Description, 3GPP TS 26.171
    [10] Jari Makinen,Bruno Bessette,“AMR-WB+:A NEW AUDIO CODING STANDARD FOR 3RD GENERATION MOBILE AUDIO SERVICES”,ICASSP 2005,IEEE 0-7803-8874-7/05.
    [11] J.Seo,D.Jang,J.Hong,and K.Kang, “A Simple Method for Reproducing High Frequency Components at Low-Bit Rate Audio Coding”, the113th AES Convention 2002 October 5-8 Los Angeles,California,USA
    [12] 陈小平,胡泽,“听觉临界频带及其在声频信号处理中的应用”,北京广播学院学报(自然科学版),第 11 卷,June 1 2004
    [13] “PlusV Specification”, VLSI Solution PlusV,December 2001
    [14] AMR Wideband Speech Codec;Transcoding functions, 3GPP TS 26.290
    [15] 杨行峻,迟惠生,“语音信号数字处理”, 电子工业出版社,1995.
    [16] Martin Dietz,Lars Liljeryd,Kristofer Kj?rling,Oliver Kunz, “Spectral Band Replication, a novel approach in audio coding”, the 112th Convention 2002 May 10–13 Munich,Germany
    [17] 刘嘉,刘佩林,孙军,“频谱恢复技术综述”, 电声技术 2005-04,1002-8684.
    [18] Enhanced aacPlus general audio codec;General description,3GPP TS 26.401
    [19] Chi-Min Liu, Wen-Chieh Lee, and Han-Wen Hsu, “HIGH FREQUENCY RECONSTRUCTION FOR BAND-LIMITED AUDIO SIGNALS” , the 6 th Int. Conference on Digital Audio Effects (DAFX-03),London,UK,September 8-11,2003 DAFX-1
    [20] Erik Larsen,Ronald M. Aarts,Michael Danessis, “Efficient high-frequency bandwidth extension of music and speech”, the 112th Convention 2002 May 10–13 Munich,Germany
    [21] Enhanced aacPlus general audio codec;Encoder Specification SBR part, 3GPP TS 26.404 V6.0.0.
    [22] ISO/IEC 14496-3:2001/FPDAM1,Bandwidth Extesnion,with the simple editorial changes,listed in NB comments,incorporated
    [23] Coding Technologies Sweden AB, “Enhancing perceptual performance of sbr and related hfr coding methods by adaptive noise-floor addition and noise substitution limiting”,申请号:US 6708145,申请日期:2000.12.20
    [24] “Quadrature Mirror Filters”, DIGITAL SIGNAL PROCESSING,EECE UBC
    [25] Coding Technologies AB, “Source coding enhancement using spectral-band replication”, 申请号:US 6925116,申请日期:2003.10.8
    [26] 编码技术瑞典股份公司,“子带域中改进的频谱平移/折叠”,申请号:CN 1430777,申请日期:2001.05.23
    [27] 陆泱, “一种改进的频带复制方法”, 半导体技术,2004 年 29 卷 6 期,84-88 页.
    [28] ITU-R Recommendation BS.1387.“Method for objective measurements of perceived audio quality (PEAQ)” 1998-2001.
    [29] ITU-R Recommendation BS.1534-1 “Method for the subjective assessment of intermediate quality”,2001-2003.
    [30] ISO/IEC JTC 1/SC 29/WG 11N6009,“Report on the Verification Tests of MPEG-4 High Efficiency AAC”,October 2003, Brisbane, Australia

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700