基于LPC残差的语音定位算法设计
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Design of Speech Localization Technology Based on LPC Residue
  • 作者:李瑶天 ; 汤剑 ; 张霞 ; 刘丽娜 ; 胡剑凌
  • 英文作者:LI Yaotian;TANG Jian;ZHANG Xia;LIU Lina;HU Jianling;School of Electronic and Information Engineering,Soochow University;
  • 关键词:麦克风阵列 ; 声源定位 ; TDOA算法 ; 广义互相关算法 ; 线性预测
  • 英文关键词:microphone array;;sound source localization;;TDOA algorithm;;generalized cross-correlation algorithm;;linear prediction
  • 中文刊名:DZQJ
  • 英文刊名:Chinese Journal of Electron Devices
  • 机构:苏州大学电子信息学院;
  • 出版日期:2018-12-20
  • 出版单位:电子器件
  • 年:2018
  • 期:v.41
  • 基金:国家自然科学基金项目(61271359)
  • 语种:中文;
  • 页:DZQJ201806051
  • 页数:6
  • CN:06
  • ISSN:32-1416/TN
  • 分类号:260-265
摘要
主要采用TDOA算法,重点研究了基于麦克风阵列的声源定位技术。在对广义互相关时延估计算法分析的基础上,通过引入基于C_0复杂度的端点检测技术、线性预测和IIR平滑滤波,设计了基于线性预测残差的广义互相关算法。同时给出了平面四元十字麦克风阵列的设计及定位原理,完成了基于DM368的嵌入式语音定位系统设计。测试结果表明,所设计的系统有效改进了时延估计的准确性,提升了噪声环境下的定位性能。
        This work focuses the application of TDOA algorithm on microphone array based sound source localization.After the analysis of generalized cross-correlation phase transform( GCC-PHAT) algorithm,the GCC-PHAT algorithm based on LP residue is proposed,which adopts C0 complexity based endpoint detection technique,linear prediction( LP) and IIR smoothing filtering. A cross-array,with four microphones symmetrically located in the same plane is designed,also the source-locating equations are deduced using the estimated delay information. An embedded sound source localization system based on DM368 is designed. The test results show that the proposed system can improve the accuracy of delay estimation and enhance the performance of localization under noisy environments.
引文
[1]张霞.基于云服务的嵌入式人机语言交互系统[D].苏州:苏州大学,2017.
    [2]姜志鹏,唐加能,梁瑞宇.实时声源定位算法与实现[J].电子器件,2015,38(6):1406-1411.
    [3]肖栋,向阳,卓瑞岩,等.基于波束形成的多类型多声源定位研究[J].应用声学,2017(3):220-227.
    [4]杨祥清.声源定位算法及实现[D].中国科学技术大学,2007.
    [5] Huang Zhaoqiong,Zhan Ge,Ying Dongwen,et al. Robust Multiple Speech Source Localization Using Time Delay Histogram[C]//ICASSP,2016:3191-3195.
    [6]童娟娟,陈晓慧,束锋,等.基于最大似然网格搜索的TDOA和FDOA加权代价函数定位算法[J].电子器件,2015,38(1):209-213.
    [7] Hu Jie,Zhang Genbao. Research Transplanting Method of Embedded Linux Kernel Based on ARM Platform[C]//Information Science and Management Engineering,2010(2):35-38.
    [8] Zhou Yunfei,Zhong Liyun,Cai Hongzhi,et al. White Light Scanning Interfero-metry Based on Generalized Cross-Correlation Time Delay Estimation[J]. IEEE Photonics Journal,2017,9(5):1-11.
    [9]范影乐,武传艳,李轶,等.基于C_0复杂度的语音端点检测技术[J].传感技术学报.2006,19(3):751-753.
    [10] Abbas Keshavarz,Saeed Mosayyebpour,Mehrzad Biguesh,et al.Speech-Model Based Accurate Blind Reverberation Time Estimation Using an LPC Filter[J]. IEEE Transactions on Audio Speech and Language Processing,2012,20(6):1884-1893.
    [11] He Hongsen,Chen Jingdong,Jacob Benesty,et al. On Time Delay Estimation Based on Multichannel Spatiotemporal Sparse Linear Prediction[C]//ICASSP,2016:390-394.
    [12] Muhammad Omer,Ahmed A Quadeer,Tareq Y,et al. An L-Shaped Microphone Array Configuration for Impulsive Acoustic Source Localization in 2-D Using Orthogonal Clustering Based Time Delay Estimation[C]//ICCSPA,2013:1-6.