用户名: 密码: 验证码:
改进相位谱补偿的语音增强方法
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Speech Enhancement Method for Improving Phase Spectrum Compensation
  • 作者:吉慧芳 ; 贾海蓉 ; 王雁
  • 英文作者:JI Huifang;JIA Hairong;WANG Yan;College of Information and Computer, Taiyuan University of Technology;
  • 关键词:相位谱补偿 ; 功率谱估计 ; 先验信噪比 ; 语音增强
  • 英文关键词:phase spectrum compensation;;power spectrum estimation;;priori signal-to-noise ratio;;speech enhancement
  • 中文刊名:JSGG
  • 英文刊名:Computer Engineering and Applications
  • 机构:太原理工大学信息与计算机学院;
  • 出版日期:2018-11-16 15:58
  • 出版单位:计算机工程与应用
  • 年:2019
  • 期:v.55;No.927
  • 基金:国家自然科学基金(No.61371193);; 山西省自然科学基金(No.201701D121058)
  • 语种:中文;
  • 页:JSGG201908008
  • 页数:5
  • CN:08
  • 分类号:54-58
摘要
针对传统单通道语音增强方法中用带噪语音相位代替纯净语音相位重建时域信号,使得语音主观感知质量改善受限的情况,提出了一种改进相位谱补偿的语音增强算法。该算法提出了基于每帧语音输入信噪比的Sigmoid型相位谱补偿函数,能够根据噪声的变化来灵活地对带噪语音的相位谱进行补偿;结合改进DD的先验信噪比估计与语音存在概率算法(SPP)来估计噪声功率谱;在维纳滤波中结合新的语音存在概率噪声功率谱估计与相位谱补偿来提高语音的增强效果。相比传统相位谱补偿(PSC)算法而言,改进算法可以有效抑制音频信号中的各类噪声,同时增强语音信号感知质量,提升语音的可懂度。
        Aiming to the problem that the clean speech phase is replaced by the noisy speech phase when reconstructing the waveform in the traditional single-channel speech enhancement methods, which leads to the poor subjective perception quality of the speech, a speech enhancement algorithm with improved phase spectrum compensation is proposed.Firstly, the Sigmoid phase-spectral compensation function which is based on the signal to noise ratio of each frame input speech is presented in this paper, it can flexibly compensate the phase spectrum of the noisy according to the change of noise. Next, it estimates noise power spectrum through combining a priori SNR estimation of the improved DD and Speech Presence Probability algorithm(SPP). Finally, Wiener filtering is applied to improve speech enhancement effect by combining the new speech presence probability noise power spectrum estimation and phase spectrum compensation.Compared to the traditional Phase Spectrum Compensation(PSC)algorithm, the improved algorithm can effectively suppress the noise in the audio signal, and then enhance the perceived quality of the speech and improve speech intelligibility.
引文
[1]Paliwal K,Wójcicki K,Shannon B.The importance of phase in speech enhancement[J].Speech Communication,2011,53(4):465-494.
    [2]Mowlaee P,Saeidi R,Stylianou Y.Advances in phaseaware signal processing in speech communication[J].Speech Communication,2016,81:1-29.
    [3]Mowlaee P,Saeidi R.Iterative closed-loop phase-aware single-channel speech enhancement[J].IEEE Signal Processing Letters,2013,20(12):1235-1239.
    [4]So S,Wójcicki K K,Lyons J G,et al.Kalman filter with phase spectrum compensation algorithm for speech enhancement[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing(ICASSP 2009),Taipei,Taiwan,China,2009:4405-4408.
    [5]Dang X,Khan M I A,Nakai T.Noise reduction of speech signal based on phase spectrum estimation[C]//International Conference on Informatics,Electronics&Vision,2013:1-4.
    [6]Mowlaee P,Stahl J,Kulmer J.Iterative joint MAP singlechannel speech enhancement given non-uniform phase prior☆[J].Speech Communication,2017,86:85-96.
    [7]Islam M T,Shahnaz C,Zhu W P,et al.Speech enhancement in adverse environments based on non-stationary noise-driven spectral subtraction and SNR-dependent phase compensation[J].Electrical Engineering and Systems Science,2018.
    [8]Maly A,Mowlaee P.On the importance of harmonic phase modification for improved speech signal reconstruction[C]//IEEE International Conference on Acoustics,Speech and Signal Processing,2016:584-588.
    [9]Stark A P,Wojcicki K,Lyons J,et al.Noise driven short time phase spectrum compensation procedure for speech enhancement[J].Proceedings Interspeech,2008:549-552.
    [10]王栋,贾海蓉.改进相位谱补偿的语音增强算法[J].西安电子科技大学学报(自然科学版),2017,44(3):83-88.
    [11]Krawczyk M,Gerkmann T.STFT phase reconstruction in voiced speech for an improved single-channel speech enhancement[J].IEEE/ACM Transactions on Audio Speech&Language Processing,2014,22(12):1931-1940.
    [12]王虎,李晶,赵恒淼,等.稀疏低秩模型及相位谱补偿的语音增强算法[J].计算机工程与应用,2018,54(5):150-155.
    [13]沈锁金,刘伟,高颖.基于语音存在概率的先验信噪比估计算法的研究[J].中国集成电路,2016,25(12):26-30.
    [14]Gerkmann T,Krawczyk M,Rehr R.Phase estimation in speech enhancement-unimportant,important,or impossible?[C]//Electrical&Electronics Engineers in Israel,2012:1-5.
    [15]Chinaev A,Haeb-Umbach R.A generalized log-spectral amplitude estimator for single-channel speech enhancement[C]//IEEE International Conference on Acoustics,Speech and Signal Processing,2017:4980-4984.
    [16]容强,肖汉.基于MMSE维纳滤波语音增强方法研究与Matlab实现[J].计算机应用与软件,2015,32(1):153-156.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700