Application of non-negative spectrogram decomposition with sparsity constraints to single-channel speech enhancement

详细信息查看全文

作者：Kyogu Lee
关键词：Single-channel speech enhancement ; Non-negative spectrogram decomposition ; Sparsity constraint ; Unsupervised source separation
刊名：Speech Communication
出版年：March, 2014
年：2014
卷：58
期：Complete
页码：69-80
全文大小：2146 K

文摘

We propose an algorithm for single-channel speech enhancement that requires no pre-trained models - neither speech nor noise models - using non-negative spectrogram decomposition with sparsity constraints. To this end, before staring the EM algorithm for spectrogram decomposition, we divide the spectral basis vectors into two disjoint groups - speech and noise groups - and impose sparsity constraints only on those in the speech group as we update the parameters. After the EM algorithm converges, the proposed algorithm successfully separates speech from noise, and no post-processing is required for speech reconstruction. Experiments with various types of real-world noises show that the proposed algorithm achieves performance significantly better than other classical algorithms or comparable to the spectrogram decomposition method using pre-trained noise models.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700