Speech enhancement for multimicrophone binaural hearing aids aiming to preserve the spatial auditory scene

详细信息查看全文

作者：Joachim Thiemann ; Menno Müller…
关键词：Hearing aids ; Binaural hearing aids ; Bilateral hearing aids ; Binaural MVDR
刊名：EURASIP Journal on Advances in Signal Processing
出版年：2016
出版时间：December 2016
年：2016
卷：2016
期：1
全文大小：733 KB
参考文献：1.AS Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound (MIT press, Cambridge, 1994).
2.EC Cherry, Some experiments on the recognition of speech, with one and with two ears. J. Acoust. Soc. Am. 25(5), 975–979 (1953).CrossRef
3.AW Bronkhorst, The cocktail party phenomenon: a review of research on speech intelligibility in multiple-talker conditions. Acta Acust. united Ac. 86(1), 117–128 (2000).
4.J Peissig, B Kollmeier, Directivity of binaural noise reduction in spatial multiple noise-source arrangements for normal and impaired listeners. J. Acoust. Soc. Am. 101(3), 1660–1670 (1997). doi:http://dx.doi.org/10.1121/1.418150 .CrossRef
5.ML Hawley, RY Litovsky, JF Culling, The benefit of binaural hearing in a cocktail party: effect of location and type of interferer. J. Acoust. Soc. Am. 115(2), 833–843 (2004). doi:http://dx.doi.org/10.1121/1.1639908 .CrossRef
6.R Beutelmann, T Brand, Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners. J. Acoust. Soc. Am. 120(1), 331–342 (2006). doi:http://dx.doi.org/10.1121/1.2202888 .CrossRef
7.J Blauert, Spatial Hearing: The Psychophysics of Human Sound Localization (MIT Press, Cambridge, 1996).
8.T Van den Bogaert, S Doclo, J Wouters, M Moonen, The effect of multimicrophone noise reduction systems on sound source localization by users of binaural hearing aids. J. Acoust. Soc. Am. 124(1), 484–497 (2008). doi:http://dx.doi.org/10.1121/1.2931962 .CrossRef
9.S Doclo, S Gannot, M Moonen, A Spriet, in Handbook on Array Processing and Sensor Networks, ed. by S Haykin, KJR Liu. Chapter 9: acoustic beamforming for hearing aid applications (Wiley-IEEE PressHoboken, 2010), pp. 269–302.CrossRef
10.J Wouters, S Doclo, R Koning, T Francart, Sound processing for better coding of monaural and binaural cues in auditory prostheses. Proc. IEEE. 101(9), 1986–1997 (2013). doi:http://dx.doi.org/10.1109/JPROC.2013.2257635 .CrossRef
11.S Doclo, W Kellermann, S Makino, SE Nordholm, Multichannel signal enhancement algorithms for assisted listening devices: exploiting spatial diversity using multiple microphones. IEEE Signal Process. Mag. 32(2), 18–30 (2015). doi:http://dx.doi.org/10.1109/MSP.2014.2366780 .CrossRef
12.V Hamacher, U Kornagel, T Lotter, H Puder, Binaural Signal Processing in Hearing Aids: Technologies and Algorithms (Wiley, New York, 2008). doi:http://dx.doi.org/10.1002/9780470727188.ch14 .
13.T Lotter, P Vary, Dual-channel speech enhancement by superdirective beamforming. EURASIP J. on Applied Sig. Proc. 2006:, 1–14 (2006). doi:http://dx.doi.org/10.1155/ASP/2006/63297 .CrossRef
14.T Rohdenburg, V Hohmann, B Kollmeier, in Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). Robustness analysis of binaural hearing aid beamformer algorithms by means of objective perceptual quality measures (IEEENew Paltz, 2007), pp. 315–318. doi:http://dx.doi.org/10.1109/ASPAA.2007.4393016 .
15.K Reindl, Y Zheng, W Kellermann, in Proceedings of the European Signal Processing Conference (EUSIPCO). Analysis of two generic Wiener filtering concepts for binaural speech enhancement in hearing aids (Aalborg, 2010), pp. 989–993.
16.JI Marin-Hurtado, DN Parikh, DV Anderson, Perceptually inspired noise-reduction method for binaural hearing aids. IEEE Trans. Audio, Speech, Language Process. 20(4), 1372–1382 (2012). doi:http://dx.doi.org/10.1109/TASL.2011.2179295 .CrossRef
17.B Cornelis, S Doclo, T Van dan Bogaert, M Moonen, J Wouters, Theoretical analysis of binaural multimicrophone noise reduction techniques. IEEE Trans. Audio, Speech, Language Process. 18(2), 342–355 (2010). doi:http://dx.doi.org/10.1109/TASL.2009.2028374 .CrossRef
18.D Marquardt, V Hohmann, S Doclo, in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). Coherence preservation in multi-channel Wiener filtering based noise reduction for binaural hearing aids (Vancouver, 2013), pp. 8648–8652. doi:http://dx.doi.org/10.1109/ICASSP.2013.6639354 .
19.A Kuklasinski, S Doclo, SH Jensen, J Jensen, in Proceedings of the European Signal Processing Conference (EUSIPCO). Maximum likelihood based multi-channel isotropic reverberation reduction for hearing aids (Lisbon, 2014).
20.J Bitzer, KU Simmer, in Microphone Arrays, ed. by M Brandstein, D Ward. Chapter 2: superdirective microphone arrays (SpringerBerlin, 2010), pp. 19–38.
21.BD Van Veen, KM Buckley, Beamforming: a versatile approach to spatial filtering. IEEE ASSP Mag. 5(2), 4–24 (1988). doi:http://dx.doi.org/10.1109/53.665 .CrossRef
22.S Gannot, D Burshtein, E Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Process. 49(8), 1614–1626 (2001). doi:http://dx.doi.org/10.1109/78.934132 .CrossRef
23.S Braun, EAP Habets, in Proceedings of the European Signal Processing Conference (EUSIPCO). Dereverberation in noisy environments using reference signals and a maximum likelihood estimator, (2013), pp. 1–5.
24.KU Simmer, J Bitzer, C Marro, in Microphone Arrays, ed. by M Brandstein, D Ward. Chapter 3: post-filtering techniques (SpringerBerlin, 2001), pp. 39–60.CrossRef
25.U Kjems, J Jensen, in Proceedings of the European Signal Processing Conference (EUSIPCO). Maximum likelihood based noise covariance matrix estimation for multi-microphone speech enhancement (Bucharest, 2012), pp. 295–299.
26.H Ye, RD DeGroat, Maximum likelihood DOA estimation and asymptotic Cramér-Rao bounds for additive unknown colored noise. IEEE Trans. Signal Process. 43(4), 938–949 (1995). doi:http://dx.doi.org/10.1109/78.376846 .CrossRef
27.M Cooke, A glimpsing model of speech perception in noise. J. Acoust. Soc. Am. 119(3), 1562–1573 (2006). doi:http://dx.doi.org/10.1121/1.2166600 .CrossRef MathSciNet
28.DS Brungart, PS Chang, BD Simpson, D Wang, Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. J. Acoust. Soc. Am. 120(6), 4007–4018 (2006). doi:http://dx.doi.org/10.1121/1.2363929 .CrossRef
29.J Thiemann, M Müller, S van de Par, in Proceedings of the European Signal Processing Conference (EUSIPCO). A binaural hearing aid speech enhancement method maintaining spatial awareness for the user (Lisbon, 2014), pp. 321–325.
30.TJ Klasen, T Van den Bogaert, M Moonen, J Wouters, Binaural noise reduction for hearing aids that preserve interaural time delay cues. IEEE Trans. Signal Process. 55(4), 1579–1585 (2007). doi:http://dx.doi.org/10.1109/TSP.2006.888897 .CrossRef MathSciNet
31.D Marquardt, Development and evaluation of psychoacoustically motivated binaural noise reduction and cue preservation techniques. PhD thesis, Fakultät für Medizin und Gesundheitswissenschaften, Carl von Ossietzky Universität Oldenburg. (2015).
32.H Kayser, SD Ewert, J Anemüller, T Rohdenburg, V Hohmann, B Kollmeier, Database of multichannel in-ear and behind-the-ear head-related and binaural room impulse responses. EURASIP J. on Applied Sig. Proc (2009). doi:http://dx.doi.org/10.1155/2009/298605 .
33.K Wagener, V Kühnel, B Kollmeier, Entwicklung und Evaluation eines Satztests für die deutsche Sprache, I: Design des Oldenburger Satztests. Zeitschrift für Audiologie. 38(1), 4–15 (1999).
34.K Wagener, T Brand, B Kollmeier, Entwicklung und Evaluation eines Satztests für die deutsche Sprache, II: Optimierung des Oldenburger Satztests. Zeitschrift für Audiologie. 38(2), 44–56 (1999).
35.K Wagener, T Brand, B Kollmeier, Entwicklung und Evaluation eines Satztests für die deutsche Sprache, III: Evaluation des Oldenburger Satztests. Zeitschrift für Audiologie. 38(3), 86–95 (1999).
36.T Van den Bogaert, S Doclo, J Wouters, M Moonen, Speech enhancement with multichannel Wiener filter techniques in multimicrophone binaural hearing aids. J. Acoust. Soc. Am. 125(1), 360–371 (2009). doi:http://dx.doi.org/10.1121/1.3023069 .CrossRef
37.JE Greenberg, PM Peterson, PM Zurek, Intelligibility-weighted measures of speech-to-interference ratio and speech system performance. J. Acoust. Soc. Am. 94(5), 3009–3010 (1993). doi:http://dx.doi.org/10.1121/1.407334 .CrossRef
38.International Telecommunication Union, ITU-T Recommendation P.862, Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs, (Geneva, 2001).
39.M Dietz, SD Ewert, V Hohmann, Auditory model based direction estimation of concurrent speakers from binaural signals. Speech Comm. 53(5), 592–605 (2011). doi:http://dx.doi.org/10.1016/j.specom.2010.05.006 .CrossRef
40.D Marquardt, V Hohmann, S Doclo, Interaural coherence preservation in multi-channel wiener filtering-based noise reduction for binaural hearing aids. IEEE/ACM Trans. Audio Speech Lang. Process. 23(12), 2162–2176 (2015). doi:http://dx.doi.org/10.1109/TASLP.2015.2471096 .CrossRef
41.International Telecommunication Union, ITU-R Recommendation BS.1534-1, Method for the subjective assessment of intermediate quality level of coding systems, (Geneva, 2003).
42. International Organization for Standardization, ISO Standard 8253-3:2012 Acoustics-audiometric test methods-part 3: speech audiometry (2012).
43.U Kjems, MS Pedersen, JB Boldt, T Lunner, D Wang, in Proceedings of the European Signal Processing Conference (EUSIPCO). Speech intelligibility of ideal binary masked mixtures (Aalborg, 2010), pp. 1909–1913.
作者单位：Joachim Thiemann (1)
Menno Müller (1) (2)
Daniel Marquardt (1)
Simon Doclo (1)
Steven van de Par (1)

1. University of Oldenburg, Cluster of Excellence “Hearing4All”, Ammerländer Heerstr. 114-118, Oldenburg, 26129, Germany
2. Jade Hochschule, Ofener Str. 16/19, Oldenburg, 26121, Germany
刊物主题：Signal, Image and Speech Processing;
出版者：Springer International Publishing
ISSN：1687-6180

文摘

Modern binaural hearing aids utilize multimicrophone speech enhancement algorithms to enhance signals in terms of signal-to-noise ratio, but they may distort the interaural cues that allow the user to localize sources, in particular, suppressed interfering sources or background noise. In this paper, we present a novel algorithm that enhances the target signal while aiming to maintain the correct spatial rendering of both the target signal as well as the background noise. We use a bimodal approach, where a signal-to-noise ratio (SNR) estimator controls a binary decision mask, switching between the output signals of a binaural minimum variance distortionless response (MVDR) beamformer and scaled reference microphone signals. We show that the proposed selective binaural beamformer (SBB) can enhance the target signal while maintaining the overall spatial rendering of the acoustic scene. Keywords Hearing aids Binaural hearing aids Bilateral hearing aids Binaural MVDR

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700