参考文献:1. Benesty, J., Sondh, M., Huang, Y. (eds.): Springer Handbook of Speech Recognition. Springer (2008) 2. Ramirez, J., Segura, J.C., Benitez, C., de la Torre, A., Rubio, A.J.: A new Kullback-Leibler VAD for speech recognition in noise. IEEE Signal Processing Letters聽11(2), 266鈥?69 (2004) 3. Gruhn, R., Raab, M., Brueckner, R.: US Patent 鈩?301445, Speech Recognition Based on a Multilingual Acoustic Model. Nuance Communications, Inc., Assignee (2012) 4. Kullback, S.: Information Theory and Statistics. Dover Pub. (1997) 5. Savchenko, V.V.: The Method of Words Phonetic Decoding in Automatic Speech Recognition Problem Using the Minimum Information Discrimination Principle, Izvestia vuzov Rossii. Radioelectronika聽5, 31鈥?1 (2009) (in Russian) 6. Qiao, Y., Shimomura, N., Minematsu, N.: Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3989鈥?992 (2008) 7. Rasipuram, R., Magimai-Doss, M.: Improving Articulatory Feature and Phoneme Recognition Using Multitask Learning. In: Honkela, T. (ed.) ICANN 2011, Part I. LNCS, vol.聽6791, pp. 299鈥?06. Springer, Heidelberg (2011) 8. Zadeh, L.A.: Fuzzy Sets. Information Control聽8, 338鈥?53 (1965) 9. Marple -Jr., S.L.: Digital Spectral Analysis: With Applications. Prentice-Hall Series in Signal Processing (1989) 10. Hill, J.E.: The Minimum of n Independent Normal Distributions, http://www.untruth.org/~josh/math/normal-min.pdf 11. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press (1981) 12. Koutroumbas, K., Theodoridis, S.: Pattern Recognition, 4th edn. Elsevier Inc. (2008) 13. Ronzhin, A.L., Yusupov, R.M., Li, I.V., Leontieva, A.B.: Survey of Russian Speech Recognition Systems. In: SPECOM 2006, pp. 54鈥?0 (2006) 14. Reddy, D.R.: Speech recognition by Machine: A Review. Proceedings of the IEEE聽64(4), 501鈥?31 (1976) 15. Jensen, R., Cornelis, C.: Fuzzy-rough nearest neighbour classification and prediction. Theoretical Computer Science聽412(42), 5871鈥?884 (2011)
作者单位:Lyudmila V. Savchenko (21) Andrey V. Savchenko (22)
21. Nizhniy Novgorod State Linguistic University, Russia 22. National Research University Higher School of Economics, Nizhniy Novgorod, Russia
ISSN:1611-3349
文摘
The definition of a phoneme as a fuzzy set of minimal speech units from the model database is proposed. On the basis of this definition and the Kullback-Leibler minimum information discrimination principle the novel phoneme recognition algorithm has been developed as an enhancement of the phonetic decoding method. The experimental results in the problems of isolated vowels recognition and word recognition in Russian are presented. It is shown that the proposed method is characterized by the increase of recognition accuracy and reliability in comparison with the phonetic decoding method.