Detection of Voice Pathology using Fractal Dimension in a Multiresolution Analysis of Normal and Disordered Speech Signals
详细信息    查看全文
  • 作者:Zulfiqar Ali ; Irraivan Elamvazuthi ; Mansour Alsulaiman…
  • 关键词:Voice pathology detection ; Wavelet transformation ; Fractal dimension ; Katz algorithm ; Higuchi algorithm ; MDVP parameters
  • 刊名:Journal of Medical Systems
  • 出版年:2016
  • 出版时间:January 2016
  • 年:2016
  • 卷:40
  • 期:1
  • 全文大小:1,828 KB
  • 参考文献:1.Mohan, B., Diseases of ear, nose and throat: Head and neck surgery, 1st edition. Jaypee Brothers Medical Publishers, New Delhi, India, 2013.
    2.Hecker, M. H. L., and Kreul, E. J., Descriptions of the speech of patients with cancer of the vocal folds. part I: Measures of fundamental frequency. J. Acoust. Soc. Am. 49:1275鈥?282, 1971.PubMed CrossRef
    3.Muhammad, G., Mesallam, T. A., Malki, K. H., Farahat, M., Mahmood, A., and Alsulaiman, M., Multidirectional regression (MDR)-based features for automatic voice disorder detection. J. Voice 26:817 e19-27, 2012.PubMed CrossRef
    4.Baken, R. J., and Orlikoff, R., Clinical measurement of speech and voice, 2nd edition. Singular, San Diego, CA, 2000.
    5.Lee, J. W., Kang, H. G., Choi, J. Y., and Son, Y. I., An investigation of vocal tract characteristics for acoustic discrimination of pathological voices. BioMed Res Int 2013:1鈥?1, 2013.
    6.Fontes, A. I. R., Souza, P. T. V., Neto, A. D. D., Martins, A. d. M., et al., Classification system of pathological voices using correntropy. Math. Probl. Eng. 2014:7, 2014.CrossRef
    7.Jung-Won, L., Kim, S., and Hong-Goo, K., Detecting pathological speech using contour modeling of harmonic-to-noise ratio, Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5969鈥?973, 2014.
    8.Panek, D., Skalski, A., and Gajda, J., Quantification of linear and non-linear acoustic analysis applied to voice pathology detection, information technologies in biomedicine. Adv Intell Syst Comput 284:355鈥?64, 2014.CrossRef
    9.Muhammad, G., and Melhem, M., Pathological voice detection and binary classification using MPEG-7 audio features. Biomed. Signal Proc. Control 11:1鈥?, 2014.CrossRef
    10.Muhammad, G., Ali, Z., Alsulaiman, M., and Al-Mutib K., Vocal fold disorder detection by applying LBP operator on dysphonic speech signal. Proc. Recent Adv. Intell. Control Model. Simul. pp. 222鈥?28, 2014.
    11.Lopes, R., and Betrouni, N., Fractal and multifractal analysis: A review. Med. Image Anal. 13:634鈥?49, 2009.PubMed CrossRef
    12.Katz, M. J., Fractals and the analysis of waveforms. Comput. Biol. Med. 18:145鈥?56, 1988.PubMed CrossRef
    13.Higuchi, T., Approach to an irregular time series on the basis of the fractal theory. Phys. D. Nonlinear Phenom. 31:277鈥?83, 1988.CrossRef
    14.Petrosian, A., Kolmogorov complexity of finite sequences and recognition of different preictal EEG patterns, Proc. of the Eighth IEEE Symposium on Computer-Based Medical Systems, pp. 212鈥?17, 1995.
    15.Maragos, P., Fractal aspects of speech signals: Dimension and interpolation, Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 417鈥?20, 1991.
    16.Senevirathne, T. R., Bohez, E. L. J., and Van Winden, J. A., Amplitude scale method: New and efficient approach to measure fractal dimension of speech waveforms. Electron. Lett. 28:420鈥?22, 1992.CrossRef
    17.Kim, Y. W., Krieble, K. K., Kim, C. B., Reed, J., and Rae-Grant, A. D., Differentiation of alpha coma from awake alpha by nonlinear dynamics of electroencephalography. Electroencephalogr. Clin. Neurophysiol. 98:35鈥?1, 1996.PubMed CrossRef
    18.Mishra, A. K., and Raghav, S., Local fractal dimension based ECG arrhythmia classification. Biomed. Signal Proc. Control 5:114鈥?23, 2010.CrossRef
    19.Esteller, R., Vachtsevanos, G., Echauz, J., and Litt, B., A comparison of waveform fractal dimension algorithms, circuits and systems I: Fundamental theory and applications. IEEE Trans. Circ. Syst. 48:177鈥?83, 2001.CrossRef
    20.Raghavendra, B. S., and Narayana Dutt, D., A note on fractal dimensions of biomedical waveforms. Comput. Biol. Med. 39:1006鈥?012, 2009.PubMed CrossRef
    21.Baljekar, P. N., and Patil, H. A., A comparison of waveform fractal dimension techniques for voice pathology classification, Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4461鈥?464, 2012.
    22.Accardo, A., Affinito, M., Carrozzi, M., and Bouquet, F., Use of the fractal dimension for the analysis of electroencephalographic time series. Biol. Cybern. 77:339鈥?50, 1997.PubMed CrossRef
    23.Accardo, A., Fabbro, F., and Mumolo, E., Analysis of normal and pathological voices via short-time fractal dimension, Proc. of 14th Annual International Conference of the IEEE on Engineering in Medicine and Biology Society, pp. 1270鈥?271, 1992.
    24.Massachusetts Eye & Ear Infirmary Voice & Speech LAB, Disordered voice database model 4337 (Ver. 1.03), ed. Boston, MA: Kay Elemetrics Corp, 1994.
    25.Little, M., McSharry, P., Roberts, S., Costello, D., and Moroz, I., Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection. Biomed. Eng. OnLine 6:23, 2007.PubMed PubMedCentral CrossRef
    26.Arjmandi, M. K., Pooyan, M., Mikaili, M., Vali, M., and Moqarehzadeh, A., Identification of voice disorders using long-time features and support vector machine with different feature reduction methods. J. Voice 25:e275鈥?9, 2011.PubMed CrossRef
    27.Cortes, C., and Vapnik, V., Support-vector networks. Mach. Learn. 20:273鈥?97, 1995.
    28.Vaziri, G., and Almasganj, F., Pathological Assessment of vocal fold nodules and polyp via fractal dimension of patients鈥?voices, Proc. of the 2nd International Conference on Bioinformatics and Biomedical Engineering, pp. 2044鈥?047, 2008.
    29.Farouk, M. H., Application of wavelets in speech processing: Springer, 2014.
    30.Godino-Llorente, J. I., G贸mez-Vilda, P., and Blanco-Velasco, M., Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters. IEEE Trans. Biomed. Eng. 53:1943鈥?953, 2006.PubMed CrossRef
    31.Markaki, M., and Stylianou, Y., Voice pathology detection and discrimination based on modulation spectral features. IEEE Trans. Audio Speech Lang. Process. 19:1938鈥?948, 2011.CrossRef
    32.Chang, C.-C., and Lin, C.-J., LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2:1鈥?7, 2011.CrossRef
  • 作者单位:Zulfiqar Ali (1) (2)
    Irraivan Elamvazuthi (2)
    Mansour Alsulaiman (1)
    Ghulam Muhammad (1)

    1. Digital Speech Processing Group, Department of Computer Engineering, King Saud University, Riyadh, 11543, Saudi Arabia
    2. Centre for Intelligent Signal and Imaging Research, Department of Electrical and Electronic Engineering, Universiti Teknologi PETRONAS, Tronoh, 31750, Perak, Malaysia
  • 刊物类别:Mathematics and Statistics
  • 刊物主题:Statistics
    Statistics for Life Sciences, Medicine and Health Sciences
    Health Informatics and Administration
  • 出版者:Springer Netherlands
  • ISSN:1573-689X
文摘
Voice disorders are associated with irregular vibrations of vocal folds. Based on the source filter theory of speech production, these irregular vibrations can be detected in a non-invasive way by analyzing the speech signal. In this paper we present a multiband approach for the detection of voice disorders given that the voice source generally interacts with the vocal tract in a non-linear way. In normal phonation, and assuming sustained phonation of a vowel, the lower frequencies of speech are heavily source dependent due to the low frequency glottal formant, while the higher frequencies are less dependent on the source signal. During abnormal phonation, this is still a valid, but turbulent noise of source, because of the irregular vibration, affects also higher frequencies. Motivated by such a model, we suggest a multiband approach based on a three-level discrete wavelet transformation (DWT) and in each band the fractal dimension (FD) of the estimated power spectrum is estimated. The experiments suggest that frequency band 1鈥?562 Hz, lower frequencies after level 3, exhibits a significant difference in the spectrum of a normal and pathological subject. With this band, a detection rate of 91.28 % is obtained with one feature, and the obtained result is higher than all other frequency bands. Moreover, an accuracy of 92.45 % and an area under receiver operating characteristic curve (AUC) of 95.06 % is acquired when the FD of all levels is fused. Likewise, when the FD of all levels is combined with 22 Multi-Dimensional Voice Program (MDVP) parameters, an improvement of 2.26 % in accuracy and 1.45 % in AUC is observed. Keywords Voice pathology detection Wavelet transformation Fractal dimension Katz algorithm Higuchi algorithm MDVP parameters
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.