Vowel- and Text-Based Cepstral Analysis of Chronic Hoarseness
详细信息    查看全文
文摘
| Figures/TablesFigures/Tables | ReferencesReferences

Summary

Objectives/Hypothesis

Automatic voice evaluation is usually performed on stable sections of sustained vowels, which often cannot capture hoarseness properly. The measures cepstral peak prominence (CPP) and smoothed CPP (CPPS) do not require exact determination of the cycles of fundamental frequency like established perturbation-based measures. They can also be applied to text recordings. In this study, they were compared with perceptual evaluation of voice quality and the German roughness-breathiness-hoarseness (RBH) scheme.

Study Design

Retrospective data analysis.

Methods

Seventy-three hoarse patients (48.3 ¡À 16.8 years) uttered the vowel /e/ and read the German version of the text ¡°The North Wind and the Sun¡±. The text recordings were evaluated perceptually by five speech therapists and physicians according to the RBH scale. The criterion ¡°overall quality¡± was measured on a 4-point scale and a visual analog scale. For the human-machine correlation, the automatic measures of the Praat program (vowels only) and the ¡°cpps¡± software were compared with the experts' ratings. The experiments were repeated for speakers with jitter ¡Ü5 % or shimmer ¡Ü5 % (n = 47).

Results

For the entire group (n = 73), the best human-machine results for most of the rating criteria were obtained for text-based CPP and CPPS (up to |¦Ñ| = 0.73). For the 47 selected speakers, the correlation was remarkably worse for all measures but still best for text-based CPP and CPPS (|¦Ñ| ¡Ü 0.50).

Conclusions

Cepstrum analysis should be performed on a text recording. Then, it outperforms all perturbation-based measures, and it can be a meaningful objective support for perceptual analysis.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700