Analysis and extraction of LP-residual for its application in speaker verification system under uncontrolled noisy environment
详细信息    查看全文
文摘
Sub-segmental analysis of excitation source may contain significant speaker-specific information pertaining to speaker verification. In this paper, the excitation source feature has been explored for design of speaker verification system (SVS). The baseline of the system is extraction of speaker-specific information from LP-residual features by modelling the speakers through different supervised and unsupervised models, based on which they will be accepted or rejected. Direct LP-residual (DLR) as well as DCT coefficients of LP-residual (DCTLR) are approximated as the excitation source features for the system. The models are processed in two different level of analysis, namely, sentence level analysis as well as voice-segment level approach (VSLA), with the variations in the frame size of the input. Effects of the change of frame size in the input vectors are observed. Studies are carried over telephonic database collected in practical environment. A comparative analysis has been presented for the combination of models, features and the two levels of analysis for the given data. The experimental study suggests that application of VSLA on unsupervised models with DCTLR as input, provides a performance which is 14.21 % better than sentence level analysis of the models.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700