Fractional Fourier transform based features for speaker recognition using support vector machine
详细信息    查看全文
文摘
This paper presents a text-independent speaker recognition technique in which the conventional Fourier transform in Mel-Frequency Cepstral Coefficient (MFCC) front-end is substituted by fractional Fourier transform. Support Vector Machine (SVM) maps these input features into a high-dimensional space to separate classes by a hyperplane with enhanced discrimination capability. SVM based on mean-squared error classifier produces more accurate system. The Fractional Fourier Transform (FrFT) reveals the mixed time and frequency components of the signal. Modelling of speech signals as mixed time and frequency signals represents better production and perception speech characteristics. Processing of time-varying signals in fractional Fourier domain allows us to estimate the signal with least Mean Square Error (MSE) making the technique robust against additive noise compared to Fourier domain maintaining same computational complexity. The feasibility of the proposed technique has been tested experimentally using Texas Instruments and Massachusetts Institute of Technology (TIMIT) and Shri Guru Gobind Singhji (SGGS) databases. The experimental results show the superiority of the proposed method.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700