Fractional Fourier transform based features for speaker recognition using support vector machine

详细信息查看全文

作者：Pawan K. Ajmera<sup> ; sup><sup>ajmera.pawan@gmail.comsup>Author Vitae ; Raghunath S. Holambe <sup>rsholambe@sggs.ac.insup>Author Vitae
刊名：Computers and Electrical Engineering
出版年：2013
出版时间：February, 2013
年：2013
卷：39
期：2
页码：550-557
全文大小：530 K

文摘

This paper presents a text-independent speaker recognition technique in which the conventional Fourier transform in Mel-Frequency Cepstral Coefficient (MFCC) front-end is substituted by fractional Fourier transform. Support Vector Machine (SVM) maps these input features into a high-dimensional space to separate classes by a hyperplane with enhanced discrimination capability. SVM based on mean-squared error classifier produces more accurate system. The Fractional Fourier Transform (FrFT) reveals the mixed time and frequency components of the signal. Modelling of speech signals as mixed time and frequency signals represents better production and perception speech characteristics. Processing of time-varying signals in fractional Fourier domain allows us to estimate the signal with least Mean Square Error (MSE) making the technique robust against additive noise compared to Fourier domain maintaining same computational complexity. The feasibility of the proposed technique has been tested experimentally using Texas Instruments and Massachusetts Institute of Technology (TIMIT) and Shri Guru Gobind Singhji (SGGS) databases. The experimental results show the superiority of the proposed method.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700