Speaker Discrimination Using Several Classifiers and a Relativistic Speaker Characterization

详细信息查看全文

关键词：Speaker discrimination ; Speaker verification ; Relativistic speaker characteristic ; PCA reduction ; Classification models
刊名：Lecture Notes in Computer Science
出版年：2016
出版时间：2016
年：2016
卷：9680
期：1
页码：203-212
全文大小：259 KB
参考文献：1.Rose, P.: Forensic speaker discrimination with Australian English vowel acoustics. In: ICPhS, XVI (2007)
2.Matrouf, D., Bonastre, J.F.: Accurate log-likelihood ratio estimation by using test statistical model for speaker verification. In: The Speaker and Language Recognition Workshop, Odyssey (2006)
3.Meignier, S., et al.: Step- by- step and integrated approaches in broadcast news speaker diarization. Comput. Speech Lang. 20, 303–330 (2006)CrossRef
4.Meignier, S.: Indexation en locuteurs de documents sonores: segmentation d’un document et Appariement d’une collection. Ph.D. thesis, LIA Avignon, France (2002)
5.Ouamour, S., Guerti, M., Sayoud, H.: A new relativistic vision in speaker discrimination. Can. Acoust. J. 36(4), 24–34 (2008). ISSN 0711-6659
6.Li, M., Xing, Y., Luo, R.: Hierarchical speaker verification based on PCA and kernel fisher discriminant. In : 4th International Conference on Natural Computation, pp. 152–156 (2008)
7.Zhao, Z.D., Zhang, J., Tian, J.F., Lou, Y.Y.: An effective identification method for speaker recognition based on PCA and double VQ. In: Proceedings of the Eighth International Conference on Machine Learning and Cybernetics, Baoding, pp. 1686–1689 (2009)
8.Jayakurnar, A., Vimal Krishnan, V.R., BabuAnto, P.: Text dependent speaker recognition using discrete stationary wavelet transform and PCA. In: International Conference on the Current Trends in Information Technology CTIT, pp. 1–4 (2009)
9.Zhou, Y., Zhang, X., Wang, J., Gong, Y.: Research on speaker feature dimension reduction based on CCA and PCA. In: International Conference on Wireless Communications and Signal Processing (WCSP), pp. 1–4 (2010)
10.Xiao-chun, L., Jun-xun, Y., A.: Text-independent speaker recognition system based on probabilistic principle component analysis. In: 3rd International Conference on System Science, Engineering Design and Manufacturing Informatization, pp. 255–260 (2012)
11.Contributors of Wikipedia: Linear discriminant analysis. https://en.wikipedia.org/wiki/Linear_discriminant_analysis . Accessed Nov 2015
12.Contributors of Wikipedia: Adaboost. https://en.wikipedia.org/wiki/AdaBoost . Accessed Nov 2015
13.Contributors of Wikipedia: Support vector machine. https://en.wikipedia.org/wiki/Support_vector_machine . Accessed Nov 2015
14.Sayoud, H.: Automatic speaker recognition – connexionnist approach. Ph.D. thesis, USTHB University, Algiers (2003)
15.Contributors of Wikipedia: linear discriminant analysis. https://en.wikipedia . Last Accessed Nov 2015, Wikipedia, “Linear regression”. http://en.wikipedia.org/wiki/Linear_regression . From Wikipedia, Last Accessed 28 Mar 2013
16.Huang, X., Pan, W.: Linear regression and two-class classification with gene expression data. Bioinformatics 19(16), 2072–2078 (2003)CrossRef
17.Contributors of Wikipedia, 2015. Generalized linear model. Last Accessed Nov 2015. https://en.wikipedia.org/wiki/Generalized_linear_model
18.Kohonen, T.: The self-organizing map. Proc. IEEE 78(9), 1464–1480 (1990). doi:10.1109/5.58325 . Invited PaperCrossRef
19.Tambouratzis, G., Hairetakis, G., Markantonatou, S., Carayannis, G.: Applying the SOM Model to Text Classification According to Register and Stylistic Content. Int. J. Neural Syst. 13(1), 1–11 (2003)CrossRef
20.Bimbot, F., Magrin-Chagnolleau, I., Mathan, L.: Second-Order Statistical Measures for text-independent Broadcaster Identification. Speech Commun. 17(1–2), 177–192 (1995)CrossRef
21.Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17(1–2), 91–108 (1995)CrossRef
22.Shlens, J.: A tutorial on principal component analysis - Derivation, Discussion and Singular Value Decomposition. Version 1, (2003). www.cs.princeton.edu/picasso/mats/PCA-Tutorial-Intuition_jp.pdf
作者单位：Siham Ouamour (19)
Zohra Hamadache (19)
Halim Sayoud (19)

19. USTHB University, Algiers, Algeria
丛书名：Image and Signal Processing
ISBN：978-3-319-33618-3
刊物类别：Computer Science
刊物主题：Artificial Intelligence and Robotics
Computer Communication Networks
Software Engineering
Data Encryption
Database Management
Computation by Abstract Devices
Algorithm Analysis and Problem Complexity
出版者：Springer Berlin / Heidelberg
ISSN：1611-3349
卷排序：9680

文摘

Automatic Speaker Discrimination consists in checking whether two speech signals belong to the same speaker or not. It is often difficult to decide what could be the best classifier to use in some specific circumstances. That is why, we implemented nine different classifiers, namely: Linear Discriminant Analysis, Adaboost, Support Vector Machines, Multi-Layer Perceptron, Linear Regression, Generalized Linear Model, Self Organizing Map, Second Order Statistical Measures and Gaussian Mixture Models. Moreover, a special feature reduction was proposed, which we called Relativistic Speaker Characteristic (RSC). On the other hand we further intensified the feature reduction by adding a second step of feature transformation using a Principal Component Analysis (PCA). Experiments of speaker discrimination are conducted on Hub4 Broadcast-News. Results show that the best classifier is the SVM and that the proposed feature reduction association (RSC-PCA) is extremely efficient in automatic speaker discrimination.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700