面向藏语声纹识别的语料库建设

英文篇名：Corpus construction for Tibetan voiceprint recognition
作者：周雁 ; 西绕多吉
英文作者：ZHOU Yan;Shereb Dorje;Research Center of Tibetan Information Technology,Tibet University;
关键词：藏语 ; 声纹识别 ; 语料库
英文关键词：Tibetan;;voiceprint recognition;;corpus
中文刊名：JSJK
英文刊名：Computer Engineering & Science
机构：西藏大学藏文信息技术研究中心;
出版日期：2018-11-15
出版单位：计算机工程与科学
年：2018
期：v.40;No.287
基金：西藏自治区自然科学基金(2015ZR-14-5);; 国家自然科学基金(61165010)
语种：中文;
页：JSJK201811024
页数：5
CN：11
ISSN：43-1258/TP
分类号：178-182

摘要

藏语声纹识别技术的研究刚刚起步,建设一个用于藏语声纹识别的语料库迫在眉睫。结合藏语特点,设计、建立了一个面向藏语声纹识别的语料库。语料库包含文本相关、文本无关两部分,文本语料来自新闻报刊、文学类、教育类、科技类、佛学类、历史类和传统文化五明类等文献资料,录音者由来自多个不同藏语方言地区的50人组成,产生了语音语料9 500条,为藏语的声纹识别研究奠定了一定的基础。
Research on Tibetan voiceprint recognition technology has just started,and it is an urgent and necessary task to establish a corpus.We design and build a corpus based on the characteristics of Tibetan language,which consists of two parts:text-dependent part and text-independent part.Texts of the corpus are collected from a variety of materials,including newspaper,literature,education,science and technology,Buddhism,and history and traditional culture.As for the recording part,we invite 50 speakers from different regions of Tibet.The corpus contains 9500 speech files and it lays a certain foundation for Tibetan voiceprint recognition.

引文

[1]Huang Xiao-dan,Hong Qing-yang,Li Lin,et al.Discussions on construction for speech database of voiceprint recognition[C]∥Proc of the 11th National Conference on Man-Machine Speech Communication,2011:1-4.(in Chinese)
    [2]Cieri C,Liberman M.Issues in corpus creation and distribution:The evolution of the linguistic data consortium[J].Biochemical Pharmacology,2000,64(4):711-721.
    [3]Wang Jing-yun.Design of Hokkien speech corpus based on college students for speaker recognition[J].Journal of Xiamen University of Technology,2009,17(3):79-83.(in Chinese)
    [4]Wang Hong,Li Xin,Gao Yang.Design of Chinese speech corpus based on college students for speaker recognition[J].Journal of Changji University,2008(6):107-111.(in Chinese)
    [5]Li Ai-jun,Wang Tian-qing,Yin Zhi-gang.RASC863-an annotated 4regional accent speech corpus for Mandarin speech recognition[C]∥Proc of the 7th Phonetics Conference of China,2003:274-277.(in Chinese)
    [6]Zhou Hao-lang,Wang Lan,Chen Ke.A Chinese speech corpus for speaker recognition[C]∥Proc of the 6th National Conference on Man-Machine Speech Communication,2001:329-332.(in Chinese)
    [7]Yang Ying-chun,Yan Shi-feng,Wu Zhao-hui,et al.Speech corpus of speaker recognition for mobile communication(SRMC)[C]∥Proc of the 7th National Conference on ManMachine Speech Communication,2003:238-242.(in Chinese)
    [8]Li Yong-hong,Yu Hong-zhi.Research of voice database for Tibetan speech synthesis[J].Journal of Northwest University for Nationalities(Natural Science),2006,27(3):36-39.(in Chinese)
    [9]Zhao Li.Voice signal processing[M].2nd Edition.Beijing:China Machine Press,2009.(in Chinese)
    [10]Zhu Jie,Ngodrup,Gesang Dorje,et al.Establishment of a Tibetan syllable rule base and analysis of its applications[J].Journal of Chinese Information Processing,2013,27(2):103-111.(in Chinese)
    [1]黄晓丹,洪青阳,李琳,等.声纹识别语音数据库建设的探讨[C]∥第11届全国人机语音通讯学术会议,2011:1-4.
    [3]王静芸.大学生闽南语说话人识别语音库的设计[J].厦门理工学院学报,2009,17(3):79-83.
    [4]王宏,李鑫,高阳.基于大学生的汉语说话人识别语音库设计[J].昌吉学院学报,2008(6):107-111.
    [5]李爱军,王天庆,殷治纲.863语音识别语音语料库RASC863-四大方言普通话语音库[C]∥第7届全国人机语音通讯学术会议,2003:274-277.
    [6]周昊朗,王岚,陈珂.一个面向说话人识别的汉语语音数据库[C]∥第6届全国人机语音通讯学术会议论文集,2001:329-332.
    [7]杨莹春,颜时锋,吴朝晖,等.面向移动互联环境的说话人识别语音库SRMC[C]∥第7届全国人机语音通讯学术会议,2003:238-242.
    [8]李永宏,于洪志.安多藏语语音合成语料库的设计[J].西北民族大学学报(自然科学版),2006,27(3):36-39.
    [9]赵力.语音信号处理[M].第2版.北京:机械工业出版社,2009.
    [10]珠杰,欧珠,格桑多吉,等.藏文音节规则库的建立与应用分析[J].中文信息学报,2013,27(2):103-111.