面向藏语声纹识别的语料库建设
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Corpus construction for Tibetan voiceprint recognition
  • 作者:周雁 ; 西绕多吉
  • 英文作者:ZHOU Yan;Shereb Dorje;Research Center of Tibetan Information Technology,Tibet University;
  • 关键词:藏语 ; 声纹识别 ; 语料库
  • 英文关键词:Tibetan;;voiceprint recognition;;corpus
  • 中文刊名:JSJK
  • 英文刊名:Computer Engineering & Science
  • 机构:西藏大学藏文信息技术研究中心;
  • 出版日期:2018-11-15
  • 出版单位:计算机工程与科学
  • 年:2018
  • 期:v.40;No.287
  • 基金:西藏自治区自然科学基金(2015ZR-14-5);; 国家自然科学基金(61165010)
  • 语种:中文;
  • 页:JSJK201811024
  • 页数:5
  • CN:11
  • ISSN:43-1258/TP
  • 分类号:178-182
摘要
藏语声纹识别技术的研究刚刚起步,建设一个用于藏语声纹识别的语料库迫在眉睫。结合藏语特点,设计、建立了一个面向藏语声纹识别的语料库。语料库包含文本相关、文本无关两部分,文本语料来自新闻报刊、文学类、教育类、科技类、佛学类、历史类和传统文化五明类等文献资料,录音者由来自多个不同藏语方言地区的50人组成,产生了语音语料9 500条,为藏语的声纹识别研究奠定了一定的基础。
        Research on Tibetan voiceprint recognition technology has just started,and it is an urgent and necessary task to establish a corpus.We design and build a corpus based on the characteristics of Tibetan language,which consists of two parts:text-dependent part and text-independent part.Texts of the corpus are collected from a variety of materials,including newspaper,literature,education,science and technology,Buddhism,and history and traditional culture.As for the recording part,we invite 50 speakers from different regions of Tibet.The corpus contains 9500 speech files and it lays a certain foundation for Tibetan voiceprint recognition.
引文
[1]Huang Xiao-dan,Hong Qing-yang,Li Lin,et al.Discussions on construction for speech database of voiceprint recognition[C]∥Proc of the 11th National Conference on Man-Machine Speech Communication,2011:1-4.(in Chinese)
    [2]Cieri C,Liberman M.Issues in corpus creation and distribution:The evolution of the linguistic data consortium[J].Biochemical Pharmacology,2000,64(4):711-721.
    [3]Wang Jing-yun.Design of Hokkien speech corpus based on college students for speaker recognition[J].Journal of Xiamen University of Technology,2009,17(3):79-83.(in Chinese)
    [4]Wang Hong,Li Xin,Gao Yang.Design of Chinese speech corpus based on college students for speaker recognition[J].Journal of Changji University,2008(6):107-111.(in Chinese)
    [5]Li Ai-jun,Wang Tian-qing,Yin Zhi-gang.RASC863-an annotated 4regional accent speech corpus for Mandarin speech recognition[C]∥Proc of the 7th Phonetics Conference of China,2003:274-277.(in Chinese)
    [6]Zhou Hao-lang,Wang Lan,Chen Ke.A Chinese speech corpus for speaker recognition[C]∥Proc of the 6th National Conference on Man-Machine Speech Communication,2001:329-332.(in Chinese)
    [7]Yang Ying-chun,Yan Shi-feng,Wu Zhao-hui,et al.Speech corpus of speaker recognition for mobile communication(SRMC)[C]∥Proc of the 7th National Conference on ManMachine Speech Communication,2003:238-242.(in Chinese)
    [8]Li Yong-hong,Yu Hong-zhi.Research of voice database for Tibetan speech synthesis[J].Journal of Northwest University for Nationalities(Natural Science),2006,27(3):36-39.(in Chinese)
    [9]Zhao Li.Voice signal processing[M].2nd Edition.Beijing:China Machine Press,2009.(in Chinese)
    [10]Zhu Jie,Ngodrup,Gesang Dorje,et al.Establishment of a Tibetan syllable rule base and analysis of its applications[J].Journal of Chinese Information Processing,2013,27(2):103-111.(in Chinese)
    [1]黄晓丹,洪青阳,李琳,等.声纹识别语音数据库建设的探讨[C]∥第11届全国人机语音通讯学术会议,2011:1-4.
    [3]王静芸.大学生闽南语说话人识别语音库的设计[J].厦门理工学院学报,2009,17(3):79-83.
    [4]王宏,李鑫,高阳.基于大学生的汉语说话人识别语音库设计[J].昌吉学院学报,2008(6):107-111.
    [5]李爱军,王天庆,殷治纲.863语音识别语音语料库RASC863-四大方言普通话语音库[C]∥第7届全国人机语音通讯学术会议,2003:274-277.
    [6]周昊朗,王岚,陈珂.一个面向说话人识别的汉语语音数据库[C]∥第6届全国人机语音通讯学术会议论文集,2001:329-332.
    [7]杨莹春,颜时锋,吴朝晖,等.面向移动互联环境的说话人识别语音库SRMC[C]∥第7届全国人机语音通讯学术会议,2003:238-242.
    [8]李永宏,于洪志.安多藏语语音合成语料库的设计[J].西北民族大学学报(自然科学版),2006,27(3):36-39.
    [9]赵力.语音信号处理[M].第2版.北京:机械工业出版社,2009.
    [10]珠杰,欧珠,格桑多吉,等.藏文音节规则库的建立与应用分析[J].中文信息学报,2013,27(2):103-111.