中文命名实体识别方法研究
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Research on Named Entity Recognition
  • 作者:刘璟
  • 英文作者:LIU Jing;Hunan Vocational College of Science and Technology;
  • 关键词:中文实体识别 ; 条件随机场 ; 自然语言处理
  • 英文关键词:Chinese Name Entity Recognition;;Conditional Random Field;;Natural Language Processing
  • 中文刊名:DNZS
  • 英文刊名:Computer Knowledge and Technology
  • 机构:湖南科技职业学院;
  • 出版日期:2019-03-25
  • 出版单位:电脑知识与技术
  • 年:2019
  • 期:v.15
  • 语种:中文;
  • 页:DNZS201909078
  • 页数:2
  • CN:09
  • ISSN:34-1205/TP
  • 分类号:185-186
摘要
针对命名实体识别不具备良好的领域自适应性,大多研究对象是某个领域的命名实体识别,本文分析了当下流行的条件随机场模型、隐马尔科夫模型和最大熵模型的优劣对比,最后采用条件随机场与规则相结合,以词特征、词性特征作为特征模板训练模型结合规则提取命名实体,实验结果表明本文的方法能有效提高命名实体识别的准确率。
        Named entity recognition does not have good domain adaptability,Most of the research object is a field named entity recog-nition.This paper analyses the advantages and disadvantages of the current popular Conditional Random Field model, Hidden Mar-kov model and Maximum Entropy model.Finally, Conditional Random Fields and rules are combined to extract named entities byusing word features and part-of-speech features as feature template training models. The experimental results show that the pro-posed method can effectively improve the accuracy of named entity recognition.
引文
[1]吴金星,丽丽,杨振新.CRF和词典相结合的蒙古地名识别研究[J].计算机工程与科学,2016,38(5):1047-1051.
    [2]李丽双,何红磊,刘珊珊,等.基于词表示方法的生物医学命名实体识别[J].小型微型计算机系统,2016,37(2):302-305.
    [3]朱颢东,杨立志,丁温雪.基于主题标签和CRF的中文微博命名实体识别[J].华中师范大学学报(自然科学版),2018,52(3):317-319.
    [4]郑秋生,刘守喜.基于CRF的互联网文本命名实体识别研究[J].中原工程学报,2016,27(1):71-73.
    [5]姚霖,刘轶,李鑫鑫.词边界字向量的中文命名实体识别[J].智能系统学报[J].2016,11(1):38-40.
    [6]Lafferty J,Mccallum A,Pereira F,et al.Probabilistic Models for Segmenting and Labeling Sequence Data[J].Proc.international Conf.on Machine Learning,2002,53(2):282-289.
    [7]Sha F,Pereira F.Shallow Parsing with Conditional Random Fields[C].Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics
    [8]Rabiner L,Juang B.An Introduction to Hidden Markov Models[J].IEEE Assp Magazine,1986(3):4-16.
    [9]Rabiner L.A Tutorial on Hidden Markov Models and Selected Applications in Speech

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700