Extraction of CYP Chemical Interactions from Biomedical Literature Using Natural Language Processing Methods
详细信息    查看全文
  • 作者:Dazhi Jiao* ; David J. Wild
  • 刊名:Journal of Chemical Information and Modeling
  • 出版年:2009
  • 出版时间:February 23, 2009
  • 年:2009
  • 卷:49
  • 期:2
  • 页码:263-269
  • 全文大小:124K
  • 年卷期:v.49,no.2(February 23, 2009)
  • ISSN:1549-960X
文摘
This paper proposes a system that automatically extracts CYP protein and chemical interactions from journal article abstracts, using natural language processing (NLP) and text mining methods. In our system, we employ a maximum entropy based learning method, using results from syntactic, semantic, and lexical analysis of texts. We first present our system architecture and then discuss the data set for training our machine learning based models and the methods in building components in our system, such as part of speech (POS) tagging, Named Entity Recognition (NER), dependency parsing, and relation extraction. An evaluation of the system is conducted at the end, yielding very promising results: The POS, dependency parsing, and NER components in our system have achieved a very high level of accuracy as measured by precision, ranging from 85.9% to 98.5%, and the precision and the recall of the interaction extraction component are 76.0% and 82.6%, and for the overall system are 68.4% and 72.2%, respectively.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700