医学文献主题语义相似度计算方法研究
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:The Study on Method for Topic Semantic Similarity Based on Medical Literature
  • 作者:范少萍 ; 安新颖 ; 逯万辉
  • 英文作者:Fan Shaoping;An Xinying;Lu Wanhui;Institute of Medical Information & Library,Chinese Academy of Medical Sciences;Center of Chinese Social Science Evaluation,Chinese Academy of Social Sciences;
  • 关键词:语义相似度 ; MeSH词表 ; 主题语义相似度
  • 英文关键词:semantic similarity;;MeSH;;topic semantic similarity
  • 中文刊名:TSQB
  • 英文刊名:Library and Information Service
  • 机构:中国医学科学院医学信息研究所;中国社会科学院中国社会科学评价中心;
  • 出版日期:2017-06-05 10:46
  • 出版单位:图书情报工作
  • 年:2017
  • 期:v.61;No.573
  • 基金:国家自然科学基金项目“基于语义的医学领域前沿知识发现及演化机制研究”(项目编号:71303259);; 中央级公益性科研院所基本科研业务费“基于统计和语义的医学文献主题新颖性探测方法研究”(项目编号:2016RC330004)研究成果之一
  • 语种:中文;
  • 页:TSQB201708018
  • 页数:10
  • CN:08
  • ISSN:11-1541/G2
  • 分类号:97-106
摘要
[目的/意义]针对目前医学领域基于主题的语义相似度计算研究较少,尚不足以揭示主题间在语义层面的关系,提出一套用于主题间语义相似度计算的方法,进而从语义角度判断主题间关系,为主题新颖性判断、主题关联研究等提供参考。[方法/过程]以Me SH词表为语义计算的基础,剖析词表结构与现有研究成果,从入口词、语义距离、注释3个维度综合测度主题间的语义相似度,利用Pub Med中2011-2014年干细胞领域的文献进行实证研究。[结果/结论]利用通用验证主题词对,验证了本文所提3个测度维度的有效性。通过主题间语义相似度的计算,发现干细胞领域2011-2014年较为新颖的主题为未成年人干细胞研究。后续研究中还需融入基于统计的主题相似度,从而更加全面地揭示主题间的关系,发现语义层面领域的新颖性研究主题。
        [Purpose/significance]For there are less studies on topic semantic similarity in medical field,and can't reveal the relationship between topics on the semantic level,this paper proposed the semantic similarity calculation method,in order to get the method of judging semantic relationship between topics.[Method/process]We used Me SH as computing basis.Firstly,it analyzed the structure of Me SH.Then,it calculated topic semantic similarity from three dimensions of enty terms,semantic distance and annotation.Finally,it used the field of stem cell for empirical study.[Result/conclusion]The validity of three dimensions proposed is verified by using the common verification concept words.It is found that,the young stem cell research is more novel than others between 2011-2014 through the topic semantic similarity method.In the follow-up study,it is necessary to integrate statistics method for topic similarity calculation,so as to reveal the relationship between topics,and find the novelty research topic in the field.
引文
[1]刘宏哲,须德.基于本体的语义相似度和相关度计算研究综述[J].计算机科学,2012,(2):8-13.
    [2]RADA R,MILI H,BICKNELL E,et al.Development and application of a metric on semantic nets[J].IEEE Transactions onsystems,man and cybernetics,1989,19(1):17-30.
    [3]RICHARDSON R,SMEATON A F.Using Word Net in a knowledge-based approach to information retrieval[R].Working Paper,CA-0395,School of Computer Applications,Dublin City University,Ireland,1995.
    [4]LORD P W,STENENS R D,BRASS A,et al.Investigating semantic similarity measures across the Gene Ontology:the relationship between sequence and annotation[J].Bioinformatics,2003,19(10):1275-1283.
    [5]LIN D.Principle-based parsing without overgeneration[C]//Proceedings of the 31st annual meeting on Association for Computational Linguistics.Columbus,Ohio,USA,Association for Computational Linguistics,1993:112-120.
    [6]TVERSKY A.Features of similarity[J].Psychological review,1977,84(4):327-352.
    [7]LI Y,BANDAR Z A,MCLEAN D.An approach for measuring semantic similarity between words using multiple information sources[J].IEEE Transactions on knowledge and data engineering,2003,15(4):871-882.
    [8]ALVAREZ M A,LIM S,editors.A graph modeling of semantic similarity between words[C]//Semantic Computing,2007 ICSC2007 International Conference on,Irvine,CA,USA,IEEE,2007:355-362.
    [9]PEDERSEN T,PATWARDHAN S,MICHELIZZI J.Word Net:similarity:measuring the relatedness of concepts[C]//Demonstration papers at HLT-NAACL 2004.Boston,Massachusetts,USA:Association for computational linguistics,2004:38-41.
    [10]MENG L,HUANG R,GU J.A review of semantic similarity measures in Word Net[J].International journal of hybrid information technology,2013,6(1):1-12.
    [11]ZHENG R,ZHAO H,ZHANG X.A word similarity algorithm with sememe probability density ratio based on How Net[J].International journal of hybrid information technology,2015,8(10):417-426.
    [12]LIAO K,BI Y.An Improved semantic similarity Algorithm on How Net[J].Management Science and Engineering,2015,9(1):25-29.
    [13]Me SH[EB/OL].[2016-10-21].https://www.nlm.nih.gov/mesh/MBrowser.html.
    [14]SAHAMI M,HEILMAN T D.A web-based kernel function for measuring the similarity of short text snippets[C]//Proceedings of the 15th international conference on World Wide Web.Edinburgh,Scotland:ACM,2006:377-386.
    [15]RODRIGUEZ M A,EGENHOFER M J.Determining semantic similarity among entity classes from different ontologies[J].IEEEtransactions on knowledge and data engineering,2003,15(2):442-456.
    [16]孙海霞,钱庆,吴英杰,等.Me SH词表的语义相似度计算研究[J].现代图书情报技术.2010(6):12-16.
    [17]KUMAR N,BIBHU V,ISIAM M,et al.Approximate string matching algorithm[J].International journal on computer science and engineering,2010,2(3):641-644.
    [18]WU Z,PALMER M.Verbs semantics and lexical selection[C]//Proceedings of the 32nd annual meeting on Association for Computational Linguistics.Las Cruces,New Mexico,USA:Association for computational linguistics,1994:133-138.
    [19]Relationships in Medical Subject Headings[EB/OL].[2016-10-21].https://www.nlm.nih.gov/mesh/meshrels.html.
    [20]Semantic Relatedness of Medical Terms[EB/OL].[2017-02-10].http://www.intelligence.tuc.gr/mesh/.
    [21]《干细胞》创刊30年十大研究发现[EB/OL].[2017-02-15].http://www.biomart.cn/news/10/63724.htm.
    [22]王艳菲,李阳.干细胞与再生医学行业发展综述[J].科技创新与应用,2016(30):298.
    [23]BLEI D M,NG A Y,JORDAN M I.Latent dirichlet allocation[J].Journal of machine learning research,2003,3(1):993-1022.
    [24]KAYMAK-CIHAN M,TUKUN A,KUSKONMAZ B,et al.Chronic eosinophilic leukemia with monosomy 8 in a five-year-old girl:a rare case.[J].Turkish journal of pediatrics,2014,56(4):444-451.
    [25]SKOCZEN S,TOMASIK P J,GOZDZIK J,et al.Visfatin concentrations in children with leukemia before and after stem cell transplantation[J].Experimental hematology,2014,42(4):252-260.
    [26]MONGRE R K,SODHI S S,GHOSH M,et al.A new paradigm to mitigate osteosarcoma by regulation of microRNAs and suppression of the NF?κB signaling cascade[J].Balsaenggwa saengsig,2014,18(4):197-212.
    [27]BASUROY U,BASILICO C,MANSUKHANI A.Perspectives on cancer stem cells in osteosarcoma[J].Cancer letters,2011,338(1):158-167.
    [28]VITALE K M,VIOLAGO L,COFNAS P,et al.Impact of palifermin on incidence of oral mucositis and healthcare utilization in children undergoing autologous hematopoietic stem cell transplantation for malignant diseases[J].Pediatric transplantation,2014,18(2):211-216.
    [29]CZYZEWSKI K,DEBSKI R,KRENSKA A,et al.Palifermin in children undergoing autologous stem cell transplantation:a matched-pair analysis[J].Anticancer research,2014,34(12):7379-7382.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700