基于本体的中医文献诊疗信息的智能检索示范研究

英文题名：Demonstration Research on Ontology-based Intelligent Search System of Medical Information and TCM Literature
作者：王静
论文级别：博士
学科专业名称：中医基础理论
中文关键词：中医临床文献 ; 本体 ; 智能检索 ; 相似度 ; 中医药信息
英文关键词：Traditional Chinese Medicine ; literature of clinical research ; Ontology ; Intelligent search ; similarity ; Information on Traditional Chinese Medicine
学位年度：2012
导师：崔蒙
学科代码：100501
学位授予单位：中国中医科学院
论文提交日期：2012-05-28

摘要

现代中医临床诊疗研究开始重视从以往的研究和治疗经验得到论证和指导,期刊文献更新快,文献数量大,论文内容具有较高的学术价值、比较注意新的趋势和发展等优点,因此,对中医临床文献进行有效的检索至关重要。本研究旨在设计一个基于中医文献信息本体的智能检索模型,能够借助当前最新的语义网知识进行本体层次的设计和本体概念的研究,从而使检索模型具备基本的推理能力,提高查全率和查准率,能将输出结果按照相关性排序并进行统计,从检索到的批量文献中总结、发现和掌握所查专题内容的整体发展状况和遣方用药规律。
     1.研究背景
     现有的中医文献数据库大多是针对文献外部特征信息建立的,中医临床研究人员很难从检索到的批量文献中掌握所查专题内容的整体发展状况。传统的检索方式主要通过关键词、关键词串或者主题词来进行全文检索,用户很难表达其真正的检索意图,也很难从检索结果提供的大量文献中提取共性的规律。由于检索过程机械,不理解语义。导致一方面,检索结果中包含大量无关信息,另一方面,很多与关键词同义的信息却没能检索出来。其次,大量同领域或者相关领域的数据存储库往往使用不同的标识制度,不兼容的术语和不同的数据格式。目前中医药信息主要以自然描述为主,缺乏知识层次的科学表达,没有统一的术语定义,给数据的集成和共享造成了一定的困难。
     2.研究内容
     本研究以中医临床文献中的诊疗研究内容为对象,利用本体建设来实现中医文献信息的智能检索。本研究的研究过程主要包括确定研究目标、需求调研、数据采集、数据处理、建立本体、智能检索模型的设计和相关性排序算法、中医临床文献的智能检索模型、系统检索测试等八个部分。
     2.1确定研究目标
     本研究根据中医医生在临床实际研究中对于文献中所涉及的内容的检索需求,通过部分中医临床文献内容的语义分析和本体建设,实现基于中医临床文献信息的中医智能检索,提高查全率、查准率和输出结果相关性排序。能将输出结果按照相关性排序并进行统计,从检索到的批量文献中总结、发现和掌握所查专题内容的整体发展状况和遣方用药规律。
     2.2需求调查
     2.2.1对中医临床文献内容的调研
     为了基本了解中医临床文献内部诊疗信息中各项信息出现频率以及之间的关系。总结文献中可以为中医临床诊疗研究提供的检索服务的支撑信息。选取中医核心期刊排名前几名的期刊,对其所刊登的2006-2007年度的关于中医临床诊疗研究的文献进行文献诊疗过程中理、法、方、药等信息记录的调研,发现临床作者诊疗研究的各种具体方向,并抽样调查部分文献中体现的具体数据需求。
     2.2.2对中医临床医生和科研人员检索需求的调研
     为了真正实现基于中医临床文献提供临床诊疗研究需求的检索服务,以调查问卷的形式进行小规模的调查,调查对象为三级甲等医院中的初、中、高级临床医师,共回收有效问卷200份。总结中医临床医生在诊疗研究过程中对文献信息检索的具体需求。经统计分析后,认为临床医师的检索需求重点停留在病、证、症及方剂和针灸疗法上,并且总结确定了一些常用检索式。
     2.3数据采集
     本研究的目的是实现中医临床文献信息的智能检索,而智能检索之前必须先确定用户的检索需求和检索的方式、对象,建立检索对象的本体,确定各个体之间的语义关联关系。资源库是检索的最基础的设施,是实现智能检索的最基本的保障,要保证资源库中有充足的可供检索的信息。
     2.3.1资源库
     本研究选择了中国中医科学院中医药信息研究所研制的中医疾病临床研究数据库和中药药理实验数据库。
     2.3.2年代
     由于资源库中共收录1988年至今的中医临床文献二十七万余篇,数据量庞大,选择基本已经加工完成且年代相对靠近的2005-2007年发表于期刊中的中医临床文献中所涉及的诊疗内容。
     2.3.3原则
     临床文献篇数比较多的疾病。所研究的对象一定要有一定的文献量,能展现所选年度内的临床研究热点和多发疾病的诊疗情况。
     2.3.4内容
     选取诊疗研究过程中涉及的疾病、证候、症状、治则、方剂、中药、穴位、药理分类等内容。
     2.4数据处理
     本研究将实施的本体建设和智能检索将基于已有的资源库进行,因此要对资源库中的数据进行数据清洗和规范整理。
     2.4.1参考标准
     数据处理必须遵循一定的原则,本研究选取美国国立医学图书馆研制的Mesh主题词表和中国中医研究院中医药信息研究所编制的中医药学主题词表为最基本的词表,并选取部分国标、《中华人民共和国药典》、《中药大辞典》等等作为词表的补充。
     2.4.2规范流程
     首先将各字段内容与标准词表中的正名进行匹配,匹配失败的词语与异名进行匹配,仍然无法与异名匹配的词语将纳入相似度计算范围,与词表中的正异名分别用Jaccard公式进行相似度计算。根据计算结果,结合人为分析,进行初步的词语规范。由于各字段的采集状况不同,人为参与的程度和难度也不同,其中疾病、中药、腧穴、方剂相对规整,人为参与较少,而证候、症状、治则的规范过程相对复杂,主观判断较多。
     2.5建立本体
     本研究进行的本体建设主要是针对文献内部内容的实际关联来人工进行的,尊重临床研究的真实内容,并不像以往本体按照教科书和词典来导入建立。
     2.5.1确定各个概念间关系属性
     2.5.2确定具体概念之间的一一关联
     由于所选取的数据存在于不同的数据库表单中,而资源库表结构复杂,共有二十八张表相互关联,要确定两个字段之间一一对应的关系,需要分次进行匹配,逐一确定。
     2.5.3建立各表单的关系属性和数据属性
     2.5.4建立各实体的本体,逐一添加语义关联的具体实体
     选择由美国Stanford大学开发的本体编辑器Prot ege,本研究使用的是Prot ege4.1版本。
     2.6智能检索模型的设计和相关性排序算法
     2.6.1智能检索模型
     2.6.1.1前端系统
     1)检索页面：提供检索入口,分为简单检索和高级检索。高级检索选择相应入口后分别填入对应的关键词,并可进行自然语言检索。
     2)结果页面：展示结果,并对可统计结果进行频次统计。
     2.6.1.2用户意图分析系统
     1)自然语言分词：借助本体库中的概念以及正异名词表对用户输入的检索词和自然查询语句进行预处理,以保证检索的概念和组合概念的准确性。
     2)基于本体创建的索引库：将本体的三元组(主谓宾)建立索引,然后依据分词结果,匹配获取用户关键词对应的本体,将其转化为spar ql语句。
     2.6.1.3语义查询系统
     是整个系统的核心,利用本体服务器中概念和概念之间的语义关系,协助查询扩展与查询语义向量等关键模块的实施。Cart Query:接收spar ql语句,并使用本体和本体数据库映射文件,将spar ql转化为sql。Dart Mapping:在本体和数据库间建立映射关系,并生成映射文件,以便Dart Query使用。
     2.6.1.4日志系统
     日志记录：记录用户的行为日志,如检索关键词、检索结果等。日志分析：对日志进行分析、挖掘,提供如检索时的智能提示、用户推荐等等功能。
     2.6.1.5后台数据库：导航信息数据库,信息资源数据库,系统相关数据。
     2.6.1.6结果优化模块：对检索结果进行相关性排序,将最符合检索条件的结果放在最前面。因此相似度算法尤其重要。
     2.6.2相关性排序算法
     与以往的本体相似度计算方式不同,本研究不仅仅从语义关系上考虑本体相似度,因为领域本体的建设并不稳定和完善,给语义相似度计算造成了很大的困难,很难得到行业认可。本研究考虑的输出结果排序主要从两个方面考虑：
     ·用户查询语言和本体本身的相似度
     ·用户查询的词语所对应的本体在文献中的重要性
     最终结合两者的计算结果来确定最后的相似度排序。
     用户查询词语类似于文本对象。因此在相似度计算上选择了文本相似度算法中的夹角余弦算法
     除了基本的文本相似度外,还需要评估用户的查询词语对于一个本体文档的重要程度。一般而言,词语的重要性随着它在一篇文献或者一个检索主题中出现的次数成正比增加,但同时会随着它在整个资源库中出现的次数成反比下降。
     因此对于查询匹配度的计算,我们将主要采用TF/IDF方法。
     最终的检索结果排序公式参考两个计算结果获得,表达为：
     SIM=d*Sim+(1-d)/mat(O,t)
     其中,Sim计算的是用户查询语言和本体本身的相似度,mat(O,t)计算本体和查询关键字的匹配程度。d为权重因子。这里d暂时先设为0.7,以后可以经过测试不断调整和优化。
     2.7中医临床文献的智能检索模型
     该模型基于中医临床文献知识体系,引入了本体语义概念,遵循了建立中医药临床文献本体体系开展相关服务的思想。可以实现以下功能：
     2.7.1关键词检索
     在检索界面检索可以检的内容可以是：疾病、证候、症状、治则、方剂、单味药、腧穴等。
     2.7.2导航检索
     检索界面左边的导航列里有疾病等的分类导航,用户可以在导航里寻找自己所要查找的项目信息。
     2.7.3高级检索
     2.7.3.1检索词高级检索
     布尔逻辑可以进行是、或、非的检索,用以扩大和缩小检索结果。
     截词检索可以进行后截、前截、中截三种类型的检索。
     2.7.3.2检索框下拉提示
     当输入关键词时,检索框会展开下拉的提示框,向用户展示与所输入的关键词最接近的准确查询词。
     2.7.3.3自然语句检索
     由于采取了本体库为基础的语义分析,基于概念进行检索。例如,当输入“心绞痛的针灸疗法”,分词工具会将这句话切分成“心绞痛”、“针灸疗法”、“的”,系统会根据常用词表去掉“的”这个助词,然后根据语义关系将“针灸疗法”关联到“腧穴”,根据“被影响”关系,提取心绞痛相关的腧穴内容。
     2.7.4专业答疑
     可以向管理人员进行提问,管理人员对用户的问题进行答复时,答案会自动发送到用户的邮箱。
     2.7.5用户日志
     可以记录并向用户展示历次查询过的问题。通过对用户的不断了解、分析,使得个性化检索更符合每个用户的需求。
     2.7.6检索结果按照相关性排序
     用户可以根据对文献的需求选择按照相关性或者是文献发表日期来排序。
     2.7.7统计功能
     直接提供所检索文献内容的直观统计信息。
     2.8系统测试结果
     以“心绞痛的针灸疗法”为例进行检索,检出文献22篇。表明智能检索模型具备按照语义关系进行自然语言检索的功能。以检索“血瘀证”的相关文献为例,传统的检索方式会返回文献828篇,而本研究的智能检索系统为1313篇。智能检索比传统检索多查找出485篇文献,比原检索结果多出59%差别及其明显,证明此智能检索模型能很好的提高检索结果的查全率。由于采用知识(概念)检索技术,明确和缩小了检索范围,减少对无用信息范围的检索,提高了查准率。本研究的智能检索系统结合用户查询语言与本体本身的相似度和用户查询的词语所对应的本体在文献中的重要性两种算法,基于语义关联,可以实现按照检索条件相关性排序,也可以按照文献发表年代排序。因此,本研究已经基本可以达到提高中医临床文献检索服务的查全率、查准率和按照结果相关性排序的设想。
     3小结
     本文的贡献和创新主要有以下几个方面：
     ·该研究引入语义网技术,在关系数据库的上层建立本体的数据模型,并据此建立智能检索模型,对同领域异质异构数据库的资源共享提供了可参考的思路和方法,提高了检索服务的查全率、查准率。
     ·提出了两种算法相结合的查询结果排序算法公式,可以更准确,更贴切实际的对检索结果进行相关性排序。
     ·中医文献信息元数据规范研究。制定相应的核心元数据规范方法及其扩展原则,以规范中医文献信息的描述,指导相关信息数据库的建立。
     ·中医临床文献本体概念体系研究。利用文献信息中的理、法、方、药信息的本体,建立中医临床文献信息概念体系,确定概念的属性和各概念之间的语义关系。为中医文献信息和知识的组织以及基于中医文献信息和知识的智能检索提供先决条件。且检索结果可以进行统计,使得用户可以越过庞大的文献浏览,直接掌握检索主题的文献概况。
     4.展望
     4.1中医证候、症状、治则、中药等信息的标准化工作需要大力投入。
     4.2中医诊疗本体的建立需要大量的人力投入。prot ege适合小样本的或者框架表格式的本体建设,中医诊疗本体的细化建设应该考虑可以批量导入式的本体建立工具。本研究进行了更细致的本体建设,可以更直接、快速的提取检索的相关项,据此建立的智能检索模型,还需要更多的测试来证明此项工作的意义和作用。
     4.3中医临床医生的参考指导作用
     基于对中医医生临床诊疗研究对文献信息的实际需求而设计的智能检索模型,更符合临床医生的检索需求,能更全面的展现文献的内部信息,能够更直接、更贴切的满足临床中医医生对文献所包含信息的获取需求。
     4.4希望通过本研究的研究,可以建立一套对中医临床文献进行结构化和关系化建设的合理研究的方法或者操作流程；可以为以后数据库的建立和数据采集加工提供相对标准的操作流程参考。
     5结语
     中医药信息服务和知识服务,开始重视针对用户的实际需求来完善信息提供方式和方法。由本体概念的引入,使得中医信息检索服务更加快速、准确、全面。同时可以解决由于不同系统或者本领域内不同结构数据库难以共享资源等问题,可以实现理论上的数据资源共享。
Clinical researchers start to pay attention to the massive literatures of clinical research and the effective literature search to get experience and guidance. This study aims to design a Traditional Chinese Medicine (TCM) literature information ontology based on the intelligent search model, design the search model by using newest semantic web technologies to improve the recall ratio and precision ratio. The output will be sorted according to the correlation.
     1. Background
     The TCM literature database, which is mostly completed based on external characteristics about literatures, can provide full-text search services.It is urgent for the TCM researchers to find out the overall development situation of relevant literature and the regular pattern of clinical treatment.
     Traditionally, full-text search are mainly carried out through keywords, keyword strings, or subject headings. Users have difficulties to express their real aim for searching, and it is hard to extract common rules from a large number of literature search results. The meaning could not been understood in the inflexible searching process, which causesthe situation that the search results contains a lot of useless information and loses many information about synonymous of keywords.
     2. Content
     Our research object is to implement an intelligent retrieval system based on ontology in order to support the treatment related content of the TCM literature of clinical research.
     Target
     Based on the search requirements of doctors for theliterature of clinical research during the dailyclinical study and treatment, create intelligent search model, through the semantic analysis of content and ontology construction of some TCM literature of clinical research. Improve the recall ratio and precision, and the output will besorted according to the correlation.
     Requirement Investigation
     Research the content of the Traditional Chinese Medicine literature of clinical researchin2006-2007to find the frequency of the information in the internal diagnosis and treatment and the relationship among them. Summarize the information of the literature can serve for the doctor.
     Make a research questionnaire to investigatethe demand for search of Traditional Chinese Medicine clinical doctors and researchers about the database of Traditional Chinese Medicine literature of clinical research.Summarizesome commonly used search types.
     Data collection
     Research resource isthe database of TCM literature of clinical research which was made by Institute of Information on TCM.The literature which basically completed entry and published in journals in2005-2007about the clinical research information is used.
     The research includeddiseases, syndrome, symptoms, therapeutic principle, formula, Traditional Chinese Medicine, acupuncture points, pharmacological classification which involves in the course of the diagnosis and treatment.
     Data processing
     The research would be performed based on existing resources, and the data in the resource center should be sorted.
     Reference standards:Data processing must follow some principles, we select Mesh keywords list made in the U.S. national library of medicine of andThesaurus of Traditional Chinese Medicine, and use part of the national standard, the People's Republic of TCM dictionary andas supplement.
     Standard process:Records can't match with standard word list exactly the words and alias will carry outSimilarity algorithm. The similarity calculation formula is Jaccard. Normalize preliminary words according to the results of the similarity calculation and the analysis by researchers.
     Due to the difference of the collectionand the different level of participation and difficulty,disease, Traditional Chinese medicine, acu-points, formula are relative neat, and syndrome, symptoms, therapeutic principle are relatively complicated.
     Establishing of ontology
     Ontology established by researcher is mainly for the actual content of literatures, respect for clinical research real content, it's notimported fromthe textbooks and dictionariesas ever.
     Determine the relationship propertiesamong concepts.
     Determine the association between the entities.Because the research data are found in different tables in database, and the structure of the table is complex,28are related, two field one-to-one relationship need to matchfor many times.
     Set up the object propertiesand the data properties between each table. Make the column as data properties.
     Establish the ontology ofdifferent records, adding semantic association of every specific entity.We choose Protege4.1, developed by Stanford University in the USA, as the tool for ontology engineering.
     The design of intelligent retrieval model and relevance ranking algorithm
     Search page:provide search entrance, divided into simple search and advanced search.The results page:show the results and statisticthe frequency of the results.
     User intent analysis system
     Segmentation natural language:with the help of ontology concept and Standard word table to pretreatment the key words and nature language of the query is entered by the user, to improvethe search accuracy of the concept and the combination of concept.
     The index library based on ontology:three-dimensional group of ontology, basicon the results of word segmentation, matching keywords which user access corresponding ontology, and then turn it into sparql statement.
     Semantic query system is the core of the whole system, to help expand and inquires the semantic query vector and other key module implementation basic of the concept and the semantic relations between concepts ontology server.
     DartQuery:Receive SPARQL queries, and turn SPARQL queries into SQL queries using the ontology and ontology-database mapping files.
     DartMapping:create a mapping relationship between the ontology and database, and produce mapping files, so that DartQuery can use it.
     Log systemLog recordand log analysis.
     The backend databases include navigation information database, information resource database and storage information database.
     Optimize Modular Result:Sort the research results and put the most correct result in the most front, so theSimilarity algorithmis especially important.
     Relevance ranking algorithm
     In this paper, we consider the sort of the output results from two aspects:
     The similarity of the query language and ontology itself
     The importance of words which user inquires of corresponding ontology in the literature.
     Finally determine the final similarity ranks basic on both results of calculation.
     The similarity of the query language and ontology itself is calculated using the formula
     The importance of words which user inquires in corresponding with ontology in the literature is calculated by using the method of TF/IDF.
     Finallythe formula of search results sort formula reference to the two calculation results, expressed as:SIM=d*Sim+(1-d)*mat (O, t)."d" is weighting factor,can test to adjust and optimize the value. Intelligent retrieval model of TCM literature of clinical research
     Including the following functions:
     KeywordsearchInclude disease, syndrome, symptoms, therapeutic principle, formula, single herb medicine, acupuncture point, etc.
     Navigation searchBoolean Operatorsand Truncation Wildcard Symbols.A search box will opened to suggest the most accurate words to the key words which users entered.
     Natural language searchThe result of the semantic analysis of ontology foundation can achieve natural language search, because the search process based on concept search not the keywords,
     Professional answerThe user could ask questions to the administrator. The answer will be automatically sent back to the user via email after being answered.
     User's logRecord and show all previous inquires of users. Through the user's log constant understanding, analysis, make personalized search more accord with each user needs.
     SortedbyrelevanceThe search results will be precedence ordering according to the correlation. The user can customize the sort according to his own demand in accordance with the correlation or literature published.
     Statistical functions:Provide direct statistics of the contents of the literature information. System test
     For example, search with "Acupuncturefor angina pectoris" and we could get the22relevant literatures. Show that intelligent search model have the function of searching natural language according to semantic relationships.
     To search literature including the research on "blood stasis syndrome" as an example,1313literature could be searched for the intelligent search system,485articles more than traditional searchsystem, proved the intelligent search model can improve the recall ratio.Due to the knowledge (concept) search technology, clear and narrowed the scope of search, and reduce the search to the scope of useless information, improve the precision.
     In this study,the intelligent search system combine the two algorithm,we can order the search results according to the correlation, and also order the search results according to the yearwhen literatureswere published.
     Therefore, this research can achievethe goalthat improve the recall ratio and the precision rate and order the results according to thecorrelation about the TCM literature about clinical research.
     3. Conclusion
     The contribution and the innovation mainly include the following aspects:
     The subject adopts the semantic web technology, established the data model of ontology on the upper deck of the relational database, and then create intelligent search model, provides reference about the ideas and methods of sharing with heterogeneous resources of heterogeneous database in the same field. Improve the recall ratio, precision ratio when searching the same repository.
     In this research the similarity algorithm combined results of two algorithms, can be more accurate, more appropriate to order the search results by correlation, and is feasible and useful.
     Research on the standard of metadata information about TCM literature. Make method about core metadata standard and the principle about extension. Regulate the description of TCM literature information and instruct the construction of relational databases.
     Research the concept system of ontology about TCM literature about clinicalresearch. Using ontology of principles, methods, formulas and medicinals in literature of clinical research to establish the concept system of literature of TCM clinical research, define the properties and semantic relations between all kinds ofconcepts. Help organizing information and knowledge of TCM literature, help achieving Intelligent Search. Direct statistics retrieval results, user can over huge document browsing and know the theme situation well.
     4. prospect
     Boosted invest is required for the work of information standardization of syndromes, symptoms, therapeutic principles.
     The establishment of ontology of TCM diagnosis and treatment needs a lot of effort, protege is suitable for small sample research, ontologybuilding of TCM construction should consider thetool that supportbatch import.
     This research do much detailed ontology construction, can be more directly, and more faster to get the related information.Intelligent search model according to these ontology still need more tests to prove the significance and efficacy.
     Provide reference for the doctor of TCM. The intelligent search model based on the result of research questionnaire to doctor about their demands to literature of clinical research more accord with their search needs, can more completed and more directly show the interior of the literature information.
     Through this research, hope can create a reasonable research methodor the operation process in Traditional Chinese Medicine literature of clinical research structured and the construction,will be areference of standard of collection and operation about database established.
     5. Epilogue
     Information service of TCMgradually turns from the resource service to deepen information service and knowledge service, begins to value the user's actual demand to improve the methods of information providing.
     Because the introduction of the concept of ontology and semantic technologies, make the service of search more accurate and rapid. At the same time it can make the different system or different structure in the same field to share resources, achieve theoretical data resource sharing.

引文

[1]Studer Rudi, Richard Benjamins, Dieter Fensel. Knowledgeengi neering:principies and net hods[J]. Dat a and Knowledge Eigineering,1998,25(1/2):161-197.
    [2]刘听鹏.Ontology理论研究和应用建模一((Ontology研究综述、V(?)C Ontology研究组文档以及Jena编程应用总结http://gis.pku.edu.cn/Resources/TR(2005).
    [3]李健康,张春辉.本体研究及其应用进展[J].图书馆论坛,2004,24(6)：80-86.
    [4]刘春,黄定光.本体初探[J].现代情报,2008,1：38-39
    [5]蒋天民.本休学习综述[J].科技情报开发与经济,2008,18(33)：146-148
    [6]林丹红,钟伶OntOlogy在中医药概念表达中的研究[J].医学信息学杂志,2007,6：545-549
    [7]高成勉,包含飞,周强.本体构建原则及其在中医顶层本体构建中的应用[J].医学信息,2008,21(5)：581-583
    [8]李新霞,陆建峰,孟红梅.本体在中医脾胃病辅助诊断中的应用[J].江南大学学报(自然科学版),2010,9(2)：151-155
    [9]曾召.构建中医药学领域本体的实践[J].中华医学图书情报杂志,2009,18(4)1-5
    [10]曹宇峰,曹存根.基于本体的中医舌诊知识的获取[J].计算机应用研究,2006,(3)31-34
    [11]李毅,张梅奎,杜侃.中医脑病学本体的探讨及其构建世界科学技术—中医药现代化2007,9(6)96-100
    [12]易钢,罗尧岳.基于本体的中医知识库系统的研究[J].医学信息,2010,23(10)：3516-3518
    [13]刘和洋,曹宇峰,秦丽娜.基于本体的中医专家临床病案的知识获取方法[J].计算机系统应用,2005(8)：80-83
    [14]蒋宏潮,王大亮,张德政.基于领域本体的中医知识获取方法[J].计算机工程,2008,34(12)16-18,21
    [15]谢琪,崔蒙,曹存根.基于领域本体方法构建中医概念信息模型的思考[J].世界科学技术中医药现代化,2009,11(4)：621-624
    [16]刘耀,穗志方,周扬.中医药本体概念描述体系的自动构建研究[J].现代图书情报技术,2008(5)：21-26
    [17]李兵,裘俭,张华敏.中医药领域本体研究概述[J].中国中医药信息杂志,2010,17(3)：100-101
    [18]纪军,徐明曙,杨韵华.针灸学领域本体构建研究[J].医学信息学杂志,2008,29(5)：25-28.
    [19]林尔正.基于中医骨伤古籍本体的语义检索研究[日.福州：福建中医学院,2008.
    [20]高红艳,周强,包含飞.以肺痨病为例探讨中医证候本体的构建[J].医学信息,2007,20(11)：1876-1882
    [21]车立娟,王瑾德,周强.基于“肺阴虚证”本体的中医证候知识库构建方法研究上海中医药大学学报2009,23(4)：18-20
    [22]侯玉,张昌林,车立娟.构建中医治则治法本体的研究[J].数理医药学杂志,2010,23(5)：603-606
    [23]方安,王惠临,王军辉.临床疾病领域本体构建方法研究—以手足口病本体为例[J].情报杂志,2009,28(11)：180-184
    [24]王梅文.药学本体构建实践[J].数据库及信息管理,1520-1522
    [25]黄惠芬.浅议医院本体建模及技巧[J].科技信息(学术研究),2007,27：495-496
    [26]刘菊红,于建荣,缪有刚.基于Mesh词表和共词分析的疾病本体半自动构建方法研究[J].现代情报,2009,29(3)：208-211
    [27]朱利君,张友华,李绍稳.基于茶虫害本体的智能检索系统设计计算机技术与发展2010,20(6)：130-132,178
    [28]徐济成,李绍稳,张友华.面向茶学领域本体的概念自动提取方法研究[J].农业网络信息,2010(8)13-15,24
    [29]徐济成,李绍稳,张友华.农业本体及本体学习研究[J].计算机技术与发展,2009,19(8)：212-215,219
    [30]赵建勋,张震明,田锡天.本体及其在机械工程中的应用综述[J].计算机集成制造系统,2007,13(4)：727-737
    [31]FarquharA.FikesR,RieeJ.The ontolingua server:a tool for eol-Laborative ontology cons-truetion.In:10th Knowledge Acquisition for Knowledge-Based Systems Work-shop (KAW96).Berlin:Springer press,1996:1-19
    [32]Dieter Fensel. Developing tools forontology construction and interchange. [2004-07-18]. http://www.ontoknowledge.org/downl/ontol.pdf
    [33]SwartoutB.RameshP,Knight Ketal.Toward distributed use of Large-scale ontologies.In:AAAI Symposium on Ontological Engineering (AAAI96).Stanford:AAAIPress,1997:1-5.
    [34]) Domingue J.Tadzebao and Webonto:discussing,browsing and editing Ontologies on the Web.In:11th Knowledge Acquisition Workshop (KAW98).Berlin:Springer press,1998:25-32
    [35]DomingueJB.MottaE.AKT-Technologies.[2004-08-12],http://www. aktors.org/technologies/webonto
    [35]Malthew Horridge,Holger Knublaueh,Alan Rector etal.A practiCal guide to building OWL ontologies using the Protege-OWL pluGin and CO-ODE tools.[2004-07-18]. http://www.co-ode.org/resources/tutorials/ProtegeOWLTutorial.pdf
    [37]Noy N F.Fergerson R W,Musen M A.The knowledge model of Protege-2000:combining interoperahility and flexibility.In:12th International Conference in Knowledge Engineering and Knowledge Management(EKAW_00). Amsterdam:Kluwer Aeademic Publisher,2000(1937):17-32
    [38]Sure Y,Erdmann M,Angele Jetal.OntoEdit:collaborative ontology Engineering for the semantic Web.In:First International Semantic Web Conferenee(ISWC_02).Berlin:Springer Press,2002(2342):221-235
    [39]SureY,AngeleJ,Staab S.OntoEdit:guiding ontology development By methodology and inferencing.[2004-08-06].http://www.aifb.uni-karlsruhe.de/WBS su/publications/2002_odbase_ontoedit.pdf
    [40]ArPirez J C,Coreho O,Ferndndez-Lopez M.WebODE:a scalable Ontological engineering workbench. In,First International Conference On Knowledge Capture(KCAP_01). Victoria:ACM Press,2001:6-13
    [41]Bechhofer S,Horroeks I,Goble C etal.oILEd:a reasonable ontology editor for the semantic web.In:Joint German/Austrian conference on Artificial Intelligence(K1_01).Berlin:Springer Press,2001(2174):396-408
    [42]Kogut P,Crane·eld S,Hart L etal.UML for ontology development. The Knowledge Engineering Review,2002,17(1):61-64
    [43]徐国虎,许芳.本体构建工具的介析与比较[J].图书情报工作,2006,50(1)：44-48
    [44]李景,孟连生.构建知识本体方法体系的比较研究[J].现代图书情报技术,2004(7)：17-21
    [45]于娟,党延忠.本体集成研究综述[J].计算机科学,2008,35(7)：9-13,18
    [46]张卫国,王琰.综述本体构建工具的发展[J].计算机工程与应用,2005,增刊：48-50
    [47]侯玉,张昌林,周强Protege在构建中医治则治法本体中的运用[J].医学信息,2007,20(11)：1866-1871
    [48]李弘伟.本体开发平台及关键技术研究中国科学技术信息研究所[J].硕士学位论文
    [49]李景.本体技术标准化综述研究,现代图书情报技术,2007,8(154)：12-17
    [50]岳静,本体及其在多文化交流平台中的应用[J].西南大学,硕士学位论文
    [51]朱毅华,郭卫兵.基于本体的教学资源检索、导航平台的设计与实现[J].中国教育信息化,2008,11：68-71
    [52]于英林,王卫东,于宗江.基于本体的可重构知识管理平台[J].计算机集成制造系统,2003,9(2)：1136-1144.
    [53]张永妹,党德鹏.基于本体的应急平台数据集成的设计与实现[J].计算机应用与软件,2010,27(3)：62-64
    [54]吴步丹,金芝.一个基于环境本体的WebServices描述平台原型[J].计算机科学,2007,34(9)：340-341
    [55]伍伟,李兵,曾诚.基于本体的语义Web服务注册平台研究[J].微电子学与计算机,2008,25(10)：40-42
    [56]李寰,贾保先.基于RSS聚合和本体检索的数字图书馆个性化门户设计[J].情报杂志,2009(2)：47-49.
    [57]杨建林.基于本体的文本信息检索研究[J].信息系统,2006,29(5)：598-601
    [ 58]叶晓燕.异构本体知识库合并技术研究[J].福建电脑,2010,(1)：32,14
    [59]胡哲,朱强.基于本体的查询扩展研究[J].数据库与信息管理,2010,6(5)：1025-1026,1037
    [60]金灿.面向不同结构化程度数据源的本体学习方法研究[J].计算机时代,2010(8)：10-13
    [61]吴柏林,宋泽锋,杨炳儒.一种基于本体的垂直检索引擎系统模型[J].计算机工程与科学,2008,30(10)：5-7,14
    [62]张晶,丘峰.期刊栏目研究[J].图书馆理论与实践,2000,(6)：32-35
    [63]池爽.基于Agent的智能检索综述李志义[J].情报科学,2009,27(7)：1035-1039
    [64]朱明.一种互联网信息智能搜索新方法[J].《计算机研究与发展》,2001,38(1)：116-120
    [65]杨海南.基于语义概念树和局部上下文分析的查询扩展[J].武汉理工大学学报(信息与管理工程版),2011,33(1)：79-82
    [66]聂卉.基于规则的智能检索系统推理机制的研究[J].《情报学报》,2008,27(6)：832-837
    [67]徐国虎,董惠.基于语义的数字图书馆推理检索研究[J].中国图书馆学报,2006,(3)：50-53
    [68]责可荣,张彦铎.人工智能[M.北京：清华大学出版社,2006：253
    [69]国家药典委员会.中华人民共和国药典：2005版.一部[M].北京：北京大学出版社,2004
    [70]江苏新医学院编.中药大辞典[M.上海：上海科学技术出版社,2004
    [71]郭武斌,周宽久,苏振魁.基于词序方法的文本相似度计算模型[J].情报学报,2008,27(6)：857-862
    [72]金博,史彦军,滕弘飞.基于语义理解的文本相似度算法[J].大连理工大学学报,2005,45(2)：291-297
    [73]李桂林,陈晓云.关于聚类分析中相似度的讨论[J].计算机工程与应用,2004,26(31)：65-82
    [74]王皴.基于Trie结构的带通配符的相似字符串匹配算法[J].计算机应用,2004,24(10)：121-122
    [75]周文,徐国梁.翻译记忆中语句相似度计算方法的研究[J].计算机应用,2007,27(5)：1210-1211
    [76]吴兰成主编.《中国中医药学主题词表》2007版[S].北京：中医古籍出版社,2007
    [77]《中华人民共和国中医药行业标准-中医病证诊断疗效标准》[S].北京：国家中医药管理局,1994
    [78]李经纬主编.《简明中医辞典》[M.中国中医药出版社,2001
    [79]李时珍.《本草纲目》校点本[M.人民卫生出版社,2007
    [80]梁繁荣.《针灸学》[M.上海科学技术出版社,2009
    [81]杜文华.本体构建方法比较研究[J].情报方法,2005,(10)：24-25
    [82]孔敬.智能检索模型研究[J].现代图书情报技术,2005,(3)：37-41
    [83]徐海涛,汪方胜,蒋馥.基于本体的信息检索机制研究[J].情报杂志,2005,(10)：75-79
    [84]孔楠.基于本体的垂直搜索系统的设计与实现[日.北京交通大学,2008
    [85]贾雪峰,王建新,齐建东,朱礼军.基于领域本体的智能检索模型[J].计算机工程,2010,36(23)：171-173
    [86]贾西平,刘海珠.面向服务的文档智能检索平台[J].广东技术师范学院学报(自然科学版),2010,31(1)：39-42
    [87]张宇,刘雨东,计钊.向量相似度测度方法[J].声学技术,2009,28(4)：532-536
    [88]孙振球主编.医学综合评价方法及其应用[M].北京：化学工业出版社,2005
    [89]夏天.汉语词语本体概念相似度计算研究[J].计算机工程,2007,33(6)：191-194
    [90]管延勇,王洪凯,史开泉.集合的粗相似度量[J].模糊系统与数学,2006,20(1)：134-139
    [91]吴江宁,刘巧凤.基于最大公共子图的文本相似度算法研究[J].情报学报,2010,29(5)：785-791
    [92]黄承慧,印鉴,侯昉.一种结合词项语义信息和TF-IDF方法的文本相似度量方法[J].计算机学报,2011,34(5)：856-864
    [93]余丹.关于查全率和查准率的新认识[J].西南民族大学学报(人文社科版),2009,30(2)：283-285
    [94]沈艳红.信息检索中检索词的选择对查全率的影响[J].情报探索,2006,(11)：73-74
    [95]韩英杰,石磊,刘杨.Web预取性能指标准确率与查全率的关系[J].计算机工程,2010,36(3)：61-63

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700