基于语义处理技术的信息检索模型研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
信息爆炸是当今信息社会的一大特点,当前信息检索技术面临着互联网网络信息更新越来越快,用户检索结果要求越来越精确的严重挑战。如何在海量的信息中有效地找到所需信息因而成为了一个关键问题,语义检索技术是解决这一问题非常有潜力的方法。然而,在语义网还没有完全实现的情况下,研究过渡时期的语义检索技术已成为近年来一个快速发展的新兴研究课题。
     本文对信息检索中的若干关键问题进行了研究,提出了基于语义处理技术的信息检索模型——SPTIR(Semantic Processing Technology based InformationRetrieval)。该模型围绕查询扩展和检索结果重排序而展开,主要由四个部分构成,即:基于词义消歧的语义查询扩展、基于词汇语义相关性度量的查询优化、基于文档语义相关性的检索结果重排序和语义加强的个性化信息推荐。
     1.在基于关键字的搜索引擎中,一个构造良好的查询是用户主观信息需求的客观表现,也是信息检索服务质量的基本保证。本文以用户查询关键字之间的语义关联为切入点,辅以隐式反馈技术获取消歧上下文,使用无导词义消歧的方法实现了查询关键字到本体概念的映射,基于概念词语关联进行语义查询扩展。基于词义消歧的语义查询扩展解决了传统的信息检索系统不能很好理解用户查询意图的问题。
     2.针对部分消歧失败的查询关键字,本文提出使用隐式反馈技术从相关文档中直接提取候选扩展查询词的策略。为了进一步精简和优化反馈产生的扩展词汇,避免查询扩展的“主题偏移”现象,本文采用基于词汇语义相关性度量的方法对扩展查询词进行过滤来优化查询。
     3.由于传统关键字检索返回的数据量过大,检索结果相关性评价成为研究的焦点。本文根据查询消歧的具体情况(成功、失败),提出两种文档语义相关性度量的方法:基于语义向量空间模型的文档相关性和基于词汇向量空间模型的文档相关性。根据文档相关性对检索结果进行重新排序,优先返回与查询语义相关性强的文档供用户浏览。
     4.本文对如何满足不同用户的个性化查询需求进行了研究,提出了一种语义加强的个性化信息推荐方法。该方法综合利用语义数据源和历史评分数据进行混合推荐,语义数据源的引入解决了传统协同过滤系统的数据稀疏性和冷启动问题。另外,为了提高推荐系统的可扩展性和实时性,在数据的离线预处理阶段,本文使用数据挖掘方法对用户和项目进行了模糊聚类。
We are in an information age that mainly characterized by information explosion, and information retrieval techniques are now challenged a lot by more frequent Internet information updating, as well as increasing user demand for more precise search results. Semantic search technique, fortunately, is a hopeful way that leads to the key to the issue of finding exact information from mass number of them effectively. However, as a result of the incomplete realization of semantic web technique, recent study has been more focused on semantic retrieval technique in transition period, making it a hot topic of research.
     Several key problems in Information Retrieval (IR) domain are addressed and a novel Semantic Processing Technology based Information Retrieval (SPTIR) model is proposed in this dissertation. SPTIR is an extension on Query Expansion (QE) and Search Result re-Ranking, which consists of four parts, namely semantic query expansion based on Word Sense Disambiguation (WSD), query optimization based on word semantic relatedness, search results re-ranking based on document semantic relevance, and semantic enhanced personalized information recommendation.
     Firstly, in the context of keyword-based search engine, a well-structured and good-meaningful user query not only expresses user's personal needs precisely, but also guarantees the QS (Quality of Service) requirement for information retrieval. Starting with the issue of semantic associations of query keywords, supplemented by implicit feedback technique, and using unsupervised Word Sense Disambiguation, this dissertation presents a technique that maps query keywords to ontology concepts, and a semantic query expansion technique based on concept-word association. The WSD based semantic query expansion solves the problem of not well understanding user's query intension in traditional retrieval systems.
     Secondly, for those query keywords that fail to disambiguate, this dissertation presents a strategy that directly selects candidate expanded query keywords from the relevant documents using implicit feedback technique. In order to further condense and optimize the expansion keywords that generates from feedback, and to avoid the "topic shift" phenomenon in query expansion, this dissertation uses a semantic relatedness measurement between terms to filter expanded keywords to optimize the query.
     Thirdly, traditional keyword-based search always returns millions of search results, thus the relevance evaluation of retrieval results has become a hot topic of research. Based on the specific situation (success, failure) of Query Disambiguation, two distinct types of Document Semantic Relevance Measure, namely Semantic Vector Space Model based Document Relevance and Word Vector Space Model based Document Relevance, are proposed in this dissertation. With Semantic Relevance, the search results are re-ranked and the documents with a strong semantic correlation to query words are presented to user with high priority.
     Fourthly, the problem of how to meet the information needs of different users is studied, and a semantic-enhanced personalized information recommendation model is proposed. This model utilizes the semantic data sources and historical rating data to implement a hybrid recommendation. The introduction of semantic data sources solves the sparse problem and the cold start problem in traditional collaborative filtering system. In addition, in order to improve the system scalability and realize real-time recommendation, data mining method of fuzzy clustering is used to cluster the users and items in offline data pre-processing stage.
引文
[1]Tim Bemers-Lee.Semantic Web Road Map[R].World Wide Web Consortium,September,1998.
    [2]M.Eric,S.Ralph.An Overview of W3C Semantic Web Activity[EB/OL],Bulletin of the American Society for Information Science and Technology,April/May 2003.
    [3]R.Baeza-Yates,B.Ribeiro-Neto.Modem Information Retrieval[M].New York:Addison-Wesley-Longman,1999.
    [4]B.Sheth,P.Maes.Evolving Agents for Personalized Information Filtering[C].Proc.Ninth IEEE Conf.Artificial Intelligence for Applications,1993:345-352.
    [5]J.R.Wen,J.Y.Nie,H.J.Zhang.Clustering user queries of a search engine[C].Proceedings of the 10th International World Wide Web Conference(WWW10).New York:ACM Press.2001:162-168.
    [6]蔡柯柯.基于查询特征上下文的检索模型研究.博士学位论文,浙江大学,2007.
    [7]G.Salton,E.A.Fox,H.Wu.Extended Boolean Information Retrieval[J].Communications of the ACM,1983,26(11):1022-1036.
    [8]G.Salton,M.J.McGill,Introduction to Modem Information Retrieval[M].New York:MxGraw-Hill,1983.
    [9]C.J.van Rijsbergen.A New Theoretical Framework for Information Retrieval[C].In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval,1986:194-200.
    [10]J.Ponte,W.B.Croft.A Language Modeling Approach to Information Retrieval[C].In Proceedings of 1998 ACM SIGIR Conference on Research and Development in Information Retrieval,1998:275-281.
    [11]E.A.Fox.Characteristics of Two New Experimental Collections in Computer and Information Science Containing Textual and Bibliographic Concepts[R].ACM SIGIR Forum,1983,35(2):12-19.
    [12] C.P. Paice. Soft Evaluation of Boolean Search Queries in Information Retrieval Systems[J]. Information Technology, 1984.3(1):33-42.
    [13] G. Salton, E.A. Fox, H. Wu. Extended Boolean Information Retrieval[J]. Communications of the ACM, 1983.26(11): 1022-036.
    [14] G. Salton, C. Buckley. Term Weighting Approaches in Automatic Text Retrieval[J]. Information Processing and Management, 1998,24(5):513-523.
    [15] J.H. Lee. Combining Multiple Evidence From Different Properties of Weighting Schemes[C]. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1995:180-188.
    [16] A. Lavelli, F. Sebastiani, R. Zanoli. Distributional Term Representations: an Experimental Comparison[C]. In Proceedings of the 13~(th) ACM International Conference on Information and Knowledge Management, 2004:615-624.
    [17] R.R. Korfhage. Information Storage and Retrieval[M]. New York, John Wiley & Sons, Inc. 1997.
    [18] S.K.M. Wong, et al. Generalized Vector Spaces Model in Information Retrieval[C]. In Proceedings of the 8th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1985:18-25.
    [19] G.W. Furnas, et al. Information Retrieval Using a Singular Value Decomposition Model of Latent Semantic Structure[C]. In Proceedings of the 11~(th) Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1988:465-480.
    [20] S.E. Roberson, K.J. Sparck. Retrieval Weight of Search Terms[J]. Journal of the American Soeiety for Information Scienee, 1976, 27(3): 129-146.
    [21] S.E. Roberson. The Probability Ranking Principle in Information Retrieval[J]. Journal of Documentation, 1977, 33(4): 294-304.
    [22] H.R. Turtle, W.B. Croft. Inference Networks for Document Retrieval[C]. In Proceedings of the 13th Annual International ACM SIGIR Conference on Resarch and Development in Information Retrieval, 1990:1-24.
    [23]J.P.Callan,W.B.Croft,S.M.Harding.The INQUERY retrieval system[C].In Proceedings of the 3th International Conference on Database and Expert Sytems Application,1992:78-83.
    [24]D.Miller,T.Leek,R.Schwartz.A Hidden Markov Model Information Retrieval System.In Proceedings of Annual Informational ACM SIGIR Conference on Research and Development in Information Retrieval,1999:214-221.
    [25]A.Berger,J.Laferty.Information Retrieval as Statistical translation.In Proceedings of Annual Informational ACM SIGIR Conference on Research and Development in Information Retrieval,1999:222-229.
    [26]D.Hiemstra.A Linguistically Motivated Probabilistic Model of Information Retrieval.In Proceedings of the 2th European Conference on Research and Advance Technology for Digital Libraries,1998:569-584.
    [27]V.Lavremko,W.B.Croft.Relevance based Language Models.In Proceedings of the 24~(th) Annual Informational ACM SIGIR Conference on Research and Development in Information Retrieval,2001:120-127.
    [28]F.Song,W.B.Croft.A General Language Model for Information Retrieval.In Proceedings of the 22~(th) Annual Informational ACM SIGIR Conference on Research and Development in Information Retrieval,1999:279-280.
    [29]余传明,基于本体的语义信息系统研究——理论分析与系统实现.博士学位论文,武汉大学,2005.
    [30]梅翔.语义检索中若干关键问题的研究.博士学位论文,北京邮电大学,2007.
    [31]D.Busealdi,P.Rosso,E.s Arnal.A wordnet-based query expansion method for geographical information retrieval[R].In Working Notes for the CLEF Workshop,2005.
    [32]P.M.Kruse,A.Naujoks,D.Roesner,M.Kunze.Clever search:A wordnet based wrapper for internet search engines[C].In Proceedings of the 2~(nd)GermaNet Workshop,2005.
    [33]桑艳艳,刘培刚,李勇.基于语义计算的查询扩展优化研究.情报学报,2007,26(5):704-710.
    [34]田萱,杜小勇,李海华.语义查询扩展中词语-概念相关度的计算.软件学报,2008,19(8):2043-2053.
    [35]张敏,宋睿华,马少平.基于语义关系查询扩展的文档重构方法.计算机学报,2004,27(10):1395-1401.
    [36]R.Guha,R.MeCool,E.Miller.Semantic search[C].Proceedings of the 12~(th)International Conference on World Wide Web,ACM Press,2003:700-709.
    [37]C.Rocha,D.Sehwabe,M.P.de Aragao.A hybrid approach for searching in the semantic web[C].In Proceedings of the 13~(th) international conference on World Wide Web.2004:374-383.
    [38]A.Eija,et al.CIRI-An ontology-based query interface for text retrieval[C].In Proceeding of the 11~(th) Finnish Artificial Intelligence Conference.2004.
    [39]Q.Cheng,et al.Implementation of semantic query optimization techniques in DB2 universal database[C].In Proceedings of VLDB,1999:687-698.
    [40]J.Heflin,J.Hendler.Searching the web with SHOE[R].In Artificial Intelligence for Web Search,Papers from the AAAI Workshop,WS-00-01,AAAI Press,2000:35-40.
    [41]E.Makela,K.Viljanen,P.Lindgren,et al.Semantic yellow Page service discovery:The veturi portal[R].Semantic Computing Research Group(SeCo)Helsinki University of Technology(TKK),Laboratory of Media Technology University of Helsinki,Department of Computer Science,2005.
    [42]N.Guarino,C.Masolo,G.Vetere.OntoSeek:Content Based Access to the Web [M].IEEE Intelligent Systems,1999.
    [43]A.Maedehe,S.Staab,et al.Seal-a framework for developing semantic web portals[C].Proceedings of the 18~(th) British National Conference on Databases.2001:1-22.
    [44]E.Makela,E.HyVonen,T.Sidoroff.View-based user interfaces for information retrieval on the semantic web[R],Helisinki Institute for Information Technology(HIIT),Helsinki University of Technology,Media Technology,and University of Helsinki,2005.
    [45]D.Reynolds,P.Shabajee,S.Cayzer.Semantic Information Portals[C].In Proceedings of the 13th International World Wide Web Conference on Alternate track papers & posters. ACM Press. 2004:290-291.
    [46] E. Makela, E. Hyvonen, S. Saarela, K. Viljanen. OntoViews-A Tool for Creating semantic Web Portals[C]. In Proceedings of the Third Intemation Semantie Web Conference, Springer Verlag, 2004:805-819.
    [47] E. Hyvonen, E. Makela, M. Salminen, et al. MUSEUMFINLAND-finnish museums on the semantic web[C]. Web Semantics: Seienee, Services and Agents on the World Wide Web 3,2005: 224-241.
    [48] E. HyVonen, E. Makela. Semantic autocompletion[R]. Semantic Computing Research Group (SeCo) Helsinki University of Technology (TKK), Laboratory of Media Technology University of Helsinki, Department of Computer Science, 2005.
    [49] D.R. Karger, K. Bakshi, D. Huynh, et al. Haystaek: A general-Purpose information management tool for end users based on semi-struetured data[C]. In Proceedings of the CIDR Conference. 2005:13-26.
    [50] D. Quan, D. Huynh, D.R. Karger. Haystack: A platform for authoring end user semantic web applications[C]. In Proceedings of the Second International Semantie Web Conference. 2003:738-753.
    [51] J. Teevan, C. Alvarado, M.S. Ackerman, D.r. Karger. The Perfect search engine is not enough: a study of orienteering behavior in directed search[C]. In Proceedings of the Conference on Human Factors in Computing Systems, CHI. 2004:415-422.
    [52] N. Athanasis, V. Christophides, D. Kotzinos. Generating on the fly queries for the semantic web: The ics-forth graphical rql interface (GRQL)[C]. In Proceedings of the Third International Semantic Web Conference. 2004:486-501.
    [53] T. Catarci, P. Dongilli, T.D. Mascio, E. Franeoni, et al. An ontology based visual tool for query formulation support[C]. In Proceedings of the 16~(th) Eureopean Conference on Artificial Intelligence, 2004:308-312.
    [54] V.S. Uren, Y. Lei, E. Motta. SemSearch: Refining Semantic Search[C]. In ESWC,2008:874-878.
    [55]L.Zhang,Y.Yu,J.Zhou,et al.An enhanced model for search in semantic portals[C].In Proceedings of the 14~(th) international conference on World Wide Web,New York,NY,USA,ACM Press.2005:453-462.
    [56]余正涛,宋丽哲,樊孝忠.基于本体的个性化领域信息服务.2005,31(5):22-24.
    [57]张宏科,王建超等.扩展UDDI实现语义及个性化查询的方法及系统[P].北京交通大学,2006.
    [58]P.Alexander,G.Susan.Ontology Based Personalized Search[C].In Proceedings of 11~(th) IEEE International Conference on Tools with Artificial Intelligence,1999:391-398.
    [59]L.Kerschberg,K.Wooju,S.Anthony.A personalizable agent for semantic taxonomy-based web search[J].Lecture Notes in Artificial Intelligenee (Subseries of Lecture Notes in Computer Science),2003:3-31.
    [60]G.Susan,C.Jason.Ontology-Based Personalized Search and Browsing[J].Web Intelligence and Agent systems,2003,1(3-4):219-234.
    [61]S.Mireo,G.Susan.Personalized search based on user search histories[C].Proceeding of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence,2005:622-628.
    [62]K.Anyanwu,A.Sheth.P-Queries:enabling querying for semantic associations on the semantic web[C].Proceeding of the WWW 2003,New York,ACM Press,2003:690-699.
    [63]梅翔,孟祥武,陈俊亮,徐萌.一种基于语义关联的查询优化方法.北京邮电大学学报.2006.29(6):107-110.
    [64]A.M.Boanerges,H.W.Christian,S.S.Satya,et al.Template Based Semantic Similarity for Security Applications[C].Proceedings of the IEEE International Conference on Intelligence and Security Informaties(ISI-2005),2005:621-622.
    [65]A.M.Boanerges,H.W.Christian,A.Budak,et al.Ranking Complex Relationships on the Semantic Web[J].Interact Computing,May/June 2005,9(3):37-44.
    [66]王昊奋,俞勇.基于Web的语义搜索[R].上海交通大学APEX数据和知识管理实验室,2008.
    [67]I.Nancy,V.Jean.Introduction to the special issue on word sense disambiguation:The state of the art[J].Computational Linguistics,1998,24(1):1-40.
    [68]M.E.Lesk.Automated Sense Disambiguation Using Machine Readable Dictionaries:How to Tell a Pine Cone from all Ice Cream Cone[C].In Proceedings of the S1GDOC Conference,Association for Computing Machinery,New York,1986:24-26.
    [69]M.Wilks.Stevenson.The grammar of sense:Is word-sense tagging much more than part-of-speech tagging?[R]Technical Report,CS-96-05,University of Sheffield,1996.
    [70]Pook,L.Stuart,C.Jason.Making sense out of searching[EB/OL].In Information Online 88,Sydney.The Information Science Section of the Library Association of Australia.1988:48-157.
    [71]梅家驹,竺一鸣,高蕴琦等.同义词词林.上海:上海辞书出版社,1983.
    [72]Y.David.Decision Lists for Lexical Ambiguity Resolution:Application to Accent Restoration in Spanish and French[C].The 32~(nd) Annual Meeting of Association for Computational Linguistics.Las Cruces,NM:ACL.1994:88-95.
    [73]卢志茂,刘挺,李生.统计词义消歧的研究进展.电子学报,2006,34(2):333-343.
    [74]T.Geoffrey,M.V.Ellen.Disambiguating Highly Ambiguous Words[J].Computational Linguistics,1998,24(1):125-145.
    [75]R.Philip.Selection and Information:A Class-Based Approach to Lexical Relation[R].USA:University of Pennsylvania,1993:23-54.
    [76]陈浩,何婷婷,姬东鸿.基于k-means聚类的无导词义消歧.中文信息学报,2005,19(4):10-16.
    [77]陈浩,何婷婷,姬东鸿.基于MDL聚类的无导词义消歧.小型微型计算机 系统,2005,26(10):1846-1849.
    [78]李涓子,黄昌宁.基于转换的无指导词义标注方法.清华大学学报(自然科学版).1999,39(7):117-121.
    [79]鲁松,白硕,黄雄.基于向量空间模型中义项词语的无导词义消歧.软件学报,2006,38(6):1082-1089.
    [80]S.Banerjee,T.Pedersen.An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet[C].In proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics.2002:136-145.
    [81]Y.Chen,J.Yin.Sense Rank AALesk:A Semantic Solution for Word Sense Disambiguation[C].FSKD 2005,LNAI 3614.2005:710-717.
    [82]E.Agirre,G.Rigau.A proposal for Word Sense Disambiguation using Conceptual Distance[C].1st International Conference on recent Advances in NLP,Bulgaria.1995.
    [83]P.Rosso,F.Masulli,D.Buscaldi,F.Pla,A.Molina.Automatic Noun Disambiguation[C].Lecture Notes in Computer Science,Springer-Verlag,2003:273-276.
    [84]B.Davide,R.Paolo,M.Francesco.Integrating Conceptual Density with WordNet Domains and CALD Glosses for Noun Sense Disambiguation[C].LNAI 3230,2004:183-194.
    [85]M.Rada.Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling[C].Conference on Empirical Methods in Natural Language Processing(EMNLP),2005:411-418.
    [86]S.Ravi,M.Rada.Unsupervised Graph-based Word Sense Disambiguation Using Measures of Word Semantic Similarity[C].Proceedings of the IEEE International Conference on Semantic Computing(ICSC),Irvine,CA,September 2007:363-369.
    [87]A.Eneko,M.David,L.L.Oier,S.Aitor.Two graph-based algorithms for state-of-the-art WSD[C].In Proceedings of the.Conference on Empirical Methods in Natural.Language Processing(EMNLP).2006.
    [88] N. Roberto, L. Mirella. Graph Connectivity Measures for Unsupervised Word Sense Disambiguation[C]. IJCAI. 2007:1683-1688.
    [89] N. Roberto, V. Paola. Structural Semantic Interconnections: A Knowledge-Based Approach to Word Sense Disambiguation[J]. IEEE transactions on pattern analysis and machine intelligence, 2005, 27(7):1075-1086.
    [90] J. Cowie, J. Guthrie, L. Guthrie. Lexical disambiguation using simulated annealing[C]. Proc. of COLING, Nantes, France. 1992:359-365.
    [91] Y. David. Word sense disambiguation using statistical models of Roget's categories train on large corpora[C]. In COLING 14. Nantes, 1992:545-460.
    [92] B. Magnini, C. Strapparava. Experiments in word domain disambiguation for parallel texts[C]. Proc. SIGLEX Workshop on Word Senses and Multi-linguality, Hong-Kong, 2000:27-33.
    [93] P. Vossen. Extending, trimming and fusing WordNet for technical documents[C]. Proceedings of NAACL Workshop WordNet and Other Lexical Resources: Applications, Extensions and Customizations, Pittsburgh, 2001.
    [94] M. Bernardo, S. Carlo, P. Giovanni, G. Alfio. The role of domain information in Word Sense Disambiguation[J]. Natural Language Engineering. 2002, 8 (4):359-373.
    [95] R. Mihalcea. Using Wikipedia for automatic word sense disambiguation[C]. In Human Language Technologies: The Conference of the North American Chapter of the Association for Computational Linguistics, Rochester, New York, 2007.
    [96] H. Schutze. Automatic word sense discrimination[J]. Computational Linguistics, 1998. 24(1): 97-123.
    [97] D. Scott, T.D. Susan, W.F. George, K.L. Thomas, H. Richard. Indexing by Latent Semantic Analysis[J]. Journal of the American Society for Information Science, 1990,41(6):391-407.
    [98] K.L. Thomas, T.D. Susan. A solution to Plato's problem: The Latent Semantic Analysis theory of acquisition, induction and representation of knowledge[J]. Psychological Review, 1997(104): 211-240.
    [99] K.L. Thomas, W.P. Foltz, L. Darrell. An introduction to Latent Semantic Analysis[J]. Discourse Processes, 1998(25): 259-284.
    [100] D.K. Lin, P. Patrick. Concept discovery from text[C]. Proceedings of the 19th International Conference on Computational Linguistics (COLING), Taipei, Taiwan. 2002:577-583.
    [101] P. Ted, B. Rebecca. Distinguishing word senses in untagged text[C]. Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, Providence, U.S.A. 1997:197-207.
    [102] P. Ted, B. Rebecca. Knowledge lean word sense disambiguation[C]. Proceedings of the 15th National Conference on Artificial Intelligence, Madison, U.S.A. 1998: 800-805.
    [103] P. Amruta, P. Ted. Word sense discrimination by clustering contexts in vector and similarity spaces[C]. Proceedings of the Conference on Computational Natural Language Learning, Boston, U.S.A., 2004:41-48.
    [104] D. Ido, I. Alon, M. Shaul. Two Languages Are More Informative Than OneEB/OL. The 29th Annual Meeting of Association for Computational Linguistics, Berkeley, CA: ACL, 1991:130-137.
    [105] D. Ido, I. Alon. Word sense disambiguation using a second language monolingual corpus[J]. Computational Linguistics. 1994,20(4):563-596.
    [106] R. Philip, Y. David. A perspective on word sense disambiguation methods and their evaluation[C]. In Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What, and How. Washington. 1997:79-86.
    [107] G. Escudero, L. Marquez, G. Rigau. Boosting applied to word sense disambiguation[C]. In Proceedings of the 12th European Conference on Machine Learning. Barcelona. 2000:129-141.
    [108] N. Ide, T. Ejavec, D. Tufis. Sense discrimination with parallel corpora[C]. In Proceedings of the ACL SIGLEX Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. Philadelphia, PA, 2002:54-60.
    [109] C. Li, H. Li. Word translation disambiguation using bilingual bootstrapping[C]. In Proceedings of the 40thAnnual Meting of the Association for Computational Linguistics. Philadelphia, PA. 2002:343-351.
    [110] H.N. Tou, B. Wang, Y.S. Chan. Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study[C]. In Proceedings of the 41th Annual Meeting of the Associlation for Computational Linguistics. Sapporo, Japan. 2003:455-462.
    [111] D. Mona, R. Philip. An unsupervised method for word sense tagging using parallel corpora[C]. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. Philadelphia, PA.2002:255-262.
    [112] D. Mona. Word Sense Disambiguation Within a Multilingual Framework[D]. University of Maryland College Park, USA. 2003.
    [113] D. Mona. An unsupervised approach for bootstrapping arabic word sense tagging[C]. Proceedings of Arabic Based Script Languages, COLING 2004. Geneva, Switzerland. 2004.
    [114] P.I. Klapaftis, M. Suresh. Google & WordNet based Word Sense Disambiguation[C]. Proceedings of the 22nd ICML Workshop on Learning & Extending Ontologies. Bonn, Germany. 2005.
    [115] C.Y. YANG. Word sense disambiguation using semantic relatedness measurement[J]. Journal of Zhejiang University SCIENCE A. 2006.7(10): 1609-1625.
    [116] G.W. Fumas, T.K. Landauer, L.M. Gomez, S.T. Dumais. The vocabulary problem in human-system communication[J]. Communication of ACM, 1987,30(11):964-971.
    [117] M. Rada, T. Paul, F. Elizabeth. PageRank on Semantic Networks with Application to Word Sense Disambiguation[C]. Proceedings of the 20th international conference on Computational Linguistics, 2004:1126-1132.
    [118] M.E. Maron, J.L. Kuhns. On Relevance, Probabilistic Indexing and Information Retrieval[J]. ACM, 1960, 7(3): 216-244.
    [119] J. Rocchio. Relevance Feedback in Information Retrieval[R]. In Salton: The SMART Retrieval System: Experiments in Automatic Document processing, Chapter 14,1971.
    [120] S. Patwardhan, S. Banerjee, T. Pedersen. SenseRelate::TargetWord-A generalized framework for word sense disambiguation[C]. In Proc. of AAAI-05, 2005.
    [121] L. Finkelstein, E. Gabrilovich, Y. Matias, et al. Placing search in context: The concept revisited[J]. ACM Transactions on Information Systems, 2002.20(1): 116-131.
    [122] S.N. Kim, T. Baldwin. Automatic interpretation of noun compounds using WordNet similarity[C]. In Proc. of IJCNLP-05. 2005:945-956.
    [123] G. Hirst, S.O. David. Lexical chains as representations of context for the detection and correction of malapropisms[DB/OL]. Christiane Fellbaum (editor). 1998.
    [124] M. Saif, H. Graeme. Distributional measures of concept-distance: A task-oriented evaluation[C]. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2006:35-43.
    [125] L. Lee. Measures of Distributional Similarity[C]. In Proceedings of the 37th conference on Association for Computational Linguistics. 1999:25-32.
    [126] J.E. Weeds. Measures and Applications of Lexical Distributional Similarity[D]. Ph.D. thesis, Department of Informatics, University of Sussex, Brighton, UK. 2003.
    [127] K. Church, P. Hanks. Word Association Norms, Mutual Information and Lexicography[J]. Computational Linguistics, 1989.16(1): 22-29.
    [128] P. Pantel, D. Lin. Discovering word senses from text[C]. In Proceedings of the 8~(th) Association of Computing Machinery SIGKDD International Conference On Knowledge Discovery and Data Mining. Edmonton, Canada, 2002:613-619.
    [129] H. Kozima, A. Kozima, H. Teiji. Similarity between Words Computed by Spreading Activation on an English Dictionary[DB/OL]. 1993.
    [130] K. Hideki. Text Segmentation Based on Similarity between Words[C]. Proceedings of the 31st annual meeting on Association for Computational Linguistics. 1993:286-288.
    [131] J. Morris, G. Hirst. Lexical cohesion computed by thesaural relations as an indicator of the structure of text[J]. Computational Linguistics. 1991,17(1):21-48.
    [132] M. Jarmasz, S. Szpakowicz. Roget's Thesaurus and Semantic Similarity[C]. Proceedings of Conference on Recent Advances in Natural Language Processing (RANLP 2003). September 2003:212-219.
    [133] B. Alexander, H. Graeme. Evaluating WordNet-based Measures of Lexical Semantic Relatedness[J]. Computational Linguistics.2006, 32(1): 13-47.
    [134] M. Sussna. Word sense disambiguation for free-text indexing using a massive semantic network[C]. In Proceedings of the Second International Conference on Information and Knowledge Management (CIKM), Arlington, Virginia. 1993:67-74.
    [135] Z.B. Wu, P. Marha. Verb semantics and lexical selection [C]. Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, New Mexico: ACM, 1994:133-138.
    [136] C. Leacock, C. Martin. Combining local context and WordNet similarity for word sense identification[M]. The MIT Press, London, 1998:265-283.
    [137] P. Resnik. Using information content to evaluate semantic similarity[C]. In Proceedings of the 14th International Joint Conference on Artificial Intelligence. Canada: IEEE, 1995:448-453.
    [138] J.J. Jiang, W. David. Conrath. Semantic similarity based on corpus statistics and lexical taxonomy[C]. In Proceedings of International Conference on Research in Computational Linguistics. Manchester: IEEE, 1997: 19-33.
    [139] D.K. Lin. An information-theoretic definition of similarity[C]. In Proceedings of the 15th International Conference on Machine Learning, Madison, Wisconsin USA: ACM, 1998:296-304.
    [140] M. Strube, S.P. Ponzetto. WikiRelate! Computing Semantic Relatedness Using Wikipedia[C]. In Proceedings of AAAI, Boston: IEEE, 2006:1419-1424.
    [141] E. Gabrilovich, S. Markovitch. Computing semantic relatedness of words and texts in Wikipedia-derived semantic space[R]. Computer Science Department, 2006.
    [142]M.David.Computing Semantic Relatedness using Wikipedia Link Structure[C].In Proceedings of the New Zealand Computer Science Research Student Conference.Hamilton,New Zealand:University of Waikato,2007.
    [143]Y.Wang,H.F.WANG,et al.Exploit Semantic Information for Category Annotation Recommendation in Wikipedia[J].Natural Language Processing and Information Systems,2007:48-60.
    [144]G.A.Miller,W.G.Charles.Contextual correlates of semantic similarity[J].Language and Cognitive Processes,1991,6(1):1-28.
    [145]H.Rubenstein,J.B.Goodenough.Contextual Correlates of Synonymy[J].Communications of the ACM,1965,8(10):627-633.
    [146]G.Jorge,M.Eduardo.Web-Based Measure of Semantic Relatedness[C].WISE 2008,LNCS 5175.2008:136-150.
    [147]L.Finkelstein,G.Evgeniy,et al.Placing Search in Context:The Concept Revisited[J].ACM Transactions on Information Systems,2002,20(1):116-131.
    [148]江娟,郑玲,海涛.搜索引擎的结果相关性排序的度量方法.2007年研究综述与技术论坛专刊.
    [149]B.Y.Ricardo,R.N.Berthier.Modem Information Retrieval[M].Addison-Wesley,1999.
    [150]N.Belkin,B.Croft.Information Filtering and Information Retrieval[J].Comm.ACM,1992,35(12):29-37.
    [151]M.Balabanovic,Y.Shoham.Fab:Content-Based,Collaborative Recommendation[J].Comm.ACM,1997,40(3):66-72.
    [152]M.Pazzani,D.Billsus.Learning and Revising User Profiles:The Identification of Interesting Web Sites[J].Machine Learning,1997,27(1):313-331.
    [153]D.S.W.Ngu,X.Wu.SiteHelper:A Localized Agent That Helps Incremental Exploration of the World Wide Web[J].Computer Networks,1997,29(8-13):1249-1255.
    [154]曾春,邢春晓,周立柱.基于内容过滤的个性化搜索算法.软件学报,2003,14(5):999-1004.
    [155]于洪涛,段军义.基于分类和聚类相结合的个性化检索方法研究.燕山大学学报,2007,31(6):489-492.
    [156]M.Baglioni,U.Ferrara,A.Romei,S.Ruggieri,F.Turini.Preprocessing and Mining Web Log Data for Web Personalization[J].In:AI~*IA2003,2003:231-249.
    [157]F.Joseph,K.W.Hing,S.F.Anthony.Online Analytic Mining for Web Access Patterns[J].Advanced Topics in Database Research,2004(3):294-326.
    [158]O.Nasraoui,H.Frigui,R.Krishnapuram,A.Joshi.Extracting Web user Profiles Using Relational Competitive Fuzzy Clustering[J].International Journal on Artificial Intelligence Tools,2000,9(4):509-526.
    [159]T.Joachims,D.Freitag,T.Mitohell.Webwatcher:A Tour Guide for the World Web[C].Proceedings of 15~(th) International Conference on Artificial Intelligence,Nagoya,Japan,1997:770-775.
    [160]T.Nakano,K.Harumoto,S.Shimojo,S.Nishio.User Adaptive Content Delivery Mechanism on the World Wide Web[C].In the 2002 ACM Sumposium on Applied Computing(SAC),Madrid,Spain,ACM,2002:1140-1146.
    [161]J.A.Konstan,B.N.Miller,D.Maltz,J.L.Herlocker,L.R.Gordon,J.Riedl.GroupLens:Applying Collaborative Filtering to Usenet News[J].Comm.ACM,1997,40(3):77-87.
    [162]D.Goldberg,D.Nichols,B.M.Oki,D.Terry.Using Collaborative Filtering to Weave an Information Tapestry[J].Comm.ACM,1992,35(12):61-70.
    [163]W.Hill,L.Stead,M.Rosenstein,G.Furnas.Recommending and Evaluating Choices in a Virtual Community of Use[C].Proc.Conf.Human Factors in Computing Systems,1995:194-201.
    [164]U.Shardanand,P.Maes.Social Information Filtering:Algorithms for Automating "Word of Mouth"[C].Proc.Conf.Human Factors in Computing Systems,1995:210-217.
    [165]L.Terveen,W.Hill,B.Amento,D.McDonald,J.Creter.PHOAKS:A System for Sharing Recommendations[J].Comm.ACM,1997,40(3):59-62.
    [166] K. Goldberg, T. Roeder, D. Gupta, C. Perkins. Eigentaste: A Constant Time Collaborative Filtering Algorithm[J]. Information Retrieval Journal, 2001, 4(2): 133-151.
    [167] M. Claypool, A. Gokhale, T. Miranda, P. Murnikov, D. Netes, M. Sartin. Combining Content-Based and Collaborative Filters in an Online Newspaper[C]. Proceedings of ACM SIGIR Workshop Recommender Systems: Algorithms and Evaluation, Aug. 1999.
    [168] M. Pazzani. A Framework for Collaborative, Content-Based, and Demographic Filtering[J]. Artificial Intelligence Rev., 1999: 393-408.
    [169] M. Balabanovic, Y. Shoham. Fab: Content-Based, Collaborative Recommendation[J]. Comm. ACM, 1997,40(3):66-72.
    [170] P. Melville, R.J. Mooney, R. Nagarajan. Content-Boosted Collaborative Filtering for Improved Recommendations[C]. Proc. 18th Nat'l Conf. Artificial Intelligence, 2002:187-192.
    [171] M.S. Ian, K.N. Charles. Combining Content and Collaboration in Text Filtering[C]. Proceedings of International Joint Conference, 1999.
    [172] C. Basu, H. Hirsh, W. Cohen. Recommendation as Classification: Using Social and Content-Based Information in Recommendation[C]. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, 1998:714-720.
    [173] P. Rin, H.U. Lyle, M.P. David, L. Steve. Probabilistic Models for Unified Collaborative and Content-Based Recommendation in Sparse-Data Environments[C]. Proceedings of 17th Conference Uncertainty in Artificial Intelligence, 2001:437-444.
    [174] Y.Y. Shih, D.R. Liu. Hybrid recommendation approaches: collaborative filtering via valuable content information[C]. Proceedings of the 38 Haiwaii International Conference on System Science, 2005:217-224.
    [175] S.S. Weng, M.J. Liu. Feature-based recommendations for one-to-one marketing[J]. Expert Systems with Applications. 2004(26):493-508.
    [176] Q. Li, H.M. Sung, M.K. Byeong. A probabilistic music recommender considering user opinions and audio features[J]. Information Processing and Management,2007,43(2):473-487.
    [177]H.C.Yoon,K.K.Jae.Application of Web usage mining and product taxonomy to collaborative recommendations in e-commerce[J].Expert Systems with Applications.2004,26(2):233-246.
    [178]S.Janusz.Implementations of web-based Recommender Systems Using Hybrid Methods[J].International Journal of Computer Science & Applications,2006,3(3):52-64.
    [179]Y.F.Li,N.Zhong.Capturing evolving patterns for ontology-based web mining[C].Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence,2004:256-263.
    [180]J.C.Bezdek.Pattern Recognition with Fuzzy Objective Function Algorithms[M].New York:Plenum Press,1981.
    [181]A.John,Campbell,Roberto Torres.Using Item Descriptors in Recommender Systems[R].Eliseo Reategui,American Association for Artificial Intelligence,2002.
    [182]秦国,杜小勇.基于用户层次信息的协同推荐算法.计算机科学,2004,31(101):138-140.
    [183]曾春,邢春晓,周立柱.个性化服务技术综述.软件学报,2002,13(10):1952-1961.
    [184]E.Volokh.Personalization and Privacy[J].Communications of the ACM,2000,43(8):84-88.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700