用户名: 密码: 验证码:
上下文感知的Web搜索关键技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着Internet的爆炸性增长,WWW已经发展成为包含多种信息资源、站点遍布全球的巨大动态信息服务网络,为用户提供了一个极具价值的信息源,实现了全世界人们信息共享的愿望。但是,也正是由于海量信息所造成的“信息过载”,刺激了对高效的Web信息检索技术的需求。2002年9月在美国麻省理工学院智能信息检索研究中心(CⅡR)召开的未来信息检索挑战的国际会议上,上下文检索(Contextual Retrieval)被一致认为是信息检索的长期挑战。2004年7月和2005年7月又先后两次召开了在上下文中的信息检索(IRⅰX)的国际会议。
     在信息检索活动中,无论是信息需求的用户,还是用户所需的信息,都是处于各自的上下文中。一方面,用户处于Task Context、User Context、QueryContext等上下文之中;另一方面,Web信息则处于Author Context、Link Context、Structural Context、Path Context等上下文之中。为了能向用户提供高质量的信息,信息检索模型必须将两方面的上下文有机地结合起来,建立上下文感知(Context-Aware)的信息检索模型。
     根据信息检索领域的战略目标以及Web search的现状,本文对上下文检索展开了深入的研究,提出了可以解决用户的信息查询和相似页面搜索的上下文感知的检索模型,并基于该模型主要完成了以下工作:
     1)感知或获取用户的查询意图或主题:将用户的查询基于上下文和参考本体获取一个参考本体中的局部子树,该子树反映了用户查询的真实意图或主题。本文给出了获取该子树的一系列相关算法。
     2)对主题子树的扩展:基于1)中获得的主题子树,将叶子节点分别基于参考本体中的ISA关系和非ISA关系进行扩展,从而得到一个以用户的查询词为中心的概念图,称为用户的个性化概念图。以个性化概念图中的关键词为特征项来表示Web页面,即Web页面的信息内容限制在该个性化概念图所张的信息子空间中,而个性化概念图中概念之间的度量关系将成为页面链接权重的度量依据。本文给出了这种个性化度量的一系列相关算法。
     3)感知Web页面作者的语义信息:Web页面作者是需求信息的诸多上下文之一,页面作者构成的社群网络的主题与页面构成的超链网络的主题具有很强的相关性甚至是同一主题,因此有必要对这个网络进行研究。本文引入“简单文档”的概念,简单文档通过一阶近邻构成平面式的“复合文档”,复合文档构成立体式的数据集,对数据集建立张量模型,通过张量分解,研究社群网络中成员之间的语义相似度。本文给出了这种相似度的一系列相关算法。
     4)感知Web页面之间的链接结构上下文:页面通过页面之间的超链接构成复杂的链接网络,从而构成需求信息的链接结构上下文。将1)和2)获得的用户的个性化概念图的拓扑结构应用于链接结构上下文:一方面,以个性化概念图中的概念(关键词)作为特征项将页面表达为向量,特征项的权重类似于TF-IDF的CF-IDF计算;另一方面,链接赋予权重,权重计算的依据是用户概念图中概念之间的个性化语义相似度。通过邻接权重矩阵计算页面的权威度量,从而按照权威度量对页面排序。本文给出了这种排序的一系列相关算法。
     显然这种排序随着个性化概念图的变化而变化,有效地克服了“作者欺骗”、“主题漂移”和“千人一面”的问题。
     5)感知Web页面之间的链接锚文本对链入页面的主题或语义指示:在4)的带权链接矩阵的基础上,增加链接锚文本作为第三轴或模式,从而建立了数据的张量模型。由于张量在数学理论及算法上还不成熟,本文将张量模型发展为三个矩阵表示的个性化模型,从而有效地利用了在数学理论上十分成熟的矩阵理论及其算法。
     本文的研究内容基于作者所参与的上海市科学技术委员会科技攻关项目(GrantNo.055115001)《面向语音服务的志愿者信息推送服务平台》的研究,该项目以2010年上海世博会为应用场景,实现了世博MIA系统。本文提出的算法在系统中得到了验证,结果都显示出它们能有效的解决相关问题,并具有较高的性能。因此,本文的研究成果对于提高网络搜索的准确性具有较大的实用价值。
With the explosive growth of Internet, WWW has been developed into a dynamic information service network which has many kinds of information resources, many worldwide websites, and provides users with an extremely valuable source of information. The aspirations of information sharing have become true. However, the "information overloaded" caused by the vast amounts of information stimulates efficient Web information retrieval technology. In September 2002, an international conference about information retrieval challenges of the future was held in CIIR in the Massachusetts Institute and contextual retrieval were identified as particularly important long-term challenges of information retrieval. Since 2004, every two years an international conference about information retrieval in the context have been held.
     In the information retrieval activities, users and the information of users' need are all in their own contexts. On the one hand, users are at the Task Context, User Context, Query Context and so on. On the other hand, the information need of users is in the Author Context, Link Context, Structural Context, Path Context and so on. In order to be able to provide users with high-quality information, information retrieval model must combine the context of two sides into a single framework, and form the context-aware information retrieval model.
     According to the strategic objectives of information retrieval and the status of Web search, we launched an in-depth study on Contextual Retrieval. A context-aware retrieval model was put forward, in order to solve the user's query and similar pages search. The main characteristics of this model are:
     Firstly, the model can be aware of the user's query intent or theme: A local sub-tree from a reference ontology can be obtained based on combining user's query and context. The sub-tree of a user's query reflects the real intention or theme. In this paper, a series of algorithms are put forward to obtain this sub-tree.
     Secondly, on the theme of the expansion of the tree Based on the trees proposed in 1), the leaves were based on the reference nodes in the body of the ISA and non-ISA expansion of relations. thus, get a user's query as the center of the concept map, called the user's personalized concept map.
     The Web pages were represent as vectors in term of key words of the personalized concept map, i.e. the content of pages is restricted in the concept of the information sub-space of personalized concept map. The measurement between concepts of personalized concept map will weight link measurement between pages. In this paper, a series of algorithms of measurement are put forward.
     Thirdly, the model can be aware of semantic information from author of pages: The authors of pages are the context of the information requirements. The topic of the authors' network and the topic of the link network are similar or the same. It is necessary to research about the authors' network. In this paper, "simple document" concept is introduced . "Compound document" are comprised of simple documents. Data sets are constitutes of compound document and model as a tensor. Through decomposition tensor, the semantic similarity between the members is defined and its algorithm is put forward.
     Fourthly, the model can be aware of the link structure context of the information requirement: The link network is comprised of the pages through link between pages and become the link structure context of the information requirement. The topology of the previous user's concept map based on 1) and 2) is applied to the context: on the one hand, it takes the concepts (keyword) of the user's concept map as the term and denote each page as a vector, and calculate the term's weight as CF-IDF like TF-IDF; On the other hand, it assigns a weight to the link. Weight calculation is based on the personalized semantic similarity. Through the adjacent weight matrix we can calculate the authority scores of pages and sort the pages in accordance with the authority scores. A series of algorithms are proposed in the paper.
     Obviously, the sequence of page which changed with user's concept map effectively overcome "Spamming", "Topic drift" and "One size fit all".
     Fifthly, the model can be aware of semantic information of anchor text of links: Add anchor text as the third axis or mode on the basis of the weight adjacent matrix of 4), so as to establish tensor model of the data. However, the mathematic theory of the tensor is unmature. The tensor model will be transformed into three-matrix model to avoid the tensor tool.
     This study content based on the Shanghai Science and Technology Committee on Science and Technology research project (Grant No. 055115001) "for voice services of volunteers push information service platform," participated by author. The Expo MIA system is realized about 2010 Shanghai World Expo based on this research project. The proposed algorithm in the system have been verified, the results show that they can effectively solve related problems and have high performance. Therefore, this paper's research results for improving the accuracy of Web search have great practical value.
引文
[Adini,02],Adini,Y.,Sagi,D.,Tsodyks,M.,Context-Enabled Learning in the Human Visual System,Nature,Feb 2002,415(2):790-793
    [Allan,02],Allan,J.,Croft,W.B.et al,Challenges in Information Retrieval and Language Modeling,Report of a Workshop held at the Center for Intelligent Information Retrieval,University of Massachusetts Amherst,September 2002
    [Andersen,03],Andersen,C.M.,Bro,R.,Practical aspects of PARAFAC modelling of fluorescence excitation emission data.J.Chemometrics,17,200 - 215.2003
    [Arasu,01],Arasu,A.,Cho,J.,Garcia-Molina,H.,Paepcke,A.,Raghavan,S.,Searching the Web.ACM Transactions on Internet Technology(TOIT),1(1):2 - 43,August 2001
    [Attardi,99],Attardi,G.,Gull,A.,Sebastiani,F.,Automatic Web Page Categorization by Link and Context Analysis.In:Proceedings of THAI-99,1st European Symposium on Telematics,Hypermedia and Artificial Intelligence,1999,105-119
    [Baeza,99],Baeza-Yates,R.,RIBEIRO-Neto,B.,Modern Information Retrieval.Addison-Wesley-Longman (1999)
    [Baldauf,07],Matthias Baldauf,A survey on context-aware systems,Int.J.Ad Hoc and Ubiquitous Computing,Vol.2,No.4,2007
    [Banerjee,03],S.Banerjee,T.Pedersen,Extended gloss overlaps as a measure of semantic relatedness.In:Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence(IJCAI-03),pages 805-810,Acapulco,Mexico,2003.
    [Bauer,02],Bauer,T.,Leake,D.B.,Exploiting Information Access Patterns for Context-Based Retrieval.In:Proceedings of 2002 International Conference on Intelligent User Interfaces,IUI'02,San Francisco,California,USA,Jan 2002,176-177
    [Belkin,92],Belkin,N.J.,Croft,W.B.,Information Filtering and Information Retrieval:two Sides of the Same Coin[J],Communications of ACM,1992,35(12):29-38
    [Berger,99],Berger,A.,Lafferty,J.,Information Retrieval as Statistical Translation.In Proceedings of SIGIR'99(Berkeley,CA,USA,Aug.15-19,1999).ACM Press,New York,NY,1999,222-229.
    [Berry,99],Berry,M.W.,Drmac,Z.,Jessup,E.R.,Matrices,Vector Spaces,and Information Retrieval.SIAM Review,41(2)(1999)335-362.
    [Bharat,98],Bharat,K.,Henzinger,M.,Improved algorithms for topic distillation in a hyperlinked environment.In:Voorhees E,Kirsch S,eds.Proceedings of the 21st ACMSIGIR International Conference on Research and Development in Information Retrieval.Melbourne:ACM Press,1998.104-111.
    [Billhardt,02],Billhardt,H.,Borrajo,D.,Maojo,V.,A Context Vector Model for Information Retrieval.Journal of the American Society for Information Science and Technology,2002,53(3):236-249
    [Borgman,02],Borgman,C.L.,Jonathan Furner,Scholarly Communication and Bibliometrics, Annual Review of Information Science and Technology:Vol.36,ed.B.Cronin.,2002
    [Brézillon,99],Brézillon,P.,Context in problem solving:A survey,The Knowledge Engineering Review,1999,14(1):1-34.
    [Brin,98],Brin,S.,Page,L.,The anatomy of a large-scale hypertextual Web search engine.In:Thistlewaite P,et al.,eds.Proceedings of the 7th ACM-WWW International Conference.Brisbane:ACM Press,1998.107-117.
    [Broder,00],Broder,A.,Kumar,R.et al,Graph Structure in the Web Experiments and Models[C],Proc.Of the 9th www Conference,Amsterdam,2000 309-320
    [Budzik,99],J.Budzik,K.Hammond,Watson:Anticipating and Contextualizing Information Needs,Proc.62nd Ann.Meeting Am.Soc.Information Science,1999.
    [Bystrom,05],Bystrom,K.,& Hansen,P.,Conceptual framework for tasks in information studies,Journal of the American Society for Information Science and Technology,Volume 56,Issue 10,Pages 1050 - 1061,2005.
    [Bystrom,95],Katriina Bystrom,Kalervo Jarvelin,TASK COMPLEXITY AFFECTS INFORMATION SEEKING AND USE,Information Processing and Management,Volume 31,Number 2,March 1995,pp.191-213(23)
    [Cai,07],Keke Cai,Chun Chen,Jiajun Bu,Peng Huang,Zhiming Kang,Exploration of Query Context for Information Retrieval,WWW 2007,May 8-12,2007,Banff,Alberta,Canada.
    [Cambazoglu,07],B.Barla Cambazoglu,Evren Karaca,Tayfun Kucukyilmaz,Ata Turk,Cevdet Aykanat,Architecture of a grid-enabled Web search engine,Information Processing and Management 43(2007) 609-623
    [Case,98],Case,J.,Jain,S.,Ott,M.,Sharma,A.,Stephan,F.,Robust learning aided by Context.In:Proceedings of Eleventh Annual Conference on Computational Learning Theory(COLT),ACM Press,New York,1998.44-55
    [Chakrabarti,01],Chakrabarti,S.,Integrating the document object model with hyperlinks for Enhanced topic distillation and information extraction.In:Vincent Y S,et al.eds.Proceedings of the 10th A CM-WWW International Conference.Hong Kong:ACM Press,2001.211-220.
    [Chakrabarti,97],Chakrabarti S,Dom B,Indyk P.,Enhanced hypertext classification using hyperlinks.In:Laura H,ed.Proceedings of the ACM SIGMOD International Conference on Management of Data.Washington:ACM Press,1998.307-318.
    [Chakrabarti,98],Chakrabarti,S.,Dom,B.,Gibson,D.,Kleinberg,J.,Raghavan,P.,Rajagopalan,S.,Automatic resource compilation by analyzing hyperlink structure and associated text.In:Thistlewaite P,et al.eds.Proceedings of the 7th ACM-WWW International Conference.Brisbane:ACMPress,1998.65-74.
    [Chen,00],Guanling Chen,David Kotz,A Survey of Context-Aware Mobile Computing Research,Dartmouth Computer Science Technical Report TR2000-381,2000,(6):68-7
    [Chert,02],Chen K-J,You J-M.,A Study on Word Similarity using Context Vector Models.Computational Linguistics and Chinese Language Processing,2002,7(2):37-58
    [Chen,98],L.Chen,K.Sycara,WebMate:A Personal Agent for Browsing and Searching,Proc.Second Int'l Conf Autonomous Agent:;and MultiAgent Systems,pp.132-139,1998.
    [Chen,99],Chen,C.,Visualising semantic spaces and author co-citation networks in digital libraries.Information Processing and Management,35(3),401 - 420,1999.
    [Claypool,01],Claypool,M.,Le,P.,Waseda,M.,et al,Implicit interest indicators.In:Campbell,M.,ed.Proceedings of the ACM Intelligent User Interfaces Conference(IUI).New York:ACM Press,2001.14-17.
    [Cleveland,76],Cleveland,D.B.,An n-dimensional retrieval method.Journal of the American Society for Information Science,27(5),1976,342 - 347.
    [CNNIC,08],CNNIC,中国互联网络发展状况统计报告,2008.1,http://www.cnnic.cn/index/0E/00/11/index.htm
    [Craswell,01],Nick Craswell,David Hawking,Stephen E.Robertson,Effective Site Finding Using Link Anchor Information.SIGIR 2001:250-257
    [Croft,93],W.B.Croft,Knowledge-Based and Statistical Approaches to Text Retrieval,IEEE Expert:Intelligent Systems and Their Applications,Volume 8,Issue 2(April 1993),Pages:8-12
    [Croft,95],W.B.Croft,What Do People Want from Information Retrieval?http://www.dlib.org/dlib/november95/11croft.html,1995
    [Davison,00],Davison,B.D.,Topical Locality in the Web[C],Proceeding of the 23rd Annual International Conference on Research and Development in Information Retrieval(SI-GIR 2000),Athens,Greece,2000 272-279.
    [Deerwester,90],Deerwester,S.,Dumais,S.T.,Furnas,G.W.,Landauer,T.K.,Harshman,R.,Indexing by Latent Semantic Analysis.Journal of the American Society for Information Science,41(6)(1990)391-407.
    [Dey,00],Dey,A.K.,Abowd,G.D.,Towards a better understanding of context and context-awareness,Proceedings of the Workshop on the What,Who,Where,When and How of Context-Awareness,ACM Press,New York.2000
    [Diligenti,00],Diligenti,M.,Coetzee,F.,Lawrence,S.,Giles,C.L.,Gori,M.,Focused Crawling using Context Graphs,Proceedings of 26th International Conference on Very Large Databases,VLDB 2000,Cairo,Egypt,pp.527-534,2000
    [Dunlavy,06],Daniel M.Dunlavy,Tamara G.Kolda,W.Philip Kegelmeyer,Multilinear algebra for analyzing data with multiple linkages,SANDIA REPORT,SAND2006-2079,Unclassified Unlimited Release,Printed April 2006
    [Liu,04],Fang Liu,Clement Yu,Senior Member,IEEE,Weiyi Merg,Personalized Web Search for Improving Retrieval Effectiveness,IEEE Transactions on Knowledge and Data Engineering,Vol.16.No.1,2004.
    [Finkesltein,02],Finkelstein,L.,Gabrilovich E.,Matias Y.,Rivtin E.,Solan Z.,Wolfman G.,Ruppin,E.,Placing search in context:the concept revisited.ACM Transactions on Information Systems,Jan 2002,20(1):116-131
    [Fischer,91],Fischer,G.Stevens,C.,Information Access in complex,poorly structured information spaces.In Human Factors in ComPuting Systems.CHI'91 Conference Proceedings,pp63-70,ACM,April 1991
    [Foltz,92],Foltz,P.W.,Dumais,S.T.,Personalized Information Delivery:An Analysis of Information Filtering Methods,Comminucations of the ACM,35(12),51-60,1992
    [Furnas,87],Furnas G.W.,The vocabulary problem in human-system communication[J].Comm. ACM,1987,30(11):964-971
    [Ganesan,03],P.Ganesan,H.Garcia-Molina,J.Widom.,Exploiting hierarchical domain structure to compute similarity.ACM Trans.Inf Syst.,21(1):64-93,2003.
    [Garfield,97],Eugene Garfield,A Tribute To Calvin N.Mooers,A Pioneer Of Information Retrieval,The Scientist,Vol:11,#6,p.9,March 17,1997
    [Gauch,03],Gauch,S.,Chaffee,J.,Pretschner,A.,Ontology-Based Personalized Search and Browsing,Web Intelligence and Agent System,Vol.1(3-4),pp.219-234,2003
    [Glover,01],E.J.Glover,G.W.Flake,S.Lawrence,W.P.Birmingham,A.Kruger,C.L.Giles,,D.M.Pennock,Improving Category Specific Web Search by Learning Query Modifications,SAINT,pp.23-34,2001.
    [Good,99],Good,N.,Schafer,J.,Konstan.J.,Borchers,A.,Sarwar,B.,Herlocker,J.,Riedl,J.,Combining Collaborative Filtering with Personal Agents for Better Recommendations.In Proceedings of AAAI'99.Menlo Park:AAAI Press,1999,439-446[Gudivada,97],Gudivada,V.N.,Ragluavan,V.V.,Information Retrieval on the World Wide Web[J].IEEE Internet Cumputing,1997;1(5):58-68
    [Harman,93],Donna Harman,Overview of the First Text Retrieval Conference (TREC-1),http://trec.nist.gov/pubs/trecl/papers/01.txt,1993
    [Hau,05],Jeffrey Hau,William Lee,John Darlington,A Semantic Similarity Measure for Semantic Web Services,WWW2005,May 10-14,2005,Chiba,Japan.[Havetiwala,03],Haveliwala,T.H.,Topic-Sensitive PageRank:A Context-Sensitive Ranking Algorithm for Web Search.(C) 2003 IEEE
    [Herlocker,99],Herlocker,J.L.,Joseph A.Konstan,Al Borchers,John Riedl,An algorithmic framework for performing collaborative filtering.Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval,U.S.1999:230-237.
    [Huang,00],Huang,L.,A Survey On Web Information Retrieval Technologies(2000),http://www.csee.umbc.edu/cadip/readings/IR.report.120600.book.pdf
    [Huang,04],Huang,Z.,CHEN,H.,ZENG,D.,Applying Associative Retrieval Techniques to Alleviate the Sparsity Problem in Collaborative Filtering,ACM,VOL22,No.1,January 2004:116-142
    [Ingwersen,04],R Ingwersen,N.Belkin.,Information retrieval in context-IRiX.SIGIR Forum,38(2),2004.
    [Ingwersen,05],P.Ingwersen,Selected variables for IR interaction in context:introduction to IRiX,SIGIR 2005 Workshop,
    [Jarvelin,04],Kalervo Jarvelin,Peter Ingwersen,Information seeking research needs extension towards tasks and technology,Information Research,Vol.10 No.1,October 2004,http://informationr.net/ir/10-1/paper212.html
    [Jose,05],Joemon Jose,C J van Rijsbergen,Workshop on Information Retrieval in Context:Report,Proceedings of the ACM SIGIR 2005 Workshop on Information Retrieval in Context (IRiX)
    [Kelly,03],D.Kelly,J.Teevan.,Implicit feedback for inferring user preference:A bibliography. SIGIR Forum,2003.
    [Kilmer,04],Misha Elena Kilmer,Carla D.Moravitz Martin,Decomposing a Tensor,SIAM News,Volume 37,Number 9,November 2004
    [Kim,02],Kim,W.,Kerschberg,L.,Scime,A.,Learning for automatic personalization in a semantic based meta-search agent.Electronic Commerce Research and Applications,2002,1:150-173
    [Kleinberg,97],Kleinberg,J.,Authoritative sources in a hyperlinked environment.In:Tarjan RE,Baecker T,,eds.Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms.New Orleans:ACM Press,1997.668-677.
    [Kolda,06],Kolda,T.,Bader,B.,The TOPHITS Model for Higher-Order Web Link Analysis,In:Proc.Workshop on Link Analysis,Counterterrorism and Security,SDM06,Apr.2006
    [Kolda,98],Kolda,T.G.,O'Leary,D.P.,A Semi-Discrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieval,ACM Transactions on Information Systems,16(4):322-346,1998
    [Konstan,97],Konstan A et al.,GroupLens:applying collaborative filtering to Usenet news.Communication of the ACM,1997,40(3):77-87
    [Kurki,99],Kurki,T.,Sami Jokela,S.,Sulonen,R.,Agents in Delivering Personalized Content Based on Semantic Metadata,In Proc.1999 AAAI Spring Symposium Workshop on Intelligent Agents in Cyberspace,pages 84-93,Stanford,USA
    [Lau.06],Lau,E.P.,Goh,D.H.,In search of query patterns:A case study of a university OPAC.Information Processing and Management,42(5),1316-1329.2006
    [Lawrence,00],Lawrence,S.,Context in Web Search.Data Engineering,IEEE Computer Society,Vol.23,No.3,pp.25-32,September 2000.
    [Lempel,00],Lempel,R.,Moran,S.,The stochastic approach for link-structure analysis (SALSA) and the TKC effect.In:Proceedingsof the Ninth International World Wide Web Conference,pages 387-401,2000.
    [Lieberman,95],Lieberman,H.,Letizia:an agent that assists web browsing.In:Burke,R.,ed.Proceedings of the International Joint Conference on Artificial Intelligence.Menlo Park,CA:AAAI Press,1995.924-929.
    [Lin,98],D.Lin,An information-theoretic definition of similarity.In:Proceedings of the Fifteenth International Conference on Machine Learning,pp.296-304,1998.
    [Lu,01],W.Lu,J.Janssen,E.Milios,N.Japkowicz,Node similarity in networked information spaces.In:Proceedings of the Conference of the IBM Centre for Advanced Studies on Collaborative Research(CASCON'O1).IBM Press,2001.
    [Maguitman,05],Ana Maguitman,Filippo Menczer,Heather Roinestad et al,Algorithmic Detection of Semantic Similarity.Proceedings of WWW 2005,Chiba,Japan,May 2005.
    [Mark,95],Mark van Uden,Rocchio:Relevance Feedback in Learning Classification Algorithms,1995
    [Menczer,04],Menczer,F.,Combining Link and Content Analysis to Estimate Semantic Similarity.Proc.13th Intl.WWW Conf Alt.Track Papers and Posters,pp.452-453,2004.
    [Menczer,05],Filippo Menczer,Finding semantic needles in haystacks of Web text and links, IEEE Internet Computing,May/June 2005.
    [Menczer],Menczer,F.,Topological measures and maps of the Web.http://www.informatics.indiana.edu/fil/Web/
    [Middleton,04],Middleton,S.E.,Shadbolt,N.R.,Roure,D.C.,Ontological User Profiling in Recommender Systems.ACM Transactions on Information Systems,2004,22(1):54-88
    [Miller,93],Miller,G.,Beckwith,R.,Fellbaum,C.,Gross,D.,Miller,K.J.,Introduction to WordNet:An On-line Lexical database.International Journal of Lexicography,3:235-312,1990(Revised August 1993)
    [Mitra,96],Mitra,M.,Singhal,S.,Buckley,C.,Improving Automatic Query Expansion.In Proceedings of the 21~(th) Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(1996)4-11.
    [Mizuuchi,99],Mizuuchi,Y.,Tajima,K.,Finding Context Paths for Web Pages.In:Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia.ACM,1999,13-22
    [Mizzaro,02],Mizzaro,S.,Tasso,C.,Ephemeral and Persistent Personalization in Adaptive Information Access to Scholary Publications on the Web.In Paul De Bra,Peter Brusilovsky,and Ricardo Conejo,editors,Adaptive Hypermedia and Adaptive Web-Based Systems,Second International Conference AH2002,pages 306-316.Springer,2002
    [Mooney,00],Mooney,R.J.,Roy,L.,Content-based Book Recommending Using Learing for Text Categorization.In:Proceedings of the fifth ACM conference on Digital Libraries.New York:ACM Press,2000,195-204
    [Myaeng,86],Myaeng,S.H.,Korfhag,R.R.,Towards an Intelligent and Personalized Retrieval System,International Symposium on Methodologies for Intelligent Systems.roceedings of theACM SIGART international symposium onMethodologies for intelligent systems.1986:121-129
    [Nanopoulos,01],Nanopoulos,A.,Manolopoulos,Y.,Mining patterns from graph traversals.Data and Knowledge Engineering,2001,37(3):243-266.
    [Newman,98],Newman,M.E.J.,The structure of scientific collaboration networks.Proc.Natl Acad.Sci.USA 98,404 - 409(2001).
    [Noel,03],Noel,S,Chee-Hung Henry Chu,Vijay Raghavan,Co-Citation count vs correlation for influence network visualization,Information Visualization(2003) 2,160 - 170
    [Ntoulas,04],Ntoulas,A.,Junghoo Cho,Christopher Olston,What's new on the web?:the evolution of the web from a search engine perspective.In:Proceedings of the 13th conference on World Wide Web,pages 1 - 12,New York,NY,USA,May 2004.ACM Press.
    [Page,98],L.Page,S.Brin,R.Motwani,T.Winograd,The pagerank citation ranking:Bringing order to the web.Technical report,Stanford Digital Library Technologies Project,1998.
    [Pant,03],Pant,G.,Deriving Link-context from HTML Tag Tree.In:Proceedings of 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery,DMKD'03,San Diego,CA,USA,Jun 2003,49-55
    [Pazzani,97],Pazzani,M.,Billsus,D.,Learning and RevisingUser Profiles:The Identification of Interesting Web Sites.Machine Learning,1997,27:313-331
    [Pilkington,06],Pilkington,A.,Teichert,T.,A Citation/Co-citation of Research Policy, RESEARCH PAPER SERIES,Royal Holloway University of London,ISBN:1-905846-01-0,August 2006
    [Ponte,98],Ponte,J.M.,W.B.Croft,A language modeling approach to information retrieval,SIGIR98,Pages:275-281,1998
    [Resnik,99],Philip Resnik,Semantic Similarity in a Taxonomy:An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language,Journal of Artificial Intelligence Research,1999,11:95-130
    [Richardson,02],Richardson,M.,Domingos,P.,The Intelligent Surfer:Probabilistic Combination of Link and Content Information in PageRank,with Matt Richardson.Advances in Neural Information Processing Systems 14(pp.1441-1448),2002.Cambridge,MA:MIT Press
    [Risson,04],Risson J,Moors T.,Survey of research towards robust peer-to-peer networks:Search methods.Technical Report,UNSWEE-P2P-1-1,Sydney:University of New South Wales,2004.1-36.
    [Robertson,93],Robertson,S.E.,Steve Walker,Micheline Hancock-Beaulieu,Aarron Gull,Marianna Lau,Okapi at TREC,The First Text Retrieval Conference(TREC-1),Galthersburg,MD:NIST,1993.
    [Rouet,03],Rouet J.What was I looking for? The influence of task specificity and prior knowledge on students'search strategies in hypertext Interacting with Computers,2003,15(3):409-428
    [Rousseau,04],Rousseau,R.,Zuccala,A.,A Classification of Author Co-citations:Definitions and Search Strategies,JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY,55(6):513 - 529,2004
    [Salton,75],Salton,G.,Wong,A.,Yang,C.S.,A vector space model for automatic indexing,Communications of the ACM,Volume 18,Issue 11(November 1975),Pages:613 - 620
    [Salton,89],Gerard Salton,Automatic Text Processing.Addison-Wesley,Reading,Mass.,1989.
    [Salton,90],Salton,G.,Buckley,C.,Improving Retrieval Performance by Relevance Feedback.Journal of the American Society for Information Science,41(4)(1990)288-297
    [Sarwar,98],Sarwar,B.M.Joseph A.Konstan,A1 Borchers,Jon Herlocker,Brad Miller,John Riedl,Using filtering Agents to Improve Prediction Quality in the GroupLens Research Collaborative filtering System,Proceedings of the ACM 1998 conference on Computer supported cooperative work Seattle,Washington,United States,1998:345-354.
    [Schickel-Zuber,07],Vincent Schickel-Zuber,Boi Faltings,OSS:A Semantic Similarity Function based on Hierarchical Ontologies,IJCAI'07
    [Schilit,94],Schilit,B.,Theimer,M.,Disseminating Active Map Information to Mobile Hosts.IEEE Network,8(5).1994.pp 22-32.
    [Setten,01],M.van Setten,Personalised Information Systems,https://extranet.telin.nl/docuserver/dscgi/ds.py/ViewProps/File-16467,27 June 2001
    [Shisanu,03],Shisanu TONGCHIM,Hitoshi ISAHARA,Improving Search Performance:a Lesson Learned fi-om Evaluating Search Engines using Thai Queries,IEICE TRANS.INF.&SYST.,VOL.E86-D,NO.5 MAY 2003
    [Smyth,02],Smyth,B.,Bradley,K.,Rafter,R.,Personalized Techniques for Online Recruitment Services.Communications of the A CM,2002,45(5):39-40
    [Song,04],Song,D.,Bruza,P.D.,Cole,R.J.,Concept learning and information inferencing on a high-dimensional semantic space.ACM SIGIR 2004 Workshop on Mathematical/Formal Methods in Information Retrieval(MF/IR'2004),29 July 2004,Sheffield,UK
    [Spink,07],Amanda Spink,Frances Alvarado-Albertorio,Bhuva Narayan,Jean Brumfield,Minsoo Park,Multitasking information behaviour in public libraries.A survey study,Journal of Librarianship and Information Science,Vol.39,No.3,177-186,2007
    [Stefani,98],Stefani,A.,Strappavara,C.,Personalizing Access to Web Sites:The SiteIF Project.In:Proceedings of the 2ndWorkshop on Adaptive Hypertext and Hypermedia(HYPERTEXT'98).Pittsburgh,USA,1998
    [Stegeman,06],Alwin Stegeman,JOS M.F.TEN BERGE,SUFFICIENT CONDITIONS FOR UNIQUENESS IN CANDECOMP/PARAFAC AND INDSCAL WITH RANDOM COMPONENT MATRICES,PSYCHOMETRIKA-VOL.71,NO.2,219-229,JUNE 2006
    [Vakkari,03],Vakkari,P.,Task-based information searching.Annual Review of Information Science and Technology,37,413-464.2003
    [Vel,98],Vel,O.,Nesbitt,S.,A Collaborative Filtering Agent System for Dynamic Virtual Communities on the Web.Conference on Automated Learning and Discovery(CONALD-98).Pittsburgh:CMU,1998
    [Volokh,00],Volokh,E.,Personalization and privacy.Communications of the ACM,2000,43(8):84-88.
    [White,04],R.W.White,J.M.Jose,C.J.van Rijsbergen,I.Ruthven.,A simulated study of implicit feedback models.In Proceedings of ECIR 2004,pages 311-326,2004.
    [White,81],White,H.D.,Griffith,B.C.,Author cocitation:A literature measure of intellectual structure.Journal of the American Society for Information Science,32,163 - 172,1981.
    [White,95],White H,McCain K.,Visualizing a discipline:An author co-citation analysis of information science 1972-1995.Journal of the American Society for Information Science,1998,49(4):327-356.
    [Wu,06],Wu,H.,Luk,R.,Wong,K.,Kwok,K.,Probabilistic document-context based relevance feedback with limited relevance judgments.In:Proceedings of CIKM'06(Arlington,VA,USA,Nov.6-11,2006) ACM Press,New York,NY,2002,854-855
    [Xia,03],Xia Lin,White,H.D.Buzydlowski,J.,Real-time author co-citation mapping for online searching,Information Processing and Management 39(2003) 689 - 706
    [Xu,00],Xu,J.X.,Croft,W.B.,Improving the Effectiveness of Information Retrieval with Local Context Analysis.ACM Transactions on Information Systems,Jan 2000,18(1):79-112
    [Xu,98],Xu,J.,Croft,W.B.,Query Expansion Using Local and Global Document Analysis.In:Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(1998)206-214.
    [Xuehua,05],Xuehua Shen,Bin Tan,ChengXiang Zhai,Context Sensitive Information Retrieval Using Implicit Feedback,In:Proceedings of SIGIR 2005,pages 43-50,2005
    [何伟,03],何伟,LSI潜在语义信息检索模型.数学的实践与认识,Vol.33 No.9,Sep.,2003.
    [李蕾,00],李蕾,王楠,等,中文搜索引擎概念检索初探[J].计算机工程与应用,2000(6):1-11
    [刘俊平,03],刘俊平,李书振,张志毅,智能引擎实例分析[J].计算机应用研究,2003(1):82-84
    [史忠植,98],史忠植,多媒体信息检索研究动态,http://www2.ccw.com.cn/1998/3/164803.shtml
    [田永鸿,05],田永鸿,基于上下文的统计关系学习研究(博士毕业论文),2005
    [曾春,02],曾春,邢春晓,周立柱,个性化服务技术综述.软件学报,Vol 13,No.October,2002.
    [张智君,04],张智君,任衍具,宿芳,结构任务类型和导航对超文本信息搜索的影响,心理学报,2004,36(5):534-539
    [赵丹群,08]],赵丹群,现代信息检索,北京大学出版社,2008.1

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700