XML信息检索关键技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着XML技术的广泛应用,XML已成为Web上表示和交换数据的标准格式,各个领域不断涌现出大量的XML数据。如何有效检索大量的XML数据,是当前数据库与信息检索等领域中一个亟待解决的热点研究问题。传统信息检索技术在处理非结构化数据的检索上,取得了大量卓有成效的研究成果。XML数据具有半结构化特性,既有结构又有内容,给信息检索领域的研究带来了新的挑战。将数据库技术与信息检索技术巧妙结合,用于解决XML检索问题,已在研究人员中达成共识,为XML检索提供了新思路。
     本文在深入分析XML检索研究现状的基础上,以XML检索方式为主线,结合数据库技术与信息检索技术,对XML检索的若干关键技术进行了深入研究,包括XML关键字检索、模糊结构上下文的XML内容与结构检索、基于关系数据库的XML全文检索等内容。具体的创新和贡献如下:
     提出了一种基于候选片段语义的XML关键字检索方法。该方法首先根据XML文档树中节点所包含的属性类型数量以及节点的后裔节点数量选择候选节点,以候选节点为中心创建候选片段,将候选片段作为回答XML关键字检索最基本的语义单元;然后,针对候选片段建立倒排索引,在回答关键字查询时,根据XML数据集自身特点和用户的选择返回包含全部关键字的候选片段集合或存在祖先后裔关系的候选片段集合。实验结果表明以候选片段作为XML关键字检索的基本语义单元,能够为用户返回粒度适中、信息比较完整、拥有实际意义的检索结果,并且检索效率也比较理想。
     提出了一种模糊结构上下文的XML检索方法。该方法将查询与文档中的结构化约束条件定义为结构上下文,以结构化词项集合表示XML查询和XML文档。在上下文相似度计算方面,综合考虑了上下文之间的最大匹配部分以及各元素的层次权重、元素间的层次相似性等因素,提出了查询上下文与文档上下文相似度计算方法。为有效实现XML内容与结构检索,扩展了向量空间模型,设计了模糊结构上下文的XML内容与结构检索算法。实验结果表明,该方法在检索效率、检索结果上均有较好性能。
     提出了一种基于关系数据库的XML全文检索方法ReXFT。ReXFT采用基于模型映射的XML数据存储方案NXRel,能够在关系模型之上自然的体现出XML数据逻辑模型。提出了基于全文检索元素节点的XML全文索引方案,允许用户自定义全文索引路径。ReXFT以W3C推荐的XML全文检索标准作为XML全文检索提交形式,检索语法符合国际标准。综合考虑XML数据层次特性以及检索词之间的逻辑关系、距离、出现频率等因素,提出了一种基于文本覆盖密度的检索结果计分方法。实验结果表明,ReXFT可以有效处理XML全文检索。
With the rapid spread of XML technology, XML has become the standard formatfor data representation and data exchange on the Web. There are a huge number ofXML documents in many domains. It becomes a hot research topic that how toretrieve XML data efficiently and effectively among database and informationretrieval research communities. There are rich solutions in unstructured data retrievalwith traditional information retrieval techniques. But XML data is semi-structuredwith both content and structure, and brings new challenges to information retrievalresearch. It becomes a novel research idea that XML data is retrieval with databaseand information retrieval.
     This paper analyzes research status of XML information retrieval, considerssolutions with database and information retrieval, and addresses some crucialproblems which are related with XML data retrieval, include XML keyword search,XML content and structure search with vagued structure context, and XML full textsearch based on relational database. The main contributions and innovations include:
     This paper proposes an approach of keyword search over XML documentsbased on Candidate Fragment semantic. This method first filters candidatenodes according to number of descendants and attribute type numbers ofXML tree nodes, and then constructs candidate fragments centered fromcandidate nodes. After indexing these candidate fragments by inverted list,this method answer user queries with candidate fragments or candidatefragments with ancestor-descendant relationship which satisfy all keywordsand adapt the characteristic of XML dataset. Experiments show thatCandidate Fragment semantic can provide users compact, meaningful andproper size results and have good performance on XML keyword search.
     This paper proposes an approach to retrieval XML data with vague structuralcontext. We processes user query and XML documents as structural term set. Context resemblance is computed based on level weight of element incontext, level similarity between elements of longest matched context, andother factors. We extends Vector Space Model to answer XML content andstructure search. Experiments show that our method has good performanceon XML content and structure search.
     This paper proposes an approach of XML full-text search method based onrelational database, named as ReXFT. ReXFT maps XML data into relationalstorage based on NXRel, and can naturally reflect the logical model of XMLdata. ReXFT allows users to create XML full text index on user defined pathsbased on full text element nodes. W3C Recommendation is adopted inReXFT to submit user XML full text search to fit the international standards.ReXFT scores search results based on cover density ranking schema, takinginto account the logical relationship between search terms, distance,frequency and other factors. Experimental results show that ReXFT has goodperformance in the processing of XML full-text search.
引文
[1]World Wide Web Consortium. REC-xml-20081126. Extensible markup language (XML)1.0(fifth edition). Available at:http://www.w3.org/TR/2008/REC-xml-20081126/,2008.
    [2]International Organization for Standardizatio n. ISO/IEC8879:1986. Information processing-Text and office systems-Standard Generalized Markup Language (SGML).1986.
    [3]World Wide Web Consortium. REC-html401-19991224. HTML4.01Specification. Available at:http://www.w3.org/TR/1999/REC-html401-19991224,1999.
    [4]World Wide Web Consortium. Namespaces in XML. W3C Recommendation14January1999. Available at:http://www.w3.org/TR/1999/REC-xml-names-19990114/.
    [5]World Wide Web Consortium. XSL Transformations (XSLT) Version1.0. W3C Recommendation16November1999. Available at:http://www.w3.org/TR/xslt.
    [6]World Wide Web Consortium. XML Path Language (XPath) Version1.0. W3C Recommendation16November1999. Available at:http://www.w3.org/TR/xpath.
    [7]World Wide Web Consortium. XML Schema Part0:Primer Second Edition. W3C Recommendation.28October2004. Available at:http://www.w3.org/TR/xmlschema-0/.
    [8]World Wide Web Consortium. XML Linking Language (XLink) Version1.0. W3C Recommendation27June2001. Available at:http://www.w3.org/TR/xlink/.
    [9]World Wide Web Consortium. XML Base. W3C Recommendation27June2001. Available at:http://www.w3.org/TR/xmlbase/.
    [10]Murray-Rust P, Rzepa H S. Chemical markup language V2.5RC1. Available at: http://cml.sourceforge.net/,2006.
    [11]HL7. HL7version3clinical document architecture. Available at:http://www.hl7.org/library/bookstore/,2005.
    [12]Fenyo D. The biopolymer markup language. Bioinformatics Applications Note,1999,15(4):339-340.
    [13]LabBook, Inc. Bioinformatic sequence markup language version2.2. Available at: http://www.labbook.com/,2002.
    [14]World Wide Web Consortium. REC-SVG11-20030114. Scalable Vector Graphics (SVG)1.1Specification. Available at:http://www.w3.org/TR/2003/REC-SVG11-20030114/,2003.
    [15]World Wide Web Consortium. NOTE-VML-19980513. Vector Markup Language (VML) Available at:http://www.w3.org/TR/1998/NOTE-VML-19980513,1998.
    [16]International Organi zation for Standardization. ISO/IEC19775:2008. Information technology-Computer graphics and image processing-Extensible3D (X3D).2008.
    [17]International Organi zation for Standardization. ISO/IEC14772:2004. Information technology-Computer graphics and image processing-The Virtual Reality Modeling Language (VRML).2004.
    [18]World Wide Web Consortium. REC-xhtmll-20020801. XHTML TM1.0the extensible hypertext markup language (second edition). Available at:http://www.w3.org/TR/2002/REC-xhtmll-20020801/,2002.
    [19]World Wide Web Consortium. WD-xhtml2-20060726. XHTML TM2.0. Available at: http://www.w3.org/TR/2006/WD-xhtml2-20060726/,2006.
    [20]World Wide Web Consortium. REC-xsll1-20061205. Extensible Stylesheet Language (XSL) Version1.1. Available at:http://www.w3.org/TR/2006/REC-xsl11-20061205/,2006.
    [21]OASIS. docbook-5.0b6-spec-wd-01. The DocBook schema working draft V5.0b6. Available at:http://www.oasis-open.org/docbook/specs/,2006.
    [22]Digital Bibliography and Library Project (DBPL). Available at:http://dblp.uni-trier.de/.
    [23]OASIS and UN/CEFACT. Electronic business using extensible markup language. Available at:http://www.ebxml.org/specs/,2001.
    [24]中国科学院软件研究所电子商务技术研究中心.中国电子商务的标准与规范cnXML.2005.网址:http://www.cnxml.org.cn/.
    [25]World Wide Web Consortium. REC-MathML2-20031021. Mathematical markup language (MathML) version2.0(second edition). Available at:http://www.w3.org/TR/2003/REC-MathML2-20031021/,2003.
    [26]Recordare LLC. The MusicXML TM version2.0W3C XML schema definition (XSD). Available at:http://www.musicxml.org/xml.html,2008.
    [27]International Organization for Standardization. ISO19136:2007. Geographic information Geography Markup Language (GML),2007.
    [28]International Organi zation for Standardization. ISO/IEC29500:2008. Information technology-Document description and processing languages-Office Open XML file formats.2008.
    [29]Alexander Holupirek, Christian Grun and Marc H. Scholl. BaseX&DeepFS-Joint storage for filesystem and database. Proceedings of the12th International Conference on Extending Database Technology (EDBT2009); ACM International Conference Proceeding Series, ACM, vol. Volume360, pages1108-1111, Saint-Petersburg, Russia, March,2009, Demo Track.
    [30]Christian Grun, Sebastian Gath, Alexander Ho lupirek and Marc H. Scholl. XQuery Full Text Implementation in BaseX. The Sixth International XML Database Symposium (XSym2009), Lecture Notes in Computer Science (LNCS), Springer Verlag, Lyon,2009.
    [31]Christian Grun, Alexander Holupirek and Marc H. Scholl. Visually Exploring and Querying XML with BaseX.12GI-Fachtagung fur Datenbanksysteme in usiness, Technologie und Web (BTW2007), pages629-632, Aachen, Germany, March,2007.
    [32] Wolfgang Meier. eXist: An Open Source Native XML Database. Lecture Notes inComputer Science,2003, Volume2593/2003,169-183.
    [33] Mark Logic Corporation. Mark Logic Server, XQuery API Documentation. Seehttp://xqzone.marklogic.com/pubs/3.0/apidocs/UpdateBuiltins.html.
    [34] Stelios Paparizos, Shurug Al-Khalifa, Adriane Chapman, H.V. Jagadish, Laks V.S.Lakshmanan, Andrew Nierman, Jignesh M. Patel, Divesh Srivastava, NuweeWiwatwattana, Yuqing Wu and Cong Yu. TIMBER: A native system for quering XML. InProc. SIGMOD Conf., Jun.2003, San Diego, CA.
    [35] Zhen Hua Liu, Muralidhar Krishnaprasad, and Vikas Arora.2005. Native XQueryprocessing in oracle XMLDB. In Proceedings of the2005ACM SIGMOD internationalconference on Management of data (SIGMOD'05). ACM, New York, NY, USA,828-833.
    [36] Ravi Murthy and Sandeepan Banerjee.2003. Xml schemas in Oracle XML DB. InProceedings of the29th international conference on Very large data bases-Volume29(VLDB'2003), VLDB Endowment1009-1018.
    [37] Muralidhar Krishnaprasad, Zhen Hua Liu, Anand Manikutty, James W. Warner, VikasArora, and Susan Kotsovolos.2004. Query rewrite for XML in Oracle XML DB. InProceedings of the Thirtieth international conference on Very large data bases-Volume30(VLDB '04), VLDB Endowment1134-1145.
    [38] Matthias Nicola and Bert van der Linden.2005. Native XML support in DB2universaldatabase. In Proceedings of the31st international conference on Very large data bases(VLDB '05). VLDB Endowment1164-1174.
    [39] Kevin Beyer, Fatma zcan, Sundar Saiprasad, and Bert Van der Linden.2005. DB2/XML:designing for evolution. In Proceedings of the2005ACM SIGMOD internationalconference on Management of data (SIGMOD '05). ACM, New York, NY, USA,948-952.
    [40] Balmin, A. Eliaz, T. Hornibrook, J. Lim, L. Lohman, G. M. Simmen, D. Wang, M. Zhang,C. Cost-based optimization in DB2XML. IBM Systems Journal, vol.45, no.2, pp.299-319,2006.
    [41] Michael Rys. XML and relational database management systems: inside Microsoft SQLServer2005. In Proceedings of the2005ACM SIGMOD international conference onManagement of data (S IGMOD '05). ACM, New York, NY, USA,958-962.
    [42] World Wide Web Consortium.XQuery and XPath Full Text1.0.W3C Recommendation17March2011.Available at: http://www.w3.org/TR/xpath-full-text-10/.
    [43] World Wide Web Consortium. REC-xml-dt-doctype-20081126. Document Type Definition.Available at: http://www.w3.org/TR/REC-xml/#dt-doctype,2008
    [44] World Wide Web Consortium. REC-xmlschema-0-20041028. Available at:http://www.w3.org/TR/2004/REC-xmlschema-0-20041028/,2004
    [45] International Organization for Standardization. ISO/IEC TR22250-1:2002. Informationtechnology-Document description and processing languages-Regular LanguageDescription for XML (RELAX)-Part1: RELAX Core,2002
    [46] World Wide Web Consortium. REC-xpath-datamodel-20070123. XQuery1.0and XPath2.0data model (XDM). Available at:http://www.w3.org/TR/2007/REC-xpath-datamodel-20070123/,2007.
    [47] World Wide Web Consortium. REC-DOM-Level-3-Core-20040407. Document objectmodel (DOM) level3core specification version1.0. Available at:http://www.w3.org/TR/2004/REC-DOM-Level-3-Core-20040407/,2004
    [48] World Wide Web Consortium. REC-xml-infoset-20040204. XML information set (secondedition). Available at: http://www.w3.org/TR/2004/REC-xml-infoset-20040204/,2004.
    [49] Dietz P F. Maintaining order in a linked list. In: Proceeding of the14th annual ACMSymposium on Theory of Computing.1982.122~127
    [50] Zhang C, Naughton J, DeWitt D, et al. On supporting containment queries in relationaldatabase management systems. In: Proceedings of the2001ACM SIGMOD InternationalConference on Management of Data.2001.425~436
    [51] Li Q, Moon B. Indexing and querying XML data for regular path expressions. In:Proceedings of the27th International Conference on Very Large Data Bases (VLDB).2001.361~370
    [52] Toshiyuki Amagasa, Masatoshi Yoshikawa, Shunsuke Uemura. QRS: A RobustNumbering Scheme for XML Documents. In: Proceedings of the19th InternationalConference on Data Engineering (ICDE).2003.705~707
    [53] Online Computer Library Center. Introduction to Dewey decimal classification. Availableat: http://www.oclc.org/dewey/versions/ddc22print/intro.pdf,2003
    [54] Tatarinov I, Viglas S D, Beyer K, et al. Storing and querying ordered XML using arelational database system. In: Proceedings of the2002ACM SIGMOD InternationalConference on Management of Data.2002.204~215
    [55] H rder T, Haustein M, Mathis C, et al. Node labeling schemes for dynamic XMLdocuments reconsidered. Data&Knowledge Engineering,2007,60(1):126~149
    [56] O'Neil P, O'Neil E, Pal S, et al. ORDPATHs: insert-friendly XML node labels. In:Proceedings of the2004ACM SIGMOD International Conference on Management ofData.2004.903~908
    [57] B hme T, Rahm E. Supporting Efficient Streaming and Insertion of XML Data in RDBMS.In: Proceedings of the3rd International Workshop Data Integration over the Web (DIWeb).2004.70~81
    [58]汪陈应,袁晓洁,王鑫,等. BSC:一种高效的动态XML树编码方案.计算机科学,2008,35(3):76-78
    [59] Wang C, Yuan X, Wang X, et al. An efficient numbering scheme for dynamic XML trees.In: Proceedings of the2008IEEE International Conference on Computer Science andSoftware Engineering (CSSE).2008.704~707
    [60]汪陈应. XML数据编码与存储管理关键技术研究(博士学位论文).天津:南开大学,2010
    [61] World Wide Web Consortium. NOTE-xml-ql-19980819. XML-QL: A Query Language forXML. Available at: http://www.w3.org/TR/1998/NOTE-xml-ql-19980819/,1999
    [62] Jonathan Robie, Joe Lapp, David Schach. XML Query Language (XQL). Available at:http://www.w3.org/TandS/QL/QL98/pp/xql.html,1998
    [63] Don Chamberlin, Jonathan Robie, Daniela Florescu. Quilt: An XML Query Language forHeterogeneous Data Sources. Lecture Notes in Computer Science,1997:1~25.2001
    [64] World Wide Web Consortium. REC-xpath20-20070123. XML Path Language (XPath)2.0.Available at: http://www.w3.org/TR/2007/REC-xpath20-20070123/,2007
    [65] World Wide Web Consortium. REC-xquery-20070123. XQuery1.0: An XML QueryLanguage. Available at: http://www.w3.org/TR/2007/REC-xquery-20070123/,2007
    [66] Balmin A,Papakonstantinou Y,Hristidis V.A system for keyword proximity search onXML databases.In: Freytag JC,Lockemann PC,Abiteboul S,Carey MJ,Selinger PG,Heuer A,eds.Proceedings of the29th International Conference on Very Large Data Bases(VLDB).Berlin:Morgan Kaufmann Publishers,2003.1069~1072
    [67] Weigel F,Meuss H,Schulz KU,Bry F.Content and structure in indexing and rankingXML.In:Amer-Yahia S,Gravano L,eds.Proceedings of the7th International Workshopon the Web and Databases(WebDB).Maison de la Chimie: ACM Press,2004.67~72
    [68] Guo L,Shao F,Botev C,Shanmugasundaram J.XRANK: Ranked keyword search overXML documents.In: Halevy AY,Ives ZG,Doan A,eds.Proceedings of the2003ACMSIGMOD International Conference on Management of Data (SIGMOD).San Diego:ACM Press,2003.16~27
    [69] Florescu D,Kossmann D,Manolescu I.Integrating keyword search into XML queryProcessing.The International Journal of Computer and Telecommunications Networkingarchive,2000,33(1~6):119~135
    [70] Schmidt A,Kersten LM,Windhouwer M.Querying XML documents made easy:Nearestconcept queries.In: Young DC,ed.Proceedings of the17th International Conferenceon Data Engineering(ICDE).Heidelberg: IEEE Computer Society,2001.595~604
    [71] Graupmann J,Schenkel R,Weikum G.The SphereSearch engine for unified rankedretrieval of heterogeneous XML and Web documents.In:B hm K,Jensen CS,Haas LM,et al.,eds.Proceedings of the31st International Conference on Very Large Data Bases(VLDB).Trondheim:ACM Press,2005.529~540
    [72] Curtmola E, Amer-Yahia S, Brown P, Fern à ndez M. GalaTex: A conformantimplementation of the XQuery FullText language.In:Florescu D,Pirahesh H,eds.Proceedings of the2nd International Workshop on XQuery Implementation,Experience,and Perspectives(XIME-P).Baltimore: ACM Press,2005.1024~1025
    [73] Amer-Yahia S,Botev C,Shanmugasundaram J.TeXQuery:A FullText search extensionto XQuery.In:Feldman SI,Uretsky M,Najork M,Wills CE,eds.Proceedings of the13th Conference on World Wide Web(WWW).Manhattan:ACM Press,2004.583~594
    [74] Amer-Yahia S,Lakshmanan LV,Pandit S.FleXPath:Flexible structure and full-textquerying for XML.In:Weikum G,K nig AC,Deβloch S,eds.Proceedings of the ACMSIGMOD International Conference on Management of Data(SIGMOD).Paris:ACMPress,2004.83~94
    [75] Carmel D,Maarek YS,Mandelbrod M,Mass Y,Soffer A.Searching XML documentsvia XML fragments.In:Proceedings of the26thAnnual International ACM SIGIRConference on Research and Development in Information Retrieval (SIGIR).Toronto:ACM Press,2003.151~158
    [76] D. Carmel, E. Amitay, M. Herscovici, Y. Maarek, Y. Petruschka and A. Soffer. AnExtension of the Vector Space Model for Querying XML Documents via XML Fragments.Second Workshop of XML and Information Retrieval.2002.
    [77] Y. Xu, Y. Papakonstantinou, Efficient keyword search for smallest LCAs in XMLdatabases, in: Proceedings of the ACM SIGMOD International Conference onManagement of Data (SIGMOD2005),2005, pp.527–538.
    [78] Y. Li, C. Yu, H. V. Jagadish, Enabling schema-free xquery with meaningful query focus,VLDB J.,2008,17(3), pp.72–84.
    [79]黄静,徐俊劲,周军锋,孟小峰. MLCEA:一种基于实体的XML关键字查询语义.计算机研究与发展,第45卷(增刊):372-377,2008.10
    [80] G. Li, J. Feng, J. Wang, L. Zhou, Efficient keyword search for valuable LCAs over XMLdocuments, in: Proceedings of the16th ACM Conference on Information and KnowledgeManagement (CIKM2007),2007, pp.31–40.
    [81] Z. Liu, Y. Chen, Identifying meaningful return information for XML keyword search, in:Proceedings of the ACM SIGMOD International Conference on Management of Data(SIGMOD2007),2007, pp.329–340.
    [82] Cohen S,Mamou J,Kanza Y,Sagiv Y.XSearch:A semantic search engine for XML.In:Freytag JC,Lockemann PC,Abiteboul S,Carey MJ,Selinger PG,Heuer A,eds.Proceedings of the29th International Conference on Very Large Data Bases(VLDB).Berlin:Morgan Kaufmann Publishers,2003.45~56
    [83] Amer-Yahia S,Koudas N,Marian A,Srivastava D,Toman D.Structure and contentscoring for XML.In: B hm K,Jensen CS,Haas LM,Kersten ML,Larson P,Ooi BC,eds. Proceedings of the31st International Conference on Very Large Data Bases(VLDB).Trondheim:ACM Press,2005.361~372
    [84] Arvola P,Junkkari M,Kek l inen J.Generalized contextualization method for XMLinformation retrieval.In:Herzog O,Schek H,Fuhr N,et al.,eds.Proceedings of the2005ACM CIKM International Conference on Information and Knowledge Management(CIKM).Bremen: ACM Press,2005.20~27
    [85] Wolff JE,Florke H,Cremers AB.Searching and browsing collections of structuralinformation. In: Proceedings of the IEEE Advances in Digital Libraries (ADL2000).Washington:ACM Press,2000.141~150
    [86] Guha S,Jagadish HV,Koudas N,Srivastava D,Yu T.Approximate XML joins.In:Franklin MJ,Moon B,Ailamaki A,eds.In:Proceedings of the2002ACM SIGMODInternational Conference on Management of Data(SIGMOD).Madison:ACM Press,2002.287~298
    [87] Yang R,Kalnis P,Tung AK.Similarity evaluation on tree-structured data.In:Ozcan F,ed.Proceedings of the ACM SIGMOD International Conference on Management of Data(SIGMOD).Baltimore:ACM Press,2005.754~765
    [88] Augsten N,B hlen MH,Gamper J.Approximate matching of hierarchical data usingpq-grams.In: B hm K,Jensen CS,Haas LM,Kersten ML,Larson P,Ooi BC,eds. Proceedings of the31st International Conference on Very Large Data Bases(VLDB).Trondheim:ACM Press,2005.301~312
    [89] Schlieder T,Meuss H.Querying and ranking XML documents.Journal of the AmericanSociety for Information Science and Technology,2002,53(6):489~503
    [90] Kailing K,Kriegel H,Sch nauer S,Seidl T.Efficient similarity search for hierarchicaldata in large databases.In: Bertino E,Christodoulakis S,Plexousakis D,et al.,eds.Advances in Database Technology-EDBT2004,Proceedings of the9th InternationalConference on Extending Database Technology(EDBT).Greece: Springer-Verlag,2004.676~693
    [91] Kotsakis E.Structured information retrieval in XML documents.In:Proceedings of the2002ACM Symp.on Applied Computing(SAC).Madrid: ACM Press,2002.663~667
    [92] C. Sun, C. Y. Chan, A. K. Goenka, Multiway SLCA-based keyword search in XML data,in: Proceedingsof the16th International Conference on World Wide Web (WWW2007),2007, pp.1043–1052.
    [93] W. Wang, X. Wang, A. Zhou, Hash-Search: An Efficient SLCA-Based Keyword SearchAlgorithm on XML Documents, in:DASFAA,2009, pp:496-510
    [94] J. Xu, J. Lu, W. Wang, B. Shi, Effective keyword search in XML documents based onMIU, in: DASFAA,2006, pp:702-716
    [95] J. Li, J. Wang, M. Huang, XKMis: Effective and Efficient Keyword Search in XMLDatabases, in: IDEAS2009, pp:121-130
    [96] N. Govert and G. Kazai. Overview of the Initiative for the Evaluation of XMLretrieval(INEX)2002. In: Proc lst Workshop of the Initiative for the Evaluation of XMLretrieval(INEX). Schloss Dagstuhl, Germany. December9-11,2002.1-17
    [97] S. Liu, W. W. Chu and R. Shahinian. Vague Content and Structure(VCAS) Retrieval forDocument-centric XML Collections. In: Proc8th Int. Workshop on the Web&Database(WebDB). Baltimore, Maryland, USA. June16-17,2005.79-84
    [98] B. Sigurbjornsson and J. Kamps. The Effect of Structured Queries and Selective Indexingon XML Retrieval. In: Proc of4th Workshop of the Initiative for the Evaluation of XMLRetrieval(INEX). Dagstuhl Castle, Germany. November28-30,2005.104-118
    [99] Xiaojie Yuan, Chenying Wang, Hua Ning, et al. A Native XML Storage Schemeseamlessly integrated with Relational Databases. In: Proceedings of the2009IEEEInternational Conference on Information Science and Engineering.2009.3:11~15
    [100]刘喜平. XML文档搜索中的查询处理技术研究(博士学位论文).江西:江西财经大学,2010
    [101]刘喜平,万常选,刘德喜.有效的XML模糊内容与结构检索和计分.计算机研究与发展,2010,47(6):1070-1078
    [102] Fuhr N,Groβjohann K.XIRQL:A query language for information retrieval in XMLdocuments.In:Croft WB,Harper DJ,KraftDH,Zobel J,eds.Proceedings of the24thAnnual International ACM SIGIR Conference on Research and Development inInformation Retrieval(SIGIR).New Orleans:ACM Press,2001.172~180
    [103] Yanlong Wen, Haiwei Zhang, Ying Zhang, Lu Zhang, Lei Xu, Xiaojie Yuan. EffectiveKeyword Search for Candidate Fragments of XML Documents. DASFAA Workshops2011, LNCS6637, pp.427-439
    [104] V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, H. Karambelkar,Bidirectional expansion for keyword search on graph databases, in: Proceedings of the31rd International Conference on Very Large Data Bases (VLDB2005),2005, pp.505–516.
    [105] H. He, H. Wang, J. Yang, P. Yu, Blinks: ranked keyword searches on graphs, in:Proceedings of the ACMSIGMOD International Conference on Management of Data(SIDMOD2007),2007, pp.305–316.
    [106] B. Ding, J. X. Yu, S. Wang, L. Qin,X. Zhang, X. Lin, Finding top-k min-cost connectedtrees in databases, in: Proceedings of the23rd International Conference on DataEngineering (ICDE2007),2007, pp.836–845.
    [107] V. Hristidis, H. Hwang, Y. Papakonstantinou, Authority-based keyword search in databases,ACM Trans. Database Syst.(TODS)33(1),2008
    [108] F. Liu, C. Yu, W. Meng, A. Chowdhury, Effective keyword search in relational databases,in: Proceedings of the ACM SIGMOD International Conference on Management of Data(SIGMOD2006),2006, pp.563–574.
    [109] Y. L, X. Lin, W. Wang, et al, Spark: top-k keyword query in relational databases, in:SIGMOD,2007, pp:115-126
    [110] B. B. Dalvi, M. Kshirsagar, S. Sudarshan, Keyword searh on external memory data graphs,in: VLDB,2008, pp:1189-1204
    [111] G. Koutrika, Z. M. Zadeh, H. Garcia-Molina, Data clouds: summarizing keyword searchresults over structured data, in: EDBT,2009,391-402
    [112] Y. Tao, J. X. Yu, Finding frequent co-occurring terms in relational keyword search, in:EDBT,2009, pp:839-850
    [113]彭朝晖,张俊,王珊. S-CBR:基于数据库模式展现数据库关键词检索结果.软件学报.2008,19(02):323-337
    [114] Guoliang Li, Xiaofang Zhou, Jianhua Feng, Jianyong Wang: Progressive Keyword Searchin Relational Databases. ICDE2009:1183-1186
    [115] G. Li, J. Feng, L. Zhang, Retune: Retrieving and materializing tuple units for effectivekeyword search over relational databases, In: Proceedings of27th InternationalConference on Conceptual Modeling (ER2008),2008, pp:469-483.
    [116]王斌,杨晓春,王国仁.关系数据库中支持语义的Top-K关键字搜索(英文).软件学报,2008,19(09):2362-2375
    [117] Lingli Li, Hongzhi Wang, Jianzhong Li, Hong Gao, Efficient Algorithms for SkylineTop-K Keyword Queries on XML Streams, In: DASFAA2009, pp:283-287.
    [118] D. Zhang, Y. M. Chee, A. Mondal, A. K. H. Tung, M. Kitsuregawa, Keyword search inspatial databases: toward searching by documents, in ICDE,2009
    [119] L. Guo, J. Shanmugasundaram, G. Yona, Topology search over biological databases, in:ICDE,2007, pp:556-565
    [120] Q. H. Vu, B. C. Ooi, D. Papadias, A. K. H. Tung, A graph method for keyword-basedselection of the top-k databases, in: SIGMOD,2008, pp:915-926
    [121] G. Li, B. C. Ooi, J. Feng, J. Wang, L. Zhou, An effective3-in-1keyword search method forunstructured, semi-structured and structured data, Inf. Syst.,2009
    [122] X. Zhou, G. Zenz, E. Demidova, W. Nejdl, SUITS: Constructing structured data fromkeywords, Technical report, L3S Research Center,2007
    [123] Shengyue Ji, Guoliang Li, Chen Li, Jianhua Feng: Efficient interactive fuzzy keywordsearch. WWW2009:371-380
    [124] Guoliang Li, Shengyue Ji, Chen Li, Jianhua Feng: Efficient type-ahead search onrelational data: a TASTIER approach. SIGMOD2009:695-706
    [125]黄静,陆嘉恒,孟小峰.高效的XML关键字查询改写和结果生成技术.第二十六届中国数据库学术会议论文集,1-7,2009.10.
    [126] Tata, S. and Lohman, G. M.(2008). SQAK: doing more with keywords. In SIGMOD,pages889-902.
    [127] Y. Li, H. Yang, H. V. Japadish, NaLIX: A generic natural language search environment forXML data, ACM Trans. Database Syst.,2007,32(4)
    [128] C. Yu, H. V. Jagadish, Schema summarization, in: VLDB,2006
    [129] Jayapandian, M. and Jagadish, H. V.(2008). Automated creation of a forms-baseddatabase query interface. PVLDB,1(1):695-709
    [130] B. Sigurbjornsson, J. Kamps and M. de Rijke. The University of Amesterdam at INEX2004. In: Proc of3rd Workshop of the Initiative for the Evaluation of XMLRetrieval(INEX). Dagstuhl Castle, Germany. December6-8,2004.104-109
    [131] Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective xml keyword search with relevanceoriented ranking. In: ICDE (2009)
    [132] R. Wilkinson, J. Zobel and R. Sacks-Dvais. Similarity Measures for Short Queries. In:TREC4,1995.277-285
    [133] Charles L. A. Clarke, Gordon V. Cormack, Elizabeth A. Tudhope. Relevance ranking forone to three term queies. Information Processing and Management.2000:36,291-311

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700