海量多媒体数据库的高效查询处理
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着多媒体和网络技术的迅猛发展,互联网已经形成了一个巨大而复杂的多媒体信息空间。其所包含的海量多媒体信息资源具有以下的特点:1)数量巨大,增长迅速;2)内容丰富,形式多样;3)结构复杂,分布广泛;4)无序混乱,杂乱无章。面对这些互联网中浩翰的多媒体信息资源,如何对其进行快速准确地检索及高效地处理已经成为一个很重要的研究课题。本论文以数字图书馆作为目标应用,面向海量多媒体数据,提出并解决了海量数据高效查询处理的一系列问题。对海量高维多媒体信息的索引及查询技术进行深度和广度上的研究,重点解决了以下5个方面的问题:
     ●针对海量高维多媒体数据查询存在的“维数灾难”的问题,提出两种高维索引方法,即基于复合距离转换的高维索引(CDT)方法和基于编码的双距离树索引(EDD-Tree)方法,以提高海量多媒体检索效率;
     ●针对书法字数据特点,分别提出基于局部距离图(PDM)的交互式书法字索引方法及基于混合距离树(HD-Tree)的书法字索引方法;
     ●针对在单机环境下,海量多媒体数据查询性能低下的问题,进一步提出了基于数据网格的可扩展并行查询技术。该技术包括优化海量数据在网格结点中的分布、基于索引的快速高维数据集的缩减、并行流水线处理及高效的数据传输机制。理论和实验表明该技术能显著提高相似查询效率;
     ●针对频繁的用户查询请求,提出基于网格环境的高维相似查询的多重查询优化技术,进一步提高在查询密集条件下海量多媒体检索的并发性;
     ●针对海量跨媒体检索的特点,提出一种跨媒体数据的统一索引框架——CIndex。需要特别指出的是,目前国际国内学术界对海量跨媒体检索与索引的研究工作刚刚起步,相关研究还几乎没有。本文对该问题进行了系统而深入的研究,提出针对跨媒体检索与索引的一系列方法和理论成果,具有很大的理论和实际意义;
With the rapid growth of multimedia and Internet technologies,the Internet has become a very huge and complex multimedia information spaces.The characteristics of the multimedia resources in Web include:1).Huge amount of data;2).Heterogeneity and multiple modalities;3).Complex structure; 4).The Unordered.Facing these massive resources in the Web,how to fast and accurately retrieve and manage such large-scale multimedia information is a very important research topic.The work presented in this paper extends both the depth and broadness of the query,index and update over large-scale high-dimensional multimedia data.The work focus on the following five aspects:
     ●Due to the "Curse of Dimensionality" of the multimedia data,we propose two high-dimensional index schemes respectively,such as a composite-distance-transformation(CDT)-based high-dimensional index and an encoding-based dual distance tree(EDD-Tree)index,which can speed up the large-scale multimedia retrieval efficiency;
     ●For the characteristics of the Chinese calligraphic character,we propose the two index schemes such as a partial-distance-map(PDM)-based interactive character index and a hybrid-distance-tree(HD-Tree)-based character index;
     ●Due to the fact that the retrieval performance of large multimedia databases in a single-PC environment is not satisfactory,we peopose a grid-based retrieval algorithm to take advantage of the parallelism of grid computing,which can further speed up the retrieval efficiency.The technique includes the optimal data allocation policy in grid environment,the index-based vector set reduction, pipeline mechanism and high-efficient data transfer method;
     ●With the increase of query-intensive applications,we propose a multi-query optimization technique for similarity search in grid environment,which is to further speed up the parallelism of the query-intensive-based large-scale multimedia retrieval;
     ●To effectively support a large scale cross-media retrieval,we propose an integrated index structure,which is called the CIndex.To the best of our knowledge,this is the first work to study the cross-media retrieval and indexing. The experimental results show the effectiveness and efficiency of this method;
引文
[1]Google Inc.www.google.com.2007.
    [2]Yahoo Inc.www.yahoo.com.2007.
    [3]Yong Rui,Thomas S.Huang,Sharad Mehrotra:Content-Based Image Retrieval with Relevance Feedback in MARS.In Proc.of ICIP(2)1997:815-818
    [4]M Flicker,Harpreet Sawhney,Wayne Niblack,Johathan Ashley.Query by image amd video content:The QBIC System[J].IEEE Computer.1995
    [5]Virage Inc.www.virage.com.2005.
    [6]A Pentland,R.W.Picard and S.Sclarf.Photobook:Content-based manipulation of image databases[J].International Journal of Computer Vision.1996,18(3),233-254.
    [7]Christel,M.,Wactlar,H.,Steven,S.,Sirbu,M.,Reddy,R.,Mauldin,M.,Kanade,T.,Informedia Digital Video Library[J].Communications of the ACM,38(4),pp.57-58,1995.
    [8]Sharad Mehrota,Yong Rui,Kaushik Chakrabarti,Michael Ortega and Thomas S.Huang.Multimedia Analysis and Retrieval System.In Proc.of the 3~(rd)Int.Workshop on Multimedia Information Systems.Como,Italy September 25-27.1997.
    [9]H.Suzuki et al.,Storage hierarchy for video-on-demand systems,In Proc.of SPIE Storage and Retrieval for Image and Video Databases II(1994)2185:198-207.
    [10]S.Ghandeharizadeh et al.,On multimedia repositories,personal computers,and hierarchical storage systems,In Proc.of ACM Multimedia(1994)407-416.
    [11]Patterson D.A.,Gibson G.,Katz R.H..A case for redundant arrays of inexpensive disks(RAID).In Proc.of International Conference on Management of Data(SIGMOD),Chicago,Illinois,1988,109-116
    [12] Bayer, R. and McCreight, E. Organization and Maintenance of Large Ordered Indexes. Acta Informatica, (1972) 173-189.
    [13] Bohm C, Berchtold S, Keim DA. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases[J]. ACM Computing Surveys, 2001,33(3):322-373.
    
    [14] Bentley JL. Multidimensional binary search trees used for associative searching[J], Communications of the ACM, 18(9): pp. 509-517, 1975.
    [15] A. Guttman, R-tree: A dynamic index structure for spatial searching, In Proceedings of the ACM SIGMOD Conference, pp.47-54, 1984.
    [16] Timos K. Sellis, Nick Roussopoulos, Christos Faloutsos: The R~+-Tree: A Dynamic Index for Multi-Dimensional Objects. In In Proc. of the 22th VLDB Conference, pp. 507-518, 1987.
    [17] N. Beckmann, H.-P. Kriegel, R. Schneider, B. Seeger. The R*-tree: An Efficient and Robust Access Method for Points and Rectangles, In Proc. of ACM SIGMOD Conference, pp. 322-331, 1990.
    [18] King-Ip Lin, H.V. Jagadish and Christos Faloutsos, The TV-tree an index structure for high-dimensional data[J], VLDB Journal, 1994.
    [19] S. Berchtold, D.A. Keim and H.P. Kriegel. The X-tree: An index structure for high-dimensional data. In Proc. of the 22th VLDB Conference, pp. 28-37, 1996.
    [20] D. A. White and R. Jain. Similarity Indexing with the SS-tree, In Proc. of ICDE Conference, pp. 516-523, 1996.
    
    [21] N. Katamaya and S. Satoh. The SR-tree: An index structure for high- dimensional nearest neighbor queries. In Proc. of ACM SIGMOD Conference, pp. 32-42. 1997.
    [22] R. Weber, H. Schek, and S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In Proc. of the 24th VLDB Conference, pp. 194-205, 1998.
    [23] S. Berchtold, C. Bohm, H.P. Kriegel, J. Sander, and H.V. Jagadish. Independent quantization: An index compression technique for high- dimensional data spaces. In Proc. of the 16th ICDE Conference, pp. 577-588. 2000.
    [24]Y.Sakurai,M.Yoshikawa,S.Uemura,and H.Kojima.The A-tree:An index structure for high-dimensional spaces using relative approximation.In Proc.of VLDB Conference,pp.516-526,2000.
    [25]J.K.Lawder,P.J.H.King,Querying Multi-dimensional Data Indexed Using the Hilbert Space-Filling Curve[J],SIGMOD Record.2001
    [26]E.Chávez,G.Navarro,R.Baeza-Yates,and J.Marroquín,Searching in Metric Spaces[J],ACM Computing Surveys:33(3),pp.273-321,ACM Press,2001.
    [27]Uhlmann,J.Satisfying general proximity/similarity queries with metric trees[J].Information Processing Letters,1991,40:175-179.
    [28]Brin,S.New neighbor search in large metric space.In:Dayal,U.,Peter,P.M.D.,et al,eds.In Proc.of the VLDB'95.CA:Morgan Kaufmann Publishers,1995.574-584.
    [29]T.Bozkaya and M.Ozsoyoglu.Distance-based indexing for high-dimensional metric spaces.In Proc.of ACM SIGMOD Conference,pages 357-368.1997.
    [30]P.Ciaccia,M.Patella,and P.Zezula.M-trees:An efficient access method for similarity search in metric space.In Proc.of the 23rd VLDB Conference,pages 426-435.1997.
    [31]S.Berchtold,C.Bohm,and H.-P.Kriegel.The pyramid technique:Towards breaking the curse of dimensionality.In Proc.of SIGMOD Conference,1998.
    [32]Traina Jr.,C,Traina,A.,Seeger,B.,Faloutsos,Slim-trees:High Performance Metric Trees Minimizing Overlap Between Nodes,In Proc.of the EDBT Conference,Konstanz,Germany,2000.
    [33]Filho,R.F.S.,Traina,A.,and Faloutsos,C.Similarity search without tears:The Omni family of all-purpose access methods.In Proc.of ICDE Conference,pp.623-630.2001
    [34]M J.Fonseca and J A.Jorge.Indexing High-dimensional Data for Content-Based Retrieval in Large Databases.In Proc.of the 8th DASSFA Conference,Kyoto,Japan,pp.267-274,2003.
    [35]H.V.Jagadish,B.C.Ooi,K.L.Tan,C.Yu,R.Zhang.iDistance:An Adaptive B~+-tree Based Indexing Method for Nearest Neighbor Search[J],ACM Transactions on Data Base Systems,2005.30(2),pp.364-397.
    [36]Stefan Berchtold,Christian Bohm,Bernhard Braunmuller,Daniel A.Keim,Hans-Peter Kriegel.Fast parallel similarity search in multimedia databases.In Proc.of the ACM SIGMOD international conference on Management of data.Tucson,Arizona,United States.Pp.1-12,1997
    [37]Adil Alpkocak,Taner Danisman,Tuba Ulker.A Parallel Similarity Search in High Dimensional Metric Space Using M-Tree.In Proc.of the NATO Advanced Research Workshop on Advanced Environments,Tools,and Applications for Cluster Computin.Pp:166-171.2001
    [38]The International Virtual Data Grid Lab(IVDGL),www.ivdgl.org,2006
    [39]The Globus Toolkit,www.globus.org/toolkit,2006
    [40]The SRB package.http://www.sdsc.edu/srb/index.php/Main_Page,2006
    [41]Foster,I.,Kesselman,C.1998.The Grid:Blueprint for a New Computing Infrastructure San Francisco,CA:Morgan Kaufmann.
    [42]Chervenak,A.,Foster,I.,Kesselman,C,et al,2001.The data grid:Towards an architecture for the distributed management and analysis of large scientific datasets[J],Journal of Network and Computer Applications.Vol.23.pp.187-200
    [43]Hoschek,W.,Martinez,J.J.,Samar,A.,et al,2000.Data management in an international data grid project.In Proc.of the 1~(st)IEEE/ACM Int'l Workshop on Grid Computing.Berlin.Springer-Verlag,pp.17-20
    [44]Segal,B.,Grid Computing:The European data grid project,2000.In Proc.of the 2000 IEEE Nuclear Science Symposium and Medical Imaging Conference,Lyon,France.
    [45]Stockinger,H.,2001.Distributed database management systems and the data grid,2001.In Proc.of the 18~(th)IEEE Symposium on Mass Storage Systems and the 9th NASA Goddard Conference on Mass Storage Systems and Technologies,San Diego,CA.
    [46]Smith,J.,Gounaris,A.,Watson,P.,et al,2002.Distributed query processing on the grid.In Proc.of the 3~(rd)International Workshop on Grid Computing.Springer Verlag,pp.279-290.
    [47]杨东华,李建中,王卫平.基于数据网格的连接操作.计算机研究与发展,2004,41(10):1848-1855(in Chinese with English abstract).
    [48]Jagadish,H.V.,Ooi,B.C.,Vu,Q.H.,Zhang,R.,Zhou,A.Y.2004.VBI-Tree:A Peer-to-Peer Framework for Supporting Multi-Dimensional Indexing Schemes.In Proc.of International Conference on Data Engineering,2004.
    [49]Gribble SD,Halevy AY,Ives ZG,Rodrig M,Suciu D.What can databases do for peer-to-peer? In Proc.of the 4th Int'l Workshop on the Web and Databases(WebDB).Santa Barbara,2001.31-36.
    [50]凌波,陆志国,黄维维,钱卫宁,周敖英.PeerIS:基于Peer-to-Peer的信息检索系统[J].软件学报,2004,15(9):1375-1384.
    [51]Kalnis,P.,NG W.S.,Ooi,B.C.,Tan,K.L.2004.Answering Similarity Queries in Peer-to-Peer Networks[J].Information Systems.
    [52]M.Bawa,B.F.Cooper,A.Crespo,N.Daswani,P.Ganesan,H.Garcia-Molina,S.Kamvar,S.Marti,M.Schlosser,Q.Sun,P.Vinograd,and B.Yang.Peer-to-peer research at Stanford[J].SIGMOD Record,32(3):23-28,2003
    [53]Ng WS,Ooi BC,Tan KL.BestPeer:A self-configurable peer-to-peer system.In Proc.of the 18th ICDE.San Jose:IEEE Computer Society Press,2002.272.
    [54]Talia,D.,and Trunfio,P.2003.Toward Synergy Between P2P and Grids.IEEE Internet Computing,pp 94-96.
    [55]Skyserver(Sloan Digital Sky Survey).http://skyserver.sdss.org/,2007.
    [56]Terraserver.http://terraserver.microsoft.com/,2007.
    [57]Sam Roweis & Lawrence Saul.Nonlinear dimensionality reduction by locally linear embedding[J].Science,290(5500),Dec.22,2000.pp.2323-2326.
    [58]J.B.Tenenbaum,V.de Silva and J.C.Langford.A Global Geometric Framework for Nonlinear Dimensionality Reduction[J].Science,290(5500):2319-2323,22 December,2000.
    [59]David L.Donoho and Carrie Grimes.Hessian eigenmaps:Locally linear embedding techniques for high-dimensional data[J].In Proc.of the National Academy of Sciences
    [60]Xiaofei He,Wei-Ying Ma,and Hong-Jiang Zhang.Learning an Image Manifold for Retrieval,In Proc.of ACM conference on Multimedia 2004,Oct.10-16,2004,New York City.
    [61]吴佑寿,丁晓青.汉字识别——原理、方法与实现.北京:高等教育出版社清华大学出版社.1992.
    [62]Palmondon R,Srihari SN.On-Line and off-line handwriting recognition:A comprehensive survey[J].IEEE Trans,on Pattern Analysis and Machine Intelligence,2000,22(1):63-84.
    [63]Rath TM,Kane S,Lehman A,Partridge E,Manmatha R.Indexing for a digital library of George Washington's manuscripts:A study of word matching techniques.Technical Report,MM-36,Boston:University of Massachusetts,2002.
    [64]Yosef IB,Kedem K,Dinstein I,Beit-Arie M,Engel E.Classification of hebrew calligraphic handwriting styles:Preliminary results.In Proc.of the 1st Int'l Workshop on Document Image Analysis for Libraries.Palo Alto,2004.299-305.
    [65]施伯乐,张亮,王勇,陈智锋,基于视觉相似性的计算机古籍内容检索方法[J].软件学报,2001,12(9):1336-1342。
    [66]Zhuang YT,Zhang XF,Wu JQ,Lu XQ.Retrieval of Chinese calligraphic character image.In Proc.of the Pacific Rim Conf.on Multimedia(PCM 2004).Berlin,Heidelberg:Springer-Verlag,2004.63-84.
    [67]Zhang T,Ramakrishnan R,Livny M.BIRCH:An efficient data clustering method for very large databases.In Proc.of the ACM SIGMOD Int'l Conf on Management of Data(SIGMOD'96).New York:ACM Press,1996.103-114.
    [68]Cohen S,Guibas L.The earth mover's distance under transformation sets.In Proc.of the Int'l Conf.on Computer Vision(ICCV'99).New York:IEEE Computer Society Press,1999.173-187.
    [69]Zhuang,Y.,Zhuang,Y.T.,Li,Q.and Chen,L.2006.Towards interactive indexing for large Chinese calligraphic character databases.In Proc.of the 15~(th)ACM International Conference on Information and knowledge management.Arlington,pp.884-885.
    [70]The cadal project.http://www.cadal.zju.edu.cn.2005.
    [71] UCI KDD Archive, http://www.kdd.ics.uci.edu. 2002.
    
    [72] P.Roy, S. Seshadri, S. Sudarshan and S. Bhole. Efficient and extensible algorithm for multi-query optimization.
    [73] J. Zhang, N. Mamoulis, D. Papadisa and Y. Tao. All nearest neighbor queries in spatial databases. In Proc. of SSDBM, pp. 297-306, 2004.
    [74] Bhatia, R., Sinha, R. K., and Chen, C.-M. 2000a. Declustering using golden ratio sequences. In Proc. of the International Conference on Data Engineering.
    [75] Bhatia, R., Sinha, R. K., and Chen, C.-M. 2000b. Hierarchical declustering schemes for range queries. In Proc. of the International Conference on Extending Database Technology.
    [76] Chen, C. and Chang, C. 1992. On GDM allocation method for partial range queries[J]. Information Systems, 17(5), 381-394.
    [77] Chen, C.-M. and Cheng, C. T. 2003. Replication and retrieval strategies of multidimensional data on parallel disks. In Proc. of the International Conference on Information and Knowledge Management.
    [78] Chen, C.-M., Sinha, R., and Bhatia, R. 2001. Efficient disk allocation schemes for parallel retrieval of multidimensional grid data. In Proc. of the International Conference of Scientific and Statistical Database Management.
    [79] Chen, L. T. and Rotem, D. 1993. Declustering objects for visualization. In Proc. of the International Conference on Very Large Data Bases.
    [80] Chen, L. T., Rotem, D., and Seshadri, S. 1995. Declustering databases on heterogeneous disk systems. In Proc. of the International Conference on Very Large Data Bases.
    [81] Cheng, C. T. and Chen, C.-M. 2004. From discrepancy to declustering: near optimal multidimensional declustering strategies for range queries[J]. Journal of ACM 51,1, 46-73.
    [82] Du, H. and Sobolewski, J. 1982. Disk allocation for cartesian product files on multiple-disk systems[J]. ACM Transactions on Database Systems, (7)1, 82- 101.
    [83] Faloutsos, C. 1988. Gray codes for partial match and range queries[J]. IEEE Transactions on Software Engineering 14, 10, 1381-1393.
    [84] Faloutsos, C. and Bhaywat, P. 1993. Declustering using fractals. In Proc. of the International Conference on Parallel and Distributed Information Systems.
    [85] Fang, M., Lee, R., and Chang, C. 1986. The idea of declustering and its applications. In Proc. of the International Conference on Very Large Data Bases.
    [86] Ferhatosmanoglu, H., Agrawal, D., and El Abbadi, A. 1999. Concentric hyperspaces and disk allocation for fast parallel range searching. In Proc. of the International Conference on Data Engineering.
    [87] Ferhatosmanoglu, H., Tosun, A. S., and Ramachandran, A. 2004. Replicated declustering of spatial data. In Proc. of the ACM Symposium on Principles of Database Systems.
    [88] Frikken, K., Atallah, M. J., and Prabhakar, S. 2002. Optimal parallel I/O for range queries through replication. In Proc. of the International Conference on Database and Expert Systems.
    [89] Kim, M. and Pramanik, S. 1988. Optimal file distribution for partial match retrieval. In Proc. of the ACM SIGMOD International Conference on Management of Data.
    [90] Kuo, S., Winslett, M., Chen, Y., Cho, Y., Subramaniam, M., and Seamons, K. E. 1997. Parallel input/output with heterogeneous disks. In Proc. of the International Conference on Scientific and Statistical Database Management.
    
    [91] Kuo, S., Winslett, M., Cho, Y., and Lee, J. 1999. New GDM-based declustering methods for parallel range queries. In Proc. of the International Database Engineering and Applications Symposium.
    [92] Kuo, S., Winslett, M., Cho, Y., Lee, J., and Chen, Y. 1999. Efficient input and output for scientific simulations. In Proc. of the Workshop on Input/Output in Parallel and Distributed Systems.
    [93] Liu, D. and Shekhar, S. 1995. A similarity graph-based approach to declustering problems and its application towards parallelizing grid files. In Proc. of the International Conference on Data Engineering.
    [94] Prabhakar, S., Agrawal, D., and Abbadi, A. E. 1998. Cyclic allocation of two-dimensional data. In Proc. of the International Conference on Data Engineering.
    [95] Rotem, D., Seshadri, S., and Bernardo, L. M. 1998. Replication and clustering in heterogeneous disk systems. In Proc. of the Workshop on Distributed Data and Structures.
    [96] Salton, G., Wong, A., and Yang, C. S. 1975. A vector space model for automatic indexing[J]. Communications of the ACM 18, 11, 613-620.
    [97] Sarawagi, S. and Stonebraker, M. 1994. Efficient organization of large multidimensional arrays. In Proc. of the International Conference on Data Engineering.
    [98] H. McGurk, J. MacDonald. Hearing Lips and Seeing Voices[J]. Nature, 1976. 264:746-748
    [99] A.Calvert. Cross-Modal Processing in the Human Brain: Insights from Functional Neuron Imaging Studies, Cerebral Cortex,11(12):1120-1123, 2001.
    [100] Yong Rui, Thomas S. Huang, Shih-Fu Chang, Image Retrieval: Current Techniques, Promising Directions and Open Issues[J] , Journal of Visual Communication and Image Representation , Vol. 10, 39-62, March, 1999
    [101] S. F. Chang, W. Chen, H. J. Meng, H. Sundaram, D. Zhong. VideoQ: An automated content based video search system using visual cues, In Proc. of ACM International Multimedia Conference, 313-324, 1997
    [102] J. Foote. An overview of audio information retrieval, In Proc. of ACM Multimedia Systems, 2-10, 1999.
    [103] Xinjing Wang, Weiying Ma, Guirong Xue, Xing Li. Multi-Model Similarity Propagation and its Application for Web Image Retrieval. In Proc. of ACM Multimedia, 2004. 944-951.
    
    [104] Westerveld, T. Probabilistic Multimedia Retrieval. In Proc. of SIGIR, 2002.
    
    [105] Srihari, R.K., Rao, A.B., Han, B., Munirathnam, S., and Wu, X.Y. A Model For Multimodal Information Retrieval. In Proc. of lCME, 2000.
    [106] Zhou, X.S., and Huang T.S. Unifying Keywords and Visual Contents in Image Retrieval. IEEE MultiMedia 9(2) (2002), 23-33.
    [107] Jun Yang, Qing Li, Yueting Zhuang. Octopus: Aggressive Search of Multi-Modality Data Using Multifaceted Knowledge Base. In Proc. of 11th International Conference on World Wide Web (WWW 2002), USA, 2002. pp.54-64.
    [108] Fei Wu, Hong Zhang, Yueting Zhuang. Learning Semantic Correlations for Cross-media Retrieval. In Proc. of the 13th International Conference on Image Processing (ICIP), USA 2006.
    [109] Hong Zhang, Jianguang Weng. Measuring Multi-modality Similarities via Subspace Learning for Cross-media Retrieval. In Proc. of the Pacific-Rim Conference on Multimedia(PCM'06), 2006. pp.979-988.
    [110] Xueyan Zhao, Yueting Zhuang, Fei Wu, Audio Clip Retrieval with Fast Relevance Feedback based on Constrained Fuzzy Clustering and Stored Index Table, In Proc. of The Pacific-Rim Conference on Multimedia, 2002, 237-244.
    [111] Deng Cai, Xiaofei He, Ji-Rong Wen, and Wei-Ying Ma. Block Level Link Analysis, In Proc. of ACM Conference on Information Retrieval (SIGIR), Sheffield, United Kingdom, July 2004.
    [112] Jun Yang, Qing Li, Liu Wenyin, Yueting Zhuang, Searching for Flash Movies on the Web: a Content and Context Based Framework. World Wide Web Journal, 8(4), pp. 495-517, Kluwer Academic Publishers, 2005.
    [113] Bin Cui, Ben Chin Ooi, Jianwen Su and Kian-Lee Tan. Indexing High-Dimensional Data for Efficient In-Memory Similarity Search. IEEE Transaction on Knowledge and Data Engineering, 17(3), March 2005.
    [114] H.V. Jagadish, Beng Chin Ooi, Heng Tao Shen, Kian-Lee Tan, Towards efficient multi-feature query processing. IEEE Transaction on Knowledge and Data Engineering, 18(3):350-362, 2006
    [115] Heng Tao Shen, Xiaofang Zhou and Bin Cui. Indexing and Integrating Multiple Features for WWW images[J]. World Wide Web (WWW), 9(3):343-364,2006
    [116] Oracle Inc. www.oracle.com. 2006
    [117] Sybase Inc. www.svbase.com. 2006
    [118] The INGRESS Project, http://db.cs.berkelev.edu/oldprojects.php. 2002
    [119] The Paradise Project, http://www.cs.wisc.edu/paradise/. 2002
    [120] IBM DB2. http://www-306.ibm.com/software/data/db2/. 2006
    
    [121] J. Hammer, H. Garcia-Molina, J. Widom, W. J. Labio, Y. Zhuge. "The Stanford Data Warehousing Project." IEEE Data Engineering Bulletin, June 1995.
    
    [122] Microsoft Encarta, http://www.encarta.msn.com. 2006
    [123] Jie Shao, Zi Huang, Heng Tao Shen, et al, Dynamic Batch Nearest Neighbour Search in Video Retrieval. In Proc. of 23rd IEEE International Conference on Data Engineering (ICDE), 2007.
    [124] G.Salton and M.J.McGill, Introduction to Modern Information Retrieval. McGraw-Hill Book Company, 1983.
    [125] J.P.Callan, W.B.Croft, and S.M.Harding, The inquery retrieval system, in Proc. of 3rd Int. Conf. on Database and Expert System Application.
    [126] Mehta M, DeWitt DJ. Data placement in shared-nothing parallel database systems. VLDB Journal, 1997,6(1):53-72.
    [127] He Z, Yu JX. Declustering and object placement in parallel OODBMS. In: Roddick JF, ed. Proc. of the 10th Australasian Database Conf., ADC'99. Auckland, 1999. 18-21
    [128] Ghandeharizadeh S, Wilhite D, Lin K, Zhao X. Object placement in parallel object-oriented database systems. In: Agrawal R, Dittrich KR, eds. Proc. of the 10th Int'l Conf. on Data Engineering. Houston: IEEE Computer Society, 1994.253-262.
    [129] H. C. Du and J. S. Sobolewski. Disk allocation for Cartesian product files on multiple-disk systems. ACM Transactions of Database Systems, 7(1):82-101, March 1982.
    [130] C. Faloutsos and P. Bhagwat. Declustering using fractals. In Proceedings of the 2nd International Conference on Parallel and Distributed Information Systems, pages 18-25, San Diego, CA, Jan 1993.
    [131] H. Ferhatosmanoglu, D. Agrawal, and A. El Abbadi. Concentric hyperspaces and disk allocation for fast parallel range searching. Technical Report TRCS99-03, Univ. of California, Santa Barbara, Jan. 1999.
    [132] K. A. Hua and H. C. Young. A general multidimensional data allocation method for multicomputer database systems. In Database and Expert System Applications, pages 401-409, Toulouse, France, Sept. 1997.
    [133]M.H.Kim and S.Pramanik.Optimal file distribution for partial match retrieval.In Proc.ACM SIGMOD Int.Conf.on Management of Data,pages 173-182,Chicago,1988.
    [134]S.Prabhakar,K.Abdel-Ghaffar,D.Agrawal,and A.El Abbadi.Cyclic allocation of two-dimensional data.In International Conference on Data Engineering,pages 94-101,Orlando,Florida,Feb 1998.
    [135]S.Prabhakar,D.Agrawal,and A.E1 Abbadi.Efficient disk allocation for fast similarity searching.In 10th International Symposium on Parallel Algorithms and Architectures,SPAA'98,Puerto Vallarta,Mexico,June 1998.
    [136]Napster.www.napster.com,2002.
    [137]Gnutella.www.gnutella.com,2002.
    [138]KaZaA,www.kazaa.com,2002.
    [139]王国仁,汤南,于亚新,孙冰,于戈.一种并行XML数据库分片策略,软件学报,Vol.17,No.4.pp.770-781.
    [140]David J.DeWitt,Jim Gray:Parallel Database Systems:The Future of High Performance Database Systems.Commun.ACM 35(6):85-98(1992)
    [141]Apostolos N.Papadopoulos Yannis Manolopoulos.Similarity query processing using disk arrays.In Proc.ACM SIGMOD Int.Conf.on Management of Data,1998,pp.225-236.
    [142]Papadopoulos A.,Manolopoulos Y.Parallel processing of nearest neighbor queries in declustered spatial data.In:Proc.ACM-GIS,Rockville,MD,1996,35-43
    [143]Papadopoulos A.,Manolopoulos Y..Nearest neighbor queries in shared-nothing environments.Geoinformatica,1997,1(4):369-392.
    [144]Arturo Crespo,Hector Garcia-Molina:Routing Indices For Peer-to-Peer Systems.ICDCS 2002:23-33.
    [145]S.Ratnasamy,P.Francis,M.Handley,R.Karp,and S.Shenker.A scalable content-addressable network.In Proceedings of the 2001 ACM Annual Conference of the Special Interest Group on Data Communication,pages 161-172,2001.
    [146]C.Tang,Z.Xu,and S.Dwarkadas.Peer-to-peer information retrieval using self-organizing semantic overlay networks.In Proceedings,of the ACM SIGCOMM,2003
    [147] Kalnis P., Ng W.S., Ooi B.C., Tan K.L., Answering Similarity Queries in Peer-to-Peer Networks. Information Systems. 2005
    [148] Shen, H-T, Zhou, X-F and Bin, C, "Indexing and Integrating Multiple Features for WWW images". World Wide Web Journal, 2006. 9(3):343-364.
    [149] Tamura, H., Yokoya, N. Image database systems: A survey. Pattern Recognition, 17(1):29-43. 1984.
    [150] Zhou, Z-H., Dai, H-B., (2007): Exploiting Image Contents in Web Search. Proceeding of International Joint Conference on Artificial Intelligence. Pp. 2922-2927.
    [151] Foote, J. An overview of audio information retrieval, Multimedia Systems, 7(1): 2-10. 1999.
    [152] Yong Rui, Thomas S. Huang: A novel relevance feedback technique in image retrieval. ACM Multimedia (2) 1999: 67-70.
    [153] Kelly, D. and Teevan, J. Implicit feedback for inferring user preference. SIGIR Forum. 2003. 37(2): 18-28.
    
    [154] Wu, F., Zhang, H., Zhuang, Y-T., Learning Semantic Correlations for Cross-Media Retrieval. ICIP: 2006. pp.1465-1468.
    [155] S.-F. Chang, W. Chen, H.J. Meng, H. Sundaram, and D. Zhong, "VideoQ-An Automatic Content-Based Video Search System Using Visual Cues," ACM Multimedia Conference, Nov. 1997, Seattle
    [156] Rong Yan, Alexander G. Hauptmann: Probabilistic latent query analysis for combining multiple retrieval sources. SIGIR 2006: 324-331
    [157] Thomas A. Funkhouser, Michael M. Kazhdan, Patrick Min, Philip Shilane: Shape-based retrieval and analysis of 3d models. Commun. of ACM. 2005. 48(6): 58-64.
    [158] Hong Zhang, Fei Wu, Bridging the Gap between Visual and Auditory Feature Spaces for Cross-media Retrieval. The 13th International MultiMedia Modeling Conference, Singapore. 2007
    [159] Hong Zhang, Jianguang Weng. Measuring Multi-modality Similarities via Subspace Learning for Cross-media Retrieval. In Proceedings of 7th Pacific- Rim Conference on Multimedia, 2006.
    [160]Fei Wu Yi Yang Yueting Zhuang Yunhe Pan,Understanding Multimedia Document Semantics for Cross-Media Retrieval,LNCS 3767(PCM 2005),993-1004
    [161]Yueting Zhuang,Yi Yang,Fei Wu,Yunhe Pan,Manifold Learning Based Cross-media Retrieval:A Solution to Media Object Complementary Nature,Journal of VLSI Signal Processing,46(2-3):153-164,2007.
    [162]YueTing Zhuang,Yi Yang,Fei Wu,Graph-based Cross-Media Retrieval,to appear in IEEE Transactions on Multimedia.
    [163]Yi Yang,YueTing Zhuang,Fei Wu,Yunhe Pan,Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-media Retrieval,to appear in IEEE Transactions on Multimedia.
    [164]庄毅,庄越挺,吴飞.一种支持海量跨媒体检索的集成索引结构.软件学报.(已录用)
    [165]The NASA Skyview.www.skyview.gsfc.nasa.gov.2000.
    [166]Jim Gemmell,Gordon Bell,Roger Lueder,Steven Drucker and Curtis Wong MyLifeBits:Fulfilling the Memex Vision.In Proc.of ACM Multimedia 2002.
    [167]The YouTube.www.youtube.com.2005.
    [168]http://www.dbanotes.net/opensource/youtube_web_arch.html
    [169]The Lycos.www.Lycos.com.2000.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700