复杂网络与互联网个性化信息服务的研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
继二十世纪末复杂网络的小世界效应及无标度性的发现之后,复杂网络的研究得到了越来越多的关注,来自各个学科的研究者们从各个层面对复杂网络展开了深入的研究,复杂网络已经成为一个充满生命力的交叉研究领域。目前,复杂网络的研究主要集中在两个方面:一方面是复杂网络理论性的分析与仿真,新的理论模型和新的分析方法不断涌现;另一方面是从现实网络中不断发现新结构与新现象,运用复杂网络理论来观察、理解和解决具体应用问题。
     随着信息技术的发展和互联网的普及,Web2.0已经成为新一代互联网应用的发展趋势。Web2.0系统中存在着大量的非线性、自组织和涌现现象。将复杂网络研究与Web2.0相结合,不仅有助于正确认识和理解Web2.0,以及对Web2.0的下一步发展有指导性意义,同时也将启发、推动复杂网络的理论研究工作。
     个性化是Web2.0的主要特点之一。个性化信息服务已经成为互联网应用的一个重要的研究热点,得到了越来越多研究者的关注。其中,用户建模,聚类、分类以及自动推荐技术又是个性化信息服务中的关键技术,这些技术的研究必将有力地推动互联网大规模的个性化信息服务。
     本论文围绕以上几个方面,将复杂网络的理论方法与互联网个性化信息服务相结合,进行了深入的研究和实践。论文的主要内容为:
     首先,将复杂网络的研究与Web2.0相结合。具体包括:一、用复杂网络的理论方法,研究Web2.0系统中存在的非线性机制,自组织和涌现现象。二、研究复杂网络中的社团(community)发现理论,提出具有交联结构的复杂网络中的可重叠社团发现算法。三、对Web2.0中一个具有交联结构的复杂网络中的社团结构进行统计分析。
     其次,将复杂网络特征应用于关键词抽取和聚类分析中,具体包括:一、研究了汉语语言所组成的自然语言网络中的“小世界”特性,提出基于复杂网络特征的关键词抽取算法。该算法综合考察单词在语言网络中的连接度和聚集性质,抽取复杂网络综合特征值高的节点作为关键词,旨在找到那些可能相对低频,但对文章主题表达起重要作用的单词。二、在对复杂网络重要特征深入研究的基础
After the discovery of small-world phenomena and scale-free characteristics of complex networks in the late 20~(th) century, the research of complex networks has been gotten more and more attention. Researchers from different fields studied complex networks from every level. Complex networks has become a vital topic crossed with many other research fields. Nowadays the research of complex networks has been mainly focused on the two aspects: One aspect is theoretical analysis and simulation, in which new theoretical models and methods were proposed continually; The other aspect is applied research, in which new structures and phenomena of real-world network were discovered continually.
    With the development of information technology and the popularity of Internet, Web2.0 has become an important trend in the application of Internet. There are many non-linear, self-organize, and emergence phenomena in Web2.0 systems. So it is important that the theoretical methods of complex networks are applied to Web2.0, which can not only benefit understanding Web2.0 and guide the further development of Web2.0, but also can accelerate the theoretical research of complex networks.
    Personalization is a major feature of Web2.0. Personalized information service has been one of the hottest research points in the applied research of Internet. User profiling, clustering, classification and automatic recommendation are the crucial techniques in personalized information service. So the research of these crucial techniques will promote the large-scale personalized information service on Internet efficiently.
    This dissertation focuses on the above aspects and combines the theoretical methods of complex networks with the research of personalized information service. The research work of this dissertation can be summarized as follows:
    Firstly, this dissertation combines the research of complex networks with Web2.0. Main works are summarized as follows: (1) The non-linear mechanism, self-organize and emergence phenomena in Web2.0 systems are studied using the theoretical methods of complex networks. (2) A novel algorithm for finding the overlapping community in the complex networks with intersection structure is proposed. (3) The overlapping community structure in the complex networks with intersection structure is analyzed statistically.
    Secondly, the features of complex networks are applied to the automatic keywords extraction and clustering analysis. Detailed works are summarized as follows:(l) The small-world structure in human language network of Chinese is studied. A novel automatic keywords extracting algorithm based on the features of complex networks is proposed. This algorithm extracts those words with higher degree and clustering coefficient in the language network as keywords. Its goal is to extract keywords which may be relatively low frequency, but do great contribute to the subject of the text. (2) The definitions of the weighted complex networks features are given after the deeply studying on the features of complex networks. A novel
引文
1 http://del.icio.us 美味书签
    2 http://www.boomyland.com/ 步一空间
    3 http://www.douban.com 豆瓣
    4 http://www.360doc.com 360doc
    5 http://rcsearch.microsoft.com/scg/Microsoft: Social Computing Group.
    6 http://domino.watson.ibm.com/cambridge/research.nsf/pages/cue.html IBM: The Collaborative User Experience(CUE) Research group.
    7 http://www.tianji.com 天际网
    8 http://www.cnnic.net.cn/uploadfiles/pdf/2006/7/19/103651.pdf
    9 http://www9.org/
    10 http://www.people.com.cn/GB/paper464/
    11 http://www.people.com.cn/GB/paper464/
    12 http://www.people.com.cn/GB/paper464/9567/883861.html
    13 http://www.nlp.org.cn/docs/docredirect.php?doc_id-295
    14 http://www.nlp.org.cn/
    15 http://www. keenage. com.
    16 http://www.nlp.org.cn/
    17 http://forums.us.dell.com/supportforums?~ck=mn
    [A. Clauset, et al, 2004]A. Clauset, M.E.J. Newman, C. Moore. Finding community structure in very large networks [J]. Phys Rev E, 2004, 70(6): 066111.
    
    [A. Joseph, et al., 1997] A. Joseph, Konstan, N. Bradley, Miller, M. David. GroupLens: Applying Collaborative Filtering to Usenet News. Communication of the ACM, 1997,40(3):77-87.
    
    [A. Hotho, et al., 2002]A. Hotho, S. Staab, A. Maedche. Ontology-based Text Clustering. KI 16, 2002.
    
    [A. Hotho, et al., 2003] A. Hotho, A. Maedche , S. Staab. Ontologies Improve Text Document Clustering, ICDM2003, 2003.
    
    [A. Hotho, et al., 2003] A.Hotho, S.Staab, G.Stumme. Text Clustering Based on Background Knowledge (Technical Report). University of Karlsruhe, Institute AIFB, 2003.
    
    [A.H. Tan, et al., 1998] A.H. Tan, C. Teo. Learning User Profiles for Personalized Information Dissemination. IEEE International Joint Conference on Neural Networks, 1998, 183-188.
    
    [A. L. Barabasi, et al.,1999] A. L. Barabasi, R. Albert. Emergence of scaling in random networks. Science, 1999,286:509-512.
    
    [A. L. Barabasi, et al.,1999] A. L. Barabasi, R. Albert, H. Jeong. Mean-field theory for scale-free random networks. Physica A, 1999, 272:173-187.
    
    [A. Pothen, et al., 1990] A. Pothen, H. Simon, K. P. Liou. Partitioning sparse matrices with eigenvectors of graphs. SIAM J Matrix Anal Appl, 1990, 11(3):430-452.
    
    [A. Pretschner, et al., 1999] A. Pretschner, S. Gauch. Ontology Based Personalized Search. In Proceedings of 11~(th) IEEE International Conference on Tools with Artificial Intelligence, 1999,391-398.
    
    [A. Stefani, et al., 1998] A. Stefani, C. Strappavara, Personalizing Access to Web Sites: The SiteIF Project. In Proceedings of the 2~(nd) Workshop on Adaptive Hypertext and Hypermedia( HYPERTEXT'98). Pittsburgh, USA, 1998.
    
    [B. Bollabas, 1981] B. Bollabas. Degree sequence of a random graphs. Discrete Math., 1981, 33:1-19.
    
    [B. Bollobas, 1998] B. Bollobas. Modern Graph Theory. New York: Springer, 1998.
    
    [B. Choudhary, et al., 2002] B. Choudhary, P. Bhattacharyya. Text Clustering using Semantics. WWW2002, Hawai, USA, May 2002.
    [B.Klmit, et al., 2004] B.Klmit, Y.Yang. The Enron Corpus: A New Dataset for Email Classification Research. ECML04. Pisa, Italy: 2004
    [B. Sarwar, et al., 2000] B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Analysis of recommender algorithms for e-commerce. In Proceedings of the 2~(nd) ACM E-Commerce Conference, New York: ACM Press, 2000, 158-167.
    [B. Sarwar, et al.,2001] B. Sarwar, G. Karpis, J. Konstan, J. Riedl. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10~(th) International World Wide Web Conference. New York: ACM Press, 2001,285-295.
    [B.Smith, et al., 1995] B.Smith, M.T.Keane, Remembering to forget: A Competence-Preserving Case Deletion Policy for Case-Based Reasoning Systems[J]. In: Proc. 14th International Joint Conference on Artificial Intelligence,1995,337-382.
    [B. Smyth, et al., 2002] B. Smyth, K. Bradley, R. Rafter. Personalized Techniques for Online Recruitment Services. Communications of the ACM, 2002, 45(5): 39-40.
    [B. W. Kernighan, et al., 1970] B. W. Kernighan, S. Lin. A efficient heuristic procedure for partitioning graphs. Bell System Technical Journal, 1970, 49(2):291-307.
    [B. Xiao, et al., 2003] B. Xiao, E. Aimeur, J.M. Fernandez. PCFinder: an intelligent product recommendation agent for e-commerce. IEEE International Conference on E-Commerce. 2003,181-188.
    [C. Li, et al., 2003] C. Li, J.R. Wen, H.Li. Text classification using stochastic keyword generation. Proceedings of the 20~(th) International Conference on Machine Learning(ICML-2003). Washington DC: 2003.
    [C. Ordonez, 2003] C. Ordonez. Clustering binery data streams with K-means[C]. ACM DKMD Workshop, San Diego, California, 2003.
    [C. Ordonez, et al., 2004] C. Ordonez, E. Omiecinski. Efficient disk-based K-means clustering for relational databases[J]. IEEE Trans. Knowledge and Data Enginerring, 2004,16(8): 909-921.
    [C. Shahabi, et al., 2003] C Shahabi, Y.S Chen. Recommendation System without Explicit Acquisition of Use Relevance Feedback. Distributed and Parallel Databases, 2003, Springer, 14:173-192.
    [蔡庆生,2005] 蔡庆生,耿焕同,赵鹏.协调式人工智能研究.《2005中国人工智能进展》中国人工智能学会第11届全国学术年会,武汉,2005:131-135.
    [蔡智,2002] 蔡智,基于Web的中文信息智能获取研究,中国科学技术大学博士学位论文,2002。
    [陈宁,2002] 陈宁,陈安,周龙骧,贾维嘉,罗三定.基于模糊概念图的文档聚类及其在Web中的应用.软件学报,2002,13(8):1598—1605。
    [崔林,2005] 崔林,宋瀚涛,陆玉昌,基于语义相似性的资源协同过滤技术研究,北京理工大学学报,2005,25(5):402—405。
    [D.D. Lewis, 1992] D.D. Lewis, An evaluation of phrasal and clustered representations on a text categorization task. In Proceedings of SIGIR-92, 15~(th) ACM International Conference on Research and Development in Information Retrieval, Copenhagen, Denmark, 1992, 37-50.
    [D. Fell, et al., 2000] D. Fell, A. Wagner. The small world of metabolism. Nature Biotechnology 18, 2000, 1121-1122.
    [D. Harman, 1992] D. Harman. Relevance feedback revisited, In Proceedings of the 15~(th) annual international ACM SIGIR conference on Research and development in information retrieval. New York: ACM Press, 1992,1-10.
    [D.J. Watts, et al., 1998] D.J. Watts, S.H. Strogatz. Collective dynamics of small-world networks. Nature 393, 1998, 440-442.
    [D.L. Lee, et al.,1997] D.L. Lee, H. Chuang, K. Seamons. Document ranking and the vector-space model[J]. IEEE Software, 1997,14(2):67-75
    [D. Mladenic, 1996] D. Mladenic. Personal WebWatcher: Implementation and Design. Technical Report, US-DP-7472, Pittsburgh: Carnegie Mellon University, 1996.
    [邓爱林,2004] 邓爱林,左子叶,朱扬勇,基于项目聚类的协同过滤推荐算法,小型微型计算机系统,2004,25(9),1665—1670
    [E. Casasola, 1998] E. Casasola. ProFusion Personal Assistant: an agent for personalized information filtering on the WWW. Master's thesis, The University of Kansas, Lawrence, KS, 1998.
    [F. Asnicar, et al., 1997] F. Asnicar, C. Tasso, IfWeb: A Prototype of User Models Based Intelligent Agent for Document Filtering and Navigation in the World Wide Web. In Proceedings of UM'97. Sardinia: Chia Laguna, 1997.
    [F. Radicchi, et al., 2004] F. Radicchi, C. Castellano, F. Cecconi. Defining and identifying communities in networks. Proc Natl Acad Sci, 2004, 101(9):2658-2663.
    [F. Schastiant, 2002] F. Schastiant. Machine learning in automated text categorization, ACM Computing Surveys,2002,34(1):1-47
    [F. Sebastiani, 2002] F. Sebastiani. Machine learning in automated text categorization, ACM Computing Surveys, Vol. 34, No. 1, March 2002, 1-47.
    [E Wu, et a1.,2004] F. Wu, B.A. Huberman. Finding communities in linear time: a physics approach. Eur Phys J B, 2004,38(2):331-338.
    [樊国萍,2005] 樊国萍,我国个性化信息服务研究综述,新世纪图书馆,2005,5,22—25。
    [傅伟鹏,2002] 傅伟鹏,吴斌,何清,史忠植.一种概念空间自生成方法.计算机工程与应用,2002,7:63—65
    [G. Bianconi, et al., 2001] G. Bianconi, A.L. Barabasi. Competition and multiscaling in evolving networks. Europhys. Lett., 2001, 54:436-442.
    [G.B. West, et al., 1997] G.B. West, J.H. Brown, B.J. Enquist. A general model for the origin of allometric scaling laws in biology. Science 276, 1997, 122-126.
    [G. Escudero, et al., 2000] G. Escudero, L. Marquez, G. Rigau. Boosting applied to word sense disambiguation[C]. In Proceedings ofECML-00, 11~(th) European Conference on Machine Learning,2000, 129-141
    [G. Miller, 1995] G. Miller. WordNet: A lexcial database for English. CACM, 38(11): 39-41, 1995.
    [G. Palla, et al., 2005] G. Palla, I. Dernyi, I. Farkas, et al. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 2005,435(7043):814-818.
    [G. Szabo, et al., 2003] G. Szabo, M.J. Alava, J. Kertesz. Structural transitions. In scale-free networks. Phys. Rev. E, 2003, 67:056102.
    [耿焕同,2006] 耿焕同,范例推理与互联网文本信息处理研究,中国科学技术大学博士学位论文,2006。
    [宫秀军,2002] 宫秀军,史忠植.基于Bayes潜在语义模型的半监督Web挖掘.软件学报,2002,13(8):1508-1514。
    [H.J. Peat et al., 1991] H.J. Peat, P. Willet. The limitations of term co-occurrence data for query expansion in document retrieval systems. Journal of American Society for Information Science, 1991,42(5): 378~383.
    [H. Jeong, et al., 2000] H. Jeong, B. Tombor, R. Albert, Z. Oltvai, A. Barabasi. The large-scale organization of metabolic networks. Nature 406, 2000,651-654.
    [H. Lieberman,1995] H. Lieberman. Letizia: An agent that assists Web browsing. In Proc. Intl. Conf. on AI, Montreal, Canada, August 1995.
    [H. Sakagami, et al.,1998] H. Sakagami, T. Kamba, A. Sugiura. Effective personalization of push-type systems: visualizing information freshness. Computer network and ISDN systems, 1998(30):53-63.
    [H. Sorensen, et al., 1995] H. Sorensen, M. Mcelligot, PSUN: A Profiling System for Usenet News, In Proceedings of CKIM'95 Workshop on Intelligent Information Agents. Baltimore, USA, 1995.
    [韩靖,2002] 韩靖.多主体系统的AER模型研究.中国科学技术大学博士论文.2002。
    [J. Han, et al., 2000] J. Han, Micheline. Data Mining: Concepts and Techniques[M]. San Francisco: Morgan Kaufmann Publishers, 2000.
    [J.G.. White, et al., 1986] J.G.. White, E. Southgate, J.N. Thompson and S. Brenner. The structure of the nervous system of the nematode C. elegans, Philosophical Transactions of the Royal Society of London- Series B: Biological Sciences 314,1986, 1-340.
    [J.M. Kleinberg, et al, 2005] J.M. Kleinberg, P. Raghavan. Query Incentive Network. In FOCS'05: 46~(th) Annual IEEE Symposium on Foundation of Computer Science. Pittsburgh, 2005, 132-144.
    [J.R. Banavar, et al., 1999] J.R. Banavar, A. Maritan, A. Rinaldo, Size and form in efficient transportation networks, Nature399, 1999, 130-132.
    [J. Scott, 2000] J. Scott. Socail Network Analysis: A Handbook 2~(nd) edn, Sage, London, 2000.
    [L. A. Adamic, et al., 2000] L. A. Adamic, B. A. Huberman, Power-law distribution of the world wide web, Science, 2000, 287(5461), 2115.
    [L.A.N. Amaral, et al., 2000] L.A.N. Amaral, A. Scala, M. Barthelemy, H.E. Stanley. Classes of small-world networks. Proceedings of the National Academy of Sciences, USA, 2000, 97(21): 11149-11152.
    [L. Chen, et al.,1997] L. Chen, K. Sycara. WebMate: A Personal Agent for Browsing and Searching. In Proceedings of the 2~(nd) International Conference on Autonomous agents. New York: ACM Press, 1998, 132-139.
    [L.D. Baker, et al., 1998] L.D. Baker, A.K. McCallum. Distributional clustering of words for text classification[C]. In Proceedings of SIGIR-98, 21~(st) ACM International Conference on Researchand Development in Information Retrieval, 1998, 96-103
    [林鸿飞,2005] 林鸿飞,杨志豪,赵晶,基于内容和合作模式的信息推荐机制,中文信息学报,2005,19(1):48—55。
    [李振星,2004] 李振星,陆大珏,任继成,唐卫清,唐容锡,基于潜在语义索引的Web信息预测采集过滤方法,计算机辅助设计与图形学学报,2004,16(1),143—147。
    [路甬祥,2001] 路甬祥.路甬祥院长报告.中科院二00一年度工作会议,2001。
    [M.E.J. Newman, et al., 2001] M.E.J. Newman, S. Strogatz, D. Watt. Random graph with arbitrary degree distributions and their applications. Phys. Rev. E, 2001,64:026118.
    
    [M.E.J. Newman, 2001] M.E.J. Newman. The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences, USA, 2001, 98: 404-409.
    
    [M.E.J. Newman, 2003] M.E.J. Newman. The structure and function of complex networks. SIAM Rev., 2003, 45:167-256.
    
    [M.E.J. Newman, 2004] M.E.J. Newman. Fast algorithm for detecting community structure in networks [J]. Phys Rev E, 2004, 69(6): 066133.
    
    [M.E.J. Newman, et al, 2004] M.E.J. Newman, M. Girvan. Finding and evaluating community structure in networks [J]. Phys Rev E, 2004, 69(2):026113.
    
    [M. Fiedler, 1973] M. Fiedler. Algebraic connectivity of graphs. Czech Math J, 1973, 23(98):298-305.
    
    [M.F. Caropreso, et al., 2001] M.F. Caropreso, S. Matwin, F. Sebastiani, A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization. In Text Databases and Document Management: Theory and Practice, A.G. Chin, ed. Idea Group Publishing, Hershey, PA, 78-102.
    
    [M. Montes, et al., 2001] M. Montes, Y. Gomez, A.Gelbukh. Text Mining with Conceptual Graphs. 2001.
    
    [M. Pazzani, et al.,1997] M. Pazzani, D. Billsus. Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning, 1997, 27:313-331.
    
    [M. Shepherd, et al., 2002] M. Shepherd, C. Watters, A.T. Marath. Adaptive user modeling for filtering electronic news. In Proceedings of the 35~(th) Annual Huwaii International Conference on System Sciences. 2002, 1180-1188.
    
    [M.Waldrop, 1997] M. Waldrop.著, 陈玲译. 复杂 :诞生于秩序与混沌边缘的科学. 北京三联 书店. 1997.
    
    [N. Fuhr, et al., 1991] N. Fuhr, S. Hartmann, G. Knorz, G. Lustig, M. Schwantner, K. Tzeras, AIR/X—a rule-based multistage indexing system for large subject fields[C]. In Proceeding of RIAO-91, 3~(rd) International Conference, 1991, 606-623.
    
    [N. Slonim, et al., 2001] N. Slonim, N. Tishby. The power of word clusters for text classification[C]. In Proceedings of ECIR-01, 23~(rd) European Colloquium on Information Retrieval Research. 2001.
    
    [P. Cunningham, et al.,2000] P. Cunningham, R. Bergmann, S. Schmitt, R. Traphoner, S. Breen, B. Smyth. WEBSELL: Intelligent Sales Assistants for the World Wide Web. Technical Report, TCD-CS-2000-42, Dublin: Trinity College, 2000.
    [P. Erdos, et al.,1959] P.Erdos, A. Renyi. On random graphs. Publications Mathematicae 6,1959,290-297.
    [P. Erdos, et al.,1960] P. Erdos, A. Renyi. On the evolution of random graphs. Publications Mathematicae Inst. Hungar. Acad. Sci, 1960,17-61.
    [P. Erdos, et al.,1961] P.Erdos, A. Renyi. On the strength of connectedness of a random graphs. Acta. Math. Acad. Sci. Hungar, 1961,12:261-267.
    [P.L. Krapivsky, et al., 2002] P.L. Krapivsky, S. Redner. A statistical physics perspective on web growth. Computer Networks, 2002, 39:261-276.
    [P. Pantel, et al., 2002] P. Pantel, D. Lin. Document Clustering with Committees. Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002
    [潘金贵,2001] 潘金贵,胡学联,李俊,张灵玲,一个个性化的信息搜索Agent的设计与实现,软件学报,2001,12(7):1074—1079。
    [钱学森,1990] 钱学森,于景元,戴汝为.一个科学的新领域——开放的复杂臣系统及其方法论.自然杂志.1990,13(1),3—10。
    [钱学森,1991] 钱学森.再谈开放的复杂巨系统.模式识别与人工智能.1991,4(1),5—8。
    [R. Agrawal, et al., 1993] R. Agrawal, T. Imieliski, A Swami. Mining Association Rules Betwecn Sets of Items in Large Database. Proc of ACM SIGMOD Intl Conf on Management of Data(SIGMOD'93),207-216, 1993.
    [R. Albert, et al., 2002] R. Albert, A.L. Barabasi. Statistical mechanics of complex networks. Rev. Mod. Phys., 2002, 74:47-97.
    [R. Burke, et al., 1997] R. Burke, K. Hammond, B. Young. The FindMe Approach to Assisted Browsing. IEEE Expert, 1997, 12(4):32-40.
    [R.D. Lawrence, et al., 2001] R.D. Lawrence, G.S. Almas, V. Kotlyar. Personalization of supermarket product recommendations. Data Mining and Knowledge Discovery, 2001, 5(1/2): 11-32.
    [R. Ferrer i Cancho, et al., 2001] R. Ferrer i Cancho and R. V. Sole, The small world of human language, Santa Fe Institute working paper ,2001.
    [R.M. Shiffrin, et al., 2004] R.M. Shiffrin, K. Borner. Mapping knowledge domain. Proc. Natl. Acad. Sci. USA 101,5183-5185, 2004.
    [R.J. Williams, et al., 2000] R.J. Williams, N.D. Martinez. Simple rules yield complex food webs. Nature 404, 2000, 180-183.
    [R.J. Williams, et al., 2001] R.J. Williams, E.L. Berlow, J.A. Dunne, A.L. Barabasi, N.D. Martinez. Two degrees of separation in complex food webs. Santa Fe Institute working paper; 2001-07-036.
    [R. Schank, 1982] R. Schank. Dynamic memory: A theory of learning in computers and people. New York: Cambridge University Press, 1982.
    [S. Brin, 1998] S. Brin, Lawrence Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine. In 7~(th) International World Wide Web Conference, 1998.
    [S.E. Middleton, et al., 2004] S.E. Middleton, N.R. Shadbolt, D.C. Roure. Ontological User Profiling in Recommender Systems. ACM Transactions on Information Systems, 2004,22(1): 54-88.
    [S. Knudsen, 2004] S. Knudsen. A Guide to analysis of DNA Microarray Data 2~(nd) edn, Wiley-Liss, New York, 2004
    [S. Milgram, 1967] S. Milgram. The small world problem. Psychology Today 2, 1967, 60-67.
    [S.N. Dorogovtsev, et al., 2000] S.N. Dorogovtsev, J.F.F. Mendes. Scaling behaviour of developing and decaying networks. Europhys. Lett., 2000, 52:33-39.
    [S.N. Dorogovtsev, et al., 2002] S.N. Dorogovtsev, J.F.F. Mendes. Evolution of networks. Adv. Phys., 2002, 51:1079-1187.
    [S.T. Dumais, et al,,1988] S.T. Dumais, G.W. Frunas, T.K. Landauer, S. Deerwester. Using latent semantic analysis to improve information retrieval. In Proc. Of CHI'88,281-285.
    [S.T. Dumais, 1991] S.T. Dumais. Improving the retrieval of information from external sources. Behav.Res.Methods, Instr. Comput. 1991,23,229-236.
    [S.T. Dumais, 1995] S.T. Dumais. Latent Semantic Indexing(LSI):TREC-3 report. In The 3rd Text Retrieval Conference,1995, D.Harman Ed.219-230.
    [史定华,2005] 史定华,网络——探索复杂性的新途经,系统工程学报,2005.20(2):115—120.
    [史忠植,2002] 史忠植,知识发现,清华大学出版社,2002.
    [宋丽哲,2005] 宋丽哲,牛振东,宋瀚涛等,数字图书馆个性化服务用户模型研究.北京理工大学学报,2005,25(1):58—62
    [宋擒豹,2002] 宋擒豹,沈钧毅.基于关联规则的Web文档聚类算法.软件学报,2002.,13(3):417—423。
    [T. Joachims, et al., 1997] T. Joachims, D.Freitag, and T. Mitchell. WebWatcher: A Tour Guide for the World Wide Web. In Proc. IJCAI'97, August 1997.
    [T.R. Gruber, 1995] T.R. Gruber. Toward Principles for the Design of Ontologies Used for Knowledge Sharing. Int. Journal of Human and Computer Studies, 1995, 43(5):907-928.
    [T. Walsh, 1999] T. Walsh. Search in a small world. In Proc. IJCAI-99, 1999, 1172-1177.
    [W. Kim, et al., 2002] W. Kim, L. Kerschberg, A. Scime. Learing for automatic personalization in a semantic based meta-search agent. Electronic Commerce Research and Application, 2002,1:150-173.
    [W. S. Li, et al., 1999] W. S. Li, Q. Vu, D. Agrawal. PowerBookmarks: a system for personalizable web information organization, sharing, and management. In Proceedings of the 8~(th) International World Wide Web Conference, Toronto, 1999, 297-311.
    [吴斌,2002] 吴斌,傅伟鹏,郑毅,刘少辉,史忠植.一种基于群体智能的Web文档聚类算法.计算机研究与发展,2002.,39(11):1429—1435
    [吴丽花,2006]吴丽花,刘鲁,个性化推荐系统用户建模技术综述,情报学报,2006,25(1),55—62。
    [肖明军,2004] 肖明军,多策略的Web信息采集系统的研究,中国科学技术大学博士学位论文,2004。
    [解刍,2005] 解刍,汪小帆,复杂网络中的社团结构分析算法研究综述,复杂系统与复杂性科学,2005,2(3):1—12.
    [许欢庆,2004]许欢庆,王永成,基于加权概念的用户兴趣建模,上海交通大学学报,2004,38(1),34—38。
    [许国志,2000] 许国志.系统科学.上海:上海科技教育出版社,2000.
    [行小帅,2003] 行小帅,潘进,焦李成,基于免疫规划的K-means聚类算法.计算机学报,2003,26(5):605—610.
    [Y. H. Li, et al., 1998] Y. H. Li, A. K. Jain. Classification of text documents[J]. Comput. J. 1998, 41,8,537-546.
    [Y. Li, et al., 2003] Y. Li, N. Zhong. Ontology-based web mining model: representations of userprofiles. In Proceedings of IEEE/WIC International Conference on Web Intelligence.2003,96-103.
    [Y. Matsuo, et al., 2001] Y. Matsuo, Y. Ohsawa, M. Ishizuka. A document as a small world. JSAI 2001 Workshops, LNAI 2253, 444-448, 2001.
    [Y. Yang, et al., 1997] Y.Yang, J.O. Pedersen. A comparative study on feature selection in text categorization. 1997, In Proceedings of ICML-97, 14~(th) International Conference on Machine Learning, 412-420
    [Y. Yang, et al., 2002] Y. Yang, S. Slattery, R. Ghani. A study of approaches to hypertextcategorization. Intell. Inform. Syst. 2002,18,2/3, 219-241.
    [杨尔弘,2001] 杨尔弘,张国清,张永奎.基于义原同现频率的汉语词义排歧方法.计算机研究与发展,Vol.38,No.7,2001.
    [杨善林,2004] 杨善林,倪志伟.机器学习与智能决策支持系统.北京:科学出版社,2004
    [杨艳,2005] 杨艳,李建中,高宏,数字图书馆中基于Ontology的用户偏好模型,软件学报,2005,16(12),2080—2087。
    [姚欣,2005] 姚欣,复杂网络及其聚集度研究,清华大学博士学位论文,2005.
    [于琨,2005] 于琨,互联网半结构化信息抽取研究,中国科学技术大学博士学位论文,2005。
    [Zhu Mengxiao, 2003] Zhu Mengxiao, Cai Zhi, Cai Qingsheng; Automatic Keywords Extraction Of Chinese Document Using Small World Structure, 2003 IEEE International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE'03, Beijing, 2003. 10。
    [Z. Pawlak, 1991] Z. Pawlak. Rough Sets: Theoretical Aspects of Reasoning About Data. Dordecht: Kluwer Academic Publishers, 1991.
    [曾春,2002] 曾春,刑春晓,周立柱,个性化服务技术综述,软件学报,2002,13(10):1952—1961.
    [曾春,2003] 曾春,刑春晓,周立柱,基于内容过滤的个性化搜索算法,软件学报,2003,14(5):999—10004。
    [张钹,1990] 张钹,张铃.问题求解的理论及应用.北京:清华大学出版社,1990
    [张铃,2003] 张铃,张钹.模糊商空间理论(模糊粒度计算方法).软件学报,2003,14(4):770~776
    [张树人,2006] 张树人.从社会性软件、Web2.0到复杂适应信息系统研究.中国人民大学博士学位论文.2006
    [张燕平,2004] 张燕平,张铃,吴涛.不同粒度世界的描述法—商空间法.计算机学报,2004,27(3):328~333
    [朱孟潇,2004] 朱孟潇.复杂网络系统的智能涌现及其应用研究.中国科学技术大学硕士学位论文,2004.
    [周军锋,2004] 周军锋,汤显,郭景峰,一种优化的协同过滤推荐算法,计算机研究与发展,2004,41(10):1842—1847。
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.