搜索引擎FTP及其它协议资源的搜索算法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着互联网的不断发展,搜索引擎已经成为检索网络上信息的重要助手。世界各科研机构对于搜索引擎技术的研究和开发十分重视,北京化工大学也将搜索引擎的研究纳入学校211工程建设的子项目。文章所研究的“搜索引擎FTP及其它协议资源的搜索算法研究”,正是此研究项目的重要组成部分,主要的研究内容和成果如下。
     对FTP搜索引擎技术进行了深入的研究,了解了FTP协议规范、分析了其协议资源的搜索原理,在此基础上设计和实现了FSearch系统。这是一个面向校园网用户,自主研发的FTP搜索引擎系统。系统的设计和研发,综合运用了.NET平台开发、XML技术、C#编程语言、数据库开发、网络应用开发等相关平台、技术和编程语言。FSearch搜索引擎系统提供了支持复合查询的用户界面,实现了网站快照等功能。系统在经过了测试和试运行之后,目前已经正式发布运行,在化工大学有了实际的应用。
     对P2P协议资源的搜索进行了研究,分析了4种典型的P2P系统拓扑结构,得出了其结构的特性。在此基础之上,根据搜索引擎系统的特点,设计了基于超级节点方式的P2P搜索引擎系统,并实现了相关搜索算法。
     综合全分布式非结构化拓扑、全分布式结构化拓扑和半分布式拓扑结构的优缺点之后,文章提出了一种P2P系统的架构模型,即“复合模型的P2P系统架构”,对其设计和实现方法做了阐述和说明,并提出了一些改进策略。
Today Search Engineer becomes the most import assistant, with the growing of Internet. Scientific research organization attaches importance to the research on Search Engine Technology. Beijing University of Chemical Technology bring the research on Search Engine Technology into sub-project of the 211 Project. It is the most import part of this project, which this paper research content "Search Engineer: researching the resource search arithmetic about ftp and others protocol".
     Studied the FTP Search Engine technology, acquired the knowledge of FTP protocol, and analyzed search theory about FTP protocol resource. Designed and developed FSearch system on base of this work. This system is self developed and face the campus people. Apply lots of technology, such as .NET, XML, C#, Database, Network and so on, in the course of developing system. The FSearch Search Engineer system provide user interface which support complex search, and completed host snap function. Now, system has completed testing, and using in Beijing University of Chemical Technology.
     Studied the peer to peer protocol. Analyzed four typical peer to peer system topology, and found out the characteristic of this topology. On this base, design a peer to peer search engineer system which depend on super node, finally implement the search arithmetic.
     Give out a peer to peer system structure, a complex peer to peer system structure; by synthesize Decentralized Unstructured Topology Decentralized Structured Topology and Partially Decentralized Topology. Finally explain the method of design and realize this system structure, and give some improve advice.
引文
[1]方志坚,张瑞林,童小素.搜索引擎综合分析[J].计算机工程与设计,2007,28(16):4038-4041
    [2]彭建荣,罗永会.搜索引擎的基本原理及发展趋势[J].电脑知识与技术,2006,(1):84-85
    [3]Mahesh S.Raisinghani.Search Engine Technology:A Closer Look at Its Future[J].Information Resources Management Journal,2005,18(2):1-7
    [4]张兴华.搜索引擎技术及研究[J].现代情报,2004,24(4):142-145
    [5]杨先明.网络信息搜索及未来趋势[J].现代情报,2006,26(3):48-49
    [6]原福永,梁顺攀.元搜索引擎的现状与发展[J].计算机工程与设计,2005,26(12):3278-3280
    [7]张兴华.搜索引擎未来技术试探[J].情报杂志,2004,23(8):95-96
    [8]RUSSELL KAY.Search Engine Optimization[J].Computerworld,2007,41(23):40-40
    [9]Christine Churchill.Search Engine Algorithms[J].Software World,2005,36(3):11-12
    [10]王丽华,何利娟,田文英.浅析FTP工作原理及应用技术[J].石家庄职业技术学院学报,2007,19(4):28-29
    [11]Alexander Gladshtein.Exploring FTP in.NET 2.0[J].Net Developer's Journal,2006,4(9):28-32
    [12]Jay Oswal,Chuck Lundgren.6 FTP Tips for the iSeries[J].iSeries News,2005,3(12):31-37
    [13]史艳,李伟生.基于XML的搜索引擎技术的研究与设计[J].计算机工程与设,2004,25(9):1488-1491
    [14]瞿裕忠,张剑锋,陈峥等.XML语言及相关技术综述[J].计算机工程,2000,26(12):3-5
    [15]戴蓓洁,余双,金蓓弘.基于DOM解析器的XML编辑器研究[J].计算机工程与设计,2007,28(22):5334-5337
    [16]刘芳.XML的标准规范[J].电脑知识与技术,2007,4(20):309-311
    [17]王昌.HTML页面加载XML文档的几种方法[J].电脑开发与应用,2007,20(12):75-75
    [18]姚树宇,赵少东.一种使用分布式技术的搜索引擎[J].计算机应用与软件,2005,22(10):127-129
    [19]窦天芳,李健,张成昱.基于P2P技术的搜索引擎[J].情报科学,2006,24(3):417-420
    [20]刘淑娴,李晓华.基于Peer-to-Peer的搜索引擎的发展[J].喀什师范学院学报,2005,26(6):62-64
    [21]冯国富,张金城,姜玉泉等.无结构P2P覆盖网络的拓扑优化[J].软件学报,2007,18(11):2820-2829
    [22]Sai Ho Kwok.P2P searching trends:2002-2004[J].Information Processing &Management,2006,42(1):237-247
    [23]Cullen Jennings,David A.Bryan.P2P For Communications:Beyond File Sharing[J].Business Communications Review,2006,36(2):36-40
    [24]Cade Metz.P2P GOES PRIVATE[J].PC Magazine,2005,24(2):96-98
    [25]Yatin Chawathe,Sylvia Ratnasamy,Lee Breslau etc.Making Gnutella-like P2P Systems Scalable[J].Computer Communication Review,2003,33(4):407-418
    [26]Andrew Herbert.What Happened to Pastry[J].Operating systems review,2007,41(2):10-16

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700