构件智能获取系统的设计与实现
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着大型软件的发展和广泛应用,软件重用技术被视为解决软件重复开发,减少软件设计费用,提高软件开发效率的关键技术。构件挖掘技术能为软件重用提供大量封装好的素材。虽然网络上分散着大量的构件,但是目前构件挖掘技术仍然局限在本地数据库或者局域网范围内,而现存的网络搜索引擎基本上还只能获取文本信息,不能有效地进行构件搜索和挖掘,所以本文主要讨论基于网络的构件搜索,构件信息获取问题。
     本文首先深入分析了目前构件挖掘技术存在的问题,现有解决方法的局限性,介绍了构件智能挖掘系统结构。然后详细讨论了Java Bean构件的存在形态和在网络上的发布方式,并且对Java Bean构件发布时,嵌入在applet小程序中的大量参数进行了仔细分析,提出了根据参数设置分析构件性质的方法。同时针对Java Bean构件的特性,在详细分析了用户对构件挖掘的需求之后,设计实现了构件智能获取原型系统(CIRS)。
     此原型系统主要实现了网络上构件的实时快速搜索,包括用户、搜索和信息描述三个模块,通过三个模块的协同合作提供用户构件详细描述和重用信息。
     用户模块分析用户用自然语言描述的需求,通过词性和语法结构分析,提取能够代表需求的关键字,递交给挖掘模块作为搜索依据。在搜索模块递交初步搜索结果时,提出暗示条件供用户选择,利用和用户之间的交流机制,最终确定结果过滤条件,完成对大量结果的细化和过滤,并且提交最终结果给用户。
     构件的搜索是结合了专业的搜索引擎技术和搜索Agent进行快速的二层搜索,首先是利用搜索引擎遍历整个网络,缩小搜索范围,提高搜索浓度,然后利用搜索Agent在搜索引擎结果基础上进行有限深度的精确搜索。针对构件的不同形态,不同发布方式,制定不同的搜索策略,不仅可以搜索到以Jar文件出现的构件,还可以搜索到直接以class文件发布的构件,基本涵盖了网络上的所有构件,实现了快速,全面,精确的搜索。根据构件网上信息的特点,信息提取和描述模块将参数提取的和网页上的文本相结合给出构件的完整描述和重用信息。最后对全文进行了总结,分析了原型系统的不足,并对今后的工作进行了展望。
With the development of the large software system, software reuse is considered as the key to deal with the repetitive design of software, the waste of money and the improvement of the efficiency of software engineering. Technique of component mining can provide a lot of packed source. There are lots of software components on the Internet, but until now, component mining is still limited on local database or network, meanwhile, searching engines on the Internet can not search and retrieve components efficiently because they are based on the text information retrieval. So this paper is concentrated on component search and information retrieval on the Internet.
    Firstly, this paper introduces the problems existed in component mining, the limitation of the current solutions and the architecture of the intelligent component mining system. Then publish of Java Bean components and the parameters embedded in the applet programs on Internet are discussed. We proposed a new method to retrieve information on the features of components by analysing the parameters in applet program. Based on the characters of Java Bean and analysis of users' requirement, component intelligent retrieval system is designed and implemented.
    This system which includes three parts: user agent, search agent and information retrieval agent realizes the real-time and quick component search on Internet. Components and description information can be provided to users by the cooperation of these three parts.
    User Agent can analyse and specify some keywords to present users' requirements which are described by natural language by semantic analysis and syntactical analysis. These keywords are transferred to search agent as the search requirements. When the initial search results are returned to user agent, it can give some clues to filter the redundant information according to users' requirements and negotiate with users to definite the filtering conditions. The final results can be provided to users directly after filtering the redundant results.
    Search Agent combines searching engines and spider to implement the quick and tow level search. Search agent makes use of searching engines to traverse the whole Internet and minimize the search fields. Then the spider gets the results from searching engines automatically by API and implements the limited depth search based on the results. In order to cover almost all the Java Bean on Internet, the search
    
    
    agent has different search strategy to different kinds of Java Bean.
    Information retrieval agent provides complete description of components to users by integrating the parameter information and text information. At the end of the paper, the conclusion, the deficiency of the system and some future work direction are proposed.
引文
[1]. Stan Franklin and Art Graesser: Is it an Agent ,or Just a Program?: A Taxonomy for Autonomous Agents, Lecture Notes in artificial Intelligence Intelligent Agent 3 Agent Theories, Architectures and Languages ECAI'96 Workshop(ATAL)Budapest, Hungary, August 1996,Proceedings
    [2] 陈松乔,任胜兵,王国军.现代软件工程[M].北京:清华大学出版社,2002年.
    [3] 郝克刚.软件构件技术综述[R].http://www.iturls.com/Articles/2-hao4.asp
    [4]. Lucas Introna, Helen Nissenbaum: Defining the Web: The Politics of Search Engines, IEEE Computer, 54-62, January 2000.
    [5]. Yue-Shan Chang, Hsin-Chun Hsieh, Shyan-Ming Yuan, Winston Lo: An Agent-based Search Engine based on the Internet Search Service on the CORBA, Distributed Objects and Applications, 1999. Proceedings of the International Symposium on, 1999, Page(s): 26-33.
    [6] 郭国强,张尧学,魏子忠,程序挖掘过程,常德师范学院学报(自然科学版),2001,13(4):45-48
    [7]. Nikos Bogonikolos, Dimitris Fragoudis, Spiros Likothanassis: "ARCHIMIDES": An Intelligent Agent for Adaptive-Personalize Navigation within a Web Server, Proceedings of the 32nd Hawaii International Conference on System Sciences-1999.
    [8]. Robert C. Seacord, Scott A. Hissam, Kurt C. Wallinau: AGORA: A Search Engine for Software Components, IEEE Internet November-December 1998 62-70.
    [9]. Steve Lawrence and C.Lee Giles: Searching the Web: General and Scientific Information Access, IEEE Communications Magazine, January 1999, 116-121.
    [10] 张驰,张尧学,魏子忠,一种分布式构件目录库的设计与实现,计算机工程与应用,2001,23:146-150,171.
    [11] 徐正权.软件复用方法与技术[M].武汉:华中理工大学出版社,1998.
    [12] 王少峰,何志均,王克宏.软件重用技术研究[J].计算机工程与设计,2000,21(5):10-15.
    [13]. Konstantions Raptis, Diomidis, Spinellis, Sokratis Katiskas: Distributed Object Bridge and Java-based Object Mediator, Informatik Informatique 2/2000.
    
    
    [14]. Diomidis, Spinellis: Explore, Excogitate, Exploit: Component mining, IEEEComputer, 32(9): 114-116,(Sept. 1999)
    [15]. Diomidis, Spinellis and Konstantions Raptis: Component Mining:A Process and its Pattern Language, Information and Software Technology,42(9):609-617,June 2000.
    [16] 北大青鸟.青鸟构件库(JBCL)概念模型[R].http://www.jbcase.com.
    [17]. Nierstrasz O, Gibbs S, and Tsichritzis D: Component-Oriented Software Development. Commun. ACM, 35 (9): 160-165,(Sept. 1992).
    [18]. Brown A. W and Wallnau K. C: The Current State ofCBSE. IEEE Software, 15(5): 37-46,(September/October 1998).
    [19]. Enric Motta, Dieter Fensel, Mauro Gaspari, and Richard Benjamins: Specification of Knowledge Components for Reuse, the Eleventh International Conference on Software Engineering SEKE 1999.
    [20]. Benjamins, R, Plaza, E.Motta, E. Fensel, D. Stider, R. Wielinga, B.J.Schreiber, G.Zdrahal.Z: An Intelligent Brokering Service for Knowledge -Component Reuse on the World-Wide-Web. Proceedings of the 11th Banff Knowledge Acquisition for Knowledge-based System Workshop(KAW'98), Banff, Canada.
    [21] 王斌,张尧学,陈松乔,Agent通信模型,中南工业大学学报(自然科学版),2002,33(2):209-213.
    [22] 王斌,张尧学,陈建二,陈松乔,基于Internet的信息挖掘的Agent通信协议,计算机科学,2002,29(8):85-87,84.
    [23]. Fafcgamps D: Organizational Factors and Reuse. IEEE Software, 11(5), pp.31-41;
    [24]. Fensel D and Greoboom.R(1997) Specifying Knowledge-based Systems with Reusable Components, Proceedings of the 9th International Conference on Software Engineering and Knowledge Engineering (SEKE-97), Madrid, Spain
    [25] 陈宁江,苏德富,基于Agent的CSCW系统及其通讯实现,广西大学学报(自然科学版),2000,25(2):94-97。
    [26]. Pozewaunig Heinz : Mining component behavior to Support Software Retrieval, Dissertation University Klagenfurt, October 2001
    
    
    [27] Kazuhiko: An approach to mobile Software Robots for the WWW, IEEE Transactions on Knowledge and Data Engineering.
    [28] Philip Heller等编著,邱仲潘等译.Java高级开发指南[M].北京:电子工业出版社.1999.06.
    [29] 沈达阳,林作铨,陈智健.Internet信息收集Agent及搜索方法[J].计算机系统应用,1998,5:18-21.
    [30] 李宏,陈松乔.基于多Agent的Web构件挖掘技术研究[A].第二届中国Rough集与软计算学术会议论文集.2002,10.
    [31] Hyunsuk Seung, Doo-Hwan Bae: A Client-Oriented Distribution Architecture foe web Search Agents, Autonomous Decentralized Systems, 1997. Proceedings. ISADS 97, Third International Symposium on, 1997 ,Page(s): 299 -308
    [32] Pogacnik, M., Tasic, J.F: Agent based retrieval of information from Internet, Electrotechnical Conference, 2000. MELECON 2000. 10th Mediterranean, Volume: 1,1998 ,Page(s): 399 -402 vol.1
    [33] 徐振宁,姚莉,张维明,智能通信主体的设计与实现,小型微型计算机系统,1999,20(8):619-622
    [34] Bradford, C.; Marshall, I.W.: A bandwidth friendly search engine, Multimedia Computing and Systems, 1999. IEEE International Conference on, Volume: 2, 1999 Page(s): 720 -724 vol.2
    [35] Uan Tong, Sheng Haolin: A component search engine model on Internet, Technology of Object-Oriented Languages, 1997. TOOLS 24. Proceedings, 1998 Page(s): 393 -396
    [36] 刘斌,王兰邵,王浩军,基于CORBA的分布式Agent通信构架,小型微型计算机系统,2001,22(6):728-731
    [37] Daniel Jurafsky, James H. Martin: Speech and Language Processing
    [38] Sato, N.; Uehara, M.; Sakai, Y.; Mori, H: A distributed search engine for fresh information retrieval, Database and Expert Systems Applications, 2001. Proceedings. 12th International Workshop on, 2001 Page(s): 211 -216.
    [39] Lambrix, P.; Shahmehri, N.; Wahllof, N.: A default extension to description logics for use in an intelligent search engine, System Sciences, 1998.,
    
    Proceedings of the Thirty-First Hawaii International Conference on, Volume: 5, 1998 ,Page(s): 28-35 vol.5
    [40] Szymanski, B.K.; Ming-Shu Chung: A method for indexing Web pages using Web bots, Info-tech and Info-net, 2001. Proceedings. ICII 2001 - Beijing. 2001 International Conferences on, Volume: 3,2001 Page(s): 1-6 vol.3
    [41] 黄林鹏,孙永强,面向Agent的多规范语言Ⅰ的设计与实现,上海交通大学学报,2000,34(7):963-966
    [42] Ming-Hsuan Yang; Yang, C.C.; Yi-Ming Chung: A natural language processing based Internet agent, Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation, 1997 IEEE International Conference on, Volume: 1, 1997 ,Page(s): 100 -105 vol.1
    [43] Motta,C.L.R.; Borges: A cooperative approach for information recommendation and filtering, M.R.S. Groupware, 2000. CRIWG 2000. Proceedings. Sixth International Workshop on, 2000 Page(s): 42 -49
    [44] 吴刚,吴泉源.ActiveX和CORBAIIOP的比较[J].计算机工程与应用,1999,01,pp:9-10
    [45] Young-Woo Park; Eun-Seok Lee: A new generation method of a user profile for information filtering on the Internet, Information Networking, 1998. (ICOIN- 12) Proceedings, Twelfth International Conference on, 1998 Page(s): 261 -264
    [46] Rabelo, J.C.B.; Silva, E.F.A.; Fernandes, F.B.; Meira, S.R.L.; Barros, F.A.: ActiveSearch: an agent for suggesting similar documents based on user's preferences, Systems, Man, and Cybernetics, 2001 IEEE International Conference on, Volume: 1,2001 ,Page(s): 549 -554 vol.1.
    [47] 郑谨,王斌,陈松乔,Java类文件解析Agent的设计原理与实现,计算机工程与应用,2002,38(18):127-129.
    [48] 王伟平,王斌,陈松乔,Internet上智能搜索多Agent体系结构及相关技术,计算机工程,2002,28(3):38-40.
    [49] Robert FIlman, Feniosky Pena-Mora: SEEK, AND YE SHALL FIND, IEEE Internet Computing, July. August 1998, Pages: 78-83
    [50] Harith Alani, Sanghee Kim, David E. Millard, Mark J. Weal, Wendy Hall, Paul H. Lewis, and Nigel R. Shadbbolt: Automayic Ontology-Based Knowledge
    
    Extraction from Web Documents, IEEE Intelligent Systems, January/February, 2003, Pages: 14-21.
    [51] Yiming Yang (Carnegie Mellon University), Jan Pedersen (Infoseek): Intelligent Information Retrieval, IEEE Intelligent Systems, July/August 1999, pages: 30-31
    [52] Jeff Heaton: Programming Spiders, Bots, and Aggregators in Java, Chapter 8: Building a spider, 248-250
    [53] Wojetek Kozaczynski, Grady Booch: Component-Based Software Engineering, IEEE Software, September/October 1998, pages: 34-36.
    [54] 赵东,周明天,分布对象技术述评,计算机应用,2000,20(12):7-9
    [55] Zacharis Z. Nick, Panayiotopoulos Themis: Web search Using a Genetic Algorithm, IEEE Internet Computing, March/April 2001, Pages: 18-26
    [56] Monika R. Henzinger: HyperIink Analysis for the Web, IEEE Internet Computing, January/February 2001, Pages: 45-50
    [57] Dunja Mlasdenic: Text-Learning and Related Intelligent Agents: A Survey, IEEE Intelligent systems, July/August, 1999, pages:44-54
    [58] 王少峰,何志均,王克宏.软件重用技术研究[J].计算机工程与设计,2000,21(5):10-15.
    [59] 魏子忠.网络环境下程序挖掘方法的研究[R].清华大学计算机科学与技术系.(博士论文).2001.
    [60] 窦郁宏,陈松乔.程序挖掘中需求描述的研究[J].计算机工程与应用.2002,38(10):53~56.
    [61] 张灵玲,潘金贵等.基于Internet的课件信息发现和收集Agent的研究[J].计算机研究与发展.1999,Vol.36,No.4,pp.465-471.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700