构件的检索技术研究及其在信用领域构件库中的应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
软件复用被视为解决软件危机,提高软件生产效率和质量的现实可行的途径。引入软件复用来实现信用评估系统的构件化开发,可快速构建适应性好、可靠性高的信用评估系统以准确评估企业信用状况,同时有效地减小软件开发成本。如何在大量的软件可复用资产中找到需要的构件是基于构件复用的信用评估软件开发过程中的关键问题。
     信用构件库提供信用构件的存储、管理、检索以及库的浏览与维护,是支持复用的重要基础设施。信用构件库构件的组织和检索是基于构件的有效表示之上的。本文通过对信用评估领域的分析,提出了信用构件信息的表示方法,并在此基础上,重点讨论了信用构件库采用的检索技术。
     本文主要在以下几个方面展开研究:
     1、通过分析构件的刻面分类优势和原则,运用领域工程方法,对信用评估领域分析,提出符合构件提取特征的信用构件刻面分类模式,使信用构件具有了良好的刻面信息表示;
     2、在信用构件的刻面信息表示基础上,结合树匹配模型和层次编码技术,将信用构件刻面检索转换为字符串的匹配,达到相对的松弛匹配效果,采用向量空间模型VSM(Vector SpaceModel)计算相似度,在保证一定查准率的情况下提高查全率;
     3、本文运用Lucene全文检索技术实现信用构件库的关键词和属性值的匹配,达到了检索效率的提高,以此适应多层次辅助查询的需求;
     4、在上述研究的基础上,实现了信用构件库的构件检索子系统,验证了可行性和实用性。
     本文通过对信用构件库检索技术的讨论,运用了综合检索性能较好的实现方法,对支持基于构件复用的信用评估系统的开发具有一定的促进意义。
Software reuse has claimed to be the realistic means to increase the productivity of programmers and improve the quality of developed software. Besides, a well-adapted and reliable credit evaluation system used to evaluate the status of the credit level of enterprises can be developed quickly through the construction of different components, and the cost of the software development can be reduced sharply at the same time. Consequently, how to find the needed components from vast amount of software reusable component library has become the key problem of the development process of credit-evaluation software based on component reuse.
     The storage, management, retrieval, browses and maintenance of credit component library has been provided, which is the most important infrastructure supporting software reuse. The composition and retrieval of the component library are based on the efficient description of components. Several means to present the evaluation component information has been proposed, based on which the adopted retrieval technology of the evaluation component library has been mainly discussed.
     This dissertation has done its research on several respects as follows:
     1. Through analysis of credit evaluation domain based on the advantage and principle of facet classification and domain engineering, a facet classification schema of credit component correspondent to the retrieval characterization has been proposed.
     2. On the basis of efficient representation of the information of credit components, in conjunction with the technology of tree-match model and hierarchy encoding, the retrieval of component facets is converted to the match between strings to achieve the goal of relaxation matching effect. Through the calculation based of the VSM, the recall has been improved based on the certain level of precision.
     3. This dissertation has adopted Lucene full text search technology to implement the match of keyword and attribute value for the credit component library. As a result, the efficiency of the retrieval has been greatly improved to satisfy the requirement of multilayer assistant search.
     4. Based on the above research, this dissertation has implemented the component retrieval system of credit component library, and testifies the feasibility and practicability of the system.
     Through the discussion of the retrieval technology applied to the credit component library, this dissertation has proposed a good implementation means with good search performance, and facilitate the development of credit evaluation system based on component reuse.
引文
[1]中国信用网[EB/OL].http://www.china315.cn
    [2]国务院发展研究中心市场经济研究所“建立我国社会信用体系的政策研究”课题组.规范发展我国信用评级行业的政策建议[J].中国工商管理研究.2002,08期.
    [3]Hafedh Mili,Fatma Mili,Ali Mili.Reusing Software:Issues and Research Directions[J].IEEE Transactions on Software Engineering,June 1995,21(6):528-562.
    [4]Alan W.Brown,Kurt C.Wallnau.The Current State of CBSE[J].IEEE Software,1998,15(5):37-46.
    [5]邵维忠.杨芙清.面向对象的系统分析[M].北京:清华大学出版社.1998.12.
    [6]潘颖,赵俊峰,谢冰.构件库技术的研究与发展[J].计算机科学,2003.30(5):90-93.
    [7]常继传,李克勤,郭立峰等.青鸟系统中可复用软件构件的表示与查询[J].电子学报,2000,28(8):20-23.
    [8]NEC Software Engineering Laboratory.NATO standard for management of a reusable software component library[S].NATO Communications and Information Systems Agency.1991,2:32-43.
    [9]Gibb F,McCartan C,O'Donnell R,Sweeney N,Leon R.The integration of information retrieval techniques within a software reuse environment[J].Journal of In formation Science,2000,26(4):520-539
    [10]王渊峰,张涌.任洪敏等.基于刻面描述的构件检索[J].软件学报,2002.13(8):1546-06.
    [11]贾晓辉,陈德华,严梅等.基于刻面描述的构件查询匹配模型及算法研究[J].计算机研究与发展.2004,41(10):1634-1638.
    [12]杨芙清,梅宏,李克勤.支持构件复用的青鸟Ⅲ型系统概述[J].计算机科学,1999,26(5):50-55.
    [13]STARS Project.Asset library open architecture framework version 1.2[R].Informal Technical Report STARS-TC-04041/001/02,1992,8.
    [14]周欣,陈向葵,孙家骕,杨芙清.面向对象系统中基于度量的可复用构件获取机制[J],电子学报,2003,31(5):649-653.
    [15]张尧学,方存好.主动服务:概念、结构与实现[M].北京:科学出版社,2005.125—156.
    [16]Latour L,Wheeler T,Frakes B.Descriptive and prescriptive aspects of the 3Cs models: SETA1 working group summary,CASE Centre.Syracuse University 《New York:Technical Report 9014》,1990.
    [17]Guttorm S.The REBOOT approach to software reuse[J].Journal of Systems and Software,1995,30(3):201-212.
    [18]杨芙清,梅宏.吴穹,朱冰.基于异质构件复用的软件开发技术及其支持系统[J].中国科学(E辑),1997.27(3):275—281.
    [19]Frakes W B,Pole T P.An empirical study of representation methods for reusable software components[J].IEEE Transactions on Software Engineering,1994,120(8),617-630.
    [20]H.Mili R.Rada,W.Wang et al.,Practitioner and SoftClass:A Comparative Study of Two Software Reuse Research Projects[J].J.Systems and Software,1994,27(5).
    [21]A.Podgurski,L.Pierce.Retrieving reusable software by sampling behavior.ACM Transactions on Software Engineering and Methodology,1993,2(3):286-303.
    [22]D.Merkl,A.M.Tjoa,G.Kappel,Learning the semantic similarity of reusable software components[C].In:Frakes,W.B.,ed.Proceedings of the 3rd International Conference on Software Reuse(ICSR'94).Rio de Janeiro:IEEE Computer Society Press,1994,33-41.
    [23]Penix J.Baraona P.Alexander P.Classification and retrieval of reusable components using semantic features[C].Proceedings the 10th knowledge-based software engineering conference IEEE Computer SOC.Press 1995,131-138.
    [24]Zaremski,A.M,Signature and specification matching[Ph.D.Thesis].School of Computer Science Carnegie Mellon University,1996.
    [25]Damiani,E.,Fugini,M.G.,Bellettini,C.A hierarchy-aware approach to faceted classification of objected-oriented components[J].ACM Transactions on Software Engineering and Methodology.1999,8(3):215-262.
    [26]Mili A,Mili R,Mittermeir R.Storing and Retrieving Software Components:A Refinement-Based System[J].IEEE Transactions on Software Engineering,1997,23(7):445-460.
    [27]Mili Hafedh,Estell Ah-Ki,Robert Godin,and Hamid Mcheick.Another Nail to the Coffin of Faceted Controlled-Vocabulary Component Classification and Retrieval[J].ACM Software Engineering Notes(SEN),1997,22(3):89 -98.
    [28]Prieto-Diaz R.Implementing Faceted Classification for Software Reuse[J].Communications of the ACM,1991,34(5):89 -97.
    [29]Prieto-Diaz R,Freeman E Classifying Software for Reusability[J],IEEE Software,1987, 4(1):6—16.
    [30]常继传.郭立峰,马黎.可复用软件构件的表示和检索[J].计算机科学,1999.26(5):45-49.
    [31]Uta Priss.Faceted Information Representation.8th International Conference on Conceptual Structures Logical,Linguistic,and Computational Issues,August 2000.
    [32]Atkinson S.A Unifying Model for Retrieval from Reusable software Libraries:[Technical Report No.95-41].The University Queensland,1995.
    [33]余小娟.面向国际贸易的信用评估原模型研究[D].硕士学位论文,浙江大学,2004.
    [34]陈建.信用评分模型技术与应用[M].北京:中国财政经济出版社,2005.
    [35]Otis Gospodnetic.Erik Hatcher著.谭鸿,黎俊鸿等译.Lucene in action中文版.北京:电子工业出版社.2007.
    [36]Salton G,Wong A,Yang C S.A vector space model for automatic indexing[J].Communications of ACM,1975,18(11):613-620.
    [37]刁力力,王丽坤,陆玉昌等.计算文本相似度阈值的方法[J].清华大学学报.2003,43:108—111
    [38]李骏.构件的分类检索技术及其应用研究[D].硕士学位论文,东华大学.2003.
    [39]高强.张晓明.边小凡.基于特定领域构件库系统中刻面分类模式的研究[J].计算机工程与应用.2003.30(24).
    [40]蒋韵.VegaGIS构件库系统设计与实现[D].硕士学位论文.北京科技大学,2005.
    [41]Sugumaran Vijayan,Storey Veda C.A Semantic-Based Approach to Component Retrieval[J].The D ATA BASE for Advances in Information Systems,2003,34(3):8-24.
    [42]Miro Casanova,Ragnhild Van der Straeten and viviane jonckers.Supporting Evolution in Component-Based Development Using Component Libraries[C].In:Proceedings of the Seventh European Conference on Software Maintenance and Reengineering.Washington,DC,USA:IEEE Computer Society,2003,123-132.
    [43]徐如志,钱乐秋等.基于XML的软件构件查询匹配算法研究[J].软件学报,2003.14(07).
    [44]袁军鹏,陈铿等.一种新的通用概念层次编码方法[J].计算机工程,2004.32(12).
    [45]徐正权,王家兵,王能超.软件构件表示与检索形式化的研究与进展[J].计算机科学,2003,30(7):99—102.
    [46]王渊峰.基于刻面描述的构件检索算法研究[D].博士学位论文,复旦大学,2002.
    [47]Zaremski A.M,Wing J M.Specification Matching of Software Components[J].Software Engineering Notes,1995,20(4):6-17.
    [48]Amy Moormarm Zaremski,Jeannette M.Wing.Signature Matching:A Key to Reuse[J].Software Engineering Notes,1993,18(5):182-190.
    [49]Khayati O,Giraudin J P.Components retrieval systems[A].Reuse in Object-Oriented Information Systems Design,OOIS workshop Montpellier[C].Sep 2002.
    [50]李克勤.陈兆良,梅宏,杨芙清.领域工程概述[J].计算机科学,1999,26(5):21-25.
    [51]Cohen,Sholom G.Application of Feature-oriented Domain Analysis to the Army Movement Control Domain.Software Engineering Institute,Carnegie Mellon University,CMU/SEI-9I-TR-28.June 1992.
    [52]Mark A.Simos.Organization Domain Modeling(ODM):Formalizing the Core Domain Modeling Life Cycle.ACMSig Software Engineering Notes,1995,20(4):196-205.
    [53]LTC Erik Mettala,Graham M H.The Domain-specific software Architecture Program [Technical Report].Software Engineering Institute,Carnegie Mellon University,CMU/SEI-92-SR-9 Pittsburgh Pennsylvania 15213,1992.
    [54]石庆焱.秦婉顺.个人信用评分模型及其应用[M].北京:中国方正出版社,2006.
    [55]郭敏华.信用评级[M].北京:中国人民大学出版社,2004.
    [56]杨芙清,梅宏,李克勤.软件复用与软件构件技术[J].电子学报,1999,27(2):68-75.
    [57]Jurgen Dorn.Towards Reusable Intelligent Scheduling Software.Institut für Informationssysteme Technische Universitat Wien.1998.
    [58]上海构件库[EB/OL].http://www.sstc.org.cn.
    [59]Sorumgard L.S.,Sindre,G.,Stokke,F.Experiences from Application of a Faceted Classification Scheme.In:Proc.Reuse '93,Lucca,Italia,IEEE CS Press.March,1993,116-124.
    [60]司莉.网络信息资源组织研究进展[J].情报科学,2003.21(06).
    [61]车东.基于Java全文检索引擎简介[EB/OL].http://www.chedong.com/tech/lucene.html
    [62]王渊峰,薛云皎,张涌等.刻面分类构件的匹配模型[J].软件学报,2003,14(03).

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700