面向授权管理的动态网页资源描述与搜集技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
为了实现对动态网页资源的细粒度授权与访问控制,从根本上提高Web网页的安全性,首先需要解决资源的搜集与描述问题。全面的描述、精确的搜集以及合理的组织资源能够给授权管理带来诸多方便,同时为细粒度的授权与访问控制奠定基础。
     本文以授权管理为背景,针对动态网页资源的特殊属性,探讨如何实现对动态网页资源的细粒度描述和搜集,从而为动态网页资源细粒度的访问控制提供解决方法。本文主要工作如下;
     1.深入系统地分析了动态网页资源管理的研究现状。在研究动态网页开发技术的基础上,对Web网页资源的授权与访问控制问题进行了深入的分析;重点研究了现有的Web网页资源描述和动态网页资源搜集方法,提出了授权管理中动态网页资源管理面临的主要问题。
     2.从授权与访问控制的角度给出了动态网页资源的全新定义,设计了适合动态网页资源的统一描述方法。基于通用的资源描述框架RDF规范,分别为动态网页和页面元素定义了反映其动态特征和关联关系的描述词汇集。该词汇集充分体现了动态网页资源的层次结构,能够细粒度、全面的描述动态网页资源的特征属性,为灵活、简便的授权和细粒度的访问控制提供支持。
     3.提出了面向授权管理的动态网页资源搜集系统模型。该模型采用Robot技术遍历动态网页,并通过数据分析和计算获取动态网页的特殊属性。重点研究了网页交互参数、有效动态变化因子集等动态网页特征属性的获取方法。设计了动态网页页面元素的抽取算法,该算法深入网页文件内部获取页面元素的特征属性,为细粒度、全面的动态网页资源管理奠定基础。
     4.深入研究了本文提出的动态网页资源管理方法在授权与访问控制中的具体应用。基于授权与访问控制中资源标识的需要,提出了一种基于动态变化因子的动态网页资源标识方法,在此基础上给出了动态网页资源访问控制的初步解决方案。
     应用表明,本文提出的动态网页资源描述方法能够为授权提供方便和更多的灵活性,简化了授权操作,同时为制定高精度、细粒度的授权策略提供支持。动态网页资源标识问题的解决直接为动态网页资源的访问控制问题提供了有效的解决思路和方法。
To improve the security of web pages and the realization of fine-grained authorization and access control of active web pages resources, collection and description of resources are the first problem to be solved. Collecting resources precisely and organizing them properly can provide much convenience for authorization management. They are the basis of fine-grained authorization and access control.
     Based on the special characters of active web pages, this dissertation studies the fine-grained description and collection of active pages resources in authorization management. The main work of this dissertation is as follows:
     1. Analyze the present research of active web pages resource management deeply. Based on the study of developed technique of active web pages, the problem exists in authorization and access control of web pages resources is studied. The present methods of describing and collecting web pages resources are analyzed. Main problem of managing active web pages in authorization management has been proposed.
     2. To the point of authorization and access control, a new definition of active web pages resources is provided and a method of describing active web pages generally is proposed. Based on Resource Description Framework (RDF), the dissertation provides a general descriptioin method for active web pages resources. This method defines the decription schema for both active pages and elements, which reflects their active characters and relationships with each other. It supports flexible and convenient authorization and fine-grained access control of resources.
     3. A model of collecting acive web pages in authorization management is proposed. Based on the Robot technique, the model extracts the special charactesr of active web pages by analyzing and calculating. The algorithm of obtaining parameters and valid changing factors of pages is provided and method of extracting resource elements contained in pages is discussed.
     4. Application of the method proposed in authorization and access control are analyzed. For the need of identification of resources, this dissertation designed a method of identifying the content of acive web pages. And a prelimitary solution for access control of active web pages is proposed.
     The application demonstrates that description method proposed for active web pages provides much convenience for authorization and simplifies the authorization operation. It supports fine-grained and precise authorization. The settlement of identifying acive web pages directly provides solution for the access control problem.
引文
[1]Steve Lawrence,C.Lee Giles.Searching the World Wide Web[J].Science,1998,80(5360);98.
    [2]王秋玲.基于RDF的Web资源管理关键技术研究与应用[D].郑州;解放军信息工程大学,2006.6
    [3]梁树军,程静,李潢琦.三层C/S结构的研究与应用[J].电脑开发与应用,2005,18(2);48-52.
    [4]谭国蓉.基于B/S架构的软件项目实训[M].北京;电子工业出版社.2004;1-25
    [5]魏应彬,周星.动态网页与Web数据库[M].北京;北京大学出版社,2001;20-21
    [6]刘赛锦等.精通PHP编程[M].北京;国防工业出版社.2001;1-7
    [7]普悠码数位科技.Java Server Pages动态网页新技术[M].北京;中国铁道出版社,2002;4-5
    [8]李波.ASP.NET 1.1高级编程[M].北京;清华大学出版社,2005;8-15
    [9]张卫丰,徐宝文,许蕾.Web页面安全性技术初探[J].计算机工程与应用,2000,36(11);158-161
    [10]Reiner Kraft.Research and Design Issues in Access Control for Network Services on the Web[A].The 3rd International Conference on Internet Computing,IC 2002
    [11]Reiner Kraft.Designing a Distributed Access Control Processor for Network Services on the Web[A].In;2002 ACM Workshop on XML Security table of contents[C].New York;ACM,2002;36-52
    [12]Konstantin Beznosov.Engineering Access Control For Distributed Enterprise Applications[D].Miami,Florida;Florida International University,2000
    [13]Konstantin Beznosov,Yi Deng,Bob Blakley,Carol Burt.A Resource Access Decision Service for CORBA-based Distributed systems[EB/OL].http;//cadse.cs.fiu.edu,2006-4-12
    [14]Atui Kumar,Deepak Gupta,Pankaj Jalote.Accessing CORBA object on the web[EB/OL].http;//www.cse.iitk.ac.in/users/jalote/papers/CORBAaccess.pdf,2006
    [15]王雅哲,李大兴.基于PMI中间件的资源访问控制方案[J].计算机工程,2005,31(10);121-124
    [16]David W.Chadwick,Alexander Otenko.The PERMIS X.509 role based privilege management infrastructure[J].Future Generation Computer Systems,2003,19;277-289
    [17]Junzhe Hu,Alfred C.Weaver.Dynamic,Context-Aware Access Control for Distributed Healthare Applilcations[A].In;Proceedings of the First Workshop on Pervasive Privacy Security,Privacy,and Trust(pspt2004),August 26,2004
    [18]David Chadwick,Sassa Otenko.A comparison of the akenti and permis Authorization infrastructures[A].In;Proceedings of the ITI First International Conference on Information and Communications Technology(ICICT 2003)[C],Cairo University,2003;5-26.
    [19]Pierangela Samarati,Elisa Bertino,Sushil Jajodia.An Authorization Model for a Distributed Hypertext System[J].IEEE transactions on knowledge and data engineering,1996,8(8);555-562
    [20]Tim Berners-Lee,R.Fielding,L.Masinter.Uniform Resource Identifiers(UPd);Generie Syntax[DB/OL].MIT/LCS,U.C.Irvine,Xerox Corporation.RFC2396,August 1998
    [21]鲁奎,基于XML/RDF数字图书馆信息资源描述与应用研究[D].合肥;合肥工业大学硕士学位论文,2003
    [22]罗时辉.基本元数据RDF形式化描述[EB/OL].;http;//cdls.nstl.gov.cn/4-cdls-RDF-20040526.pdf,2006
    [23]W3C Recommendation.Resource Description Framework(RDF)Model and Syntax Specification[EB/OL].http;//www.w3.org/TR/1999/RFC-rdf-syntax-19990222,1999
    [24]W3C Recommendation.RDF Primer Recommendation[EB/OL].http;//www.w3.org/TR/rdf-primer,2006
    [25]Dan Brickley,R.V.Guha.Resource Dsecription Framework(RDF)Schema Specification 1.0[EB/OL].;http;//www.w3.org/TR/2000/CR-rdf-schema-20000327,2006
    [26]周明建,高济,李飞.基于本体论的Web信息抽取[J].计算机辅助设计与图形学报,2004,16(4);535-541.
    [27]石宇.基于XML的Web信息抽取与集成技术的研究[D].大连;大连海事大学,2006
    [28]许建潮,侯锟.Web信息的自主抽取方法[J].计算机工程与应用,2005,41(14);185-189
    [29]ALA.Committee on Cataloging;Description and Access Task Force on Metadata and the Cataloging Rules Final Report[EB/OL].http;//www.ala.org/alcts/organization/ccs/ccda.2005
    [30]LAENDER A,RIBEIRO-NETO B,SILVA A.A brief survey of web data extraction tool[J].SIGMOD Record,2002,31(2);84-93.
    [31]郭志红.基于Web资源的信息抽取技术.情报科学[J],2002,20(12);1282-1284.
    [32]ROBERT BAUMGARTNER,SERGIO FLE SCA,GEORG GOTTLOB.Supervised wrapper generation with lixto[A].In;Proceedings of 27th International Conference on Very Large Database,Roma,Intaly 2001.
    [33]VALTER CRESCENZI,GIANSALVATORE MECCA.RoadRounder;toward automatic data extraction from large Web sites[A].In;Proceedings of 27th International Conference on Very Large Database,Roma,Intaly 2001.
    [34]AROCENA G,MENDELZON A.WebOQL;Restruction documents,databased and webs.In;Proceedings of the 14th ICDE Conference,Orlando,Flofida,USA,1998.
    [35]Balachander Krishnamurthy,Jennifer Rexford.Web协议与实践[M].北京;科学出版社,2003;31-35
    [36]陈小宁,Web信息资源获取技术的研究与实现[D].广州;暨南大学,2001
    [37]Sriram Raghavan,Hector Garcia-Molina.Crawling the Hidden Web[A].In;Proceedings of the 27~(th) VLDB Conference,Roma,Italy,2001
    [38]Sriram Raghavan,Hector Garcia-Molina.Crawling the HiddenWeb[EB/OL].;Technical Report2000-36,ComputerScienceDept,StanfordUniversity,2000.
    [39]Chakrabarti S,Joshi M,Tawde V.Enhanced topic distillation using text,markup tags,and hyperlinks[A].In;Kraft DH,ed.Proc.of the 24th ACM[M].SIGIR Conf.on Research and Development in Information Retrieval New Orleans;ACM Press,2001 208-216.
    [40]Damon Hougland,Aaron Tavistock,JSP核心技术[M],北京;机械工业出版社,2001.8-9
    [41]丁贵广,郭宝龙.ASP动态网站建站实例与技巧—ASP与Web数据库的结合[M].西安;西安电子科技大学出版.2001;6-7
    [42]B.L.Narayan,C.A.Murthy,Sankar K.Pal.Topic continuityfor Web document categorization and ranking[A].In;Proc.IEEE/W IC Int'l Conf.Web Intelligence[C].New Jersey;IEEE Press.2003;310-315
    [43]Ester M,Kriegel HP,Schubert M.Web site mining;A new way to spot competitors,customers and suppliers in the word wide web[A].In;Hand D,ed.Proc.of the SIGKDD 2002[M].Edmonton;ACM Press,2002.249-258
    [44]Chakrabarti S.Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction[A].In;ShenVY.ed Proc.Of the WWW 2001[M].HongKong;ACM Press,2001.211-220.
    [45]Diligenti M,Gori M,Maggini M,Scarselli F.Classification of HTML documents by hidden tree-Markov models[A].In;Tombre K,etal,eds.Proc.ofthe Int'l Conf.on Document Analysis and Recognition(ICDAR 2001)[M].Los Vaqueros;IEEE Computer Society Press,2001.849-853
    [46]Lauren Wood.Document Object Model(DOM)Level 1 Specification[EB/OL].http;//www.w3.org/TR/1998/REC-DOM-Level-1-19981001/DOM.pdf,1998
    [47]李巍.JSP编程入门与应用实例[M].北京;清华大学出版社,2001;1-2
    [48]田永鸿,黄铁军,高文.基于多粒度树模型的Web站点描述及挖掘算法[J].软件学报,2004,15(9);1393-1404
    [49]Cbuck Musciano,Bill Kennedy.HTML与XHTML权威指南[M].北京;清华大学出版社,2003;50-419
    [50]Thomas A.Powel.HTML参考大全[M],北京;清华大学出版社,2001;53-68
    [51]詹文军,王新程.ASP.NET安全应用程序开发[M].北京;清华大学出版社,2003;6-40
    [52]Leon Atkinson.PHP 4核心编程[M].北京;中国水利水电出版社,2000;2-18
    [53]李申堂.Web敏感网址发现技术研究[D].郑州;解放军信息工程大学,2003
    [54]Ph.R.WWW&DataBases[EB/OL]http;//lbdpc15.epfl.ch/~ibd/IBD2002/Slides/PDF2/WWW&DB.pdf,2006
    [55]Kaljuvee,O.Buyukkokten,H.Garcia-Molina,A.Paepcke.Efficient web form entry on pdas[A].In;Proc.of the 10th Intl.WWW Conf.Hong Kong,2001.
    [56]杨道玲.Web资源采集与保存研究[D].武汉;武汉大学,2005
    [57]T.Berners-Lee,L.Masinter,M.McCahill.Uniform Resource Locatiors(URL)[EB/OL].CERN,Xerox Corporation,University of Minnesota.RFC1738,December 1994
    [58]孟小峰.Web数据管理研究综述[J].计算机研究与发展,2001,38(4);385-395
    [59]吴峰,张玉清,李锋,康效龙.一种动态网页保护系统的设计与实现[J].计算机工程与应用,2005,41(28);141-143
    [60]荣钦科技主笔室.最新ASP入门与应用[M].北京;中国铁道出版社,2001;8-9

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700