一种基于XML的Web地震信息提取的实现
详细信息 本馆镜像全文    |  推荐本文 | | 获取馆网全文
摘要
开发一种通用化的处理程序,它可以自动从指定的Web页面中提取地震事件公报,采用XSLT将数据转换为指定格式的XML文档,存入地震信息数据库,实现了Web数据的清理与集成。
A generalized processing program is developed.This program can automatically extract seismic event bulletin data from specific Web pages.XSLT is used to convert useful datas in other format into a specified XML format.These extracted data are stored into a seismic information database.This program realizes Web data cleaning and data integration.
引文
[1]Madria S K,et al.Research Issue in Web Data Mining[C].Proc.OfData Warehousing and Knowledge Discovery,First Intel.Conf.,DaWak 99,1999:303 312.
    [2]Incorporated Research Institutions for Seismology[EB/OL].www.iris.edu.
    [3]USGS National Earthquake Information Center(NEIC)[EB/OL].ht-tp://neic.usgs.gov.
    [4]CTBT Prototype International Data Centre[EB/OL].http://www.pidc.org/.
    [5]International Seismological Centre,United Kingdom[EB/OL].http://www.isc.ac.uk/.
    [6]Jussi Myllymaki.Effective Web Data Extraction with Standard XMLTechnologies[EB/OL].IBM,http://www.research.ibm.com/peo-ple/j/jussi/papers/ANDES/ANDES.pdf,May 2001.
    [7]HTML Tidy[EB/OL].http://www.w3.org/People/Raggett/tidy/.
    [8]XSL Transformations(XSLT)[EB/OL].W3C Recommendation,No-vember 1999.http://www.w3.org/TR/xslt.html.
    [9]XML Path Language(XPath)[EB/OL].W3C Recommendation,No-vember 1999.http://www.w3.org/TR/path.html.
    [10]Oracle document.XML Developer s Kits Guide-XDK Release 2(9.2).No.A96621-01.March 2002.

版权所有:© 2023 中国地质图书馆 中国地质调查局地学文献中心