中文文本中事件时空与属性信息解析方法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
本文依托国家“863”课题“泛在空间信息关联更新与面向主题时空信息挖掘研究”,较为系统地探索中文文本中事件时空与属性信息解析方法,为泛在空间信息动态关联更新,全球统一时空框架下的空间信息与知识服务提供数据和技术支持,同时为事件时空模式挖掘奠定数据基础,进而为事件风险评估、公共安全等重大问题提供决策服务。本文针对中文文本中事件时空与属性信息描述的非结构化、定性化和不确定性等特点,围绕“文本描述-规范化表达-结构化抽取-可视化重构”的技术主线,重点研究事件时空与属性信息解析方法。主要研究内容与结论包括以下几个方面:
     (1)事件时空与属性信息的结构化表达:通过归纳总结中文文本中事件时空与属性信息描述的语言特征和语义结构,设计了事件时空与属性信息的知识表达框架和标注体系;以突发公共事件为例,以网络文本为数据源,基于GATE平台构建了中文文本中事件时空与属性信息标注语料库,为事件时空与属性信息抽取研究提供了标准化训练和测试数据。
     (2)事件时空与属性信息抽取:分析中文文本中时间信息描述的规律性,实现了基于触发词和规则模型结合的时间信息抽取、推理和规范化解析,准确率、召回率和F值分别达到75.00%、88.24%和40.54%;利用条件随机场模型和规则模型,实现了事件名称识别和空间位置(包括地名和空间关系)信息抽取,其中事件名称识别准确率、召回率和F值分别为82.08%、80.18%和81.12%;设计了基于Bootstrapping的事件属性信息抽取算法,量词性的属性信息抽取准确率和召回率达到80.80%和85.16%。
     (3)时空驱动的事件分类方法:通过分析事件时空认知和表达特性,提出一种融合时间、空间、属性、事件名称、触发词汇等多种上下文语义和语境信息的事件分类方法。按照句子、段落、篇章三个语言单元等级,探讨了事件替代性名称的推理方法。实验结果表明,事件分类准确率在封闭和开放测试中分别达到92.30%和80.60%。
     (4)事件时空信息匹配与可视化:以地名数据库为空间数据源,提出了定性时空信息(地名、空间关系和时间信息)的匹配和可视化表达方法,探索了基于“时间-空间-概念类型”多重一致性约束的主题事件判断和时空过程重构方法,实现了事件信息在时空信息系统中有机的、直观的可视化表达,并对事件时空信息分布模式进行了聚类分析。
     研究结果表明,采用规则模型和统计模型结合的方式可以有效实现中文文本中事件时空与属性信息抽取,但是特征项的设置在统计模型的学习过程中起到举足轻重的作用;不同类型事件的时间、地名、空间关系、事件名称和类型等信息抽取模型具有通用性和可移植性,而属性信息存在较大差异,需要针对具体类型事件构建相应知识库和学习模型;事件类型判断存在灵活、复杂、语义模糊、不确定性特点,且属于多标记分类,融合词性、触发词汇、时间、空间、属性和事件名称等多种上下文语义和语境信息,可以有效提高事件分类效果;空间数据的覆盖范围和数据质量,以及空间关系解析模型,对事件时空与属性信息匹配、时空过程重构性能具有较大的影响。
This thesis is supported by the national "863" project "The Research of Associated Updating of the Ubiquitous Spatial Information and Mining of Subject-oriented Spatio-temporal Information". An interpretation approach of event spatio-temporal and attribute information in Chinese Text is explored in this thesis. The contributions will provide a data and technology support for the associated updating of the ubiquitous spatial information, the spatial information and knowledge services under an unified spatio-temporal framework, and the spatio-temporal mining analysis of event information. Furthermore, it will provide decision-making services for the event risk assessment, public safety, and the other major issues. In Chinese text, descriptions of event spatio-temporal and attribute information are unstructured, qualitative and uncertain. According to the above description characters, this research is carried out according to the main idea of "text description, normalization expression, structured extraction, visualization reconstruction" of event information in Chinese text. The main research contents and results are described as follows:
     (1) Structured expression of event spatio-temporal and attribute information in Chinese text
     With an analysis of the linguistic features and semantic structures of event, spatial, temporal, attribute information described in Chinese text, a representation framework and annotation schema are identified and specified. Moreover, GATE (General Architecture for Text Engineering) is introduced as an annotation platform, and an annotated corpus based on the Web data source is developed in case of events of public emergencies. The annotation schema and annotated corpus will provide a standard training and testing data support for the extraction of event information.
     (2) Extraction of event spatio-temporal and attribute information in Chinese text
     Based on description regularities of temporal information in Chinese text, a interpretation approach is illustrated for extraction, reasoning and standardization of temporal information, which combines trigger words and rule-based model. The values of precision, recall and F-measure are75.00%,88.24%and40.54%respectively. Place names and event names are recognized with a Condition Random Field model, and spatial relations are extracted with a rule-based model. For the recognition of event names, the values of precision, recall and F-measure were respectively82.08%,80.18% and81.12%. Moreover, A Bootstrapping method is explored for the extraction of event attributes. For the quantitative attribute information, the values of precision, recall and F-measure can reach80.80%,85.16%respectively.
     (3) Automatic event classification based on spatio-temporal information
     Every event has temporal, spatial and attribute properties. A classification method of event information is developed which integrates contextual and semantic information. It emphasizes the spatial and temporal elements for event tracking, and discovers that feature items of trigger words, part of speech, place names, temporal information, event names and attributes have an important contribution for event classification. Moreover, some special phenomenons of abbreviation and alias are reasoned according to different language units, i.e. sentence, paragraph and chapter. The experiment results show that it can reach a classification accuracy of92.30%and80.60%in a closed and open testing respectively.
     (4) Matching and visualization of event spatio-temporal information
     Based on the spatial data source of national gazetteer, a matching and visualization method for event information is presented. With a hierarchical matching of place names, spatial relations and temporal information, event information are expressed in a GIS spatio-temporal framework. Moreover, with a consistency constraint of "temporal information-spatial information-concept type", a judgement method of theme event, and the reconstruction of spatio-temporal process are presented. Finally, a clustering analysis of the spatio-temporal pattern for event information is finished.
     The studies proposed in this thesis suggest that the combination of rule-model and statistical model can effectively extract event information from Chinese text, however, reasonable and effective feature items play an important role in the learning process of statistical models. For different types of events, the extraction models of temporal information, place names, spatial relations, event names and event types are universal and transplantable, however, their attribute information are with many differences. Therefore, the knowledge base and learning model need to be modified for specific types of events. The judgement of event type is flexible, complex, semantic ambigous and uncertain, in other words it is a multi-label classification problem. This paper integrates the contextual and semantic information of part of speech, place names, temporal information, event names, attributes and trigger words, which can effectively improve the event classification performance. Among the Matching and visualization of spatio-temporal information, the coverage and quality of spatial data, as well as the interpretation model of spatial relations have a large impact on the performance. Overall, the proposed approach in this dissertation for the interpretation of spatio-temproal information, attributes and event classification in Chinese text is effective, but its integration with GIS is greatly depended on the mapping spatial data.
引文
[1]Goodchild M F. Citizens as Sensors:The World of Volunteered Geography[J]. GeoJournal,2007,69(4):211-221.
    [2]Palkowsky B, MetaCarta I. A New Approach to Information Discovery-Geography Really Does Matter[C]. In:Proceedings of the SPE Annual Technical Conference and Exhibition, United States,2005:3231-3234.
    [3]冯志伟.标准通用置标语言SGML及其在自然语言处理中的应用[J].当代语言学(试刊),1998,4:1-11.
    [4]冯志伟.基于经验主义的语料库研究[J].术语标准化与信息技术,2007,0l:29-39.
    [5]丁信善.语料库语言学的发展及研究现状[J].当代语言学(试刊),1998,01:4-12.
    [6]李晗静,李生,赵铁军等.基于自然语言理解的实体自动摆放的研究[J].电子与信息学报,2007,29(8):1845-1849.
    [7]冯志伟.自然语言处理的学科定位[J].解放军外国语学院学报,2005,28(3):1-8.
    [8]Francis W N, Kucera H. Brown Corpus Manual[J]. Letters to the Editor,1979,5(2): 7-15.
    [9]Charles E, Kahn J. Standard Generalized Markup Language for Self-defining Structured Reports[J]. International journal of medical informatics,1999,53(2): 203-211.
    [10]Maeda K, Strassel S. Annotation Tools for Large-scale Corpus Development: Using AGTK at the Linguistic Data Consortium[C]. In:Proceedings of the Fourth International Conference on Language Resources and Evaluation,2004: 2077-2080.
    [11]俞士汶,朱学锋.大规模现代汉语标注语料库的加工规范[J].中文信息学报,2000,14(6):58-64.
    [12]冯志伟.中国语料库研究的历史与现状[J]. Journal of Chinese Language and Computing,2002,11(2):127-136.
    [13]黄昌宁,张小凤.自然语言处理技术的三个里程碑[J].外语教学与研究:外国语文双月刊,2005,34(3):180-187.
    [14]俞士汶,段慧明,朱学锋等.北京大学现代汉语语料库基本加工规范[J].中文 信息学报,2002,16(5):49-64.
    [15]张霄军,陈小荷.双语平行语料的预处理[J].外语教育,2007,027:145-149.
    [16]Baker C F, Fillmore C J, Lowe J B. The Berkeley Framenet Project[C]. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics,1998: 86-90.
    [17]Mani I, Wilson G. Robust Temporal Processing of News[C]. In:Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, Association for Computational Linguistics,2000:69-77.
    [18]Pustejovsky J, Hanks P, Sauri R, et al. The Timebank Corpus[C]. In:Proceedings of Corpus Linguistics,2003:647-656.
    [19]Gildea D, Hockenmaier J. Identifying Semantic Roles Using Combinatory Categorial Grammar[C]. In:Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing Association for Computational Linguistics,2003:57-64.
    [20]Shen Q, Zhang X, Jiang W. Annotation of Spatial Relations in Natural Language[C]. In:Proceedings of International Conference on Environmental Science and Information Application Technology,2009, (3):418-421.
    [21]Doddington G, Mitchell A, Przybocki M, et al. The Automatic Content Extraction (ACE) Program-tasks, Data, and Evaluation[C]. In:Proceedings of LREC,2004, (4):837-840.
    [22]张先飞,郭志刚,李弼程等.自动内容抽取中的中文事件标注[J].情报学报,2011,30(1):61-68.
    [23]Leidner J L. Toponym Resolution in Text:Annotation, Evaluation and Applications of Spatial Grounding of Place Names [D]. Edinburgh:University of Edinburgh,2008.
    [24]Lieberman M D, Samet H, Sankaranarayanan J. Geotagging With Local Lexicons to Build Indexes for Textually-specified Spatial Data[C]. In:Proceedings of the 26th International Conference on Data Engineering (ICDE),2010:201-212.
    [25]Blaylock N, Swain B, Allen J. TESLA:A Tool for Annotating Geospatial Language Corpora[C]. In:Proceedings of the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics,2009: 45-28.
    [26]Mani I, Doran C, Harris D, et al. SpatialML:Annotation Scheme, Resources, and Evaluation [J]. Language Resources and Evaluation,2010,44(3):263-280.
    [27]张雪英,张春菊,朱少楠.中文文本的地理空间关系标注[J].测绘学报,2012,41(3):468-474.
    [28]张雪英,朱少楠,张春菊.中文文本的地理命名实体标注[J].测绘学报,2012,41(1):115-120.
    [29]Allen J F. Towards a General Theory of Action and Time [J]. Artificial Intelligence, 1984,23(2):123-154.
    [30]郭宏蕾,姚天顺.时间语义层次结构及理解[J].中文信息学报,1997,11(001):11-19.
    [31]蔡华利,刘鲁,刘志明等.突发事件Web新闻中时间信息分析及抽取[J].计算机工程与应用,2010,46(034):107-110.
    [32]Mani I, Verhagen M, Wellner B, et al. Machine Learning of Temporal Relations[C]. In:Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, 2006:753-760.
    [33]Li W J, Wong K F, Cao G H, etc. Applying Machine Learning to Chinese Temporal Relation Resolution[C]. In:Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics,2004:582-588.
    [34]Jones C B, Purves R S. Geographical Information Retrieval[J]. International Journal of Geographical Information Science,2008,22(3):219-228.
    [35]乐小虬,杨崇俊,于文洋.基于空间语义角色的自然语言空间概念提取[J].武汉大学学报(信息科学版),2005,30(12):1100-1103.
    [36]俞鸿魁,张华平,刘群等.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94.
    [37]李丽双,党延忠,廖文平等.CRF与规则相结合的中文地名识别[J].大连理工大学学报,2012,52(002):285-289.
    [38]钱晶,张玥杰,张涛.基于最大熵的汉语人名地名识别方法研究[J].小型微型计算机系统,2006,27(009):1761-1765.
    [39]蒋文明.面向中文文本的空间方位关系抽取方法研究[D].南京:南京师范大学,2010.
    [40]张晓艳,王挺,陈火旺.基于混合统计模型的汉语命名实体识别方法[J].计算机工程与科学,2006,28(006):135-139.
    [41]Li L, Ding Z, Huang D. Recognizing Location Names from Chinese Texts Based on Max Margin Network[C]. In:Proceedings of International Conference on Natural Language Processing and Knowledge Engineering,2008:325-331.
    [42]黄德根,孙迎红.中文地名的自动识别[J].计算机工程,2006,32(3):220-222.
    [43]冯元勇,孙乐,张大鲲等.基于小规模尾字特征的中文命名实体识别研究[J].电子学报,2008,36(9):1833-1838.
    [44]谭红叶,郑家恒,刘开瑛.基于变换的中国地名自动识别研究[J].软件学报,2001,12(11):1608-1613.
    [45]周俊生,戴新宇,尹存燕等.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809.
    [46]唐旭日,陈小荷,张雪英.中文文本的地名解析方法[J].武汉大学学报(信息科学版),2010,35(8):930-938.
    [47]Purves R S, Clough P, Jones C B, et al. The Design and Implementation of SPIRIT: A Spatially Aware Search Engine for Information Retrieval on the Internet[J]. International Journal of Geographical Information Science,2007,21(7):717-745.
    [48]Buscaldi D, Rosso P. A Conceptual Density-Based Approach for the Disambiguation of Toponyms [J]. International Journal of Geographical Information Science,2008,22(3):301-313.
    [49]Garbin E, Mani I. Disambiguatiing Toponyms in News[C]. In:Proceedings of Conference on Human Language Technology and Empirical Methods in Natural Language Processing,2005:363-370.
    [50]刘瑜,袁一泓,张毅.基于认知的模糊地理要素建模—以中关村为例[J].遥感学报,2008,(002):370-377.
    [51]Abdelmoty A I, Smart P D, Jones C B, et al. A Critical Evaluation of Ontology Languages For Geographic Information Retrieval on the Internet[J]. Journal of Visual Languages & Computing,2005,16(4):331-358.
    [52]杜萍,刘勇.中文地名识别与歧义消除-以中国县级以上行政区划地名为例[J].遥感技术应用,2012,26(6):868-873.
    [53]王宇.基于网络文本的地名空间模糊建模[D].南京:南京师范大学,2012.
    [54]车万翔,刘挺,李生.实体关系自动抽取[.J].中文信息学报,2005,19(2):1-6.
    [55]Coyne B, Sproat R. WordsEye:An Automatic Text-to-scene Conversion System[C]. In:Proceedings of the 28th annual conference on Computer Graphics and Interactive Techniques,2001:486-497.
    [56]Reinbergerr M L. Automatic Extraction of Spatial Relations[C]. In:Proceedings of the IEEE Conference on Artificial intelligence,2005:331-337.
    [57]陆汝钤,张松懋.从故事到动画片—全过程计算机辅助动画自动生成[J].自动 化学报,2002,28(3):322-348.
    [58]乐小虬,杨崇俊.非受限文本中深层空间语义的识别方法[J].计算机工程,2006,32(4):35-38.
    [59]马林兵,龚健雅.空间信息自然语言查询接口的研究与应用[J].武汉大学学报(信息科学版),2003,28(3):301-305.
    [60]钱程扬,龙毅,徐震等.基于Web文本的开放式空间信息查询[J].武汉大学学报(信息科学版),2010,35(001):83-87.
    [61]廖楚江,杜清运.GIS空间关系描述模型研究综述[J].测绘科学,2006,29(4):79-83.
    [62]郑玥,龙毅,明小娜.多种空间关系组合的地理位置自然语言描述方法[J].地理信息科学学报,2011,13(4):465-471.
    [63]朱少楠,张雪英,张春菊.地理空间关系描述的句法模式识别[C]. In: Proceedings of International Conference on Broadcast Technology and Multimedia Communication,2010, (4):1172-1176.
    [64]Zhang C J, Zhang X Y, Jiang W M, et al. Rule-Based Extraction of Spatial Relations in Natural Language Text[C]. In:Proceedings of International Conference on Computational Intelligence and Software Engineering (CiSE),2009: 1-4.
    [65]Egenhofer M J, Shariff R. Metric Details for Natural-language Spatial Relations [J]. ACM Transactions on Information Systems,1998,16 (4):295-321.
    [66]Mark D M, Egenhofer M J. Calibrating the Meanings of Spatial Predicates from Natural Language:Line-Region Relations[C]. In:Proceedings of Spatial Data Handling,1994:538-553.
    [67]Shariff R, Egenhofer M J, Mark D M. Natural-Language Spatial Relations Between Linear and Areal Objects:The Topology and Metric of English to Language Terms[J]. International Journal of Geographical Information Science,1998,12 (3): 215-246.
    [68]许珺,张晶,司望利等.线状物体空间关系的自然语言理解的双语比较[J].遥感学报,2008,12(2):362-369.
    [69]杜冲,司望利,许珺.基于地理语义的空间关系查询和推理[J].地球信息科学,2010,12(1):48-58.
    [70]杜清运.空间信息的微观语言学概念模型[J].地理信息世界,2004,2(6):5-8.
    [71]杜世宏,王桥,杨一鹏.一种定性细节方向关系的表达模型[J].中国图象图形学报(A辑),2004,9(012):1496-1503.
    [72]张雪英,闾国年.自然语言空间关系及其在GIS中的应用研究[J].地球信息科学,2007,9(6):77-81.
    [73]Hall M, Smart M, Jones C B. Interpreting Spatial Language in Image Captions[J]. Cognitive processing,2011,12(1):67-94.
    [74]陈雨田.中文文本中空间关系词汇的语义解析模型研究[D].南京:南京师范大学,2012.
    [75]Ying Y. and Wang X L. Information Extraction for Chinese Free Text Based on Pattern Match Combine With Heuristic Information[C]. In:Proceedings of International Conference on Machine Learning and Cybernetics,2002:214-218.
    [76]Ghani R, Probst K, Liu Y, et al. Text Mining for Product Attribute Extraction[Jj. ACM SIGKDD Explorations Newsletter,2006,8(1):41-48.
    [77]Tezuka T, Tanaka K. Temporal and Spatial Attribute Extraction from Web Documents and Time-Specific Regional Web Search System[M]. Kwon Y J, Bouju A, Claramunt C, Editors. Web and Wireless Geographical Information Systems, Lecture Notes of Computer Science,2005:14-25.
    [78]Probst K, Ghani R, Krema M, et al. Semi-supervised Learning of Attribute-Value Pairs From Product Descriptions[C]. In:Proceedings of the 20th International Joint Conference on Artificial Intelligence,2007:2838-2843.
    [79]Balahur A, Montoyo A. A Feature Dependent Method for Opinion Mining and Classification[C]. In:Proceedings of International Conference on Natural Language Processing and Knowledge Engineering,2008:1-7.
    [80]Zhao Q, Sui Z. To Extract Ontology Attribute Value Automatically Based on WWW[C]. In:Proceedings of International Conference on Natural Language Processing and Knowledge Engineering,2008:1-7.
    [81]Ravi S, Pasca M. Using Structured Text for Large-Scale Attribute Extraction[C]. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management,2008:1183-1192.
    [82]胡国晴,李建华.一种基于可信度分析的Web页面新属性发现方法[J].计算机技术与发展,2009,19(1):56-59.
    [83]Zhang C J, Zhang X Y, Chen Y T, Wang Y. Extraction of Geographical Attribute-Values in Natural Language Text[J]. Advances in Intelligent and Soft Computing,2012,142:51-59.
    [84]杨尔弘.突发事件信息提取研究[D].北京:北京语言大学,2005.
    [85]赵妍妍,秦兵,车万翔等.中文事件抽取技术研究[J].中文信息学报,2008, 22(1):3-8.
    [86]Grishman R, Sundheim B. Design of the MUC-6 Evaluation[C]. In:Proceedings of a Workshop on Association for Computational Linguistics, Vienna, Virginia,1996: 413-422.
    [87]Surdeanu M, Harabagiu S, Williams J, et al. Using Predicate-Argument Structures for Information Extraction[C]. In:Proceedings of the 41st Annual Meeting on Association for Computational Linguistics,2003:8-15.
    [88]姜吉发,王树西.一种自举的二元关系和二元关系模式获取方法[J].中文信息学报,2005,19(2):71-77.
    [89]郑家恒,王兴义,李飞.信息抽取模式自动生成方法的研究[J].中文信息学报,2004,18(1):48-54.
    [90]姜吉发.一种事件信息抽取模式获取方法[J].计算机工程,2005,31(15):96-98.
    [91]梁晗,陈群秀,吴平博.基于事件框架的信息抽取系统[J].中文信息学报,2006,20(2):40-46.
    [92]吴平博,陈群秀,马亮.基于事件框架的事件相关文档的智能检索研究[J].中文信息学报,2003,17(6):25-30.
    [93]张学工.关于统计学习理论与支持向量机[J].自动化学报,2000,26(1):32-42.
    [94]Fine S, Singer Y, Tishby N. The Hierarchical Hidden Markov Model:Analysis and Applications [J]. Machine Learning,1998,32(1):41-62.
    [95]张宇,刘挺,文勖.基于改进贝叶斯模型的问题分类[J].中文信息学报,2005,19(2):100-105.
    [96]李素建,刘群,杨志峰.基于最大熵模型的组块分析[.J].计算机学报,2003,12:110-119.
    [97]郭辉,刘贺平,王玲.最小二乘支持向量机参数选择方法及其应用研究[J].系统仿真学报,2006,18(7):2033-2036.
    [98]冯二波.领域实体属性及事件抽取技术研究[D].哈尔滨:哈尔滨工业大学,2008.
    [99]于江德,樊孝忠,庞文博.事件信息抽取中语义角色标注研究[J].计算机科学,2008,35(3):155-157.
    [100]Naughton M. Kushme N, Carthy J. Event Extraction from Heterogeneous News Sources[C]. In:Proceedings of the Workshop Event Extraction and Synthesis, American National Conference in Artificial Intelligence,2006:1-6.
    [101]许红磊,陈锦秀.自动识别事件类别的中文事件抽取技术研究[J].心智与计算,2010,4(1):34-44.
    [102]Chen Z, Ji H. Language Specific Issue and Feature Exploration in Chinese Event Extraction[C]. In:Proceedings of Human Language Technologies:The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics,2009:209-212.
    [103]付剑锋,刘宗田,付雪峰.基于依存分析的事件识别[J].计算机科学,2009,36(11):217-219.
    [104]Lodha S K, Verma A K. Spatio-temporal Visualization of Urban Crimes on a GIS Grid [C]. In:Proceedings of the 8th ACM International Symposium on Advances in Geographic Information Systems, ACM,2000:174-179.
    [105]Girardin F, Calabrese F, Fiore F D, et al. Digital Footprinting:Uncovering Tourists With User-generated Content[J]. Pervasive Computing,2008,7(4):36-43.
    [106]Shaw S L, Yu H, Bombom L S. A Space-Time GIS Approach to Exploring Large Individual-Based Spatio-Temporal Datasets[J]. Transactions in GIS,2008,12(4): 425-441.
    [107]石建军,许国华,何民等.交通地理信息系统数据模型的研究进展[J].北京工业大学学报,2004,30(3):318-322.
    [108]李勇,陈少,陈少沛等.基于基态距优化的改进基态修正时空数据模型研究[J].测绘科学,2007,32(1):26-29.
    [109]Langran G, Chrisman, N R. A Framework for Temporal Geographical Information[J]. Cartographica,1988,25(3):11-14.
    [110]龚健雅.GIS中面向对象时空数据模型[J].测绘学报,1997,26(4):289-298.
    [111]石云.空间数据采掘的研究与进展[J].计算机研究与发展,1999,26(11):1301-1309.
    [112]王春波,张军,蒋涛.基于事件的时空数据模型应用研究[J].测绘科学,2005,30(2):67-69.
    [113]冯文娟,杜云艳,苏奋振.台风时空过程的网络动态分析技术与示例[J].地球信息科学,2007,9(5):57-63.
    [114]李昭,黄克玲,刘仁义等.海洋-大气二氧化碳通量三维时空可视化研究[J].浙江大学学报(理学版),2011,38(2):229-233.
    [115]Forer P C, Kivell H. Space Time Budgets, Public Transport, and Spatial Choice[J]. Environment and Planning,1981,13(4):497-509.
    [116]林广发,黄永胜.GIS在时间地理学中的应用初探[.J].人文地理,2002,17(5):69-72.
    [117]Kwan M P. Interactive Geovisualization of Activity-travel Patterns Using Three-dimensional Geographical Information Systems:A Methodological Exploration With a Large Data Set [J]. Transportation Research Part C:Emerging Technologies,2000,8(1):185-203.
    [118]黄潇婷.基于时间地理学的景区旅游者时空行为模式研究—以北京颐和园为例[J].旅游学刊,2009,24(6):82-88.
    [119]Jones C B, Purves R S, Clough P D, et al. Modelling Vague Places With Knowledge From the Web[J]. International Journal of Geographical Information Science,2008,22(10):1045-1065.
    [120]Mehler A, Bo Y, Li X, et al. Spatial Analysis of New Sources[J]. Visualization and Computer Graphics, IEEE Transactionson,2006,12(5):765-772.
    [121]Kurashima Y, Tezuka T. Blog Map of Experiences:Extracting and Geographically Mapping Visitor Experiences from Urban Blogs[C]. In:Proceedings of International Conference on Web Information Systems Engineering, Springer Berlin Heidelberg,2005:496-503.
    [122]Akerberg O, Svensson H, Schulz B. CarSim:An Automatic 3D Text-to-Scene Conversion System Applied to Road Accident Reports[C]. In:Proceedings of the Tenth Conference on European chapter of the Association for Computational Linguistics,2003:191-194.
    [123]Johansson R, Berglund A, Danielsson M, Nugues P. Automatic Text-to-Scene Conversion in the Traffic Accident Domain[C]. In:Proceedings of the 19th International Joint Conference on Artificial Intelligence,2005:1073-1078.
    [124]Damianos L E, Bayer S, Chisholm M A, et al. MiTAP for SARS Detection[C]. In: Proceedings of Association for Computational Linguistics,2004:12-20.
    [125]Careem M, Silva C D, Silva R D. Sahana:Overview of a Disaster Management System[C]. In:Proceedings of International Conference on Information and Automation,2006:361-366.
    [126]Kameda H. Keynote Presentation, Information Sharing for Technology and Knowledge Based on Implementation Strategies-Disaster Reduction Hyperbase (DRH) Project[C]. In:Proceedings of Sixth DPRI-IIASA Forum on Disaster Risk Management, Risk and Challenges for Business and Industry,2006:120-128.
    [127]Ma Y, Kalashnikov D V, Hariharan R, et al. On-demand Information Portals for Disaster Situations[C]. In:Proceedings of 1st EEE International Conference on Intelligence and Security Informatics Location,2007:224-230.
    [128]李克莉,冯子健.突发公共卫生事件及其监测系统[J].疾病监测,2007,22(4): 282-284.
    [129]Hale J E, Dulek R E, Hale D P. Crisis Response Communication Challenge: Building Theory from Qualitative Data [J]. Journal of Business Communication, 2005,42(2):112-134.
    [130]Lucas C, Mueller M, Pete, B. Integration of Language in GIS:Models in Ownership Cadastre and Disaster Management [J].Photogrammetrie, Fernerkundung, Geoinformation,2008,3:217-225.
    [131]Schuffert S, Richter D, Wiesel J. Investigation of Uncertainty in Spatial Descriptions and its Modeling[C]. In:Proceedings of 5th International Workshop on Digital Approaches,2010:795-804.
    [132]Goodchild M F. Geographical Information Science [J]. International Journal of Geographical Information Systems,1992,6(1):31-45.
    [133]许荣华,吴刚,李培峰等.基于事件框架的主题事件融合研究[J].计算机应用研究,2009,26(12):4542-4545.
    [134]方经民.空间方位参照的认知结构[J].世界汉语教学,1999,50(4):32-38.
    [135]王际桐.地名学概论[M].北京:中国社会出版社,1993,1-10.
    [136]Cohn A G, Bennett B, Gooday J, et al. Qualitative Spatial Representation and Reasoning With the Region Connection Calculus[J]. Geoinformatica,1997,1(3): 275-316.
    [137]胡鹏,黄杏元,华一新.地理信息系统教程[M].武汉大学出版社,2008,75-89.
    [138]邬桐,周雅倩,黄萱菁等.自动构建时间基元规则库的中文时间表达式识别[J].中文信息学报,2010,24(004):3-10.
    [139]张春菊,张雪英,朱少楠等.基于网络爬虫的地名数据库维护方法[J].地球信息科学学报,2011,3(4):1-8.
    [140]温艳霞,谭红叶,郑家恒.基于规则的时间规范化研究[J].计算机科学,2009,36(4B):45-47.
    [141]黄杏元,马劲松,汤勤.地理信息系统概论(修订版)[M].高等教育出版社,2008,70-95.
    [142]伍星,何中市,黄永文.基于弱监督学习的产品特征抽取[J]. Computer Engineering,2009,35(13):1101-1107.
    [143]程涛,施水才,王霞等.基于同义词词林的中文文本主题词提取[J].广西师范大学学报(自然科学版),2007,25(2):145-148.
    [144]Abney S. Bootstrapping[C]. In:Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics,2002:360-367.
    [145]Salton G, Lesk M E. Computer Evaluation of Indexing and Text Processing[J]. Journal of the ACM (JACM),1968,15(1):8-36.
    [146]丁国栋,白硕,王斌.文本检索的统计语言建模方法综述[J].计算机研究与发展,2006,43(5):769-776.
    [147]车万翔,刘挺,秦兵,等.基于改进编辑距离的中文相似句子检索[J].高技术通讯,2004,14(7):15-19.
    [148]苏金树,张博锋,徐听.基于机器学习的文本分类技术研究进展[J]. Journal of Software,2006,17(9):1848-1859.
    [149]Vapnik V, Levin E, Cun Y L. Measuring the VC-Dimension of a Learning Machine [J]. Neural Computation,1994,6(5):851-876.
    [150]邓乃扬,田英杰.支持向量机—理论、算法与拓展[M].科学出版社,2009,81-101.
    [151]Amari S, Wu S. Imporving Support Vector Machine Classifiers by Modifying Kernel Functions [J]. Neural Networks,1999,12(6):783-789.
    [152]郭丽娟,孙世宇,段修生.支持向量机及核函数研究[J].科学技术与工程,2008,8(2):487-490.
    [153]Manning C D, Schutze H.统计自然语言处理基础[M].电子工业出版社,2005,230-254.
    [154]刘健,张维明.基于互信息的文本特征选择方法研究与改进[J].计算机工程与应用,2008,44(10):135-137.
    [155]刘刚,胡四泉,范值华等.神经网络在文本分类上的一种应用[J].计算机工程与应用,2003,39(6):73-76.
    [156]张健沛,徐华.支持向量机主动学习方法研究与应用[J].计算机应用,2004,24(1):1-3.
    [157]丁效,宋凡,秦兵等.音乐领域典型事件抽取方法研究[J].中文信息学报,2011,25(2):15-20.
    [158]王勇.香农信息定义分析与改进[J].情报杂志,2008,27(8):57-60.
    [159]林永民,吕震宇,赵爽等.文本特征加权方法TF-IDF的分析与改进[J].计算机工程与设计,2008,29(11):2923-2925.
    [160]Pred A. The Choreography of Existence:Comments on Hagerstrand's Time Geography and Its Usefulness [J]. Economic Geography,1977 (2):207-221.
    [161]柴彦威.时间地理学的起源、主要概念及其应用[J].地理科学,1998,18(1):65-73.
    [162]柴彦威,赵莹.时间地理学研究最新进展[J].地理科学,2009,29(4):593-601.
    [163]Egenhofer M J, Robert D F. Point-set Topological Spatial Relations [J]. International Journal of Geographical Information System,1991,5(2):161-174.
    [164]Rosenblatt M. Remarks'on Some Nonparametric Festinates of a Density Function[J]. Annals of Mathematical Statistics,1956,27 (6):832-837.
    [165]Parzen E. Chestirrlation of a Proability Density Function and Mode [J]. Annals of Mathematical statistics,1962,33(8):1065-1076.
    [166]Tomoki N, Yano K J. Visualizing Crime Clusters in a Space-time Cube:An Exploratory Data-analysis Approach Using Space-time Kernel Density Estimation and Scan Statistics[J]. Transactions in GIS,2010,14(3):223-239.
    [167]Chen J, Shaw S L, Yu H, et al. Exploratory Data Analysis of Activity Diary Data:a Space-time GIS Approach[J]. Journal of Transport Geography,2011,19(3): 394-404.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700