情感倾向分析中的结构化方法

英文题名：Structured Methods for Opinion Mining
作者：吴苑斌
论文级别：博士
学科专业名称：计算机应用技术
中文关键词：情感倾向分析 ; 结构化机器学习 ; 短语依存句法树 ; 树核函数 ; 倾向性的图表示 ; 解码算法 ; 整数线性规划
英文关键词：Opinion mining ; Structured learning ; Phrase dependency tree ; Tree kemel ; Graph-based sentiment representation ; Inference algorithm ; Integer
英文关键词：linear programming
学位年度：2012
导师：黄萱菁
学科代码：081203
学位授予单位：复旦大学
论文提交日期：2012-04-09

摘要

近年来情感倾向分析在自然语言处理领域引起了广泛的关注.它可以帮助分析文本中与情感相关的信息,从而提供直接的应用结果或者为其他的自然语言处理任务服务.结构化方法是自然语言处理的各个任务中广泛使用的一类机器学习方法,它通过利用结构化的信息提高分类器的性能.本文中主要研究倾向性信息抽取任务中的结构化方法.
     首先,对于倾向性信息抽取中的评价词,评价对象的关系抽取任务,过去的关系抽取方式要么为简单的将相邻评价词,评价对象的关联在一起,要么依靠手工制定的模板,都没有充分利用句法树上的信息.同时,也忽略了评价词.评价对象的短语结构.本文提出了短语依存句法树,将短语结构引入了依存句法树中,较好的处理了短语间的依存关系.在短语依存句法树上,首次提出了依赖于短语结构的树核函数.它能够区别对待不同类型的依存关系,很大的提高了树核函数在关系抽取中的辨识能力.在5个不同领域的在线评论语料上的实验证明了短语依存句法树能够很好的处理短语类型的评价词,评价对象；同时,新的树核函数能够有效的提高关系抽取的各方面性能.
     其次、传统的文本倾向性信息表示忽略了文本中许多与倾向性相关的信息.这使得最终的抽取结果可能是不准确,不完整的.针对这样的问题,本文提出了基于图的倾向性表示.其中除了传统的评价词,评价对象等要素外,还包括了对评价词的限制隐含的评价对象,以及评价词之间的关系.它极大的丰富了倾向性信息抽取的结果,也扩充了倾向性任务处理的对象能够提供更加精确,更加完备的抽取结果.本文使用了一种新的结构化方法将一个句子的倾向性信息转化成对应的图表示.它通过整数线性规划,有力的整合了图上的各类结构化约束,同时有较强的扩展能力和稳定性.在中文在线评论语料库上的实验证明,基于图的倾向性表示有较强的表示能力,同时结构化方法能构明显的提高倾向信信息抽取系统的各方面性能.
Sentiment analysis and opinion mining have received much attention in re-cent years. A number of automatic methods have been proposed to identify and extract opinions, emotions, and sentiments from text. It will facilitate both opin-ion related application and other natural language processing(NLP) tasks. Struc-tured learning, which utilize structure information to improve machine learning approaches, has successed in many NLP field and is considered to be one of the most effective methods.
     This work is focusd on structured learning methods in opinion mining. First we present a novel approach for mining opinions from product reviews, where it converts opinion mining task to identify product features, expressions of opin-ions and relations between them. Previous works on this topic are either simply relate adjacent opinion expression and product feature, or use hand-written pat-terns to extract relation directly. By taking advantage of the observation that a lot of product features are phrases, a concept of phrase dependency parsing is introduced, which extends traditional dependency parsing to phrase level. This concept is then implemented for extracting relations between product features and expressions of opinions by a newly designed tree kerenl. Experimental eval-uations show that the mining task can benefit from phrase dependency parsing and the new tree kernel function.
     Second, Based on analysis of on-line review corpus we observe that most sentences have complicated opinion structures. Existing methods, such as frame-based and feature-based ones, igore a lot of useful information. In this work, a novel graph-based representation for sentence level sentiment is proposed. An structured learning method with integer linear programming-based inference al-gorithm is then introduced to produce the graph representations of input sen-tences. Experimental evaluations on a manually labeled Chinese corpus demon-strate the effectiveness of the proposed approach.

引文

[1]Choi Yejin and Cardie Claire, Learning with Compositional Semantics as Structural Inference for Sub-sentential Sentiment Analysis, in Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2008.
    [2]Yessenalina Ainur, Yuo Yisong and Cardie Claire, Multi-level Structured Models for Document-level Sen-timent Classification, in Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2010,
    [3]Yessenalina Ainur and Cardie Claire, Compositional Matrix-Space Models for Sentiment Analysis, in Pro-ceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2011, 172-182.
    [4]Yessenalina Ainur, Choi Yejin and Cardie Claire, Automatically generating annotator rationales to improve sentiment, classification, in Proceedings of the ACL 2010 Conference Short Papers, (ACL), 2010, 336-341.
    [5]Veselin Stoyanov and Claire Cardie. Topic Identification for Fine-Grained Opinion Analysis, in Proceedings of the 22nd International Conference on Computational Linguistics, (COLINC), 2008. 817-824.
    [0]Yejin Choi, Eric Brock and Claire Cardie, Joint Extraction of Entities and Relations for Opinion Recognition, iu Proceedings of the 2006 Conference on Empirical Mtlhods in Natural Language Processing (EMNLP), 2006, 431-439
    [7]Yejin Choi and Claire Cardie, Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification, in Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2009. 590-598
    [8]Bo Pang, Lillian Lee and Shivakumar Vaithyanathan. Thumbs up? Sentiment Classification using Machine Learning Techniques, in Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), 2002 79-86.
    [9]Pang Bo and Lee Lillian, A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts, in Proceedings of the 42th Annual Meeting of the Association for Computational Linguistics (ACL), 2004 271-278.
    [10]Yejin Choi. Claire Cardie. Ellen Riloff, and Siddharth Patwardhan. Identifying sources of opinions with conditional random fields and extraction patterns, in Proceedings of Human Language Technology Conference and. Conference on Empirical Methods in Natural F,avguage Processing (HLT/EMNLP), 2005. 355-362.
    [11]Alistair Kennedy and Diana Inkpen, Sentiment Classification of Movie Reviews Using Contextual Valence Shifters. Computational Intelligence, 22(2):110-125, May 2000.
    [12]Nakagawa Tetsuji, Inni Kentaro and Kurohashi Sadaoa, Dependency Tree-based Sentiment Classification using CRFs with Hidden Variables Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. (HLT/NAACL), 2010. 786-794.
    [13]McDonald Ryan. Haunani Kerry, Neylon Tyler, Wells Mike and Reynar Jeff, Structured Models for Fiue-to-Coarse Sentiment Analysis, in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL), 2007 ,432-439.
    [14]Ellen RilofF and Janyce Wiebe. Learning extraction patterns for subjective expressions in Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2003, 105-112.
    [15]Gamon Michael, Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis, in Proceedings of the 18nd International Conference on Computational Linguistics, (COLIN G), 2004, 841-847
    [16]Shotaro Matsumoto, Hiroya Takamura and Manabu Okumura, Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees, in ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MIN-ING PROCEEDINGS., 2005, 301-311
    [17]Ng Vincent, Dasgupta Sajib and Arifin S. M. Niaz, Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews. in Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, 2006, 611-618
    [18]Mullen Tony and Collier Nigel, Sentiment analysis using support vector machines with diverse information sources, in Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2004, 412-418
    [19]Ivan Titov and Ryan McDonald, Modeling Online Reviews with Multi-grain Topic Models. in Proceeding of the nth international conference on World Wide Web (WWW), 2008, 111-120
    [20]Ivan Titov, Ryan McDonald A Joint Model of Text, and Aspect. Ratings for Sentiment Summarization. in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL). 2008, 308-316
    [21]Qiaozhu Mei, Xu Ling, Matthew Wondra, Hang Su, Chengxiang Zhai, Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs. in Proceeding of the 16th international conference on World Wide Web (WWW), 2007, 171-180
    [22]David M Bloi. Jon D McAuliffe. Supervised topic models. Advances in Netural Information Processing Systems (NIPS) . 2010. l-8
    [23]Yue Lu, Chengxiang Zhai. Opinion Integration Through Semi-supervised Topic Modeling in Proceeding of the 17th international conference on World Wide Web (WWW), 2008. 121-130
    [24]Wei Jin, Hung Hay Ho, A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining, in Proceedings of the 26th Annual International Conference on Machine Learning. (ICML), 2000.
    [25]Erie Breck, Yejin Choi. Claire Cardie, Identifying expressions of opinion in context. in Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (I.ICAI). 2007
    [26]Alekh Agarwal. Pushpak Bhattacharyya, Sentiment Analysis: A New Approach for Effective Use of Lin-guistic Knowledge and Exploiting Similarities in a Set of Documents to be Classified in Proceedings of the International Conference on Natural Language Processing ICON, 2005
    [27]John Blitzer. Mark Dredze, Fernando Pereira. Biographies. Bollywood. Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification, in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL). 2007, 440-447
    [28]Anthony Aw, Michael Gamon, Customizing Sentiment, Classifiers to New Domains: a Case Study in Pro-ceedings of Recent Advances in Natural Language I'rocessing (HA NLP), 2005
    [29]Theresa Wilson. Jauyce Wiebe, Paul Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. in Proceedings of the conference on Human Language Technology and Empiricat Methods in Notural Language Processing HLT/EMNLP, 2005, 347-354
    [30]Janyce Wiebe, Theresa Wilson, Rebecca Bruce, Matthew Bell, Melanie Martin, Learning Subjective Lan-gurage Computational Linguistics, CL, Volume: 30, Issue: 3. 2004, 277-308
    [31]Janyce Wiebe, Theresa Wilson, and Claire Cardie, Annotating expressions of opinions and emotions in language, in Language Resources and Evaluation, 2005, 39(2/3)
    [32]Bo Pang and Lillian Lee, Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales in Proceedings of the 43th Annual Meeting of the Association for Computational Linguistics (ACL), 2005, 115-124
    [33]Benjamin Snyder and Regina Barzilay, Multiple aspect ranking using the Good Grief algorithm in Proceed-ings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL). 2007, 300-307.
    [34]Andrew Goldberg and Xiaojin Zhu, Seeing stars when there aren' t many stars: Graph-based semi-supervised learning for sentiment categorization in Proceedings of HLT-NAACL 2006 Workshop on Textgraphs: Graph-based Algorithms for Natural Language Processing, 2006
    [35]Yi Mao, Joshua Dillon, Isotonic conditional random fields and local sentiment flow Advances in Neural Information Processing Systems (NIPS) , 2007, 1208-1215
    [36]Jun Zhao. Kang Lin and Gen Wang, Adding Redundant Features for CRFs-bascd Sentence Sentiment Classification, in Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2008, 117-126
    [37]Choi Yejin and Cardie Claire, Hierarchical Sequential Learning for Extracting Opinions and their Attributes in Proceedings of the ACL 2010 Conference Short Payers, 2010, 269-274
    [38]Maite Taboada, Julian Brooke, Milan Tofiloski, Kimberly Vol1. Manfred Sl.ede, Lexicon-Based Methods for Sentiment Analysis Computational Linguistics. CL, Volume: 37, Issue: 2, 277-308
    [39]Soo-Min Kim. Eduard Hovy. Determining the sentiment of opinions in Proceedings of the 20th international conference. on Computational Linguislics (COLINC), 2004,
    [40]Jaap Kamps, Maurten Marx. Robert J. Mokkeu and Maarten Do Rijke. Using wovdnet to measure seman-tic orientations of adjectives, in Proceedings of 4th International Conference on Language Resources and Evaluation. LRRC, 2004, 1115-1118
    [41]Hassan Ahmed and Radev Dragomir R. Identifying Text Polarity Using Random Walks in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, (ACL), 2010, 395-403
    [42]Vasileios Hatzivassiloglou and Kathleen R. McKeown, Predicting the semantic orientation of adjectives in Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL), 1997. 174-181
    [43]Ellen Riloff, Janyce Wiebe and Theresa Wilson Learning subjective nouns using extraction pattern boot-strapping, in Proceedings of Conference on Computational Natural Language Learning. CoNLL, 2003, 25-32
    [44]Satoshi Moriuaga. Ken.ji Yamanishi. Kenji Tateishi, Toshikazu Fukushima. Mining product reputations on the web. in Proceedings of the eighth ACM SICKDD international conference on Knowledge discovery and data mining (KDD). 2002. 341-349
    [45]Steven Bcthard, Hong Yu, Ashley Thornton. Vasileios Hatzivassiloglou, Dan Jurafsky, Automatic extraction of opinion propositions and their holders in Proceedings of the A A AI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, 2004
    [46]Philip Beineke, Trevor Hastie, Christopher Manning, Shivakumar Vaithyanathan, Exploring sentiment sum-marisation in Proceedings of the A A AI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, 2004
    [47]Bo Pang and Lillian Lee. Using very simple statistics for review search: An exploration, in Proceedings of the 24th international conference on Computational Linguistics, Companion volume: Posters(COLFNG), 2008, 73-76
    [48]Yue Lu, Malu Castellanos, Umeshwar Dayal, ChengXiang Zhai. Automatic Construction of a Context-Aware Sentiment, Lexicon: An Optimization Approach, in Proceeding of the 20th international conference on World Wide Web (WWW), 2011, 347-356
    [49]Minqing Hu, Bing Liu, Mining and Summarizing Customer Reviews, in Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. KDD. 2004,
    [50]Ana-Maria Popescu, Oren Etzioni, Extracting product features and opinions from reviews in Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP). 2005, 339-346
    [51]Nit.in Jindal, Bing Liu, Identifying comparative sentences in text documents in Proceedings of the 29th annual international ACM SICIR conference, on Research and development in information retrieval SICUR, 2006,
    [52]Ramanathan Narayanan. Bing Liu, Alok Choudhary, Sentiment Analysis of Conditional Sentences, in Pro-ceedings of the 2009 Conference on Empirical Methods in Natural Language Processing EMNLP, 2009.
    [53]Guang Qiu, Bing Liu. Jiajun Bu, Chun Chen, Expanding Domain Sentiment Lexicon through Double Propagation in PROCEEDINGS 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL IN-TELLIGENCE (IJCAl),'2000
    [51]Nitin Jindal and Bing Liu. Opinion spam and analysis, in Proceedings of the international conferenet on Web starch and web data mining (WSDM). 2008
    [55]Yu Hong. Hatzivassiloglou Vasileios. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2003
    [56]Min Zhang, Xingyao Ye, A generative model to unify topic relevance and lexicon-based sentiment for opinion retrieval, in Proceedings of The 31st Annual International ACM SIGIR Conference (SICIR). 2008
    [57]Xuanjing Huang. VV. Bruce Croft. A unified relevance model for opinion retrieval, in Proceedings of The 18th ACM International Conference, on Information and Knuwlcdyc Management (CIKM) . 2009
    [58]Michael Collins. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms, in Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing EMNLP, 2002. 1-8
    [59]Ioannis Tsochantaridis, Thomas Hof'mann, Thorsten Joachims, Yasemin Altun, Support Vector Machine Learning for Interdependent and Structured Output Spaces in Proceedings of 21th international conference on Machine learning (ICML). 2004
    [60]Chun-Nam Johm Yu. Thorsten Joachims . Learning Structural SYMs with Latent Variables in Procedings of the 26th Annual International Conference on Machine. Learning (ICML). 2009
    [61]Thorsten Joachims, Structured Output Prediction with Structural SVMs Invited talk. the 6th International Workshop on Mining and Learning with Graphs (MLG). 2008
    [62]Koby Crammer, Yoram Singer, On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines, in Journal of Machine Learning Research (JMLR),2002,265-292
    [63]Koby Crammer, Ofer Dekel, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Online Passive-Aggressive Algorithms in Jounal of Machine Learning Research (JMLR),2006,551-585
    [64]Koby Crammer, Ryan Mcdonald and Fernando Pereira, Scalable Large-MarginOnline Learning for Struc-tured Classification, Technical Report. Department of Computer and Information Science, University of Pennsylvania,2005.
    [65]Thorsten Joachims, Thomas Finley, Chun-Nam John Yu, Cutting-Plane Training of Structural SVMs, in Machine Learning,2009,27-59
    [66]Ben Taskar and Vassil Chatalbashev and Daphne Koller, Learning associative Markov Networks, in Pro-ceedings of the twenty-first international conference on Machine learning,2004
    [67]Ben Taskar, Carlos Guestrin, Daphne Koller, Max-margin Markov Networks, in Neural Information Processing Systems Conference, (NIPS),2003
    [68]Ben Taskar, Vassil Chatalbashev, Daphne Koller and Carlos Guestrin, Learning Structured Prediction Models:A Large Margin Approach, in Proceedings of the 22th international conference on Machine learning, 2005
    [69]Ben Taskar, Structured Prediction:A Large Margin Approach. Neural Information Processing Systems Conference (NIPS), Tutorial,2007
    [70]Ryan Mcdonald, Keith Hall, Gideon Mann, Distributed Training Strategies for the Structured Perceptron, in Human Language Technologies:The 2010 Annual Conference of the North American Chapter of the ACL, HLT/NAACL,2010,456-464
    [71]John D. Lafferty, Andrew McCallum and Fernando C. N. Pereira, Conditional Random Fields:Probabilis-tic Models for Segmenting and Labeling Sequence Data in Proceedings of the 18th Annual International Conference on Machine learning (ICML),2001
    [72]Christopher M. Bishop, Pattern Recognition and Machine Learning,Springer,2006
    [73]L. Tesni re. 1959. El'ments de syntaxe structurale. Editions Klincksieck.
    [74]Nozomi Kobayashi, Kentaro Inui, and Yuji Matsumoto Extracting aspect-evaluation and aspect-of relations in opinion mining, in Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL,2007.
    [75]Theresa Wilson, Janyce Wiebe, and Paul Hoffmann, Recognizing contextual polarity in phrase level senti-ment analysis, in Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing HLT/EMNLP,2005,317-351
    [76]Aron Culotta and Jeffrey Sorensen, Dependency tree kernels for relation extraction, in Proceedings of the 42th Annual Meeting of the Association for Computational Linguistics (ACL),2004
    [77]Dmitry Zelenko, Chinatsu Aone, Anthony Richardella. Kernel methods for relation extraction in Jounal of Machine Learning Research (JMLR),2003.1083-1106
    [78]Dan Klein and Christopher D. Manning. Fast exact inference with a factored model for natural language parsing. In Advances in Neural Information Processing Systems. NIPS,2002
    [79]E. Riloff and W. Phillips. An introduction to the sundance and autoslog systems. In University of Ulah School of Computing Technical Report UUCS-04-015..2004
    [80]R. Mcdonald and F. Pereira Identifying gene and protein mentions in text using conditional random fields. BMC Bioinformatics, 2005
    [81]R. Karp. Reducibility among combinatorial problems. In R. Miller and J. Thatcher, editors, Complexity of Computer Computations. 1972, 85-103. Plenum Press.
    [82]Sebastian Riedel and James Clarke. Incremental integer linear programming for non-projective dependency parsing, in Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2006
    [83]Riedel Sebastian and McCallum Andrew. Fast and Robust Joint Models for Biomedical Event Extraction, in Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2011 ,1-12
    [8-1]Micheal Collins, Structured Prediction Problems in Natural Language Processing, Invited talks. The 25th International Conference on Machine Learning (ICML). 2008
    [85]Noah Smith, Structured Prediction for Natural Language Processing Tutorial, the 20th international con-ference on Machine learning, (ICML), 2009
    [86]Michael I. Jordan, An Introduction to Probabilistic Graphical Models, unpublished book.
    [87]McClosky David, Surdeanu Mihai and Manning Christopher. Event. Extraction as Dependency Parsing, in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. 2010, 1626-1635
    [88]Andre Martins, Xoah Smith, and Eric Xing. Concise integer linear programming formulations for depen-dency parsing. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, (ACL-IJCNLP), 2009, 342-350
    [89]Ryiin Mcdonald and Fernando Pereira Online learning of approximate dependency parsing algorithms. In Proceedings of the European Chapter of the ACL. (EACL). 2006, pages 81—88
    [90]Thomas I,. Magnauti and Laurence A. Wolsey. 1991. Optimal trees
    [91]赵军,许洪波,黄萱菁.谭松波.刘康,张奇,中文倾向性分析评测技术报告,第一届中文倾向性分析评测(COAE2008),2008
    [92]D. Roth and W, Yih, Integer Linear Programming Inference for Conditional Random Fields , in Proceedings of the International Conference on Machine Learninq (ICML). 2005
    [93]Pascal Denis and Jason Baldridge. Joint determination of anaphoricity and coreference resolution using integer programming, in Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics, HLT/NAACL. 2007, 236-243
    [94]Bo Pang and Lillian Lee. Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval 2(1-2). pp. 1-135, 2008.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700