详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
     针对以上问题,本文提出了一种通用的领域无关的多文档观点摘要方法。本方法采用传统摘录式多文档摘要技术,结合概率主题模型LDA(Latent Dirichlet Allocation, LDA)和语义倾向进行多文档观点摘要。本文方法首先利用LDA模型对多文档的句子集合建模,挖掘文本集合中的潜在主题,利用Gibbs抽样得到句子在主题上的概率分布和主题在词上的概率分布,同时对句子进行词性分析并利用WordNet和SentiWordNet计算句子中词的语义倾向值;然后依次计算主题的重要度、词的重要度,在这两者基础上结合词的语义倾向计算句子的重要度;最终根据句子的重要度排序依次抽取句子,根据主题去除句子冗余后得到抽取式文摘。本文方法利用LDA模型挖掘评论文本中的重要主题,并结合语义倾向挖掘在重要主题上的主观性较强的观点。实验证明,本文方法得到的摘要更接近专家摘要。
Human natural language text contains two kinds of information:objective and subjective information. The subjective information represents one's attitude, standpoint and opinion to a specific object. Text sentiment analysis focuses on subjective information to recognize, classify, extract and annotate the expression of sentiment, opinion and effect in the content.
     With the rapid increase usage of internet, there are more and more subjective information appearing at the social medium, such as forum, community, blog and shopping websites. Both individual and organization became strongly relying on the review information obtained from the internet to make their own decisions. However, due to the huge amount of information available on the internet, one has to search, check and judge each review one by one before the person or organization can make the final decision. In this situation, it will be very useful to first summarize the relevant huge amount of information; this summary will be valuable for both the customer and manufacturer. This kind of work is called opinion-based multi-document summarization. Furthermore, it will greatly enhance the customers' efficiency to obtain the information if there is an automatic analysis of the original information, for example, which is positive attitude, which is negative attitude, and to what extent. This is called sentiment classification.
     This thesis focused on the opinion-based multi-document summarization and sentiment classification, two fields in text sentiment analysis. It contains the following three parts:
     1) Developed a new method for the opinion-based multi-document summarization
     Current opinion-based multi-document summarization that mainly based on the feature or aspect of the review is called feature/aspect based opinion summarization. This is largely depended on the accurate recognition of opinion feature and opinion word, however in reality, the opinion feature or opinion word is often not explicitly appeared in the sentence. Therefore, the feature/aspect based opinion mining will miss the opinion that is implied in the sentence due to the failing of recognition of the implicit opinion, and affect the performance of the following summarization. As to accurately recognize the feature/aspect requires the domain knowledge, thus make it domain dependent. Furthermore, this feature/aspect based method mainly focuses on the recognition and evaluation of each feature; therefore, it cannot provide summary information about the main topic and basic idea that covers all the opinions.
     To overcome this problem, this thesis proposed a general, domain-independent multi-document opinion summarization method. This new method utilizes the traditional extractive summarization method, combining Latent Dirichlet Allocation (LDA) and semantic orientation for mullet-document summarization. This method first builds the model of the sentence sets from multi-document with LDA, and explores the latent topics, obtains the sentence-topic distribution and topic-word distribution through Gibbs sampling, performs part of speech analysis and computes semantic orientation of word with WordNet and SentiWordNet. Secondly, it evaluates the importance degree of topic and word sequentially, and then based on these results and semantic orientation of word, it evaluates the importance degree of sentence. Finally, it sorts the sentence by the importance degree of sentence, obtains the extractive abstract after getting rid of the redundancy according to the topics. This identifies the important topic from the opinion text with LDA model and the strong subjective opinion on such topic with semantic orientation method. Experiment results indicate that results with this new method are comparable to expert summarization.
     2) Developed a new ensemble learning based method for sentiment classification of unbalanced data
     Current binary sentiment classification has been focusing on improving the performance of classification, while the unbalanced data, in which the number of samples in one category is several folds of that of another category, is neglected. Majority of the study on sentiment classification has been on the balanced data, so these methods perform well on balanced data, while are unable to maintain the same performance in practical applications. Therefore, it is imperative to study and develop new methods to deal with unbalanced data for sentiment classification and to improve the performance of sentiment classification in practical applications.
     To this end, this thesis proposed a new method of sentiment classification that combines unbalanced data classification method and ensemble learning technique. As a hybrid method, it considers both algorithm and datasets. In the framework of ensemble learning, it integrates three different methods: under-sampling, Bootstrap re-sampling and random feature selection to process the training set. It thus combines the advantage of the three methods to obtain the subset with larger diversity in both sample space and feature space, and leads to a larger diversity base classifier. In the end, it can enhance the ability of the ensemble classifier. Experiment on the unbalanced data for sentiment classification show that such new approach could significantly improve the classification performance on unbalanced data.
     3) Developed a fine-grained sentiment classification and analyzed the effect of pre-process of text on sentiment classification
     Majority of study in sentiment classification focus on binary sentiment classification which categories subjective text as positive or negative. However, in reality, text with subjective information cannot always be simply classified as positive or negative. For example, the review information from many shopping websites contains ranking information from1star to5stars. In this case, classifying them only into positive or negative cannot meet the practical need. To solve this problem, this thesis proposed a method called fine-grained sentiment classification. This method not only considers the positive or negative polarity of the review text, it also addresses the ranking strength of the review text. It further analyzed the essential difference between the fine-grained sentiment classification and the traditional multi-class categorization.
     Considering the difference between the sentiment classification and the traditional topic-based categorization, to better study the fine-grained sentiment classification, this thesis used supervised machine learning method to analyze various components that affect the sentiment classification. Specifically, it compared performance of the combination among the number of feature, stop words list, text feature selection, feature weight computation and text categorization method on sentiment classification. These studies indicated that there were differences between sentiment classification and topic-based classification when applied stop words list and feature selection in text categorization. Finally, to study the fine-grained sentiment classification of Chinese text, this thesis did experiment in analyzing reviews in Chinese scientific literature using machine-learning method. In the experiment, the usage of ranking information correspondent to the review text as category label solved the problem of manual annotation. The experiment shows that fine-grained sentiment classification is not only different from the topic-based multi-class categorization, but also difficult to classification compared to traditional multi-class categorization and binary sentiment classification.
[1]Horrigan J A. Online shopping, Pew Internet & American Life Project Report, 2008.
    [2]comScore/the Kelsey group, Online consumer-generated reviews have significant impact on offline purchase behavior, http://www.comscore.com/press/release.asp?press=1928, November 2007.
    [3]Hearst M A. Direction-Based Text Interpretation as an Information Access Refinement, Jacobs P., Text-Based Intelligent Systems, Lawrence Erlbaum Associates,1992.
    [4]Hatzivassiloglou V, McKeown K R. Predicting the semantic orientation of adjectives:Proceedings of ACL-97,1997, pp.174-181.
    [5]Spertus E. Smokey:Automatic Recognition of Hostile Messages:Innovative Applications of Artificial Intelligence (IAAI)'97, pp.174-181,1997.
    [6]Riloff E, Shepherd J., A Corpus-Based Approach for Building Semantic Lexicons, In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing,1997.
    [7]Wiebet J M, Bruce R F, O'Harat T P. Development and Use of a Gold-Standard Data Set for Subjectivity Classifications. ACL'99,1999.
    [8]Das S, Chen M., Yahoo! for Amazon:Extracting market sentiment from stock message boards, The Asia Pacific Finance Association Annual Conference (APFA'01),2001.
    [9]Tong R M., An operational system for detecting and tracking opinions in on-line discussion:The Workshop on Operational Text Classification (OTC)2001,2001.
    [10]Dave K, Lawrence S, Pennock D M. Mining the peanut gallery:Opinion Extraction and Semantic Classification of Product Reviews:WWW2003, Budapest, Hungary,2003.
    [11]Pang B, Lee L. A sentimental education:sentiment analysis using subjectivity summarization based on minimum cuts:Association for Computational Linguistics, ACL'04, Barcelona, Spain,2004, pp.271-278.
    [12]Hiroshi K, Tetsuya N, Hideo W. Deeper sentiment analysis using machine translation technology:Computational Linguistics, Association for Computational Linguistics. COLING'04, Geneva, Switzerland,2004.
    [13]Hu M, Liu B. Mining and summarizing customer reviews:Knowledge discovery and data mining, Seattle, WA, USA,2004. ACM Press.
    [14]NTCIR. http://research.nii.ac.jp/ntcir/index-en.html.
    [15]TREC. http://trec.nist.gov. http://trec.nist.gov.
    [16]Agrawal R, Rajagopalan S, Srikant R, et al. Mining newsgroups using networks arising from social behavior:WWW 2003.
    [17]Efron M. Cultural Orientation:Classifying Subjective Documents by Cociation Analysis:American Association for Artificial Intelligence 2004,2004.
    [18]Turney P D. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews, Philadelphia,2002. July 2002.
    [19]Mei Q, Ling X, Wondra M. Topic sentiment mixture-modeling facets and opinions in weblogs:WWW2007, Banff, Alberta, Canada,2007.
    [20]Aciar S, Zhang D, Simoff S, et al. Informed Recommender:Basing Recommendations on Consumer Product Reviews. IEEE Intelligent Systems, 2007,22(3):39-47.
    [21]Chaovalit P, Zhou L. Movie Review Mining:a Comparison between Supervised and Unsupervised Classification Approaches:Proceedings of the 38th Hawaii International Conference on System Sciences,2005.
    [22]Morinaga S, Yamanishi K, Tateishi K, et al. Mining Product Reputations on the Web:SIGKDD 02, Edmonton, Alberta, Canada,2002.
    [23]Ghose A, Ipeirotis P G, Sundararajan A. Opinion Mining Using Econometrics:A Case Study on Reputation Systems:Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic,2007.
    [24]Qiang Y, Bin L, Yi-Jun L. Sentiment classification for Chinese reviews:a comparison between SVM and semantic approaches:Proceedings of 2005 International Conference on Machine Learning and Cybernetics,2005.
    [25]Qiang Y, Wen S, Yijun L. Sentiment Classification for Movie Reviews in Chinese by Improved Semantic Oriented Approach:System Sciences, Proceedings of HICSS'06,2006.
    [26]Zhang Q, Wu Y, Wu Y, et al. Opinion Mining with Sentiment Graph:Web Intelligence and Intelligent Agent Technology (WI-IAT),2011 IEEE/WIC/ACM, 2011.
    [27]Wu Q, Tan S, Zhai H, et al. SentiRank:Cross-Domain Graph Ranking for Sentiment Classification:Web Intelligence and Intelligent Agent Technologies, 2009(WI-IAT'09).2009.
    [30]Pang B, Lee L. Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval,2008,2(1-2):1-135.
    [31]Liu B. Web Data Mining:Exploring Hyperlinks, Contents, and Usage Data. Second Edition. Springer,2011.
    [32]Songbo Tan, Xue qi, Huifeng Tang. A survey on sentiment detection of reviews. Expert Systems with Applications,2009,36 (7):10760-10773.
    [33]Beineke P, Hastie T, Manning C, et al. Exploring sentiment summarization: Proceedings of the AAAI Spring Symposium on Exploring Attitude and Affect in Text, AAAI technical report SS-04-07,2004.
    [34]Seki Y, Eguchi K, Kando N. Analysis of multi-document viewpoint summarization using multi-dimensional genres,2004.
    [35]Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment Classification using Machine Learning Techniques:Proceedings of EMNLP,2002.
    [36]Esuli A, Sebastiani F. Determining the Semantic Orientation of Terms through Gloss Classification:CIKM'05, Bremen, Germany,2005.
    [37]Palakvangsa-Na-Ayudhya S, Sriarunrungreung V, Thongprasan P, et al. Nebular: A sentiment classification system for the tourism business:2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE),2011.
    [38]Kechaou Z, Ben Ammar M, Alimi A M. Improving e-learning with sentiment analysis of users' opinions:2011 Global Engineering Education Conference (EDUCON),2011.
    [39]Yu H, Hatzivassiloglou V. Towards Answering Opinion Questions:Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences:The 2003 conference on empirical methods in natural language processing,2003.
    [40]Blitzer J, Dredze M, Pereira F. Biographies, Bollywood, Boom-boxes and Blenders:Domain Adaptation for Sentiment Classification.:ACL'07,2007.
    [41]Whitehead M, Yaeger L. Building a General Purpose Cross-Domain Sentiment Mining Model:Computer Science and Information Engineering,2009 WRI World Congress,2009.
    [42]Lau R Y K, Lai C L, Li Y. Leveraging the web context for context-sensitive opinion mining:ICCSIT 2009, Beijing,2009.
    [43]Abbasi A, France S, Zhu Z, et al. Selecting Attributes for Sentiment Classification Using Feature Relation Networks. IEEE Transaction on Knowledge and Data Engineering,2011,23(3):447-462.
    [44]Binali H, Potdar V, Wu C. A State Of The Art Opinion Mining And Its Application Domains:ICIT'2009,2009.
    [45]Turney P D, Littman M L. Measuring Praise and Criticism:Inference of Semantic Orientation from Association. ACM Transactions on Information Systems,2003,21(4):315-346.
    [46]Mitsdorffer R, Diederich J. Rule extraction from technology IPOs in the US stock market:Proceedings of the 9th International Conference on Neural Infomation Processing (ICONIP'02),2002.
    [47]Zhou L, Chaovalit P. Ontology-supported polarity mining. Journal of the American Society for Information Science and Technology,2008,59(1):98-110.
    [50]NLProcessor-Text Analysis Toolkit. http://www.infogistics.com/textanalysis.html.
    [51]Stanford Postagger, http://nlp.stanford.edu/software/tagger.shtml#Download. http://nlp. stanfo rd. edu/so ft ware/tagger. shtml#Do wnl oad.
    [53]Shang W, Qu Y, Huang H, et al. A Role-based Customer review Mining System. 2006 IEEE International Conference on Systems, Man, and Cybernetics, Taipei, Taiwan,2006.
    [54]Liu B, Hsu W, Ma Y. Integrating Classification and Association Rule Mining: KDD'98,1998.
    [55]Turney P, Littman M L. Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus.NRC/ERB-1094. May 15,2002. NRC 44929.
    [56]Church K W, Hanks P. Word association norms, mutual information and lexicography, New Brunswick, NJ:ACL,1989.
    [57]Turney P D. Mining the Web for Synonyms. PMI-IR Versus LSA on TOEFL, Berlin:Springer-Verlag,2001.
    [58]Landauer T K, Dumais S T. A solution to Plato's problem:The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review,1997,104(2):211-240.
    [60]Ku L, Liang Y, Chen H. Opinion Extraction, Summarization and Tracking in News and Blog Corpora. AAAI'06,2006.
    [63]谭松波.中文情感挖掘语料-ChnSentiCorp, http://www.searchforum.org.cn/tansongbo/corpus-senti.htm.
    [64]Riloff E, Patwardhan S, Wiebe J. Feature Subsumption for Opinion Analysis: The 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), Sydney,2006.
    [65]Yang Y, Pederson J. O. A comparative study on feature selection in text categorization, ICML'97,1997, pp.412-420.
    [66]Xiao-Bin Wu, Zhi-Hong Deng, Ming Zhang, Dong-Qing Yang. Relative term-Frequency based feature selection for text categorization, Proceedings of 2002 International Conference on Machine Learning and Cybernetics,2002, vol3, pp.1432-1436.
    [67]Han J, Kamber M. Data Mining. Concepts and Techniques(1st Edition). Morgan Kaufmann,2006.
    [69]Ni X, Xue Q Ling X, et al. Exploring in the Weblog Space by Detecting Informative and Affective Articles:WWW 2007/Track:Industrial Practice and Experience, Banff, Alberta, Canada,2007.
    [70]Yi J, Niblack W. Sentiment mining in WebFountain:Data Engineering, ICDE 2005.2005.
    [71]Wong A, Salton C. G, A Vector Space Model for Automatic Indexing. Communications of the ACM,1975,18(11)613-620.
    [72]Cover T, Hart P. Nearest neighbor pattern classification. IEEE Transactions on Information theory,1967,13(1):21-27.
    [73]Polpinij J, Ghose A K. An Ontology-Based Sentiment Classification Methodology for Online Consumer Reviews:Web Intelligence and Intelligent Agent Technology,2008(WI-IAT'08).2008.
    [74]Shein K P P, Nyunt T T S. Sentiment Classification Based on Ontology and SVM Classifier:Communication Software and Networks,2010 (ICCSN'10).2010.
    [75]Huettner A, Subasic P. Fuzzy Typing for Document Management:ACL'00,2000.
    [76]Esuli A, Sebastiani F. PageRankingWordNet Synsets:An Application to Opinion Mining:The 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic,2007.
    [77]Hart G W. To decode short cryptograms. Communications of the Acm,1994, 37(9):102-108.
    [78]Silva C, Rieiro B. The importance of stop word removal on recall values in text categorization. Neural Networks,2003, vol.3, pp.20-24.
    [79]Liu H, Yu L. Toward integrating feature selection algorithms for classification and clustering. IEEE Transaction on Knowledge and Data Engineering,2005, 17(4):491-502.
    [83]Barandela R, Sanchez J, Garcia V, et al. Strategies for Learning in Class Imbalance Problems, Pattern Recognition,2003, vol.36, pp.849-851.
    [84]Kubat M, Matwin S. Addressing the Curse of Imbalanced Training Sets: One-Sided Selection:ICML-97,1997.
    [85]Chawla N, Japkowicz N, Kotcz A. Editorial:Special Issue on Learning from Imbalanced Data Sets. SIGKDD Exploration Newsletter,2004,6(1):1-6.
    [86]Drown D J, Khoshgoftaar T M, R N. Using evolutionary sampling to mine imbalanced data:The 6th International Conference on Machine Learning and Applications, Washington DC:IEEE Computer Society,2007.
    [87]Yen S. J, Lee Y. S. Cluster-based under-sampling approaches for imbalanced data distributions. Expert Systems with Applications,2009, vol.36, pp.5718-5727.
    [88]Chawla N, Bowyer K, Hall L, et al. SMOTE:Synthetic Minority Over-Sampling Technique. Journal of Artificial Intelligence Research,2002, vol.16, pp.321-357.
    [89]Juszczak P, Duin R. Uncertainty Sampling Methods for One-Class Classifiers: ICML'03,2003.
    [90]Zhou Z, Liu X. Cost-Sensitive Neural Networks with Methods Addressing the Class Imbalance Problem. IEEE Transaction on Knowledge and Data Engineering,2006, vol.18, pp.63-77.
    [91]Guo H, Viktor H L. Learning from imbalanced data sets with boosting and data generation:the DataBoost-IM approach. ACM SIGKDD Explorations Newsletter-Special issue on learning from imbalanced datasets,2004,6(1):30-39.
    [92]Dietterich T. Machine learning research:Four current directions. AI Magazine, 1997,8(4):97-136.
    [93]Valentini G, Masulli F. Ensembles of learning machines. Neural Nets, LNCS 2486,2002, pp.3-20.
    [95]Z-H Z, J W, W T. Ensembling neural networks:Many could be better than all. Artificial Intelligence,2002,137(12):239-263.
    [96]Liu C-L. Classifier Combination Based on Confidence Transformation. Pattern Recognition,2005,1(38):11-28.
    [97]Aksela M, Laaksonne J. Using Diversity of Errors for Selecting Members of a Committee Classifier. Pattern Recognition,2006,4(39):608-623.
    [98]Witten IH, Frank E, Hall MA. Data Mining:Practical Machine Learning Tools and Techniques(Second Edition). San Francisco:Morgan Kaufmann,2005.
    [99]Hansen L K, Salamon P. Neural network ensembles. IEEE Transactions on Pattern Analysis and Machine Intelligence,1990,10(12):993-1001.
    [100]Ueda N. Optimal Linear Combination of Neural Networks for Improving Classification Performance. IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22(2):207-215.
    [101]Valiant LG A Theory of the Learnable. Communications of the ACM,1984, 27(11):1134-1142.
    [102]Schapire RE. The Boosting Approach to Machine Learnling:An overview, MSRI Workshop on Nonlinear Estimation and Classification,2002. Berkeley, califonia,2002.
    [103]Kearns MJ. The Computational Complexity of Machine Learning. Cambridge: MIT Press,1990.
    [104]Kearns M, Valianty L. Cryptographic Limitations on Learning Boolean Formulae and Finite Automata. Journal of the ACM,1994,41(1):67-95.
    [105]R O. Duda, P E. Hart, D G Stork.模式分类(第二版).机械工业出版社,2003.
    [106]Freund Y, Schapire RE. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences,1997, 55(1):119-139.
    [107]Breiman L. Bagging Predictors. Machine Learning,1996,24(2):123-140.
    [108]Efron B T R. An Introduction to the Bootstrap. Chapman and Hall,1993.
    [109]Breiman L. Random Forests. Machine Learning,2001(45):5-32.
    [111]Tin Kam Ho. The Random Subspace Method for Constructing Decision Forests. IEEE Transactions on Pattern Analysis and Machine Intelligence,1998, 20(8):832-844.
    [115]Hirao T, Isozaki H, Maeda E. Extracting Important Sentences with Support Vector Machines,19th COLING, 2002.
    [116]Nenkova A, Vanderwende L. The Impact of Frequency on Summarization, MSR-TR-2005-101.2005.
    [117]Harabagiu S, Hickl A, Lacatusu F. Satisfying Information Needs with Multidocument Summaries. Information Processing and Management,2007, 43(6):1619-1642.
    [118]Antiqueira L, Osvaldo N, Oliveira J. A Complex Network Approach to Text Summarization. Information Science,2009, vol.179, pp.584-599.
    [119]McKeown K R, Barzilay R, Evans D. Tracking and Summarizing News on a Daily Basis with Columbia's Newsblaster, HLT'02,2002, pp.280-285.
    [120]Radev D R, Jing H, Stys M, et al. Centroid-based summarization of multiple documents. Information Processing and Management,2004(40):919-938.
    [121]Harabagiu S M, Lacatusu F. Generating Single and Multi-document Summaries with Gistexter, DUC2002,2002, pp.30-38.
    [130]Arora R, Ravindran B. Latent Dirichlet Allocation and Singular Value Decomposition Based Multi-document Summarization:ICDM'08.2008.
    [131]Bing L, Mingqing H, Junsheng C. Opinion observer:analyzing and comparing opinions on the Web:WWW'05.
    [132]Lu Y, Zhai C, Sundaresan N. Rated Aspect Summarization of Short Comments: WWW'09,2009.
    [133]Blair-Goldensohn S, Hannan K, Mcdonald R. Building a Sentiment Summarizer for Local Service Reviews:NLPIX'08,2008.
    [134]Peng L, Yinglin W. Automatically extracting summaries with a novel unsupervised framework:Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2011),2011.
    [135]Raymond Ng, Pauls A, Carenini G Multi-document Summarization of Evaluative Text, In Proceedings of the 11st Conference of the European, Chapter of the Association for Computational Linguistics,2006.
    [136]Zhan J, Loh H T, Liu Y. Gather customer concerns from online product reviews-A text summarization approach. Expert Systems with Applications:An International Journal,2009,36(2):2107-2115.
    [139]Keller M, Bengio S. Theme Topic Mixture Model:A Graphical Model for Document Representation, In:PASCAL Workshop on Learning Methods for Text Understanding and Mining,2004.
    [140]Blei D M, Ng A Y, Jordan M I. Latent Dirichlet Allocation. Journal of Machine Learning Research,2003, vol.3, pp.993-1022.
    [141]Steyvers M, Steyvers T. Probabilistic Topic Models, Handbook of Latent Semantic Analysis, Laurence Erlbaum,2007.
    [142]Teh Y W, Jordan M I, Beal M J. Hierarchical Dirichlet Processes. Journal of the American Statistical Association,2006,101 (476):1566-1581.
    [144]Griffiths T. Gibbs sampling in the generative model of Latent Dirichlet Allocation:Tech. rep.,2002. Stanford University, (2002).
    [145]Griffiths T L, Steyvers M. Finding scientific topics. PNAS,2004, 101(1):5228-5235.
    [146]Chesley P, Vincent B, Xu L, et al. Using verbs and adjectives to automatically classify blog sentiment:AAAI-CAAW'06,2006.
    [147]Nasukawa T, Yi J. Sentiment analysis:Capturing favorability using natural language processing, K-CAP'03,2003, pp.70-77.
    [148]Esuli A, Sebastiani F. SentiWordNet:A Publicly Available Lexical Resource for Opinion Mining, LREC'06,2006.
    [149]Goldstein J, Mittal V, Kantrowitz M, et al. Multi-Document Summarization by Sentence Extraction, NAACL-ANLP-AutoSum,2000,vol 4, pp.40-48.
    [151]ROUGE-1.5.5, http://www.rouge.com. au/.
    [152]Lin C. ROUGE:A Package for Automatic Evaluation of Summaries, Proceedings of the ACL-04 Workshop, Barcelona, Spain,2004, pp.74-81.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700