Stylized Facts of Linguistic Corpora: Exploring the Lexical Properties of Affect in News
详细信息    查看全文
  • 关键词:Text analysis ; Corpus linguistics ; Linguistic properties ; Stylized facts ; Sentiment analysis
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9937
  • 期:1
  • 页码:494-502
  • 全文大小:153 KB
  • 参考文献:1.Cont, R.: Empirical properties of asset returns: stylized facts adn statistical issues. Quant. Finance 1, 223–236 (2001)CrossRef
    2.Taylor, S.J.: Asset Price Dynamics, Volatility, and Prediction. Princeton University Press, Princeton (2011)CrossRef MATH
    3.Shiller, R.J., Perron, P.: Testing the random walk hypothesis: power versus frequency of observation. Econ. Lett. 18(4), 381–386 (1985)CrossRef MATH
    4.Tetlock, P.C.: Giving content to investor sentiment: the role of media in the stockmarket. J. Finance 62(3), 1139–1168 (2007)CrossRef
    5.Garcia, D.: Sentiment during recessions. J. Finance LXVIII 3, 1267–1300 (2013). doi:10.​1111/​jofi.​12027 CrossRef
    6.Antweiler, W., Frank, M.Z.: Is all that talk just noise? the information content of internet stock message boards. J. Finance 59(3), 1259–1294 (2004)CrossRef
    7.Ahmad, K.: Being in text and text in being: notes on representative texts. In: Andeman, G., Rogers, M. (eds.) Incorporating Corpora, pp. 60–91. Multilingual Matters, Clevedon (2008)
    8.Loughran, T., McDonald, B.: The use of word lists in textual analysis. J. Behav. Finance 16(1), 1–11 (2015)CrossRef
    9.Davies, M., The corpus of contemporary american english: 450 million words, 1990-present (2008)
    10.British National Corpus. Oxford University, Humanities Computing Unit, New York (2000)
    11.Kelly, S.: Signs of irrational exuberance: an investigation into the role of news and sentiment in finance. Ph.D. thesis, Trinity College, University of Dublin (2015)
    12.Zhao, Z., Ahmad, K.: Qualitative and quantitative sentiment proxies: interaction between markets. In: Jackowski, K., Burduk, R., Walkowiak, K., Woźniak, M., Yin, H. (eds.) IDEAL 2015. LNCS, vol. 9375, pp. 466–474. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-24834-9_​54 CrossRef
    13.Zhao, Z., Ahmad, K.: A computational account of investor behaviour in chinese and US market. Int. J. Econ. Behav. Organ. 3(6), 78–84 (2015)
    14.Stone, P.J., Dunphy, D.C., Smith, M.S., Olgilvie, D.M., with associates: The General Inquirer: A Computer Approach to Content Analysis. The MIT Press, Cambridge (1966)
    15.Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6, pp. pp. 417–422. Citeseer (2006)
    16.Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retrieval 2(1–2), 1–135 (2008)CrossRef
    17.Loughran, T., McDonald, B.: When is a liability not a liability. J. Finance 66, 35–65 (2011)CrossRef
    18.Cook, J.A., Ahmad, K.: Behaviour and markets: the interaction between sentiment analysis and ethical values? In: Jackowski, K., Burduk, R., Walkowiak, K., Woźniak, M., Yin, H. (eds.) IDEAL 2015. LNCS, vol. 9375, pp. 551–558. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-24834-9_​64 CrossRef
    19.Kelly, S., Ahmad, K.: The impact of news media and affect in financial markets. In: Jackowski, K., Burduk, R., Walkowiak, K., Woźniak, M., Yin, H. (eds.) IDEAL 2015. LNCS, vol. 9375, pp. 535–540. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-24834-9_​62 CrossRef
  • 作者单位:Jason A. Cook (21)
    Zeyan Zhao (21)
    Khurshid Ahmad (21)

    21. School of Computer Science and Statistics, Trinity College, Dublin, Ireland
  • 丛书名:Intelligent Data Engineering and Automated Learning ¨C IDEAL 2016
  • ISBN:978-3-319-46257-8
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9937
文摘
Investors are often said to be driven by emotions, and studies in sentiment analysis claim that there is a causal relationship between negative affect in text and prices in financial markets. The text collections used in these studies tend to be of varying sizes and sources, with little justification of their design criteria. This is a classic data engineering problem, which requires specification of the data sources and design of the data repositories and retrieval facilities. In this paper, we explore the statistical properties of negative affect expressed in various textual corpora, differing in specification, size and provenance. The question we ask is whether there are any stylized facts of negative affect that are universal across all texts. We observed two main findings: (1) The frequency distribution of negative terms is generally stable across different corpus sizes and (2) The frequency of negative terms accounts for a relatively small proportion of the total terms in the corpus.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700