Speech Acts Annotation of Everyday Conversations in the ORD Сorpus of Spoken Russian
详细信息    查看全文
  • 关键词:Corpus linguistics ; Speech corpus ; Pragmatics ; Spoken Russian ; Everyday dialogues ; Speech acts ; Discourse ; Annotation
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9811
  • 期:1
  • 页码:627-635
  • 全文大小:112 KB
  • 参考文献:1.Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T.: The ORD speech corpus of Russian everyday communication “One Speaker’s Day’’: creation principles and annotation. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 250–257. Springer, Heidelberg (2009)CrossRef
    2.Sherstinova, T.: Macro episodes of Russian everyday oral communication: towards pragmatic annotation of the ORD speech corpus. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 268–276. Springer, Heidelberg (2015)CrossRef
    3.Sherstinova, T.: Approaches to Pragmatic Annotation in the ORD Corpus: Microepisodes and Speech Acts. In: Proceedings of the International Conference on “Corpus linguistics-2015”, pp. 436–446 (2015)
    4.Weisser, M.: Speech act annotation. In: Aijmer, K., Rühlemann, C. (eds.) Corpus Pragmatics: a Handbook, pp. 84–111. CUP, Cambridge (2014)
    5.Potapova, R.K.: Speech: Communication, Information, Cybernetics. URSS, Moscow (2003) (In Russian)
    6.Austin, J.L.: How To Do Things With Words. Oxford University Press, Oxford (1962)
    7.Searle, J.R.: A classification of illocutionary acts. Lang. Soc. 5(1), 1–23 (1976)CrossRef
    8.Qadir, A., Riloff, E.: Classifying sentences as speech acts in message board posts. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP-2011), pp. 748–758 (2011)
    9.Bakhtin, M.M.: Speech Genres and Other Late Essays. University of Texas Press, Austin (1986). Edited by Caryl Emerson and Michael Holquist, Translated by Vern W. McGee
    10.Jurafsky, D.: Pragmatics and computational linguistics. In: Horn, L., Ward, G. (eds.) The Handbook of Pragmatics, pp. 578–604. Blackwell, Oxford (2006)CrossRef
    11.Allen, J., Core, M.: Draft of DAMSL: dialog act markup in several layers (1997). https://​www.​cs.​rochester.​edu/​research/​speech/​damsl/​RevisedManual/​
    12.Leech, G., Weisser, M.: Generic speech act annotation for task-oriented dialogues. In: Proceedings of the Corpus Linguistics 2003 Conference, vol. 16. UCREL Technical Papers, Lancaster University (2003)
    13.Weisser, M.: SPAACy: a semi-automated tool for annotating dialogue acts. Int. J. Corpus Linguist. 8(1), 63–74 (2003)CrossRef
    14.Carletta, J., Isard, A., Isard, S., Kowtko, J.S., Doherty-Sneddon, G., Anderson, A.H.: The reliability of a dialogue structure coding scheme. Comput. Linguist. 23, 13–32 (1997)
    15.Blum-Kulka, S., Olshtain, E.: Requests and apologies: a cross-cultural study of speech act realization patterns (CCSARP). Appl. Linguist. 5(3), 196–215 (1984)CrossRef
    16.Stiles, W.: Describing Talk: A Taxonomy of Verbal Response Modes. Sage, Newbury Park (1992)
    17.Borisova, I.N.: Russian spoken dialogue. Structure and Dynamics. KomKniga, Moscow (2009) (In Russian)
    18.Hellwig, B., Van Uytvanck, D., Hulsbosch, M., et al.: ELAN — Linguistic Annotator. Version 4.9.3 (2014). http://​tla.​mpi.​nl/​tools/​tla-tools/​elan/​
  • 作者单位:Tatiana Sherstinova (16)

    16. Saint Petersburg State University, 7/9 Universitetskaya Nab., St. Petersburg, 199034, Russia
  • 丛书名:Speech and Computer
  • ISBN:978-3-319-43958-7
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9811
文摘
The paper describes annotation principles developed for tagging of speech acts in the “One Day of Speech” (ORD) corpus of Russian everyday speech, with special attention being paid to categories and subcategories of speech acts distinguished in the ORD. Annotation of speech acts is a part of pragmatic annotation of the corpus, which includes as well the tagging of macro- and microepisodes of verbal communication. Speech acts are annotated on four levels: (1) the orthographic transcript with information on syntagmatic and phrasal boundaries, (2) the speakerʼs code, (3) the main category of a speech act, and (4) its subcategory. Practical approbation of the proposed annotation scheme has been made on the material of 6 macroepisodes of everyday communication, in which 2250 speech acts have been discerned. Pragmatic annotation of the ORD corpus provides an opportunity to study everyday discourse in terms of speech acts and to study linguistic properties and patterns of speech acts of different types.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700