Comparing Two Strategies for Query Expansion in a News Monitoring System
详细信息    查看全文
  • 关键词:Query analysis ; Query expansion ; Web IR and social media search
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9612
  • 期:1
  • 页码:267-275
  • 全文大小:315 KB
  • 参考文献:1.Arguello, J., Diaz, F., Callan, J., Crespo, J.: Sources of evidence for vertical selection. In: Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 315–322 (2009)
    2.Cao, G., Nie, J., Gao, J., Robertson, S.: Selecting good expansion terms for pseudo-relevance feedback. In: Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 243–250 (2008)
    3.Cilibrasi, R., Vitanyi, P.M.: The Google similarity distance. IEEE Trans. Knowl. Data Eng. 19, 370–383 (2007)CrossRef
    4.Habibi, M., Popescu-Belis, A.: Using crowdsourcing to compare document recommendation strategies for conversations. In: Workshop on Recommendation Utility Evaluation, Held in Conjunction with ACM RecSys (2012)
    5.Manning, C., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)CrossRef MATH
    6.Ponte, J.M., Croft, B.: A language modeling approach to information retrieval.In: Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 275–281 (1998)
    7.Tablan, V., Bontcheva, K., Roberts, I.: Mímir: an open-source semantic search framework for interactive information seeking and discovery. Web Semant. Sci. Serv. Agents World Wide Web 30, 52–68 (2015)CrossRef
    8.Zhao, L., Callan, J.: Term necessity prediction. In: Proceedings of the ACM International Conference on Information and Knowledge Management (CIKM), pp. 259–268 (2010)
  • 作者单位:Parvaz Mahdabi (18)
    Andrei Popescu-Belis (18)

    18. Idiap Research Institute, Martigny, Switzerland
  • 丛书名:Natural Language Processing and Information Systems
  • ISBN:978-3-319-41754-7
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9612
文摘
In this paper, we study query expansion strategies that improve the relevance of retrieved documents in a news and social media monitoring system, which performs real-time searches based on complex queries. We propose a two-step retrieval strategy using textual features such as bi-gram word dependencies, proximity, and expansion terms. We compare two different methods for query expansion: (1) based on word co-occurrence information; (2) using semantically-related expansion terms. We evaluate our methods and compare them with the baseline version of the system by crowdsourcing user-centric tasks. The results show that word co-occurrence outperforms semantic query expansion, and improves over the baseline in terms of relevance and utility.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700