Using query logs of USPTO patent examiners for automatic query expansion in patent searching
详细信息    查看全文
  • 作者:Wolfgang Tannebaum (1)
    Andreas Rauber (1)
  • 关键词:Patent searching ; Query expansion ; Query log analysis
  • 刊名:Information Retrieval
  • 出版年:2014
  • 出版时间:October 2014
  • 年:2014
  • 卷:17
  • 期:5-6
  • 页码:452-470
  • 全文大小:366 KB
  • 参考文献:1. Alberts, D., Yang, C., Fobare-DePonio, D., Koubek, K., Robins, S., Rodgers, M., Simmons, E., & De Marco, D. (2011). Introduction to patent searching. In M. Lupu, K. Mayer, J. Tait, & A. J. Trippe (Eds.), / Current challenges in patent information retrieval. The information retrieval series (Vol. 29, pp. 3鈥?3). Springer.
    2. Amitay, E., & Broder, A. (2008). Introduction to special issue on query log analysis: Technology and ethics. In / ACM Trans. Web 2, Article 18.
    3. Azzopardi, L., Vanderbauwhede, W., & Joho, H. (2010). Search system requirements of patent analysts. In / Proceeding of the 33rd international ACM SIGIR conference on research and development in information retrieval (SIGIR 2010). Geneva, Switzerland, pp. 775鈥?76.
    4. Bashar, A., & Myaeng, S. (2011). Query phrase expansion using wikipedia in patent class search. In / Proceedings of the 7th Asia conference on information retrieval technology (AIRS鈥?1). Dubai, United Arab Emirates, pp. 115鈥?26.
    5. Clough, P., & Berendt, B. (2009). Report on the treble CLEF query log analysis workshop 2009. / SIGIR Forum, / 43, 71鈥?7. CrossRef
    6. De Marco, D. (2011). / Plumbing the depths of examiner search (il)-logic: A patent searching perspective. Presentation given at PIUG 2011 Northeast Conference, New Brunswick (New Jersey), USA. http://www.demarcoip.com/wordpress1/wp-content/uploads/2013/07/ExaminerSearch.pdf.
    7. Fujita, S. (2007). Technology survey and invalidity search: An comparative study of different tasks for Japanese patent document retrieval. / Information Processing and Management, An International Journal, / 42(5), 1154鈥?172. CrossRef
    8. Garside, R., & Smith, N. (1997). A hybrid grammatical tagger: CLAWS4. In R. Garside, G. Leech, & A. McEnery (Eds.), / Corpus annotation: Linguistic information from computer text corpora (pp. 102鈥?21). London: Longman.
    9. Hang, C., Ji-Rong, W., Jian-Yun, N., & Wei-Ying, M. (2002) Probabilistic query expansion using query logs. In / Proceedings of the 11th international conference on world wide web (WWW 2002). Hawaii, USA, pp. 325鈥?32.
    10. Herbert, B., Szarvas, G., & Gurevych, I. (2009). Prior art search using international patent classification codes and all-claims-queries. In / Proceedings of the 10th cross-language evaluation forum conference on multilingual information access evaluation (CLEF 2009). Corfu, Greece, pp. 452鈥?59.
    11. Hunt, D., Nyugen, L., & Rodgers, M. (2007). / Patent searching: Tools & techniques. Hoboken: Wiley.
    12. Jochim, C., Lioma, C., Sch眉tze, H. (2011). Expanding queries with term and phrase translations in patent retrieval. In / Proceedings of the second international conference on multidisciplinary information retrieval facility (IRFC 2011). Vienna, Austria, pp. 16鈥?9.
    13. Jochim, C., Lioma, C., Sch眉tze, H., Koch, S., & Ertl, T. (2010). Preliminary study into query translation for patent retrieval. In / Proceedings of the patent information retriveal workshop (PaIR 2011). Toronto, Canada, pp. 57鈥?6.
    14. J眉rgens, J., Hansen, P., & Womser-Hacker, C. (2012). Going beyond CLEF-IP: The 鈥榬eality鈥?for patent searchers? In / Proceedings of the third international conference of the CLEF initiative (CLEF 2012). Rome, Italy, pp. 30鈥?5.
    15. Kato, M., Sakai, T., & Tanaka, K. (2013). When do people use query suggestion? A query suggestion log analysis. / Information Retrieval, / 16(6), 1鈥?2.
    16. Konishi, K. (2005). Query terms extraction form patent documents for invalidity search. In / Proceedings of NTCIR 2005: NTCIR-5 workshop meeting. Tokyo, Japan.
    17. Kunpeng, Z., Xiaolong, W., & Yuanchao, L. (2009). A new query expansion method based on query logs mining. / International Journal on Asian Language Processing, / 19, 1鈥?2.
    18. Magdy, W., & Jones, G. J. F. (2011). A study of query expansion methods for patent retrieval. In / Proceedings of PaIR 2011. Glasgow, Scotland, pp. 19鈥?4.
    19. Mahdabi, P., & Crestani, F. (2011). Learning-based pseudo-relevance feedback for patent retrieval. In / Proceedings of the second international conference on multidisciplinary information retrieval facility (IRFC 2011). Vienna, Austria, pp. 1鈥?1.
    20. Mahdabi, P., Keikha, M., Gerani, S., Landoni, M., & Crestani, F. (2011). Building queries for prior-art search. In / Proceedings of the second international conference on multidisciplinary information retrieval facility (IRFC 2011). Vienna, Austria, pp. 3鈥?5.
    21. Miller, G. (1995). WordNet: A lexical database for english. / Communications of the ACM, / 38(11), 39鈥?1. CrossRef
    22. Russo, D. 2011. Knowledge extraction from patent: Achievements and open problems. A multidisciplinary approach to find functions. In / Proceedings of the 20th CIRP design conference (CIRP Design 2012). Nantes, France, pp. 567鈥?76.
    23. Sekine, S., & Suzuki, H. (2007). Acquiring ontological knowledge from query logs. In / Proceedings of the 16th international conference on world wide web (WWW 2007). Banff, Canada, pp. 1223鈥?224.
    24. Silvestri, F. (2010). Mining query logs: Turning search usage data into knowledge. / Foundations and Trends in Information Retrieval, / 4(1鈥?), 1鈥?74. CrossRef
    25. Tannebaum, W., & Rauber, A. (2012). Acquiring lexical knowledge from query logs for query expansion in patent searching. In / The IEEE sixth international conference on semantic computing (IEEE ICSC 2012), Italy, Palermo, pp. 336鈥?38.
    26. Tannebaum, W., & Rauber, A. (2012). Analyzing query logs of USPTO examiners to identify useful query terms in patent documents: A preliminary study. In / Proceedings of the information retrieval facility conference (IRFC 2012). Vienna, Austria, pp. 127鈥?36.
    27. Xue, X., & Croft, W. (2009). Automatic query generation for patent search. In / Proceedings of CIKM 2009. Hong Kong, China, pp. 2037鈥?040.
    28. Xue, X., Croft, W. (2009). Transforming patents into prior-art queries. In / Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval. Boston, USA, pp 808鈥?80.
    29. Zhang, J., Xiong, M., & Yu, Y. (2006). Mining query log to assist ontology learning from relational database. In / Proceedings of the 8th Asia Pacific web conference (APWeb 2006). Harbin, China, pp. 437鈥?48.
  • 作者单位:Wolfgang Tannebaum (1)
    Andreas Rauber (1)

    1. Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
  • ISSN:1573-7659
文摘
In the patent domain significant efforts are invested to assist researchers in formulating better queries, preferably via automated query expansion. Currently, automatic query expansion in patent search is mostly limited to computing co-occurring terms for the searchable features of the invention. Additional query terms are extracted automatically from patent documents based on entropy measures. Learning synonyms in the patent domain for automatic query expansion has been a difficult task. No dedicated sources providing synonyms for the patent domain, such as patent domain specific lexica or thesauri, are available. In this paper we focus on the highly professional search setting of patent examiners. In particular, we use query logs to learn synonyms for the patent domain. For automatic query expansion, we create term networks based on the query logs specifically for several USPTO patent classes. Experiments show good performance in automatic query expansion using these automatically generated term networks. Specifically, with a larger number of query logs for a specific patent US class available the performance of the learned term networks increases.
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.