Evidence-based uncertainty sampling for active learning
详细信息    查看全文
  • 作者:Manali Sharma ; Mustafa Bilgic
  • 关键词:Active learning ; Uncertainty sampling ; Classification
  • 刊名:Data Mining and Knowledge Discovery
  • 出版年:2017
  • 出版时间:January 2017
  • 年:2017
  • 卷:31
  • 期:1
  • 页码:164-202
  • 全文大小:
  • 刊物类别:Computer Science
  • 刊物主题:Data Mining and Knowledge Discovery; Artificial Intelligence (incl. Robotics); Information Storage and Retrieval; Statistics for Engineering, Physics, Computer Science, Chemistry and Earth Sciences;
  • 出版者:Springer US
  • ISSN:1573-756X
  • 卷排序:31
文摘
Active learning methods select informative instances to effectively learn a suitable classifier. Uncertainty sampling, a frequently utilized active learning strategy, selects instances about which the model is uncertain but it does not consider the reasons for why the model is uncertain. In this article, we present an evidence-based framework that can uncover the reasons for why a model is uncertain on a given instance. Using the evidence-based framework, we discuss two reasons for uncertainty of a model: a model can be uncertain about an instance because it has strong, but conflicting evidence for both classes or it can be uncertain because it does not have enough evidence for either class. Our empirical evaluations on several real-world datasets show that distinguishing between these two types of uncertainties has a drastic impact on the learning efficiency. We further provide empirical and analytical justifications as to why distinguishing between the two uncertainties matters.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700