Multiple kernel and multi-label learning for image categorization.
详细信息   
  • 作者:Bucak ; Serhat Selcuk.
  • 学历:Doctor
  • 年:2014
  • 毕业院校:Michigan State University
  • Department:Computer Science - Doctor of Philosophy.
  • ISBN:9781303879937
  • CBH:3619202
  • Country:USA
  • 语种:English
  • FileSize:2811194
  • Pages:227
文摘
One crucial step towards the goal of converting large image collections to useful information sources is image categorization. The goal of image categorization is to find the relevant labels for a given an image from a closed set of labels. Despite the huge interest and significant contributions by the research community,there remains much room for improvement in the image categorization task. In this dissertation,we develop efficient multiple kernel learning and multi-label learning algorithms with high prediction performance for image categorization. There are many image representation methods in the literature. However,it is not possible to pick one as the best method for image categorization,since different representations work better in different scenarios. Multiple kernel learning MKL),a natural extension of kernel methods for information fusion,is often used by researchers to improve image representation by integrating it to the learning step for selecting and combining different image features. MKL is mostly considered as a binary classification tool,and it is difficult to scale up MKL when the number of labels is large. We address this computational challenge by developing a framework for MKL that aims to learn a single kernel combination,which benefits all classes,by combining a worst-case analysis with stochastic approximation. Another contribution of this dissertation is presenting efficient multi-label learning algorithms. Multi-label learning is arguably the most suitable formulation for the image categorization task. Many researchers have employed decomposition methods,particularly one-vs-all framework,with SVM support vector machines) as a base classifier for addressing the image categorization problem. However,the decomposition methods have several shortcomings,such as their inability to exploit label correlations. Further,they suffer from imbalanced data distributions when the number of labels is large. Our contribution is to address multi-label learning via a ranking approach,termed multi-label ranking. Given a test image,multi-label ranking algorithms aim to order all the image classes such that the relevant classes are ranked higher than the irrelevant ones. The advantage of the proposed multi-label ranking approach,termed MLR-L 1 multi-label ranking with L1 norm),over other multi-label learning methods is its computational efficiency and high prediction performance. Image categorization is a supervised learning task,thus requiring a large set of training images annotated by humans. Unfortunately,labeling is an expensive process,and it is often the case that the annotators provide a limited set of labels,meaning that they only give a small subset of relevant tags for an image. One of the contributions of this dissertation is defining the problem of multi-label learning with incomplete class assignments and presenting a robust multi-label ranking algorithm,termed MLR-GL multi-label ranking with group lasso norm),that addresses the challenge of learning from incompletely labeled data. Finally,we present a multiple kernel multi-label ranking algorithm to simultaneously address two essential factors for improving the performance of image categorization: Heterogeneous information fusion,and exploiting label correlation of multi-label data. We propose a multiple kernel multi-label ranking method that learns a shared sparse kernel combination that benefits all image classes. This way,we not only improve the training and prediction efficiency,but also improve the accuracy,particularly for classes with a small number of samples,by enabling information sharing between classes. We integrate the proposed MLR-L1 algorithm with an efficient semi-infinite linear programming SILP) based MKL solver and develop a computationally efficient wrapper algorithm,termed MK-MLR multiple kernel multi-label ranking).

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700