An extensive experimental comparison of methods for multi-label learning

详细信息	查看全文 \| 推荐本文 \|

作者：Gjorgji Madjarov^a ; ^b ; ^{gjorgji.madjarov@finki.ukim.mk} ; [Author Vitae] ; Dragi Kocev^b ; ^{Dragi.Kocev@ijs.si} ; [Author Vitae] ; Dejan Gjorgjevikj^a ; ^{dejan.gjorgjevikj@finki.ukim.mk} ; [Author Vitae] ; Sa&scaron ; o D啪eroski^b ; ^{Saso.Dzeroski@ijs.si} ; [Author Vitae]
关键词：Multi-label ranking ; Multi-label classification ; Comparison of multi-label learning methods
刊名：Pattern Recognition
出版年：2012
期刊代码：100_00313203
类别：cp
出版时间：September, 2012
卷：45
期：9
页码：3084-3104
文件大小：568 K

摘要

Multi-label learning has received significant attention in the research community over the past few years: this has resulted in the development of a variety of multi-label learning methods. In this paper, we present an extensive experimental comparison of 12 multi-label learning methods using 16 evaluation measures over 11 benchmark datasets. We selected the competing methods based on their previous usage by the community, the representation of different groups of methods and the variety of basic underlying machine learning methods. Similarly, we selected the evaluation measures to be able to assess the behavior of the methods from a variety of view-points. In order to make conclusions independent from the application domain, we use 11 datasets from different domains. Furthermore, we compare the methods by their efficiency in terms of time needed to learn a classifier and time needed to produce a prediction for an unseen example. We analyze the results from the experiments using Friedman and Nemenyi tests for assessing the statistical significance of differences in performance. The results of the analysis show that for multi-label classification the best performing methods overall are random forests of predictive clustering trees (RF-PCT) and hierarchy of multi-label classifiers (HOMER), followed by binary relevance (BR) and classifier chains (CC). Furthermore, RF-PCT exhibited the best performance according to all measures for multi-label ranking. The recommendation from this study is that when new methods for multi-label learning are proposed, they should be compared to RF-PCT and HOMER using multiple evaluation measures.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700