Image classification by multimodal subspace learning

详细信息	查看全文 \| 推荐本文 \|

作者：Jun Yu^a ; ^{yujun@xmu.edu.cn} ; Feng Lin^b ; Hock-Soon Seah^b ; Cuihua Li^a ; Ziyu Lin^a
关键词：Subspace ; Image classification ; Semi-supervised learning ; Multimodality
刊名：Pattern Recognition Letters
出版年：2012
期刊代码：101_01678655
类别：cp
出版时间：1 July, 2012
卷：33
期：9
页码：1196-1204
文件大小：1087 K

摘要

In recent years we witnessed a surge of interest in subspace learning for image classification. However, the previous methods lack of high accuracy since they do not consider multiple features of the images. For instance, we can represent a color image by finding a set of visual features to represent the information of its color, texture and shape. According to the 鈥淧atch Alignment鈥?Framework, we developed a new subspace learning method, termed Semi-Supervised Multimodal Subspace Learning (SS-MMSL), in which we can encode different features from different modalities to build a meaningful subspace. In particular, the new method adopts the discriminative information from the labeled data to construct local patches and aligns these patches to get the optimal low dimensional subspace for each modality. For local patch construction, the data distribution revealed by unlabeled data is utilized to enhance the subspace learning. In order to find a low dimensional subspace wherein the distribution of each modality is sufficiently smooth, SS-MMSL adopts an alternating and iterative optimization algorithm to explore the complementary characteristics of different modalities. The iterative procedure reaches the global minimum of the criterion due to the strong convexity of the criterion. Our experiments of image classification and cartoon retrieval demonstrate the validity of the proposed method.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700