Local visual feature fusion via maximum margin multimodal deep neural network

详细信息查看全文

作者：Zhiquan RenAuthor Vitae ; Yue DengAuthor Vitae ; Qionghai DaiAuthor Vitae
关键词：Image categorization ; Deep learning ; Feature fusion ; Discriminative learning
刊名：Neurocomputing
出版年：2016
出版时间：29 January 2016
年：2016
卷：175
期：part_PA
页码：427-432
全文大小：749 K

文摘

In this letter, we consider improving the image categorization performance by exploiting multiple local descriptors on the image. To achieve this goal, a novel deep learning configuration called maximum margin multimodal deep neural network (3mDNN) is proposed to learn joint feature from different data views. The local feature representations encoded by 3mDNN exhibit two significant advantages: (1) involving the information of multiple descriptors and (2) exhibiting discriminative ability. The whole deep architecture is well solved by the typical back propagation (BP) method and its performances are verified on three benchmark image datasets.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700