Weakly Supervised Learning of Objects, Attributes and Their Associations

详细信息查看全文

作者：Zhiyuan Shi (19)
Yongxin Yang (19)
Timothy M. Hospedales (19)
Tao Xiang (19)
关键词：Weakly supervised learning ; object attribute associations
刊名：Lecture Notes in Computer Science
出版年：2014
出版时间：2014
年：2014
卷：8690
期：1
页码：472-487
全文大小：2,279 KB
参考文献：1. Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. TPAMI (2011)
2. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR (2003)
3. Bourdev, L., Maji, S., Malik, J.: Describing people: A poselet-based approach to attribute classification. In: ICCV (2011)
4. Chen, H., Gallagher, A., Girod, B.: Describing clothing by semantic attributes. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol.聽7574, pp. 609鈥?23. Springer, Heidelberg (2012) CrossRef
5. Chen, X., Shrivastava, A., Gupta, A.: Neil: Extracting visual knowledge from web data. In: ICCV (2013)
6. Deselaers, T., Alexe, B., Ferrari, V.: Weakly supervised localization and learning with generic knowledge. IJCV聽100 (2012)
7. Doshi-Velez, F., Miller, K.T., Gael, J.V., Teh, Y.W.: Variational inference for the indian buffet process. Tech. rep., University of Cambridge (2009)
8. Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: A library for large linear classification. JMLR (2008)
9. Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: CVPR (2009)
10. Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. TPAMI (2010)
11. Feng, Z., Jin, R., Jain, A.: Large-scale image annotation by efficient and robust kernel metric learning. In: ICCV (2013)
12. Fu, Y., Hospedales, T.M., Xiang, T., Gong, S.: Attribute learning for understanding unstructured social activity. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part IV. LNCS, vol.聽7575, pp. 530鈥?43. Springer, Heidelberg (2012) CrossRef
13. Griffiths, T.L., Ghahramani, Z.: The indian buffet process: An introduction and review. JMLR (2011)
14. J茅gou, H., Perronnin, F., Douze, M., S谩nchez, J., P茅rez, P., Schmid, C.: Aggregating local image descriptors into compact codes. TPAMI (2011)
15. Kovashka, A., Grauman, K.: Attribute adaptation for personalized image search. In: ICCV (2013)
16. Kovashka, A., Vijayanarasimhan, S., Grauman, K.: Actively selecting annotations among objects and attributes. In: ICCV (2011)
17. Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A., Berg, T.: Baby talk: Understanding and generating simple image descriptions. In: CVPR (2011)
18. Lampert, C.H., Nickisch, H., Harmeling, S.: Attribute-based classification for zero-shot visual object categorization. IEEE TPAMI (2013)
19. Li, L.J., Socher, R., Fei-Fei, L.: Towards total scene understanding:classification, annotation and segmentation in an automatic framework. In: CVPR (2009)
20. Mahajan, D., Sellamanickam, S., Nair, V.: A joint learning framework for attribute models and object descriptions. In: ICCV (2011)
21. Marchesotti, L., Perronnin, F.: Learning beautiful (and ugly) attributes. In: BMVC (2013)
22. Nguyen, N.: A new svm approach to multi-instance multi-label learning. In: ICDM (2010)
23. Ordonez, V., Deng, J., Choi, Y., Berg, A.C., Berg, T.L.: From large scale image categorization to entry-level categories. In: ICCV (2013)
24. Patterson, G., Hays, J.: Sun attribute database: Discovering, annotating, and recognizing scene attributes. In: CVPR (2012)
25. Rasiwasia, N., Vasconcelos, N.: Latent dirichlet allocation models for image classification. TPAMI (2013)
26. Rastegari, M., Diba, A., Parikh, D., Farhadi, A.: Multi-attribute queries: To merge or not to merge? In: CVPR (2013)
27. Russakovsky, O., Fei-Fei, L.: Attribute learning in large-scale datasets. In: Kutulakos, K.N. (ed.) ECCV 2010 Workshops, Part I. LNCS, vol.聽6553, pp. 1鈥?4. Springer, Heidelberg (2012) CrossRef
28. Sadeghi, M., Farhadi, A.: Recognition using visual phrases. In: CVPR (2011)
29. van聽de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. TPAMI (2010)
30. Scheirer, W., Kumar, N., Belhumeur, P.N., Boult, T.E.: Multi-attribute spaces: Calibration for attribute fusion and similarity search. In: CVPR (2012)
31. Shi, Z., Hospedales, T.M., Xiang, T.: Bayesian joint topic modelling for weakly supervised object localisation. In: ICCV (2013)
32. Siddiquie, B., Feris, R., Davis, L.: Image ranking and retrieval based on multi-attribute queries. In: CVPR (2011)
33. Socher, R., Fei-Fei, L.: Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora. In: CVPR (2010)
34. Turakhia, N., Parikh, D.: Attribute dominance: What pops out? In: ICCV (2013)
35. Wang, G., Forsyth, D.: Joint learning of visual attributes, object classes and visual saliency. In: ICCV (2009)
36. Wang, S., Joo, J., Wang, Y., Zhu, S.C.: Weakly supervised learning for attribute localization in outdoor scenes. In: CVPR (2013)
37. Wang, X., Ji, Q.: A unified probabilistic approach modeling relationships between attributes and objects. In: ICCV (2013)
38. Wang, Y., Mori, G.: A discriminative latent model of object classes and attributes. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol.聽6315, pp. 155鈥?68. Springer, Heidelberg (2010) CrossRef
39. Wu, L., Jin, R., Jain, A.K.: Tag completion for image retrieval. TPAMI (2013)
40. Zhang, N., Farrell, R., Iandola, F., Darrell, T.: Deformable part descriptors for fine-grained recognition and attribute prediction. In: ICCV (2013)
41. Zhou, Z.H., Zhang, M.L., Huang, S.J., Li, Y.F.: Multi-instance multi-label learning. Artificial Intelligence (2012)
作者单位：Zhiyuan Shi (19)
Yongxin Yang (19)
Timothy M. Hospedales (19)
Tao Xiang (19)

19. Queen Mary, University of London, London, E1 4NS, UK
ISSN：1611-3349

文摘

When humans describe images they tend to use combinations of nouns and adjectives, corresponding to objects and their associated attributes respectively. To generate such a description automatically, one needs to model objects, attributes and their associations. Conventional methods require strong annotation of object and attribute locations, making them less scalable. In this paper, we model object-attribute associations from weakly labelled images, such as those widely available on media sharing sites (e.g. Flickr), where only image-level labels (either object or attributes) are given, without their locations and associations. This is achieved by introducing a novel weakly supervised non-parametric Bayesian model. Once learned, given a new image, our model can describe the image, including objects, attributes and their associations, as well as their locations and segmentation. Extensive experiments on benchmark datasets demonstrate that our weakly supervised model performs at par with strongly supervised models on tasks such as image description and retrieval based on object-attribute associations.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700