Dog Breed Classification Using Part Localization

详细信息查看全文

作者：Jiongxin Liu (21)
Angjoo Kanazawa (22)
David Jacobs (22)
Peter Belhumeur (21)
刊名：Lecture Notes in Computer Science
出版年：2012
出版时间：2012
年：2012
卷：7572
期：1
页码：186-199
全文大小：2162KB
参考文献：1. Spady, T.C., Ostrander, E.A.: Canine behavioral genetics: Pointing out the phenotypes and herding up the genes. AJHG聽82(1), 10鈥?8 (2008) CrossRef
2. Branson, S., Wah, C., Schroff, F., Babenko, B., Welinder, P., Perona, P., Belongie, S.: Visual Recognition with Humans in the Loop. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol.聽6314, pp. 438鈥?51. Springer, Heidelberg (2010) CrossRef
3. Nilsback, M.E., Zisserman, A.: Automated flower classification over a large number of classes. In: Proc. 6th Indian Conf. on Computer Vision, Graphics and Image Processing, pp. 722鈥?29 (2008)
4. Farrell, R., Oza, O., Zhang, N., Morariu, V., Darrell, T., Davis, L.: Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance. In: Proc. ICCV (2011)
5. Belhumeur, P.N., Chen, D., Feiner, S.K., Jacobs, D.W., Kress, W.J., Ling, H., Lopez, I., Ramamoorthi, R., Sheorey, S., White, S., Zhang, L.: Searching the World鈥檚 Herbaria: A System for Visual Identification of Plant Species. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol.聽5305, pp. 116鈥?29. Springer, Heidelberg (2008) CrossRef
6. Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. In: Proc. CVPR (2011)
7. Csurka, G., Dance, C.R., Fan, L., Willamowski, J.: Visual categorization with bags of keypoints. In: Work. on Stat. Learning in Comp. Vis., ECCV, pp. 1鈥?2 (2004)
8. Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: Proc. ICCV (2005)
9. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proc. CVPR, pp. 2169鈥?178 (2006)
10. Gehler, P., Nowozin, S.: On feature combination for multiclass object classification. In: Proc. CVPR (2009)
11. Wang, Z., Hu, Y., Chia, L.-T.: Image-to-Class Distance Metric Learning for Image Classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol.聽6311, pp. 706鈥?19. Springer, Heidelberg (2010) CrossRef
12. Deselaers, T., Ferrari, V.: Visual and semantic similarity in imagenet. In: Proc. CVPR (2011)
13. Sadeghi, M.A., Farhadi, A.: Recognition using visual phrases. In: Proc. CVPR (2011)
14. Yao, B., Khosla, A., Fei-Fei, L.: Combining randomization and discrimination for fine-grained image categorization. In: Proc. CVPR (2011)
15. Bourdev, L., Maji, S., Brox, T., Malik, J.: Detecting People Using Mutually Consistent Poselet Activations. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol.聽6316, pp. 168鈥?81. Springer, Heidelberg (2010) CrossRef
16. Parkhi, O., Vedaldi, A., Zisserman, A., Jawahar, C.: Cats and dogs. In: Proc. CVPR (2012)
17. Viola, P., Jones, M.: Robust real-time object detection. IJCV聽57, 137鈥?54 (2001) CrossRef
18. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. CVPR, vol.聽1, pp. 886鈥?93 (2005)
19. Parkhi, O., Vedaldi, A., Jawahar, C.V., Zisserman, A.: The truth about cats and dogs. In: Proc. ICCV (2011)
20. Cristinacce, D., Cootes, T.: Feature detection and tracking with constrained local models. In: Proc. BMVC, pp. 929鈥?38 (2006)
21. Milborrow, S., Nicolls, F.: Locating Facial Features with an Extended Active Shape Model. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol.聽5305, pp. 504鈥?13. Springer, Heidelberg (2008) CrossRef
22. Saragih, J.M., Lucey, S., Cohn, J.F.: Face alignment through subspace constrained mean-shifts. In: Proc. ICCV (2009)
23. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 20 (2004)
24. Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile classifiers for face verification. In: Proc. ICCV (2009)
25. Yin, Q., Tang, X., Sun, J.: An associate-predict model for face recognition. In: Proc. CVPR, pp. 497鈥?04 (2011)
26. Arca, S., Campadelli, P., Lanzarotti, R.: A face recognition system based on automatically determined facial fiducial points. Pattern Recognition聽39, 432鈥?43 (2006) CrossRef
27. Campadelli, P., Lanzarotti, R., Lipori, G.: Precise eye localization through a general-to-specific model definition. In: Proc. BMVC (2006)
28. Vidaldi, A., Zisserman, A.: Image classification practical (2011), http://www.robots.ox.ac.uk/~vgg/share/practical-image-classification.htm
29. Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proc. ICCV, pp. 606鈥?13 (2009)
30. Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: Proc. CVPR, pp. 3360鈥?367 (2009)
作者单位：Jiongxin Liu (21)
Angjoo Kanazawa (22)
David Jacobs (22)
Peter Belhumeur (21)

21. Columbia University, USA
22. University of Maryland, USA

文摘

We propose a novel approach to fine-grained image classification in which instances from different classes share common parts but have wide variation in shape and appearance. We use dog breed identification as a test case to show that extracting corresponding parts improves classification performance. This domain is especially challenging since the appearance of corresponding parts can vary dramatically, e.g., the faces of bulldogs and beagles are very different. To find accurate correspondences, we build exemplar-based geometric and appearance models of dog breeds and their face parts. Part correspondence allows us to extract and compare descriptors in like image locations. Our approach also features a hierarchy of parts (e.g., face and eyes) and breed-specific part localization. We achieve 67% recognition rate on a large real-world dataset including 133 dog breeds and 8,351 images, and experimental results show that accurate part localization significantly increases classification performance compared to state-of-the-art approaches.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700