High-level semantic image annotation based on hot Internet topics
详细信息    查看全文
  • 作者:Xiaoru Wang ; Junping Du ; Shuzhe Wu ; Xu Li ; Haiming Xin…
  • 关键词:Abstract semantics ; Complex graph ; High ; level semantics ; Hot Internet topic ; Hypergraph ; Image annotation
  • 刊名:Multimedia Tools and Applications
  • 出版年:2015
  • 出版时间:March 2015
  • 年:2015
  • 卷:74
  • 期:6
  • 页码:2055-2084
  • 全文大小:1,703 KB
  • 参考文献:1. Blei, DM, Ng, AY, Jordan, MI (2003) Latent dirichlet allocation. J Mach Learn Res 3: pp. 993-1022
    2. Chang, E, Goh, K, Sychay, G, Wu, G (2003) Cbsa: content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Trans Circuits Syst Video Technol 13: pp. 26-38 CrossRef
    3. Chua, TS, Tang, J, Hong, R, Li, H, Luo, Z, Zheng, Y (2009) Nus-wide: a real-world web image database from national university of singapore. Proceedings of the ACM international conference on image and video retrieval, CIVR -9. ACM, New York, pp. 1-48 CrossRef
    4. Cusano C, Ciocca G, Schettini R (2003) Image annotation using svm, pp 330-38. doi:10.1117/12.526746
    5. Dai W, Chen Y, Xue GR, Yang Q, Yu Y (2009) Translated learning: transfer learning across different feature spaces. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) Advances in neural information processing systems, 21, pp 353-60
    6. Deschacht K, Moens MF (2007) Text analysis for automatic image annotation. In: Proceedings of the 45th annual meeting of the association of computational linguistics. Association for Computational Linguistics, Prague, pp 1000-007
    7. Duygulu, P, Barnard, K, Freitas, J, Forsyth, D Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Heyden, A, Sparr, G, Nielsen, M, Johansen, P eds. (2006) Computer vision ECCV 2002. Springer, Berlin Heidelberg, pp. 97-112 CrossRef
    8. Eakins JP (1996) Automatic image content retrieval—are we getting anywhere? De Montfort University, Milton Keynes, pp 123-35
    9. Feng S, Manmatha R, Lavrenko V (2004) Multiple bernoulli relevance models for image and video annotation. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer vision and pattern recognition, 2004. CVPR 2004, vol?2, pp II-002–II-009. doi:10.1109/CVPR.2004.1315274
    10. Griffiths, TL, Steyvers, M (2004) Finding scientific topics. Proc Natl Acad Sci USA 101: pp. 5228-5235 CrossRef
    11. Jeon J, Lavrenko V, Manmatha R (2003) Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in informaion retrieval, ACM, New York, SIGIR -3, pp 119-26. doi:10.1145/860435.860459
    12. Kang F, Jin R (2005) Symmetric statistical translation models for automatic image annotation. In: Proceedings of the 2005 SIAM international conference on data mining, pp 21-3
    13. Kennedy, LS, Chang, SF, Kozintsev, IV (2006) To search or to label?: predicting the performance of search-based automatic image classifiers. Proceedings of the 8th ACM international workshop on multimedia information retrieval, MIR -6. ACM, New York, pp. 249-258 CrossRef
    14. Lavrenko, V, Manmatha, R, Jeon, J A model for learning the semantics of pictures. In: Thrun, S, Saul, L, Sch?lkopf, B eds. (2004) Advances in neural information processing systems 16. MIT Press, Cambridge, MA, pp. 553-560
    15. Li, J, Wang, J (2003) Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans Pattern Anal Mach Intell 25: pp. 1075-1088 CrossRef
    16. Li, X, Snoek, CG, Worring, M (2008) Learning tag relevance by neighbor voting for social image retrieval. Proceedings of the 1st ACM international conference on multimedia information retrieval, MIR -8. ACM, New York, pp. 180-187 CrossRef
    17. Liu, J, Wang, B, Li, M, Li, Z, Ma, W, Lu, H, Ma, S (2007) Dual cross-media relevance model for image annotation. Proceedings of the 15th international conference on multimedia, MULTIMEDIA -7. ACM, New York, pp. 5-614
    18. Long B, Zhang M, Yu PS, Xu T (2008) Clustering on complex graphs. In: Proceedings of the 23rd national conference on artificial intelligence, vol 2, AAAI-8. AAAI Press, pp 659-64
    19. Lowe, DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60: pp. 91-110
文摘
Images are complex multimedia data that contain rich semantic information. Currently, most of image annotation algorithms are only annotating the object semantics of images. There are still many challenges on high-level semantic image annotation. The major issues are the lack of effective modeling method for the high-level semantics of images and the lack of efficient dynamic update mechanism for the training set. To address these issues, we propose a high-level semantic annotation method based on hot Internet topics in this paper. There are two independent sub tasks in our method: dynamic update of the training set based on hot Internet topics and search-based image annotation. In the first sub task, we propose to model the abstract semantics of images based on three relationships: image–to–image similarity relationship, topic–to–topic co-occurrence relationship, and image–to–topic relevance relationship. Through the complex graph clustering, the hot Internet topics are extracted for images with consistent visual and semantic contents. Then the dynamic update mechanism will update the original training set with the new topics and images. It avoids the huge computing cost in traditional update methods and does not need to re-calculate the whole mapping relationship between the semantic concepts and visual features. In the second sub task, given a query image, it first searches for similar candidates in the annotated training set via visual features. Then the hypergraph modeling and spectral clustering are exploited to filter out the images with irrelevant semantics. The keywords will be extracted for annotation from the remaining images according to an annotation probability. Extensive experiments have been conducted and the results demonstrate that our algorithm could achieve better annotation performance than the state-of-the-art algorithms. And the update mechanism could extend the training set efficiently so that the coverage of the semantics in the training set wouldn’t be obsolete.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700