基于特征的图像检索技术研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

基于特征的图像检索技术研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Study on Image Retrieval Based on Feature Representation
作者：高彦彦
论文级别：博士
学科专业名称：信号与信息处理
中文关键词：图像检索 ; 特征表示 ; 显著点 ; CNETRIST ; LBP ; 特征包
英文关键词：image retrieval ; feature representation ; salient point ; CENTRIST ; LBP ; Bag-of-Features
学位年度：2012
导师：郭军
学科代码：081002
学位授予单位：北京邮电大学
论文提交日期：2012-09-12

摘要

伴随着网络和多媒体获取技术的快速发展,数字图像的数量急剧增长。如何从海量数字图像集合中为用户快速检索目标图像,已成为信息领域亟待解决的关键问题。在此背景下,图像检索技术近年来迅猛发展,引起了学术界和产业界的高度关注。现有的搜索引擎普遍采用传统的基于文本标注的图像搜索。这种方法具有人工标注耗时费力、主观多义的局限,已经难以适用海量网络数据库的检索需求。为此,研究人员提出了基于内容的图像检索来克服这一局限。基于内容的图像检索直接从图像本身出发考虑,尽可能地从与人类视觉相似的角度去描述图片的固有属性,查找与查询图片相似的图像集合提供给用户。在这一过程中,图像的特征表示是核心问题,直接影响基于内容的检索系统的性能。
     本文从全局和局部特征两方面研究分析了常用的图像特征表示,并在以下几方面开展了创新性工作。本文的主要工作包括以下几点：
     1.基于颜色特征和纹理特征的图像检索算法
     单一特征的检索由于特征本身对图像描述的局限性,在复杂图像库上完成检索时,不能得到较好的检索效果。本文综合颜色特征和LBP纹理特征的实现了图像检索。实验结果表明,基于单一特征的检索算法,在完成特征内部归一化和特征间归一化后,其结果优于未进行特征归一化的算法,表明归一化的必要性；在结合多特征后,其检索效率明显上升。
     2.提出了一种基于双树复数小波的显著点检测算法,并结合图像的局部和全局特征实现了图像检索
     本文提出的基于双树复数小波变换的显著点检测算法是在图像变换后的多尺度空间上完成的。利用同尺度层内多个方向上的复小波分解系数得到能量图,提取能量图中的局部极值点作为图像显著点。在此基础上,对显著点所在区域进行环形分割,统计每个环形区域内的显著点邻域的颜色直方图；计算显著点的离散度作为其几何特征；并结合图像复小波变换多尺度层上的幅值和相位特征完成图像描述,实现图像检索。实验部分先分析了Harris角点检测、基于高斯差分的显著点检测与本文提出的检测算法的时间复杂度；继而在Oxford数据集上利用重复率对三种检测算法的性能进行评价；并在Corel库上实现图像检索。
     3.不同局部纹理特征的理论比较和在图像检索上的比较
     本文从理论方面分析了CENTRIST和LBP两种相似的纹理特征表示的同异。同时,结合具有空间分布信息的多尺度金字塔分割,实现了基于CENTRIST的图像检索,并在两个数据库上完成了实验。实验结果先是表明直方图交叉核比欧式距离更适用于CENTRIST特征表示。为了确定两者差异的重要性,本文提出了与CENTRIST更接近的逆LBP的特征表示。实验验证三种纹理特征中CENTRIST的性能最优,也揭示了CENTRIST与LBP的最大区别在于其在图像相邻像素之间具有约束性和传导性。
     4.提出了一种对特征包表示进行动态加权的算法
     图像的特征包表示利用视觉词的无序组合表示一幅图像,属于中层语义特征,已被成功地应用于大尺度图像库的检索和分类。对特征包的改进涉及特征描述,视觉词典的建立,特征量化和后处理等多个方面。本文通过分析相似图像之间特征包表示,提出了一种基于相关反馈的动态加权的算法,并在两个数据集上定量地分析了改进算法,同时与基本的基于特征包的图像检索算法相比较。实验结果表明本算法有效提升了前N个返回图像的准确率。
With the development of Internet and the acquisition technology of multimedia, the amount of digital images is increasing tremendously. How to retrieve the target images quickly and correctly from large scale image database for users is the key problem to be solved exigently in the domain of information. Under the circumstance, image retrieval has been developed and studied widely. Text-based image retrieval is popular in the web search engine, which is based on textual labels by human annotating. The disadvantages are time-consuming, subjectivity and ambiguity. And it is hard to meet the requirements of retrieval on large web database. To handle this problem, content-based image retrieval is proposed which aims to find the similar images with the query based on the inherent property of images. This techonolgy analyzes the images directly justlike the human visual system. Feature representation of image is the key process, and it plays a direct role on retrieval.
     This paper studied feature representations of image from two aspects of local and global. The main work including:
     1. Image retrieval based on color and texture features.
     Single feature-based image retrieval can not achieve higher precision and recall on complex image database because of its limitation. This paper exploits color histogram and LBP texture feature for image retrieval. The experiments show that internal and external normalizations' of features are efficient for retrieval based on single feature, and multi-features can bring higher precision and recall than single feature.
     2. A novel keypoint detector based on the dual tree complex wavelet transform is proposed. Based on this detector, local and global fetures are combined for image retrieval.
     This algorithm is performed in complex wavelet pyramid space of an image. It uses the intra-scale coefficients'product to obtain the energy map. And then extracts the local extrema of the map as the salient keypoints. Based on this detector, the image is divided into several concentric circles according to the distribution of salient points. And then, the annular color histogram is exploited to describe the local color information, the divergence is computed as the geometry feature of an image, and magnitude and phase features at different scales of the complex wavelet transform decomposition of image are extracted for image retrieval. The comparison of complexity among Harris detector, DoG-based detector and the detector proposed in this paper is analyzed. And the experiments on Oxford affine database are performed with the evaluation of repeatability. The results of image retrieval show the effectiveness of our proposed algorithm.
     3. Theory and empirical Comparisons between two local textural features on CBIR.
     In this paper, the differences between CENTRIST and LBP are analyzed on theory. And they are integrated with the spatial information by multi scale spatial pyramid for content-based image retrieval. The experimental results firstly show that the similarity of two images computed by histogram intersection can achieve better result than computed by Euclidean distance for CENTRIST descriptor. For analyzing the impact of the differences, reverse LBP is performed, which is more similar with CENTRIST. The results demonstrate the most important of differences between CENTRIST and LBP is that whether the constraints and the transitivity among neighbored pixels exist.
     4. A dynamically weighting scheme for Bag-of-features based image retrieval.
     Bag-of-Features (BoF) representation of image, which is a middle-level semantic feature, is consisted of an orderless collection of visual words, and successfully used in retrieval and classification of large scale image repository. Several extensions have been proposed that involve feature description, dictionary building, feature encoding and post-query process, etc. This paper proposes a dynamically weighting scheme for BoF-based image retrieval based on feedback. We quantitatively evaluate the proposed method on two different databases. Experiments confirm that the proposed weighting scheme has better performance than the baseline of BoF-based image retrieval systems. Meanwhile, the results demonstrate the effectiveness of the weighting scheme in terms of the precision of top-N returned images.

引文

[1]Ritendra Datta, Jia Li, James Z. Wang, "Content-based image retrieval:approaches and trends of the new age",7th ACM SIGMM international workshop on Multimedia information retrieval,2005: 253-262.
    [2]Datta R, Jia Li, James Z. Wang. "Content-based image retrieval:approaches and trends of the new age," In ACM Computing Surveys,40(2),2008.
    [3]王惠锋,孙正兴,王箭.语义图像检索研究进展.计算机研究与进展,2002,39(5)：513-523.
    [4]李志欣,施智平,李志清等.图像检索中语义映射方法综述.计算机辅助设计与图形学学报,2008,20(8)：1085-1096.
    [5]M. Flickner, H. Sawhney et al., "Query by image and video content:the QBIC System", IEEE Computer,1995,28(9):23-32.
    [6]J.R.Bach, C.Fuller, A.Gupta et al., "The Virage Image Search Engine:An Open Framework for Image Management", In Proc. SPIE, Storage and Retrieval for Still Image and Video Databases IV, vol 2670. San Jose, CA, USA,1996:76-87
    [7]J.Dowe, "Content-based retrieval in multimedia imaging", SPIE Storageand Retrieval for Image and Video Database,1993,1908:164-167.
    [8]A. Pentland, R.W. Picard, and S. Sclaroff, "Photobook:tools for content-based manipulation of image database", Proc. SPIE,1994,2185:34-47.
    [9]A. Pentland, R.W. Picard, and S. Sclaroff, "Photobook:Tools for content-based manipulation of image databases", Storage and Retrieval for Image and Video Database H,1996:34-47.
    [10]J.R. Smith and S.F. Chang, "VisualSEEK:a fully automated content-based image query system", Proc. ACM Multimedia,1996:87-98.
    [11]Mehrotra S, Rui Y, Ortega Metal, "Supporting Content-t-based Queries over Image in MARS", Proc of IEEE International Conference on Multimedia Computing and Systems,1997:632-633.
    [12]Yong Rui, Thomas S. Huang and Sharad Mehrotra, "Content-based Image Retrieval With Relevance Feedback in MARS", In Proc. IEEE International Conference on Image Processing,1997,2:815-818.
    [13]J.Z. Wang, J. Li, and G. Wiederhold, "SIMPLIcity:semantics-sensitive integrated matching for picture libraries", IEEE Trans, on PAMI,2001,23(9):1-17.
    [14]Informedia, http://www.informedia.cs.cmu.edu/,2012-4-22.
    [15]R.Rahmani, S.A.Goldman, H.Zhang et al., "Localized content based image retrieval", IEEE Transaction on Pattern Analysis and Machine Intelligence,2008,30(11):1902-1912.
    [16]陈秀新,邢素霞.图像/视频检索与图像融合.机械工业出版社,2012.
    [17]MIRES, http://www.intsci.ac.cn/image/mires.html.
    [18]章毓晋.基于内容的视觉信息检索.北京：科学出版社,2003.
    [19]TinEye, http://www.tineye.com/.
    [20]百度识图,http://shitu.baidu.com/.
    [21]谷歌图片,http://www.google.com.hk/imghp?hl=zh-CN&tab=wi.
    [22]Picitup-Computer Vision Solution, http://www2.picitup.com/.
    [23]淘淘搜,http://www.taotaosou.com/.
    [24]图想,]http://imagine.taobao.com/imagine/index.htm.
    [25]Tiltomo, http://www.tiltomo.com/.
    [26]Incogna, http://www.incogna.com/.
    [27]Gazopa, http://www.gazopa.com/.
    [28]C. Harris and M. Stephens, "A combined corner and edge detector", Alvey Vision Conference, 1988:147-151.
    [29]S.M. Smith and J.M. Brady, "Susan-a new approach to low level image processing", International Journal of Computer Vision,1997,23(1):45-78.
    [30]Edward Rosten and Tom Drummond, "Machine learning for high-speed corner detection", European Conference on Computer Vision,2006,1:430-443.
    [31]Edward Rosten and Tom Drummond, "Fusing points and lines for high performance tracking", IEEE International Conference on Computer Vision,2005,2:1508-1511.
    [32]T. Lindeberg, "Feature detection with automatic scale selection", International Journal of Computer Vision,1998,30(2):79-116.
    [33]K. Mikolajczyk and C. Schmid, "Scale & affine invariant interest point detectors", International Journal of Computer Vision,2004,60(1):63-86.
    [34]D.G. Lowe, "Distinctive image features from scale-invariant keypoints", International Journal of Computer Vision,2004,60(2):91-110.
    [35]H. Bay, T. Tuytelaars, and L. Van Gool, "Surf:Speed-up robust feature", European Conference of Computer Vision,2006:404-417.
    [36]J.Matas, O.Chum, M.Urban, and T.Pajdla, "Robust wide baseline stereo from maximally stable extremal regions", Proc. of British Machine Vision Conference,2002:384-396.
    [37]K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors", IEEE Transaction on Pattern Analysis and Machine Intelligence,27(10),2005:1615-1630.
    [38]N. Dalal and B Triggs. "Histogram of oriented gradients for human detection", CVPR,2005:886-893.
    [39]W. Freeman, E. Adelson, "The design and use of steerable filters", IEEE Transaction on Pattern Analysis and Machine Intelligence,1992,13(9):891-906.
    [40]L. Van Gool, T. Moons and D. Ungureanu, "Affine/photometric invariants for planar intensity patterns", In Proceedings of the European Conference on Computer Vision, Lecture Notes in Computer Science, 1996,1064:642-651.
    [41]Jing Li, Nigel M. Allinson, "A comprehensive review of current local features for computer vision", Neurocomputing,2008,71(10-12):1771-1787.
    [42]Y. Ke, R. Sukthankar, "PCA-SIFT:A more distinctive representation for local image descriptors", IEEE International Conference on Computer Vision and Pattern Recognition,2004,2:506-513.
    [43]Luo Juan and Oubong Gwun, "A comparision of SIFT, PCA-SIFT and SURF", International Journal of Image Processing,2009,3(4):143-152.
    [44]E Rublee, V Rabaud, K Konolige, G.Bradski, "ORB:an efficient alternative to SIFT or SURF", IEEE International Conference on Computer Vision,2011:2564-2571.
    [45]M. Calonder, V. Lepetit, C. Strecha, and P. Fua. Brief:Binary robust independent elementary features. European Conference on Computer Vision,2010:778-792.
    [46]Anna Bosch, Andrew Zisserman, Xavier Munoz, "Representing shape with a spatial pyramid kernel", CIVR. Amsterdam, Netherlands:ACM,2007:401-408.
    [47]袁杰,魏宝刚,王李冬.一种综合PHOG形状和小波金字塔能量分布特征的图像检索方法.电子学报,2011,39(9)：2114-2119.
    [48]杨恒,王庆.一种新的局部不变特征检测和描述算法.计算机学报,2010,33(5)：935.944.
    [49]T.Ojala, M.Pietikainen, D.Harwood, "A comparative study of texture measures with classification based on featured distributions", Pattern Recognition,1996,29(1):51-59.
    [50]Jianxin Wu, J James M. Rehg, "CENTRIST:a visual descriptor for scene categorization", IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(8):1489-1501.
    [51]T.Ojala, MPietikainen and T.Maenpaa, "Multiresolution gray-scale and rotation invariant texture classification with local binary patterns", IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(7):971-987.
    [52]Xiaoyu Wang, Tony X Han, "An HOG-LBP human detector with partial occlusion handling", International Conference of Computer Vision,2009:32-39.
    [53]Shengcai Liao, Xiangxin Zhu, Zhen Lei, Lun Zhang, and Stan Z. Li, "Learning Multi-scale Block Local Binary Patterns for Face Recognition", In Proceedings of the 2nd IAPR/IEEE International Conference on Biometrics (ICB 2007), Seoul, Korea, August 2007:828-837.
    [54]M.Heikkila, M.PietikSinen and C.Schmid, "Description of interest regions with center-symmetric local binary patterns", In Proceedings of Computer Vision, Graphics and Image Processing (ICVGIP),2006, 4338:58-69.
    [55]T.Ahonen, J.Matas, He Chu, and M.Pietikainen, "Rotation invariant image description with local binary pattern histogram Fourier features", In SCIA'09 Proceedings of the 16th Scandinavian Conference on Image Analysis,2009,5575:61-70.
    [56]郑永斌,黄新生,丰松江SIFT和旋转不变LBP相结合的图像匹配算法.计算机辅助设计与图形学学报,2010,22(2)：286-292.
    [57]王叶,张洪刚,方旭等.基于改进的LBP的低分辨率车牌汉字识别.中文信息学报,2009,23(5)：86-91.
    [58]王玮,黄非非,李见为等.采用LBP金字塔的人脸描述与识别.计算机辅助设计与图形学学报,2009,21(1)：94-106.
    [59]孙君顶,毋小省.纹理谱描述符及其在图像检索中的应用.计算机辅助涉及与图形学报,2010,22(3)：516-520.
    [60]J. Sivic and A. Zisserman, "Video google:A text retrieval approach to object matching in videos," ICCV, 2003:1470-1477.
    [61]Grauman K, Darrell T. The pyramid match kernel:discriminative classication with sets of image features. In:Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. Beijing, China:IEEE,2005,1458-1465.
    [62]Lazebnik S, Schmid C and Ponce J, "Beyond bags of features spatial pyramid matching for recognizing natural scene categories", IEEE Conference on Computer Vision & Pattern Recognition, 2006:2169-2178.
    [63]S. Savarese, J. Winn, and A. Criminisi,"Discriminative object class models of appearance and shape by correlations", IEEE Conference on Computer Vision & Pattern Recognition,2006:2033-2040.
    [64]H. Ling and S. Soatto, "Proximity distribution kernels for geometric context in category recognition", IEEE International Conference on Computer Vision,2007:1-8.
    [65]D. Liu, G. Hua, P. Viola, and T. Chen, "Integrated feature selection and higher-order spatial feature extraction for object categorization", IEEE Conference on Computer Vision & Pattern Recognition, 2008:1-8.
    [66]Yi Yang and S.Newsam, "Spatial pyramid co-occurrence for image classification", IEEE International Conference on Computer Vision,2011:1465-1472.
    [67]H.Jegou, M.Douze and C.Schmid, "Packing bag-of-features",IEEE International Conference on Computer Vision,2009:2357-2364.
    [68]H.Jegou, M.Douze and C.Schmid, "Improving bag-of-features for large scale image search",International Journal of Computer Vision,2010,87(3):191-212.
    [69]F.Perronnin, C.R. Dance. "Fisher kernel on visual vocabularies for image categorization", IEEE Conference on Computer Vision & Pattern Recognition,2007:1-8
    [70]H.Jegou, D.Matthijs, S.Cordelia and P.Patrick, "Aggregating local descriptors into a compact image representation", IEEE Conference on Computer Vision & Pattern Recognition,2010:3304-3311.
    [71]何云峰,周玲,于俊清,徐涛,管涛.基于局部特征聚合的图像检索方法.计算机学报,2011,34(11)：2224-2233.
    [72]M. J. Swain and D. H. Ballard, "Color indexing", International Journal of Computer Vision,1991, 7(1):11-32.
    [73]Y.Rubner, C.Tomasi and L.J.Guibas, "The earth mover's distance as a metric for image retrieval", Internatinal Journal of Computer Vision,2000,40(2):99-121.
    [74]H.Jegou, H.Harzallah andC.Schmid, "A contextual dissimilarity measure for accurate and efficient image search", IEEE Conference on Computer Vision and pattern Recognition,2007:1-8
    [75]P. Indyk, R. Motwani, "Approximate nearest neighbor-Towards removing the curse of dimensionlity" In proceedings of SIGMOD,1998:604-613.
    [76]Brian Kulis, Kristen Grauman, "Kernelized locality-sensitive hashing for scalable image search", In Proceedings of the IEEE International Conference on Computer Vision, Kyoto. Japan,2009:2130-2137.
    [77]Ondrej Chum, Michal Perdoch, Jirl Matas, "Geometric min-hashing:Finding a (thick) neddle in a haystack", IEEE Conference on Computer Vision & Pattern Recognition,2009:17-24.
    [78]Zhou X. S., Huang T. S., "Relevance feedback for image retrieval:A comprehensive review", ACM Multimedia Systems Journal, Special Issue on CBIR,2003,8(6):536-544.
    [79]吴洪,卢汉清,马颂德.基于内容图像检索中相关反馈技术的回顾.计算机学报,2005,28(12)：1769-1779.
    [1]陈秀新,邢素霞.图像/视频检索与图像融合.机械工业出版社,2012.
    [2]M.F.Swainand,D.H.ballard, "Colorindexing", International Journal on Computer vision,1991, 7(1):11-32.
    [3]Hsin-Teng, HU W C, "A rotationally invariant two-phase scheme for corner detection", Pattern Reeognition Letters,1996,28(5):819-828.
    [4]Sun Junding, Zhang Ximin, Cui Jiangtao et al., "Image retrieval based on color distribution entropy", Pattern Reeognition Letters,2006,27:1122-1126.
    [5]R.S.John, Chang Shih-Fu, "Tools and techniques for color image retrieval",In Proceeding of SPIE:Storage and Retrieval for Image and Video Database,1995,2670:1-12.
    [6]M.Stricker, M.Orengo, "Similarity of color images", SPIE Storage and Retrieval for Image and Video Databases III,1995,2185:381-392.
    [7]曹莉华,柳伟,李国辉.基于多种主色调的图像检索算法研究与实现.计算机研究与发展,1999,36(1)：96-100.
    [8]G.Pass, R.Zabih, J.Miller, "Comparing images using color coherence vectors",In Proceeding of the fourth ACM Conference on Multimedia, NY, USA,1997:65-73.
    [9]Huang J, Zabih R., "Image indexing using color correlograms", IEEE International of Computer Vision & Pattern Recognition,1997:762-768.
    [10]刘丽,匡纲要.图像纹理特征提取方法综述.中国图象图形学报,2009,14(4)：622-635.
    [11]R.M.Haralick, K.Shanmugan, and I.Dinstein, "Textural Features for Image Classification", IEEE Transactions on Systems, Man, and Cybernetics,1973, SMC-3:610-621.
    [12]H. Tamura, S. Mori, and T. Yamawaki, "Texture features corresponding to visual perception", IEEE Transactions on Systems, Man, and Cybernetics,1978, SMC-8(6):460-473.
    [13]周明全,耿国华,韦娜.基于内容图像检索技术.北京：清华大学出版社,2007.
    [14]R.CheHappa, S.Chatterjee, "Classification of texture using ganssian Markov random fields", IEEE Transactions on Acoustics, Speech and Signal Processing,1985,33(4):959-963.
    [15]Chen C C. Huang C L, "Markov random fields for texture classification", Pattern Recognition Letter, 1993,14(11):907-914.
    [16]M.Hassner, J.Sklansky, "The use of markov random fields as models of texture", Computer Graphics and Image Processing,1980,12(3):357-370.
    [17]H.Kaneko, E.Yodognwa, "A markov random field application to texture classification", InProceedings of IEEE International Conference on Pattern Recognition and Image Processing,1982:221-225.
    [18]T.Ojala, M.Pietikainen, D.Harwood, "A comparative study of texture measures with classification based on featured distributions", Pattern Recognition,1996,29(1):51-59.
    [19]T.Ojala, MPietikainen and T.Maenpaa, "Multiresolution gray-scale and rotation invariant texture classification with local binary patterns", IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(7):971-987.
    [20]Tan Xiaoyang, B.Triggs, "Enhanced local texture feature sets for face recognition under difficult lighting conditions", IEEE Transaction on Image Processing,2007,19(6):1635-1650.
    [21]M.Heikkila, M.Pietikainen, C.Schmid, "Description of interest regions with local binary patterns", Pattern Recognition,2009,42(3):425-436.
    [22]Tuytelaars T, Gool L V, "Matching widely separated views based on affine invariant regions", Ineternational Journal of Computer Vision,2004,59(1):61-85.
    [23]Schaffalitzky F, Zisserman A,"Multi-view matching for unordered image sets", European Conference of Computer Vision,, Cambridge, MA, USA, MIT Press,2002:414-431.
    [24]Pritchett P, Zisserman A, "Wide baseline stereo matching", Ineternational Conference of Computer Vision, New York, USA, ACM Press,1998:754-760.
    [25]Lowe D G,"Object recognition from local scale-invariant features",Ineternational Conference of Computer Vision, New York, USA, ACM Press,1999:1150-1157.
    [26]S. Belongie, J. Malik, J. Puzicha, "Shape matching and object recognition using shape contexts", IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(4):509-522.
    [27]Johnson A, Hebert M, "Using spin image for efficient object recognition in cluttered 3D scenes",IEEE Transactions on Pattern Analysis and Machine Intelligence,1999,21(5):433-449.
    [28]Obdrzalek S, Matas J, "Object recognition using local affine frames on distinguished regions", BMVC, Oxford, England, BMVA Press,2002:113-122.
    [29]Mikolajczyk K, Leibe B, Schiele B, "Local features for object class recognition", Ineternational Conference of Computer Vision, New York, ACM Press,2005:1792-1799.
    [30]Dorko G, Schmid C, "Selection of scale invariant neighborhoods for object class recognition", Ineternational Conference of Computer Vision, New York, ACM Press,2003:634-640.
    [31]C.Schmid, R.Mohr, "Local grayvalue invariants for image retrieval",IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(5):530-534.
    [32]Sivic J, Zisserman A, "Video google:A text retrieval approach to object matching in videos", Ineternational Conference of Computer Vision, New York, USA:ACM Press,2003:1470-1478.
    [33]Sivic J, Schaffalitzky F, Zisserman A, "Object level grouping for video shots", Ineternational Journal of Computer Vision,2006,67(2):189-210.
    [34]Se S, Lowe D G, Little J. "Mobile robot localization and mapping with uncertainty using scale invariant visual and marks", International Journal of Robotics Research,2002,21(8):735-758.
    [35]Oliva A, Torralba A, "Modeling the shape of the scene:a holistic representation of the spatial envelope", International Journal of computer vision,2001,42(3):145-175.
    [36]T. Tuytelaars and K. Mikolajczyk, "Local invariant feature detectors:A survey",Foundations and Trends(?) in Computer Graohics and Vision,2008,3(3):177-280.
    [37]H. Moravec, "Towards automatic visual obstacle avoidance", International Joint Conference on Artificial Intelligence,1977:584.
    [38]C. Harris and M. Stephens, "A combined corner and edge detector", Alvey Vision Conference, 1988:147-151.
    [39]S.M. Smith and J.M. Brady, "Susan-a new approach to low level image processing", International Journal of Computer Vision,1997,23(1):45-78.
    [40]Edward Rosten and Tom Drummond, "Machine learning for high-speed corner detection", European Conference on Computer Vision,2006,1:430-443.
    [41]T. Lindeberg, "Feature detection with automatic scale selection", International Journal of Computer Vision,1998,30(2):79-116.
    [42]K. Mikolajczyk and C. Schmid, "Scale & affine invariant interest point detectors", International Journal of Computer Vision,2004,60(1):63-86.
    [43]T.Kadir,M.Brady, "Scale, saliency and image description", International Journal of Computer Vision, 2001,45(2):83-105.
    [44]D.G. Lowe, "Distinctive image features from scale-invariant keypoints", International Journal of Computer Vision,2004,60(2):91-110.
    [45]H. Bay, T. Tuytelaars, and L. Van Gool, "Surf:Speed-up robust feature", European Conference of Computer Vision,2006:404-417.
    [46]J.Matas, O.Chum, M.Urban, and T.Pajdla, "Robust wide baseline stereo from maximally stable extremal regions", Proc. of British Machine Vision Conference,2002:384-396.
    [47]Y. Ke, R. Sukthankar, "PCA-SIFT:A more distinctive representation for local image descriptors", IEEE International Conference on Computer Vision and Pattern Recognition,2004,2:506-513.
    [48]K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors", IEEE Transaction on Pattern Analysis and Machine Intelligence,27(10),2005:1615-1630.
    [49]W. Freeman, E. Adelson, "The design and use of steerable filters", IEEE Transaction on Pattern Analysisand Machine Intelligence,1991,13(9):891-906.
    [50]Jing Li, Nigel M. Allinson, "A comprehensive review of current local features for computer vision", Neurocomputing,2008,71(10-12):1771-1787
    [51]王永明,王贵锦.图像局部不变性特征与描述.北京：国防工业出版社,2010.
    [52]L. Fei-Fei, R. Fergus and P. Perona, "Learning generative visual models from few training examples:an incremental Bayesian approach tested on 101 object categories",IEEE Computer Vision and Pattern Recognition, Workshop on Generative-Model Based Vision,2004.
    [53]G. Griffin and A. Holub and P. Perona, "Caltech-256 Object Category Dataset", Caltech Technical Report,2007.
    [54]J. Deng, A. Berg, K. Li and L. Fei-Fei, "What does classifying more than 10,000 image categories tell us?", In Proceedings of the 12th European Conference of Computer Vision,2010
    [55]O. Russakovsky and L. Fei-Fei, "Attribute Learning in Large-scale Datasets",In Proceedings of the 12th European Conference of Computer Vision,1st International Workshop on Parts and Attributes,2010.
    [56]J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei, "ImageNet:A Large-Scale Hierarchical Image Database", IEEE Computer Vision and Pattern Recognition,2009.
    [57]Xie Xing, Lu Lie, Jia Menglei, Li Hua, Seide Frank, Ma Wei-Ying, "Mobile search with multimodal queries", Proceeding of IEEE,2008,96(4).
    [58]M. douze, H. Jegou, H. Sandhawalia. "Evaluation of GIST descriptors for web-scale image search," CIVR,2009.
    [59]J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. "Object retrieval with large vocabularies and fast spatial matching," CVPR,2007.
    [60]B.S.Manjunath, Jens-Rainer Ohm, Viond V. Vasudevan, Akio Yamada, "Color and texture descriptors", IEEE Transaction on Circuits and Systems for Video Technology,2001,11(6):703-715.
    [61]Ka-man Wong, Lai-Man Po, "MPEG-7 dominant color descriptor based relevance feedback using merged palette histogram", ICASSP,2004,3:iii-433-6.
    [1]H. Moravec, "Towards automatic visual obstacle avoidance", International Joint Conference on Artificial Intelligence,1977:584.
    [2]C. Harris and M. Stephens, "A combined corner and edge detector", Alvey Vision Conference, 1988:147-151.
    [3]S.M. Smith and J.M. Brady, "Susan-a new approach to low level image processing", International Journal of Computer Vision,1997,23(1):45-78.
    [4]T. Lindeberg, "Feature detection with automatic scale selection", International Journal of Computer Vision,1998,30(2):79-116.
    [5]K. Mikolajczyk and C. Schmid, "Scale & affine invariant interest point detectors", International Journal of Computer Vision,2004,60(1):63-86.
    [6]D.G. Lowe, "Distinctive image features from scale-invariant keypoints", International Journal of Computer Vision,2004,60(2):91-110.
    [7]H. Bay, T. Tuytelaars, and L. Van Gool, "Surf:Speed-up robust feature", European Conference of Computer Vision,2006:404-417.
    [8]Edward Rosten and Tom Drummond, "Machine learning for high-speed corner detection", European Conference on Computer Vision,2006,1:430-443.
    [9]Edward Rosten and Tom Drummond, "Fusing points and lines for high performance tracking", IEEE International Conference on Computer Vision,2005,2:1508-1511.
    [10]张宗平,刘贵忠.基于小波的视频图像压缩研究进展.电子学报,2002,30(6)：883-889.
    [11]王永玉,孙衢,袁超伟.有效图像压缩的提升小波优化设计.北京邮电大学学报,2007,30(4)：64-68.
    [12]周祚峰,水鹏朗.交替使用小波去噪何全变差正则化的盲图像恢复算法.电子与信息学报,2008,30(12)：2912-2915.
    [13]S.G. Chang, Bin Yu and M.Vetterli, "Adaptive wavelet thresholding for image denoising and compression", IEEE Transaction on Image Processing,2000,9(9):1532-1546.
    [14]Yong Xu, Xiong Yang, Haibin Ling, and Hui Ji, "A newtexture descriptor using multifractal analysis in multi-orientation wavelet pyramid", IEEE Conference on Computer Vision and Pattern Recognition, 2010:161-168.
    [15]A. Teynor and H. Burkhardt, "Wavelet-based salient points with scale information for classification", International Conference on Pattern Recognition,2008:1-5.
    [16]E. Loupias, N. Sebe, S. Bres, and J.-M. Jolion, "Wavelet-based salient points for image retrieval", International Conference on Image Processing,2002,2:518-521.
    [17]W. Ayadi and A. Benazza-Benyahia, "Wavelet based statistical detection of salient points by the exploitation of the interscale redundancies", IEEE International Conference on Image Processing, 2009:1001-1004.
    [18]N.G. Kingsbury, "Complex wavelets for shift invariant analysis and filtering of signals", Journal of Applied and Computational Harmonic Analysis,2001,10(3):234-253.
    [19]王永明,王贵锦.图像局部不变性特征与描述.北京：国防工业出版社,2010.
    [20]Alison Noble, "Descriptions of Image Surfaces", PhD thesis, Department of EngineeringScience, OxfordUniversity 1989.
    [21]周明全,耿国华,韦娜.基于内容图像检索技术.清华大学出版社,2007.
    [22]A. Oliva and A. Torralba,"Modeling the shape of the scene:a holistic representation of the spatial envelope", International Journal of Computer Vision,2001,42(3):145-175.
    [23]R.Rahmani, S.A.Goldman, H.Zhang et al., "Localized content based image retrieval", IEEE Transaction on Pattern Analysis and Machine Intelligence,2008,30(11):1902-1912.
    [24]李杰,程义民,葛仕明等.基于显著点特征多示例学习的图像检索方法.光电子·激光,2008, 19(10)：1405-1409.
    [25]Tian Q, Sebe N, Lew M S, et al, "Image retrieval using wavelet-based salient points", Journal of Electronio Imaging,2001,10(4):835-849.
    [26]I.W.Selesnick,R.G.Baraniuk,N.G.Kingsbury.The Dual Tree Complex Wavelet Transform.IEEE Signal Processing Magazine,2005,22(6):123-151
    [27]高彦彦.基于双树复数小波的压缩传感图像重构算法研究.硕士论文,燕山大学,2009.
    [28]J. Fauqueur, N.G. Kingsbury, and R. Anderson, "Multiscale keypoint detection using the dual tree complex wavelet transform", IEEE International Conference on Image Processing,2006:1625-1628.
    [29]http://www.cs.ubc.ca/-lowe/keypoints/
    [30]Oxford database:http://www.robots.ox.ac.uk/-vgg/research/affine/.
    [31]C. Schmid, R. Mohr, and C. Bauckhage, "Evaluation of interest point detectors", International Journal of Computer Vision,2000,37(2):151-172.
    [32]丁贵广,戴海琼,徐文立.基于显著点局部分布特征的图像检索方法.光电子·激光,2005,16(9)：1101-1106.
    [33]孟繁杰,郭宝龙.一种基于兴趣点颜色及空间分布的图像检索方法.西安电子科技大学学报(自然科学版),2005,32(2)：256-259.
    [34]符祥,曾接贤.基于兴趣点匹配和空间分布的图像检索方法.中国激光,2010,37(3)：774-778.
    [35]Turgay Celik, Tardi Tjahjadi, "Multiscale texture classification and retrieval based on magnitude and phase features of complex wavelet subbands", Computers and Electrical Engineering,2011,37:729-743.
    [36]孟繁杰,郭宝龙.使用兴趣点局部分布特征及多示例学习的图像检索方法.西安电子科技大学学报(自然科学版),2011,38(2)：47-53.
    [37]苏小红,丁进,马培军.用兴趣点凸包和SVM加权反馈实现图像检索.计算机学报学报,2009,32(11)：2221-2228.
    [1]Ritendra Datta, Jia Li, James Z. Wang, "Content-based image retrieval:approaches and trends of the new age",7th ACM SIGMM international workshop on Multimedia information retrieval,2005: 253-262.
    [2]Jianhua Wu, Zhaorong Wei and Youli Chang, "Color and Texture Feature For Content Based Image Retrieval", JDCTA:International Journal of Digital Content Technology and its Applications,2010, 4(3):43-49.
    [3]Chi-Man Pun, Chan-Fong Wong, "Fast and Robust Color Feature Extraction for Content-based Image Retrieval", IJACT:International Journal of Advancements in Computing Technology,2011,3(6):75-83.
    [4]X. S. Zhou and T. S. Huang, "Relevance Feedback in Image Retrieval:A Comprehensive Review," Multimedia Systems,2003,8:536-544.
    [5]C. Bohm, S. Berchtold, and D. A. Keim, "Searching in High-Dimensional Spaceslndex Structures for Improving the Performance of Multimedia Databases," ACM Computing Surveys,2001,33(3):322-373.
    [6]H. Muller, N. Michoux, D. Bandon, and A. Geissbuhler, "A Review of Content-Based Image Retrieval Systems in Medical Applications-Clinical Benefits and Future Directions," International Journalof Medical Informatics,2004,73(1):1-23.
    [7]C.-c. Chen, H. Wactlar, J. Z. Wang, and K. Kiernan, "Digital Imagery for Significant Cultural and Historical Materials-An Emerging Research Field Bridging People, Culture, and Technologies," International Journal on Digital Libraries,2005,5(4):275-286.
    [8]周明全,耿国华,韦娜.基于内容图像检索技术.清华大学出版社,2007.
    [9]刘丽,匡纲要.图像纹理特征提取方法综述.中国图象图形学报,2009,14(4)：622-635.
    [10]Lowe D G, "Object recognition from local scale-invariant features",IEEE International Conference on Computer Vision,1999,2:1150-1157.
    [11]D.G. Lowe, "Distinctive image features from Scale-Invariant keypoints", International Journal of Computer Vision,2004,60(2):91-110.
    [12]Van de Sande KE, Gevers T and Snoek CG, "Evaluating color descriptors for object and scene recognition",IEEE Transactions on Pattern Analysis and Machine Intelligence,2010,32(9):1582-1596.
    [13]K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors", IEEE Transaction on Pattern Analysis and Machine Intelligence,2005,27(10):1615-1630.
    [14]M. Calonder, V. Lepetit, C. Strecha, and P. Fua, "Brief:Binary robust independent elementary features", European Conference on Computer Vision,2010:778-792.
    [15]E Rublee, V Rabaud, K Konolige, G.Bradski, "ORB:an efficient alternative to SIFT or SURF", IEEE International Conference on Computer Vision,2011:2564-2571.
    [16]J. Sivic and A. Zisserman, "Video google:A text retrieval approach to object matching in videos",IEEE International Conference on Computer Vision,2003:1470-1477.
    [17]S. Lazebnik, C. Schmid, and J. Ponce, "Beyond bags of features:Spatial pyramid matching for recognizing natural scene categories", IEEE Conference on Computer Vision and Pattern Recognition, 2006,2:2169-2178.
    [18]Jianxin Wu, J James M. Rehg, "CENTRIST:a visual descriptor for scene categorization", IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(8):1489-1501.
    [19]Yu-Ying Liu, Mei Chen, Hiroshi Ishikawa, Gadi Wollstein, Joel Schuman, and James M. Rehg, "Automated macular pathology diagnosis in retinal OCT Images using multi-scale spatial pyramid with local binary patterns", In Proceeding of Medical image computing and computer-assisted intervention: 2010, Part I:1-9.
    [20]Jianxin Wu, Christopher Geyer, and James M. Rehg, "Real-time human detection using contour cues", IEEE International Conference on Robotics and Automation,2011:860-867.
    [21]Ojala T, Pietikainen M and Harwood D, "A comparative study of texture measures with classification based on featured distribution", Pattern Recognition,1996,29(1):51-59.
    [22]R. Zabih, J. Woodfill, "Non-parametric local transforms for computing visual correspondence", European Conference on Computer Vision,1994,2:151-158.
    [23]Maenpaa T, Ojala T, Pietikainen M and Soriano M, "Robust texture classification by subsets of local binary patterns", International Conference on Pattern Recognition,2000,3:947-950.
    [24]Pietikainen M, Ojala T and Xu Z, "Rotation invariant texture classification using feature distributions", Pattern Recognition,2000,33:43-52.
    [25]Ojala T, Pietikainen M and Ma'enpaa T, "Multiresolution gray-scale and rotation invariant texture classification with local binary patterns", IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(7):971-987.
    [1]S.Lazebnik, C.Schmid, J.Ponce, "Beyond bags of features spatial pyramid matching for recognizing natural scene categories", VPR,2006,2169-2178.
    [2]K.Grauman, T.Darrell,"The pyramid match kernel:discriminative classication with sets of image features", IEEE International Conference on Computer Vision and Pattern Recognition,2005:1458-1465.
    [3]Shiliang Zhang, Qi Tian, Gang Hua et al.,"Generating descriptive visual words and visual phrases for large-scale image applications", IEEE Transactions on Image Processing,2011,20(9):2664-2677.
    [4]张琳波,王春恒,肖柏华等.基于Bag-of-Phrases勺图像表示方法.自动化学报,2012,38(1)：46-54.
    [5]J.Yuan, Y.Wu, M.Yang, "Discovery of collocation patterns:from visual words to visual phrases", IEEE Conference on Computer Vision and Pattern Recogntion,2007:1-8.
    [6]D.Liu, G.Hua, P.Viola, et al.,"Integrated feature selection and high-order spatial feature extraction for object categorization", IEEE Conference on Computer Vision and Pattern Recogntion,2008:1-8
    [7]T.Quak, V.Ferrari, B.Leibe, et al., "Efficient mining of frequent and distinctive feature configurations", International Conference on Computer Vision,2007:1-8.
    [8]J. Sivic and A. Zisserman, "Video google:A text retrieval approach to object matching in videos",International Conference on Computer Vision,2003:1470-1477.
    [9]E.Nowak, F. Jurie, B. Triggs, "Sampling strategies for bag-of-features image classification", European Conference on Computer Vision,2006,3954:490-503.
    [10]D.G. Lowe, "Distinctive image features from Scale-Invariant keypoints", Internationa Journal of Computer Vision,2004,60(2):91-110.
    [11]Van de Sande KE, Gevers T and Snoek CG, "Evaluating color descriptors for object and scene recognition", Pattern Analysis and Machine Intelligence,2010,32(9):1582-1596.
    [12]D. Nister, H. Stewenius, "Scalable recognition with a vocabulary tree", IEEE Conference on Computer Vision and Pattern Recogntion,2006:2161-2168.
    [13]J. Philbin, O.Chum, M. Isard, J. Sivic, A. Zisserman, "Object retrieval with large vocabularies and fast spatial matching", IEEE Conference on Computer Vision and Pattern Recogntion,2007:1-8.
    [14]J. C. van Gemert, J. M. Geusebroek, C. J. Veenman, and A. W. M. Smeulders, "Kernel codebooks for scene categorization", European Conference on Computer Vision,2008:696-709.
    [15]F. Perronnin, J. Sanchez, and T. Mensink, "Improving the fisher kernel for large-scale image classification", European Conference on Computer Vision,2010:143-156.
    [16]J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, "Locality-constrained linear coding for image classification", IEEE Conference on Computer Vision and Pattern Recogntion,2010:3360-3367.
    [17]Jegou H, Schmid C, Harzallah H, Verbeek J, "Accurate image search using the contextual dissimilarity measure", Pattern Analysis and Machine Intelligence,2010,32(1):2-11.
    [18]Chum O, Philbin J, Sivic J, Isard M, Zisserman A, "Total recall:Automatic query expansion with a generative feature model for object retrieval", International Conference on Computer Vision,2007:1-8.
    [19]K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors", IEEE Transaction on Pattern Analysis and Machine Intelligence,2005,27(10):1615-1630.
    [20]HerbertBay, Andreas Ess, Tinne Tuytelaars, Luc Van Gool, "SURF:Speeded Up Robust Features", Computer Vision and Image Understanding (CVIU),2008,110(3):346-359.
    [21]J.Philbin, O.Chum, M.Isard, etc, "Lost in quantization:improving particular object retrieval in large scale image databases", IEEE Conference on Computer Vision and Pattern Recogntion,2008:1-8.
    [22]Ken Chatfield, Victor Lempitsky, Andrea Vedaldi, etc, "The devil is in the details:an evalution of recent feature encoding methods", Brithsh Machine Vision Conference,2011.
    [23]K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, LV.Gool, "A comparison of affine region detectors", Internationa Journal of Computer Vision,2005,65(1):43-72.
    [24]T. Lindeberg, "Feature detection with automatic scale selection", International Journal of Computer Vision,1998,30(2):79-116.
    [25]王永明,王贵锦.图像局部不变性特征与描述.国防工业出版社.2010.
    [26]M. Heikkil, M. Pietikainen, and C. Schmid, "Description of interest regions with local binary patterns", Pattern Recognition,2009,42(3):425-436.
    [27]Jianxin Wu, J.M. Rehg, "Beyond Euclidean distance:creating effective visual codebooks using the histogram intersection kernel", International Conference on Computer Vision,2009:630-637.
    [28]Odone, F., Barla, A., and Verri, A., "Building kernels from binary strings for image matching", IEEE Transaction on Image Processing,2005,14(2):169-180.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700