摘要
摘 要
近年来,信息网络的高速发展和多媒体业务的普及极大地改变了人类自身
的生产、生活方式,人们对多媒体信息业务的需求促使新的多媒体业务的开发
成为当前信息领域的热点。图像数据海量、无序等特点决定了大部分图像信息
业务的构建都必须解决图像数据的高效压缩和有效检索两个关键问题。目前图
像的高效压缩已经取得了长足发展,图像普遍以压缩格式存在,为了进一步加
速图像信息的检索过程,基于压缩域的图像/视频检索技术的研究受到了人们的
关注,压缩域基于内容的图像检索系统的开发也引起了广泛的重视。
本论文通过对影响图像检索性能的多个关键技术的研究,基于 VB6.0 和数
据库 SQL Server 2000 开发了一套压缩域基于内容的图像检索系统。主要工作包
括:
1.讨论了目前典型的几个原型系统,并从用户的角度出发,通过对目前图
像检索系统基本框架结构的分析,设计了一个界面友好、能够对图像数据库进
行有效管理且灵活、安全的图像检索系统;
2.提出了一种用于提高图像检索精度的,基于自适应最优特征维的相关反
馈算法,使检索趋势能够按照不同用户的意志自适应改变;
3.讨论了多种用于提高图像检索响应速度的高维索引关键技术,并结合不
同树的优点,提出了一种混合树算法,改进了图像的检索响应速度;
4.针对图像上传和入库工作的复杂性,提出类模块式入库的方案,通过实
验对其可行性和有效性进行了讨论。
With the booming of digitized information and network, the patterns of
production and life of human beings have been changed dramatically due to the rapid
development of information network and multimedia applications. The increasing
need for multimedia information has made the development of new network-based
multimedia applications become one of the hottest subjects in the field of
information technology. Due to the enormous and unstructured multimedia data,
solutions must be provided for their effective compression and efficient indexing in
order to realize all kinds of multimedia applications. Recently, effective compression
technique has substantially developed and images exist widely in compressed format.
To make fast retrieval, indexing techniques of image/video data in
compressed-domain have witnessed a booming interest. And people attach
importance to the research and development of content-based image retrieval system
in compressed-domain extensively.
This paper discusses several key techniques which influence image retrieval
performance and develops a new content-based image retrieval system in compressed-
domain based on VB6.0 and database SQL Server 2000.The major contents are:
1. Some current typical contented-based image retrieval systems are reviewed.
Based on the structure analysis of these systems, this paper presents a flexible and
safe system with a friendly user interface and which can provide efficient image
database management.
2. A novel relevance feedback algorithm using the optimal feature components
adaptive extraction is put forward. Retrieval results can be changed with different
users.
3. A new compound high-dimension indexing method is constructed by taking
advantage of some other high-dimension techniques. Simulation results show the
efficiency of the method.
4. A model method according to the complexity of uploading and downloading
images and features is proposed whose feasibility and efficiency are verified.
引文
参考文献
1 何立民, 万跃华. 数字图书馆中基于内容的图像检索技术. 现代图书情报技术, 2002,
年刊, (93): 26~36
2 黄祥林. 基于压缩域的图像检索技术初步研究. 北京工业大学博士毕业论文, 2001
3 李向阳, 庄越挺, 潘云鹤. 基于内容的图像检索技术与系统. 计算机研究与发展, 2001,
38(3):344~354
4 魏海, 沈兰荪. 小波变换域内基于方向梯度相角直方图的图像检索算法. 电路与系统
学报, 2001, 6(2): 20~24
5 魏海, 沈兰荪, 李晓华. 基于迭代分形的图像压缩和检索方法. 中国图象图形学报,
2002, 7(11):1198~1203
6 魏海, 沈兰荪. 基于分类矢量量化的图像压缩和检索算法. 电子学报, 2001, 29(7):1~4
7 黄祥林, 沈兰荪. 基于 DCT 压缩域的纹理图像分类. 电子与信息学报, 2002,
24(2):216~221
8 黄祥林, 沈兰荪. 一种具有旋转不变性的压缩域纹理图像分类方法. 电子与信息学报,
2002, 24(11):1441~1446
9 黄祥林, 宋磊, 沈兰荪. 基于 DCT 压缩域的图像检索方法. 电子学报, 2002,
30(12):1786~1789
10 Information Technology–Multimedia Content Description Interface–Part 3: Visual. ISO/IEC
JTC1/SC29/WG11/N4062, Singapore, 2001
11 MPEG-7 Applications Document V. 8, ISO/IEC JTC1/SC29/ WG11 N2728[S], Seoul,
Korea. 1999, 3
12 石军, 常义林. 图像检索技术综述. 西安电子科技大学学报, 2003, 8, 30(4):486~491
13 Jia Kebin.A Study of Query by Image Content According to Color and Shape. Korea China
joint Symposium, Jan., 2000, korea
14 Wei-Ying Ma; Hong Jiang Zhang. Benchmarking of Image Features for Content-Based
Retrieval.Signals, Systems & Computers, 1998. Conference Record of the Thirty-Second
Asilomar Conference, 1998, 1:253~257
15 M. J. Swain,D H Ballard. Color Indexing. Int journal of computer vision, 1991, 7(1):11~32
16 G. Pass, R. Zabih, J. Miller. Comparing Images Using Color Coherence Vectors. 4th ACM
conf on multimedia,boston,11/96:65~73
17 J. M. Zachary, S. S. Iyengar. Content Based Image Retrieval Systems.IEEE Symposium on
Application~Specific Systems and Software Engineering and Technology, 1999: 136 ~143
18 J. Huang. Image Indexing Using Color Correlogram. IEEE int conf on computer vision and
- 77 -
北京工业大学工学硕士论文
pattern recognition, puerto rico,1997: 762~768
19 徐建华. 图像处理与分析. 科学出版社, 1994
20 M. Flickner et al. Query by Image and Video Content: The QBIC System. IEEE Computer,
1995, 28:23~32
21 黄祥林, 宋磊, 沈兰荪. 用于图像检索的连通直方图方法. 电路与系统学报, 2002,
7(4):58~61
22 C. Faloutsos, R. Barber, M. Flickner, et al. Efficient and Effective Querying by Image
Content. J intelligent information systems, 1994, 3(1):231~262
23 S. Loncaric. A survey of shape analysis techniques. Pattern Recognition, 1998,
31(8):983~1001
24 J. Mott-Smith. Medial Axis Transformations, in Picture Processing and Psychopictories,
Lipkin and Rosenfeld eds., Academic Press New York, 1970:267~283
25 H. Blum, R Nagel. Shape Description Using Weighted Symmetric Axis Features. Pattern
Recognition 10, 1978:167~180
26 S. Peleg, A Rosenfeld. A Min-Max Medial Axis Transformation. IEEE Trans Pattern Anal
Mech Intell 3, 1981:208~210
27 L. Davis. Two~dimensional shape representation. In: Handbook of Pattern Recognition and
Image Processing, I Young and K S Fu eds, Academic Press New York, 1986:233~245
28 M. R. Teague. Image Analysis via The General Theory Of Moments. J Opt Soc Amer 70,
1980:920~930
29 S. O. Belkasim, M. Shridhar, M Ahmadi. Pattern Recognition With Moment Invariants: a
comparative study and new results. Pattern Recognition 24,1991:1117~1138
30 R. J. Prokop, A. P. Reeves. A survey of Moment-Based Techniques for Unoccluded Object
Representation and Recognition. CVGIP: graphical models image process 54,1992:438~460
31 H. Kim, K Park, M. Kim. Shape Decomposition by Collinearity. Pattern Recognition Lett
6,1987:335~340
32 H. V. Jagadish, A. M. Bruckstein. On Sequential Shape Descriptions. Pattern Recognition
25,1992:165~172
33 A. Taza, C. Suen. Discrimination of Planar Shapers Using Shape Matrices. IEEE trans SMC
19, 1989: 1281~1289
34 B. Kartikeyan, A Sarkar. Shape Description by Time Series. IEEE trans Pattern Anal Mech
Intell 11, 1989:977~984
35 H. Samet. Connected Component Labeling Using Quadtrees. Journal of ACM, 1981,
28(3):487~501
36 M. Shneier. Calculations of Geometric Properties Using Quadtrees. Computer graphics and
- 78 -
参考文献
image processing, 16,1981:296~302
37 H. Samet. Computing Perimeters of Regions In Image Representated by Quadtrees. IEEE
Trans PAMI-3,1981:683~687
38 Z. Q. Liu,J. P. Sun. Structured Image Retrieval. J Visual Languages and computing ,1997,
8(3):333~357
39 V. N. Gudivada, G S Jung. an Algorith for Content-Based Retrieval In Multimedia
Databases. proc intl conf multimedia computing and systems,1996:193~200
40 S. K. Chang. Iconic Indexing by 2D String. IEEE trans pattern analysis and machine
intelligence,1984, 6(4): 413~428
41 A. P. Sistla, C. Yu, C. Liu, et al. Similarity Based Retrieval Of Pictures Using Indices On
Spatial Relationships. Proc intl conf very large databases,9/95: 619~629
42 D. Toman. Point vs. Interval-bassed Query Languages for Temporal Databases. Proc Fifth
ACM SIGACT/MOD/ART symp principles of database systems,1996:58~67
43 Y. Theodoridis, M. Vazirgiannis, T. Sellis. Spatio-temporal Indexing for Large Multimedia
Applications. Proc intl conf multimedia computing and systems, 1996:441~448
44 贾克斌. 语义引导的图像内容查询方法的研究. 中国科技大学博士论文, 1998
45 黄祥林, 沈兰荪. 基于 DCT 压缩域的图像字符定位. 中国图象图形学报, 7(A)(1):22~26
46 徐曼. 基于内容的图像检索技术的研究与系统实现. 南京理工大学硕士论文,
2002:17~18
47 I. J Cox., M. L. Miller, S. M. Omohundro et al. Pichunter: Bayesian Relevance Feedback
for Image Retrieval System. In: Int’l Conf. on Pattern Recognition. Vienna, Austria,
1996:361~369
48 D. White, R. Jain. Similarity Indexing: Algorithms and Performance. In Proc. SPIE: Storage
and Retrieval for Image and Video Database, 1997
49 J. T. Robinson. The k-d-b-tree: A Search Structure for Large Multidimensional Dynamic
Indexes. In Proceedings of the 1981 ACM-SIGMOD Conference, 1981:10~18
50 A. Guttman. R-trees: A Dynamic Index Structrre for Spatial Searching. In ACM Proc. Int.
Conf. Manag. Data(SIGMOD), 1984:47~57
51 N. Beckmann, H. P. Kriegal, et al. The R*-tree: An Efficient and Robust Access Method for
Points and Rectangles. ACM, 1990, 322~331
52 K. Lin, H.V. Jagadish, and C. Faloutsos. The TV-Tree: An Index Structure for
High-Dimensional VLDB Journal, 1994:517~542
53 S. Berchtold, D. A. Keim, H. P. Kriegel. The X-Tree: An Index Structure for
High-Dimensional Data. Proceedings on the 22nd VLDB Conference, Mumbai (Bombay),
India, 1996:28~39
- 79 -
北京工业大学工学硕士论文
54 D. A. White and R. Jain, Similarity Indexing with the SS-tree. Proc. Of the 12th Int. Conf.
on Data Engineering, New Orleans USA, 1996:516~523
55 N. Katayama, S. Satoh. The SR-Tree: An Index Structure for High-Dimensional Nearest
Neighbor Queries. Proc. Of ACM SIGMOD Int. Conf. on Management of Data, Tucson,
Arizona, 1997:369~380
56 S. Berchtold C. B?hm, H. P. Kriegel. The Pyramid-Technique: Towards Breaking the Curse
of Dimensionality. ACM SIGMOD, Seattle, WA, USA, 1998:142~153
57 D. H. Lee, H. J. Kim. An Efficient Algorithm for Hyperspherical Range Query Processing
in High-Dimensional Data Space, Information Processing Letters, May, 2002, 83(2):
115~123
58 D. H. Lee, H. J. Kim. An Efficient Nearest Neighbor Search in High-Dimensional Data
Spaces, Information Processing Letters, March, 2002, 81(5):239~246
59 Yasushi Sakurai, Masatoshi Yoshikawa, et al. The A-tree: An Index Structure for
High-Dimensional Spaces Using Relative Approximation. Proc. of the 26th International
Conference on Very Large Data Bases, Cairo, Egypt, 2000:516~526
60 S. Berchtold, C. B?hm, et al. Independent Quantization: An Index Compression Technique
for High-Dimensional Data Spaces. 16th International Conference on Data Engineering
(ICDE), San Diego, CA, 2000
61 Weber, H. J. Schek, S. Blott. A Quantitative Analysis and Performance Study for
Similarity-Search Methods In High-Dimensional Spaces. The Int’l Conf. on Very Large
Databases, New York, 1998
62 冯玉才, 曹奎, 曹忠升. 一种支持快速相似检索的多维索引结构. 软件学报, 2002,
13(8):1678~1685
63 薛向阳, 罗航哉, 吴立德. 用代数格实现点数据索引. 计算机学报, 2000, 23(6):629~633
64 韩 莹 洁 , 孙 永 强 , 黄 林 鹏 . 自 适 应 近 似 树 . 计 算 机 研 究 与 发 展 , 2002, 12,
39(12):1751~1757
65 M. S. Kankanhalli. Introduction to Multimedia Information Retrieval. 2000, http:
//comp.nus.edu.sg
66 R. Brunelli, O. Mich. On the Use of Histograms for Image Retrieval. IEEE International
Conference on Multimedia Computing and Systems, 1999, 2:143~147
67 J. K. Wu, A. D. Narasimhalu. Fuzzy Retrieval of Image Databases. Proceedings of the first
Asian fuzzy systems symposium. Singapore, 93,11:38~40
68 F. Idris, S. Panchanathan. Review of Image and Video Indexing Techniques. Journal of
visual communication and image representation. 97, 8(2):146~166
69 W. Y. Ma, H. J. Zhang. Benchmarking of Image Features for Content-Based Retrieval.
- 80 -
参考文献
Signals, Systems & Computers, Conference Record of the Thirty-Second Asilomar
Conference, 1998:253~257
70 K. Wu, A. D. Narasimhalu. Identifying Faces Using Multiple Retrievals. IEEE Multimedia,
Procedings of the first Asian fuzzy systems symposium. Singapore, 93:38~40
71 Y. A. Aslandogan, C. T. Yu. Techniques and systems for image and video retrieval. IEEE
Transactions on Knowledge and Data Engineering, 1999, 11(1):56~63
72 V. N. Gudivada, V. V. Raghavan. Content based image retrieval systems. IEEE Computer,
9/95, 28(9):18~22
73 J. R. Smith. Image retrieval evaluation. IEEE Workshop on Content-Based Access of Image
and Video Libraries, 1998:112~113
74 陈韶斌, 丁明跃, 周成平等. 一个图像数据库检索系统的结构设计和快速检索方法. 计
算机与数字工程, 2001, 29(3):34~40
75 V. E. Ogle, M. Stonebraker. Chabot: Retrieval From A Relational Database Of Images.
IEEE computer, 1995, 28(9):40~48
76 章毓晋. 基于内容图像检索研究的进展. 中国学术期刊文摘, 1999, 5(2):269~270
77 J.R. Smith and S. F. Chang. VisualSEEk: A Fully Automated Content-based Image Query
System. In Proc. The Fourth ACM International Multimedia Conference, 1996, 87~98
78 J R Bach, C. Fuller, et. al., The Virage Image Search Engine: An Open Framework for
Image Management. In Proc. SPIE: Storage and Retrieval for Still Image and Video
Databases, 1996, (4):76~87
79 R. W. Piccard, A. Pentland and S. Sclaroff. Photobook: Content-based Manipulation of
Image Databases. International Journal of Computer Vision, 1996, 18(3):233~254.
80 R. W. Picard. A Society of Models for Video and Image Libraries. IBM SYSTEMS
JOURNAL, 1996, 35(3&4):292~312
81 J. R. Smith and S. F. Chang. VisualSEEk: A Fully Automated Contentbased Image Query
System. In Proc. The Fourth ACM International Multimedia Conference, 1996, 87~98
82 Ma, W.Y., and Manjunath, B.S., NETRA: A Toolbox for Navigating Large Image Databases,
In Proc. IEEE International Conference on Image Processing, Santa Barbara, California,
1997, 1:568~571
83 J R Bach, C Fuller, et. al., The Virage Image Search Engine: An Open Framework for Image
Management. In Proc. SPIE: Storage and Retrieval for Still Image and Video Databases,
1996, (4):76~87
84 朱兴全, 张宏江, 刘文印等. iF ind:一个结合语义和视觉特征的图像相关反馈检索系统.
计算机学报, 2002, 25(7):681~688
85 段立娟, 高文, 马继勇. Rich Get Richer—图像检索中的一种自适应的相关反馈方法.
- 81 -
北京工业大学工学硕士论文
计算机研究与发展, 2001, 38(8):960~965
86 柴兴无, 乔京成, 陈芸生. 对多媒体数据库管理系统的层次结构分析. 计算机科学,
1995, 22(1)
87 宋磊. 视频监控若干关键技术的初步研究. 北京工业大学硕士论文. 2003:50~52
88 M. K. Mandal, T. Aboulnass, S. Panchanathan. Fast Wavelet Histogram Techniques for
Image Indexing. Journal of Electronic Imaging, 1998, 75(1/2):99~110
89 S. K. Chang. Image Information Systems: Where Do We Go from Here. IEEE Trans on
Know ledge and Data Engineering, 1992, (5) : 431~441
90 张磊, 林福宗, 张钹. 基于前向神经网络的图像检索相关反馈算法设计. 计算机学报, 2002,
25(7):673~680
91 石艳霞. 信息检索中“相关性”与“相关反馈”研究概述. 晋图学刊, 2002, (2):13~15
92 Y. Rui, T. S. Huang. RelevanceFeedback: APower Tool for Interactive Content-Based Image Retrieval.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998,
8(5):644~654
93 Y. Rui and T. S. Huang. A Novel Relevance Feedback Technique in Image Retrieval. In
Procedings of the 7th ACM International Conference on Multimedia. ACM press,
1999:67~70
94 Y. Rui, T. S. Huang, and S. Mehrotra. Content-based Image Retrieval with Relevance
Feedback in MARS. In Proc. IEEE Int. Conf. Image Processsig, 1997:815~818
95 K. Porkaew, M. Ortega, S. Mehrota. Query Reformulation for Content Based Multimedia
Retrieval in MARS. IEEE Int. Conf. Multimedia Computing and Systems, 1999, 2:747~751
96 苏中, 张宏江, 马少平. 基于贝叶斯分类器的图像检索相关反馈算法. 软件学报, 2002,
13(10):2001~2006
97 Y. M. Wu and A. D. Zhang. A Feature Re-weighting Approach for Relevance Feedback in
Image Retrieval. In Proceedings of International Conference on Image Processing, 2002, 2:
581~584
98 朱旭娟, 李晓华, 沈兰荪. 一种自适应提取最优特征维的相关反馈算法. 电路与系统学
报, 2004, 9(1):36~40
99 I. Kamel, C. Faloutsos. Hilbert R-tree: An Improved R-Tree Using Fractals[A]. Proc. 20th
Int . Conf. on Very Large Databases[C]. 1994:500~509