基于MPEG-21的三维矩阵彩色图像表征、编码与质量评价研究

英文题名：Research on Representation, Coding and Quality Assessment of Color Image Based on 3-D Matrix in MPEG-21
作者：陈强
论文级别：博士
学科专业名称：通信与信息系统
中文关键词：彩色图像 ; 三维矩阵 ; 编码 ; 质量评价 ; MPEG-21
英文关键词：Color Image ; 3-D Matrix ; Coding ; Quality Assessment ; MPEG-21
学位年度：2006
导师：陈贺新
学科代码：081001
学位授予单位：吉林大学
论文提交日期：2006-10-01

摘要

由于彩色图像表示的三帧之间存在着色度冗余,如果不针对彩色图像表示的性质来充分利用色度冗余信息,那么将不会形成更为有效的压缩编码。与传统的彩色图像的压缩方法相比,基于MPEG-21的三维矩阵理论能够将彩色图像用一个统一的数学模型来表示,通过RGB空间到YCbCr空间的颜色空间转换后,采用新的三维子阵联合分割方式,将分块后的两类三维子阵作为一个数字项来考虑,能够与未来多媒体框架标准MPEG-21有很好的兼容性;同时采用线性非均匀标量量化方法,进一步提高了编码效率。仿真实验结果表明该方法的性能要稍优于JPEG标准。利用三维矩阵变换对彩色图像压缩编码的有效性,本文从主观和客观两方面出发,给出了一种来更为符合人眼视觉感知特性的彩色图像质量综合评价方案。
With the development of Internet and multimedia application technologies, color images are used widely than ever, for which can reflect the real world and comply with the perception of human eyes, additionally, the advance of compression and coding technologies for color images is one of key factors. An uncompressed color image will need more space, which will cause more trouble for its transmission over Internet; however, compressed one will need smaller space and can be transmitted conveniently.
It is well known that the fundamental theory is that there is much redundant information within one image according to information theory and generally speaking, the data distribution rule of one image cannot be observed only by human eyes and there is much relevance between image data representation by statistics, such as the position relevance of pixels. For a color image represented by RGB (Red, Green and Blue) color space, there exists not only position relevance for pixels in one frame but also color relevance in the same position of different frames, i.e. redundant information in one color image includes statistic, structural, knowledge and visional redundancy as well as color redundancy, which also indicates the specificity of compression and coding for color image and usually, the method of color space conversion is used firstly to remove some color redundancy and then the compression methods adopted for gray image is transplanted directly.
Because image is two-dimensional data representation and the progress of data compression technologies will also accelerate that of image compression technologies, transforms, such as DCT and wavelet transform, are used to convert image representation from space domain to frequency or transformed domain and correspondingly, many compression methods, such as RLE (Run-Length Encoding), predictive coding, Huffman coding and arithmetic coding will also be

引文

[1] P.Curwen “High-definition Television: A case study of industrial policy versus the market,” European Business Review, 1994, 94(1), pp.17-23
    [2] H.Leopold, A. Campbell and N. Singer “Will B-ISDN services meet the needs of distributed multimedia communications?” Proc. of the 4th IEE Conf. on Telecom., Manchester, UK, 1993, pp.139-145
    [3] L.Porta, T.F. Veeraraghavan et al “B-ISDN: a technological discontinuity,” IEEE Communications Magazine, 1994, 32(10), pp.84-97
    [4] 余松煜等 “现代图像信息压缩技术,” 北京: 科学出版社, 1998, 第1版, pp.28-40
    [5] G.K.Wallace “The JPEG still picture compression standard,” IEEE Trans. on Consumer Electronics,1992, 38(1), pp.18-34
    [6] D.L.Gall “MPEG: A video compression standard for multimedia applications,” Communications of the ACM, 1991, 34(4), pp.46-58
    [7] M.Nelson and J.G.Ailly “The Data Compression Book (2nd Edition),” IDG Books Worldwide, Inc 1995, pp.21-129
    [8] I.Burnett, R.Van de Walle et al “MPEG-21:goals and achievements,” IEEE Multimedia, 2003, pp.60-70
    [9] ISO/IEC JTC1/SC29/WG11 TR 21000-1“Information technology-Multimedia Framework MPEG-21 Part 1: Vision, technologies and strategy,” Nov.2001, pp. 1-32
    [10] D.Salomon著吴乐南等译 “数据压缩原理与应用(第2版),” 北京: 电子工业出版社, 2003.09, pp.21-65
    [11] 高文 “多媒体数据压缩技术,” 北京: 电子工业出版社, 1994.04, 第1版, pp.3-6
    [12] S.Süsstrunk, R.Buckley and S.Swen “Standard RGB color spaces,” Proc. of the Seventh Color Imaging Conference: Color Science, Systems, and Applications, pp. 127-134
    [13] D.Hilbert “What is color vision?” Philosophical Studies, 1992, Vol.68, pp.351-370
    [14] V.R.Algazi and D.J.Sakrison “On the optimality of Karhunen-Loeve expansion,” IEEE Trans. on Information Theory, 1969, pp.319-321
    [15] N.Ahmed, T.Natarajan and K.R.Rao “Discrete cosine transform,” IEEE Trans. on Computer, 1974, Vol.1, pp.90-93
    [16] K.W.Henderson “Some notes on the Walsh functions,” IEEE Trans. on Electronic Computers, 1964, 13(1), pp.50-52
    [17] W.K.Prath, H. C. Andrews and L. R. Welch “Slant transform image coding,” IEEE Trans. on Communications, 1974, 22(8), pp.1075-1093
    [18] J.W.Woods and S.D.O'Neil “Subband coding of images,” IEEE Trans. on Acoustic, Speech and Signal Processing, 1986, 34(5), pp.1278-1288
    [19] F.Parke “Parameterized-models for facial animation,” IEEE Computers Graphics andApplication Magazine, 1982, 2(6), pp.61-68
    [20] R.Forchheimer “Image coding: From waveforms to animation,” IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989, 37(12), pp.2008-2023
    [21] I.Daubechies “Ten lectures on wavelets,” Society for Industrial and applied mathematics, Philadelphia, PA, 1992, pp.21-45
    [22] A.B.Watson, “Image compression using the discrete cosine transform,” Mathematica Journal, 1994,4(1), pp.81-88
    [23] A.S.Lewis and G.Knowles “Image compression using the 2-D wavelet transform,” IEEE Trans. on Image Processing, 1992, 1(2), pp.244-250
    [24] I.Daubechies “The wavelet transform, time-frequency localization and Fourier analysis,” IEEE Trans. on Information Theory, 1990, Vol.36, pp.961-1004
    [25] S.A.Mallat “A theory for multiresolution signal decomposition: the wavelet representation,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 1989, 11(7), pp.674-693
    [26] A.E.Jacquin “Image coding based on a fractal theory of iterated contractive image transformations,” IEEE Trans. on Image Processing, 1992, 1(1), pp.18-30
    [27] ISO/IEC JTC1/SC29/WG11 WD v2.0 21000-2“Information Technology-Multimedia Framework-Part 2: Digital item declaration,” Mar. 2001, pp. 1-77
    [28] 朱艳秋、陈贺新、戴逸松 “彩色图像三维 DCT 变换压缩编码,” 中国图象图形学报, 1997,2(11), pp.795-800
    [29] 魏政刚等 “图像质量评价方法的历史、现状和未来,” 中国图象图形学报, 1998,3(5), pp.236-239
    [30] 周建鹏等 “一种图像质量的感知测量方法,” 中国图象图形学报, 1998,3(3), pp.200-204
    [31] A.Skodras, C.Christopoulos and T.Ebrahimi “The JPEG 2000 still image compression standard,” IEEE Signal Processing Magazine, Sep. 2001, pp.36-58
    [32] S.W.Golomb “Run-length encoding,” IEEE Trans. on Information Theory, 1996,12(3), pp.399-401
    [33] W.Berghorn, T.Boskamp et al “Context conditioning and run-length coding for hybrid, embedded progressive image coding”, IEEE Trans. on Image Processing,2001,10(12), pp.1791-1800
    [34] G.Lakhani “Modified JPEG Huffman coding,” IEEE Trans. on Image Processing, 2003, 12(2), pp.159-169
    [35] J.Ziv and A.Lempel “A universal algorithm for sequential data compression,” IEEE Trans. on Information Theory, 1977, 23(3), pp.337-343
    [36] D.Chevion, E.D.Karnin and A.C.Walach “High efficiency, multiplication free approximation of arithmetic coding,” Proc. of Data Compression Conference, 1991, pp.43-52
    [37] Dai Yang, Hongmei Ai et al “Adaptive Karhunen-Loeve transform for enhancedmultichannel audio coding,” Proc. of SPIE Mathematics of data/image coding, compression, and encryption IV, with applications, 2001,Vol.4475, pp.43-54
    [38] Y.Linde, A.Buzo and R.M.Gray “An algorithm for vector quantizer design,” IEEE Trans on Com., 1980, 28(1), pp.84-95
    [39] 孙圣和、陆哲明 “矢量量化技术及应用” 北京: 科技出版社, 2002.05, 第 1 版, pp.31-49
    [40] Jinhua Yu “Advantages of uniform scalar dead-zone quantization in image coding system,” Proc. of IEEE Int. Conf. on Communications, Circuits and Systems.2004, pp.805-808
    [41] ITU-T Recommendation G.711 “Pulse code modulation (PCM) of voice frequencies,” Nov.1988
    [42] P.J.Burt and E.H.Adelson, “The Laplacian pyramid as a compact image code,” IEEE Trans. on Communications, 1983, pp.532-540
    [43] W.Sweldens “The Lifting Scheme: A custom-design construction of biorthogonal wavelets,” Appl.Comut.Harmon.Appl.,1996, 3(2), pp.186-200
    [44] I.Daubechies and W.Sweldens “Factoring wavelet transforms into lifting steps,” Journal of Fourier Analysis and Application,1998, 5(3), pp.245-267
    [45] A.R.Calderbank, I.Daubechies et al “Wavelet transforms that map integers to intergers,” Applied and Computational Harmonic analysis, 1998, 5(3), pp.332-369
    [46] A.E.Jacquin “Image coding based on a fractal theory of iterated contractive image transformations,” IEEE Trans. on Image Processing, 1992,1(1), pp.18-30
    [47] 李孝安、张晓缋 “神经网络与神经计算机导论” 西安: 西安电子工业出版社, 1994.10, 第 1 版,pp.34-41
    [48] G.K.Wallace “Overview of the JPEG: still image compression standard,” Proc. of the SPIE Conf. on Image Processing Algorithms and Techniques, Vol.1244, pp.220-233
    [49] M.A.Golner, W.B.Mikhael et al “Region based variable quantization for JPEG image compression,” IEEE Midwest Symposium on Circuits and Systems, Lansing,MI,USA, Aug 2000, pp.8-11
    [50] 焦晓、朱光喜、马明罡 “JPEG2000 的编码技术,” 计算机仿真, 2003,20(9), pp.112-114
    [51] M.D.Adams “The JPEG-2000 still image compression standard,” ISO/IEC JTC1/SC 29/WG11 N2412, 2001.09
    [52] M.Charrier, D.S.Cruz and M.Larsson “JPEG2000: the next millennium compression atandard for still images,” Proc. of IEEE Int. Conf. on Multimedia Computing and Systems, 1999, Vol.1, pp.131-132
    [53] MPEG Industry Forum “MPEG-4-the media standard,” www.m4if.org/public/doc uments/vault/m4-out-20027.pdf, Nov. 2002, pp.1-30
    [54] B.L.Tseng, Ching-Yung Lin and J.R.Smith “Using MPEG-7 and MPEG-21 for personalizing video,” IEEE Multimedia, 2004, pp.42-52
    [55] ISO TC46/SC9 Review of MPEG-21 CD 21000-3 “Multimedia Framework-Part 3: Digital item identification and description,” Feb.2002, pp.1-16
    [56] ISO/IEC JTC 1/SC 29/WG 11 FCD “Information Technology-Multimedia Framework-Part 4: Intellectual property management and protection components,” Apr.2005
    [57] ISO/IEC JTC 1/SC 29/WG 11 CD “Information Technology-Multimedia Framework-Part 5: Rights expression language,” Mar.2003, pp.1-61
    [58] ISO/IEC JTC 1/SC 29/WG 11 CD “Information Technology-Multimedia Framework-Part 6: Rights data dictionary,” Jul.2002, pp.1-51
    [59] ISO/IEC JTC 1/SC 29/WG 11 FCD “Information Technology-Multimedia Framework-Part 7: Digital item adaptation,” Jul.2003, pp.1-175
    [60] ISO/IEC JTC 1/SC 29/WG 11 FCD “Information Technology-Multimedia Framework-Part 8: Reference software,” May.2005
    [61] ISO/IEC JTC1/SC29/WG11 ISO/IEC FCD 21000-9 “Information technology-Multimedia Framework-Part 9: File format,” Aug.2004
    [62] ISO/IEC JTC 1/SC 29/WG 11 FCD “Information Technology-Multimedia Framework-Part 10: Digital item processing,” Oct.2004
    [63] ISO/IEC JTC 1/SC 29/WG 11 TR “Information Technology-Multimedia Framework-Part 11: Evaluation tools for persistent association technologies,” Dec.2003
    [64] ISO/IEC JTC 1/SC 29/WG 11 TR 21000-12 “Information Technology-Multimedia Framework-Part 12: Test bed for MPEG-21 resource delivery,” Dec.2003
    [65] ISO/IEC JTC 1/SC 29/WG 11 TR 21000-13 “Information Technology-Multimedia Framework-Part 13: Scalable video coding,” Oct.2004
    [66] ISO/IEC JTC 1/SC 29/WG 11 CD 21000-14 “Information Technology-Multimedia Framework-Part 14: Conformance testing,” Apr.2005, pp.1-9
    [67] ISO/IEC JTC 1/SC 29/WG 11 CD 21000-15 “Information Technology-Multimedia Framework-Part 15: Event reporting,” Oct.2004
    [68] ISO/IEC JTC 1/SC 29/WG 11 FCD 21000-16 “Information Technology-Multimedia Framework-Part 16: Binary format,” Nov.2004
    [69] ISO/IEC JTC 1/SC 29/WG 11 CD 21000-17 “Information Technology-Multimedia Framework-Part 17: Fragment identification of MPEG resources,” Jul. 2005, pp.1-35
    [70] ISO/IEC JTC 1/SC 29/WG 11 CD 21000-18 “Information Technology-Multimedia Framework-Part 18: Digital item streaming,” Oct. 2005, pp.1-48
    [71] R.J.Glushko, J.M.Tenenbaum and B.Meltzer “An XML framework for agent-based E-commerce,” Communications of the ACM, 1999, 42(3), pp.106-114
    [72] Qiang Chen, Hexin Chen et al “The trend to composition of electronic services in MPEG-21,” Proc. of IEEE Int. Conf. on SOLI, 2005, pp.829-832
    [73] Qiang Chen, Hexin Chen et al “Reliable description and interactive management of user preferences in MPEG-21,” Proc. of IEEE Int. Conf. on SOLI, 2006, pp.29-33
    [74] 朱艳秋、陈贺新、戴逸松 “彩色图像三维矩阵变换压缩编码,” 电子学报,1997,25(7), pp.16-21
    [75] 桑爱军、陈贺新 “三维矩阵彩色图像 WDCT 压缩编码,” 电子学报, 2002,30(4), pp.594-597
    [76] 林福宗 “图像文件格式(上)-Windows 编程,” 北京: 清华大学出版社, 1996.12, 第 1版, pp.40-48
    [77] 唐传尧 “图像电子学基础,” 北京: 电子工业出版社., 1995.10, 第 1 版, pp.291-298
    [78] 田玉敏、梁若莹 “计算机彩色输入输出设备常用颜色空间及其转换,” 计算机工程, 2002, 28(9), pp.198-200
    [79] C.Poynton “Color science and color appearance models for CG, HDTV and D-cinema,” SIGGRAPH2004 Course 2, 2004.05, pp.33-38
    [80] K.R.Castleman 著, 朱志刚等译 “Digital Image Processing,” 北京: 电子工业出版社, 2002.07, 第 2 版, pp.473-479
    [81] H.J.Trussell, E.Saber and Michael “Color image processing-basics and special issue overview,” IEEE Signal Processing Magazine, 2005, pp.14-22
    [82] L.A.Rowe “Equiment, facilities and transmission,” Berkeley Multimedia Research Center, University of California at Berkeley, Oct. 2000, pp.6-12
    [83] 刘韶桑爱军等 “基于YC子阵的彩色图像三维矩阵变换压缩编码,” 吉林大学学报(工学版), 2006, 36(4), pp.569-573
    [84] Qiang Chen, Hexin Chen et al “Enabling color image compression by 3-D submatrix integration transform in MPEG-21,” Proc. of IEEE Int. Conf. on Information Acquisition, 2006, Vol.1, pp.262-267
    [85] 桑爱军陈贺新 “基于三维离散余弦变换的彩色图象压缩编码,” 中国图象图形学报 2002, 7(12), pp.1269-1273
    [86] M.Antonini, M.Barlaud et al “Image coding using vector quantization in the wavelet transform domain,” Proc. of IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, 1990, 90(4), pp.2297-2300
    [87] A.Gyorgy and T.Linder “Optimal entropy-constrained scalar quantization of a uniform source,” IEEE Trans. on Information Theory, 2000, Vol.46, pp.2704-2711
    [88] G.Lakhani “Improving DC coding models of JPEG arithmetic coder,” IEEE Signal Processing Letters, 2004, pp.505-508
    [89] A.M.Eskicioglu, P.S.Fisher and S.Chen “Image quality measures and their performance,” IEEE Trans. on Communications, 1995, 43(12), pp.2959-2965
    [90] B.Girod “What’s wrong with mean-squared error?” Digital Images and Human Vision. A.B.Watson, Ed. Cambridge, MA: MIT Press, 1993, pp.207-220
    [91] I.Avcibas, B.Sankur and K.Sayood “Statistical evaluation of image quality measures,” Journal of Electronic Imaging, 2002, 11(2), pp.206-223
    [92] Van Dijk.M. and J.B.Martens “Subjective quality assessment of compressed images,” Signal Processing,1997,Vol.58,pp.235-252
    [93] M.P.Eckert and A.P.Bradley, “Perceptual quality metrics applied to still image compression,” Signal Processing, Vol. 70, 1998, pp. 177–200
    [94] ITU-R “Methodology for the subjective assessment of the quality of television pictures,” ITU,1995, BT, pp.500-510
    [95] F.W.Campbell and J.G.Robson “Application of Fourier analysis to the visibility of gratings,” J.Physiol., 1968, Vol.197,pp.551-556
    [96] G.C.Higgins “Image quality criteria,” J. Appl. Photogr. Eng., 1977, 3(2), pp.53-60
    [97] E.M.Granger and J.C.Heurtley “Visual chromaticity-modulation transfer function,” J.Opt.Soc.,1973, Vol.63, pp.1173-1174
    [98] C.Zetzsche and G.Hauske “Multiple channel model for the prediction of subjective image quality,” Proc. of SPIE conference on human vision, visual processing and digital display, 1989,Vol.1077, pp.209-216
    [99] L.G.Roberts “Machine perception of three-dimensional solids,” Optical and Electro-Optical Information Processing,1965, Cambridge, MA, MIT Press, pp.159-197
    [100] P.C.Teo and D.J.Heeger “Perceptual image distortion,” First International Conference on Image Processing, 1994, Vol.2, pp.982–986
    [101] K.T. Mullen “The contrast sensitivity of human color vision to red-green and blue-yellow chromatic gratings,” J.Physiol., 1985, Vol.359, pp.381-400
    [102] C.R.Carlson and R.W.Cohen “A simple psycho-physical model for predicting the visibility of displayed information,” Proc. of Soc, 1980,Inf.Displ.21,pp.229-246
    [103] A.Poirson and B.Wandell “Appearance of colored patterns pattern-color separability,” Journal of the Optical Society of America, 1993, 10(12), pp.2458–2470
    [104] H.L.Task, A.R.Pinkus and J.P.Hornseth “A comparison of several television display image quality measures,” Proc. of Soc. Inf. Displ.19,1978, pp.113-119
    [105] P.G.J.Barten “The effects of picture size and definition on perceived image quality,” IEEE Trans. on Electron, 1989, Vol.36, pp.1865-1869
    [106] Zhou Wang and A.C.Bovik “A universal image quality index,” IEEE Signal Processing Letters, 2002, Vol.9, pp.81–84
    [107] C.R.Carlson “Sine-wave threshold contrast-sensitivity function:dependence on display size,” RCA Rev.1982,Vol.43, pp.675-683
    [108] S.Winkler “Issues in vision modeling for perceptual video quality assessment,” Signal Processing, Vol. 78, 1999, pp.231–252
    [109] R.A.Peters II “A new algorithm for image noise reduction using mathematical morphology,” IEEE Trans. on Image Processing, 1995 Vol.4, No.3, pp.554-568
    [110] A.Vassilev “Contrast sensitivity near borders: significance of test stimulus form, size, and duration,” Vision Research, 1973,Vol.13 pp.719-730
    [111] R.Matrik, M.Petrou and J.Kittler “Error-sensitivity assessment of vision algorithms,” Proc. of IEEE Int. Conf. on Vision Image Signal Processing, 1998,145(2),pp.124-130
    [112] P.G.J.Barten “The SQRI method: a new method for the evaluation of visible resolution on a display,” Proc. of Soc. Inf. 1987,Displ.28, pp.253-262
    [113] E.V.Zee and M.H.W.A.Boesten “The influence of luminance and size on the image quality of complex scenes,” IPO Annual Progress Report (Institute for Perception Research, Eindhoven, Netherlands), 1980, pp.69-75
    [114] M.D.Health and S.Sarkar “A robust visual method for assessing the relative performance of edge-detection algorithms,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 1997, 19(12), pp.1338-1359
    [115] 章毓晋 “图像工程(上册)-图像处理与分析,” 北京: 清华大学出版社, 1999.03, 第 1版, pp.181-186
    [116] R.J.Qian and T.S.Huang “Optimal edge detection in two-dimensional images,” IEEE Trans. on Image Processing, 1996,5(7), pp.1215-1220
    [117] V.Berzins “Accuracy of Laplacian edge detection,” CVGIP,1984,Vol.27, pp.195-210
    [118] 郑南宁 “计算机视觉与模式识别,” 北京: 国防工业出版社, 1998.03, 第 1 版, pp.49-69
    [119] D.Marr and E.Hildreth “Theory of edge detection,” Proc. of the Royal Society, London, 1980, B(207), pp.187-217
    [120] J.Canny “A computational approach to edge detection,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 1986, 8(6), pp.679-698
    [121] D.A.Silverstein and J.E.Farrell “The relationship between image fidelity and image quality,” Proc. of IEEE International Conference on Image Processing, 1996, Vol.11, pp.881-884
    [122] S.Daly “Quantitative performance assessment of an algorithm for the determination of image fidelity,” SID Digest, 1993, pp.317–320
    [123] Zhou Wang, A.C.Bovik and L.Lu “Why is image quality assessment so difficult?” Proc. of IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 2002,Vol.4, pp.3313-3316
    [124] A.D.Fleming et al “Automated assessment of diabetic retinal image quality based on clarity and field definition,” Invest.Ophthalmol.Vis.Sci.47, 2006, pp.1120-1125
    [125] Zhou Wang and A.C.Bovik “Image quality assess: from error visibility to structural similarity,” IEEE Trans. on Image Processing, 2004, 13(4), pp.600-612
    [126] A.B.Watson “DCT quantization matrices visually optimized for individual images,” Proc. of SPIE conference on human vision, visual processing and digital display IV, Bellingham, WA,1993, Vol.1913, pp.202-216
    [127] D.J.Lee “Color space conversion for linear color grading,” Proc. of SPIE Intelligent Robots and Computer Vision, XIX, 2000, Vol.4197, pp.358-366
    [128] Qiang Chen, Hexin Chen et al “Quality assessment of color images based on DSF in MPEG-21,” Proc. of IEEE Int. Conf. on Information Acquisition, 2006, Vol.1, pp.268-273

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700