用户名: 密码: 验证码:
复杂凝聚态体系中近红外光谱信号的信息提取及定量表示
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
本论文采用二甲苯三种同分异构体混合物样品的近红外光谱数据,对近红外光谱模型建立中的信息提取和信息量的定量表示进行了系统性研究。该研究对近红外预测模型预测准确度的提高、近红外光谱中所含信息量的增强和信息量的定量表示具有重要的指导意义和应用价值。
     本论文对比研究了近红外光谱信息提取中的几种连续波段的选择方法,指出了多重共线性对预测模型精确度的严重危害性。首次提出在同分异构体组分混合物的近红外光谱数据处理中使用差谱技术选择光谱波段,可以部分克服多重共线性问题,提高模型的预测准确度;特别对相关系数法进行了系统性研究,指出了该方法的优缺点,首次指明了相关系数与近红外光谱数据体系中经过PCA提取出来的主要的主成分载荷之间的相似关系,对主成分物理意义的解释有一定的指导意义。
     首次深入研究了互相关分析方法在近红外光谱信息提取中的作用,理论推导出了近红外光谱信号经过互相关运算后,在一定的条件下变换后的光谱信号强度和目标组分的浓度呈线性关系,经过互相关分析后目标组分的光谱质量得到了提升;首次将互相关算法应用到近红外光谱模型的传递中,通过对不同类型近红外光谱仪器采集到的同一种混合物体系的近红外光谱进行互相关分析,突显出了不同光谱仪器间存在的固有差别,发现通过补偿的办法可以弥补这个差别,从而起到模型传递的作用。
     首次将信息论的部分概念和观点引入到近红外光谱的分析领域,从信息论的角度,重新审视了近红外光谱的数据结构;首次采用组分熵对近红外光谱数据的重叠程度进行了定量表示,在对高度重叠的二甲苯三种同分异构体的近红外光谱数据的分析中取得了良好的效果;在改进多组分数据体系经PCA分解后的信息量增益公式的基础上,首次提出一种加权重的熵代数来定量表示多组分近红外光谱中所含有的信息量,并采用该代数来评价近红外光谱预处理操作对数据的变换品质,通过实验分析证实了该代数可以有效地表征近红外光谱数据中的信息增益,清晰地区分近红外光谱预处理对最终预测结果准确度的好坏影响。
This thesis has put the emphases on the information extraction and quantitative expression during the model building of near infrared spectrum using the spectral data sets of the dimethyl benzene’isomer. The research work may be important to the application and accuracy development of the NIR model.
     In this thesis several kinds of wavelength selection methods, including Interval Partial Least-Squares Regression and Moving Window Partial Least-Square Regression, have been studied and compared. Both their merits and demerits have been pointed out. Specially, the method based on the correlation with concentration was carefully studied. The similarity between the PCs and the correlation coefficient was pointed out, and this will help us to better understand the meaning of PCs. The subtractive spectra technology was used to partly overcome the multicollinearity of the NIR spectrum data sets by wavelength selection.
     The Cross-Correlation algorithm was introduced as spectral pretreatment in NIR field. The algorithm gave a substantial improvement of the performance when the NIR spectra with complex backgrounds and big noise. What is more, the Cross-Correlation algorithm was also applied to calibration transfer of the NIR spectra, and a good result was gotten.
     The information theory was also introduced for analysis spectral data structure in the NIR field. The component entropy was used to evaluate the degree of overlapping of NIR spectra, and it gave a powerful solution for this problem. A new algorithm, named weighted Shannon’s entropy, stemming from the Shannon’s information, is used to assess the criteria for NIR spectral data optimization. The analysis results show that the quantitative analysis gives an accurate result and the algorithm works very well for assessing a right criterion for the NIR data pretreatment. Key words:
引文
[1] Nils Bo Büchmann, Henrik Josefsson,and Ian A Cowe. Performance of European artificial neural network(ANN) calibration for moisture and protein in cereals using the Danish near-infrared transmission(NIT) network.Cereal Chem,2001,78(5):572~577
    [2] 陈斌,王豪,林松等.基于相关系数法与遗传算的啤酒酒精度近红外光谱分析.农业工程学报,2005,21(7)99~102
    [3] Michael J.McShane,Gerard L. Coté, et al .Assessment of partial least-squares calibration and wavelength selection for complex near-infrared spectra.Applied Spectroscopy,1998,52(6):878~884
    [4] Michael J. McShane,Gerard L. Coté,et al. Spiegelman, Variable selection in multivariate calibration of a spectroscopic glucose sensor.1997,51(10),1559~1564
    [5] Jouan-Rimbaud, Delphine. Massart, Desire-Luc. Leardi, Riccardo.et al. Genetic Algorithms as a Tool for Wavelength Selection in Multivariate Calibration.Analytical Chemistry,1995, 67(23), 4295~301
    [6] C.B. Lucasius and G. Kateman. Genetic algorithms for large-scale optimization in chemometrics: An application. TrAC Trends in Analytical Chemistry,1991,10(8):254~261
    [7] Leardi, Riccardo.González, Amparo Lupiá?ez. Genetic algorithms applied to feature selection in PLS regression: how and when to use them.Chemometrics and Intelligent Laboratory Systems,1998,41(2),195~207
    [8] L. N?rgaard, A. Saudland, J. Wagner,et al .Interval Partial Least Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy.Applied Spectroscopy,2000,54(3):413~419
    [9] Jiang, J.-H.; Berry, R. J.; Siesler,,H. W.et al. Wavelength Interval Selection in Multicomponent Spectral Analysis by Moving Window Partial Least-Squares Regression with Applications to Mid-Infrared and Near-Infrared Spectroscopic Data. Anal. Chem,2002,74(14):3555~3565
    [10] R. Todeschini,D. Galvagni. Kohonen artificial neural networks as a tool for wavelength selection in multicomponent spectrofluorimetric PLS modeling: application to phenol, o-cresol,m-cresol and p-cresol mixture.Trends in Analytical Chemistry,1999,18(2):93~98
    [11] 王惠文.偏最小二乘回归方法及其应用.北京:国防工业出版社,1999:67~70
    [12] 梁逸曾,俞汝勤.化学计量学.北京:高等教育出版社,2003:135
    [13]G. M. Hieftje, R.I. Bystroff, and R. Lim . Application of Correlation Analysis for Signal-to-Noise Enhancement in Flame Spectrometry . Analytical Chemistry , 1973 : 45(2),253~258
    [14] WF Bryant, M. Trivedi, B. Hinchman. Data-blocking cross-correlation peak detection in computerized gas chromatography-mass spectrometry. Analytical Chemistry,1980:52(1),38~43
    [15] D.C. Villalanti, M. F. Burke, and J. B. Phillips. Frequency modulated correlation chromatography. Analytical Chemistry,1979:51, 2222~2225
    [16] 刘青格,陈斌.基于互相关分析技术的近红外光谱信息特征的提取.农业机械学报,2003:34(3):79~81
    [17] 蔡长年,汪润生.信息论.北京:人民邮电出版社,1962
    [18] Karel Eckschlager and Klaus Danzer. Information Theory in Analytical Chemistry.New York: John Wiley & Sons, INC,(1994)
    [19] Kilg, R; Smit,H.C. The quality of quantitation in chromatography Part I. Information theoretical considerations. Analytica Chimica Acta, 1996,333, 3~10
    [20] Geert van Marlen. Information theory applied to selection of peaks for retrieval of mass spectra.Analytical Chemistry, 1976,48(3):595~598
    [21] F.H. Heite,P. F. Dupuis,H.A. et al. Numerical taxonomy and information theory applied to feature selection from filed infrared spectra for automated interpretation. Analytica Chimica Acta.1978,103(4): 313~321
    [22] P. C. Thijssen and G. Kateman.Optimal designs with information theory in least-squares problems. Analytica Chimica Acta, 1984,157,99~115
    [23] B?rd Buttingsrud, Bj?rn K Alsberg. A new maximum entropy-based method for deconvolution of spectra with heteroscedastic noise. Journal of Chemometrics, 2004,18(12): 537~547
    [1] Herschel, W.Investigation of the Powers of the prismatic Colours to heat and illuminate Objects; with Remarks that ... with Telescopes of large Apertures and high magnifying Powers.Philosophical Transactions,1800,90:255~326
    [2] 郑一善.分子光谱学导论.上海:上海科学技术出版社,1963
    [3] G.赫兹堡.分子光谱与分子结构,第一卷(王鼎昌译).科学出版社,1983.8
    [4] 严衍禄,赵龙莲,韩东海等.近红外光谱分析基础与应用.北京:中国轻工业出版社,2005
    [5] 陆婉珍,袁洪福,徐广通等.现代近红外光谱分析技术.北京:中国石化出版社,2000.
    [6] Kaye W. Near infrared spectroscopy- a review. Spectrochim Acta 1954,(6):257~287
    [7] Garrigues S, Andrade J M . Multivariate calibrations in Fourier transform infrared spectrometry for prediction of kerosene properties. Anal Chim Acta,1995,317:95~105
    [8] Kelly, Jeffrey J. and James B Callis. Nondestructive Analytical Procedure for Simultaneous Estimation of the Major Classes of Hydrocarbon Constituents of Finished Gasolines. Analytical Chemistry,1990.62(14): 1444~1451
    [9] 吉海彦,严衍禄.在国产近红外光谱仪实验样机用偏最小二乘法定量分析大麦的成分.分析化学,198(5):607~611
    [10] 黄岚,丁海曙,王广志.用近红外光漫反射光谱无创检测血糖的初步研究.光谱学与光谱分析,2002,22(3)387~397
    [11] Bin Chen, Xi-guang Fu ,Dao-li Lu. Improvement of predicting precision of oil content in instant noodles by using wavelet transforms to treat near-infrared spectroscopy. Journal of food engineering,2002,53:373~376
    [12] 许禄,邵学广.化学计量学方法.北京:科学出版社,2004
    [13] 倪永年.化学计量学在分析化学中的应用.北京,科学出版社,2004
    [14] 梁逸曾,俞汝勤.化学计量学.北京,高等教育出版社,2003.5
    [15] A.Savitzky and M.J.E. Golay.Smoothing and differentiation of data by simplified least squares procedures. Anal.Chem,1964,36:1627~1639
    [16] Demetriads-Shah, T. Steven M. and Clark, J. Remote Sens. Environ.1990,33:35
    [17] Rundquist, D. Han, L. Schalles J.et al.Photogramm. Eng. Remote Sens,1996,62(2):195
    [18] Michael P.; Szczepanski and Augustus W. FountainⅢ. Use of Multivariate Analysis to Determine Temperature from Low Resolution Infrared Spectra of Carbon Dioxide. Applied Spectroscopy,54,197(2000)]
    [19] Mandel J.American Statistican.1982,36:15
    [20] Colub G.H,Van Loan F.Matrix Computation.Baltimore (MD):John Hopkine UniversityPress,1983
    [21] D.M. Allen. The relationship between variable selection and data augmentation and a method for prediction. Technometrics, 1974, 16(1):125~127,
    [1] 王惠文.偏最小二乘回归方法及其应用.北京:国防工业出版社,1999:67~70
    [2] 梁逸曾,俞汝勤.化学计量学.北京:高等教育出版社,2003:135
    [3] 李政军,黄伟,林华山等.傅立叶红外差谱技术在复合材料分析中的应用.光谱实验室,2003:20(4)509~512
    [4] 马晓国,张展霞,钱浩雯.计算机差谱法消除 ICP-AES 中光谱干扰.光谱学与光谱分析,1994:14(4)55~62
    [5] 武中臣,徐晓轩,张静等.采用近红外差谱法选择信息区提高同分异构体预测模型的准确度. 光谱实验室, 2005,22(1):38~41
    [6] Michael J.McShane,Gerard L. Coté, et al .Assessment of partial least-squares calibration and wavelength selection for complex near-infrared spectra.Applied Spectroscopy,1998,52(6): 878~884
    [7] Michael J. McShane,,Gerard L. Coté,,et al. Spiegelman, Variable selection in multivariate calibration of a spectroscopic glucose sensor.1997,51(10),1559~1564
    [8] Jouan-Rimbaud, D.; Walczak, B.; Massart, D.L.; Comparison of multivariate methods based on latent vectors and methods based on wavelength selection for the analysis of near-infrared spectroscopic data.Analytica Chimica Acta.1995,304(3):285~295
    [9] 武中臣,徐晓轩,杨仁杰等.提高二甲苯三种同分异构体预测模型准确度的相关系数法. 光谱学与光谱分析,2005,25(1):136~140
    [10] 陆婉珍,袁洪福,徐广通等.现代近红外光谱分析技术.北京:中国石化出版社,2000.
    [11] 冯新泸,史永刚 .近红外光谱及其在石油产品分析中的应用.北京:中国石化出版社,2002,179
    [12] 徐永群,孙素琴,许锦文 .红外指纹图谱库与阵列相关系数法快速鉴别中药材.光谱实验室,2002;19(5):606~610
    [13] 胡伟,唐婉莹,周申范等.色谱流动相的相关系数法聚类分析.分析化学,1997;25(5):614
    [14] 陈斌,王豪,林松等.基于相关系数法与遗传算的啤酒酒精度近红外光谱分析.农业工程学报,2005,21(7)99~102
    [15] 武中臣,徐晓轩,杨仁杰,俞钢,张存洲.采用主成分分析结合相关系数法提高苯及其同系物含量预测结果的准确度.光谱学与光谱分析,2004,24(12):1566~1570
    [16] L. N?rgaard,A. Saudland, J. Wagner,,et al .Interval Partial Least Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy.Applied Spectroscopy,2000,54(3):413~419
    [17] M. Forina; C. Casolino and C.P. Millan. Iterative predictor weighting (IPW) PLS: a technique for the elimination of useless predictors in regression problems.J.Chemom,1999, 13(2):165~184
    [18] Da Chen; Bin Hu; Xueguang Shao et al.Variable selection by modified IPW (iterative predictor weighting)-PLS (partial least squares) in continuous wavelet regression models Analyst,2004:129,664~669
    [19] Jiang, J.-H.; Berry, R. J.;Siesler,H. W.et al. Wavelength Interval Selection in Multicomponent Spectral Analysis by Moving Window Partial Least-Squares Regression with Applications to Mid-Infrared and Near-Infrared Spectroscopic Data. Anal. Chem,2002,74(14):3555~3565
    [1]G. M. Hieftje, R.I. Bystroff, and R. Lim . Application of Correlation Analysis for Signal-to-Noise Enhancement in Flame Spectrometry. Analytical Chemistry,1973: 45(2),253~258
    [2] WF Bryant, M. Trivedi, B. Hinchman.Data-blocking cross-correlation peak detection in computerized gas chromatography-mass spectrometry. Analytical Chemistry,1980:52(1),38~43
    [3] D.C. Villalanti, M. F. Burke, and J. B. Phillips. Frequency modulated correlation chromatography. Analytical Chemistry,1979:51,2222~2225
    [4] Gary Horlick.Detection of Spectral information Utilizing Cross-Correlation Techniques. Analytical Chemistry,1973:45(2)319~324
    [5]C. K. Mann, J. R. Goleniewski and C. A. Sismanidis.Spectrophotometric Analysis by Cross-Correlation, Applied Spectroscopy,1982:36(3)223~227
    [6] 刘青格,陈斌. 基于互相关分析技术的近红外光谱信息特征的提取.农业机械学报,2003:34(3):79~81
    [7] 武中臣,徐晓轩,张静等.互相关分析在近红外光谱预处理中的应用. 光谱实验室, 2005,22(1):126~129
    [8] 褚小立,袁洪福,陆婉珍.光谱多元校正中地模型传递.光谱学与光谱分析,2001,21(6):881~885
    [9] 王艳斌,袁洪福,陆婉珍.一种基于目标因子分析地模型传递方法.光谱学与光谱分析,2005:25(3):398~401
    [10] 褚小立,袁洪福,陆婉珍.普鲁克分析用于近红外光谱仪地分析模型传递.分析化学仪器装置与实验技术,2002,30(1):114~119
    [11] Sj?blom, Jonas; Svensson, Olof; Josefson, Mats.An evaluation of orthogonal signal correction applied to calibration transfer of near infrared spectra.Chemometrics and Intelligent Laboratory Systems,1998,44:229~244
    [12] 武中臣,徐晓轩,杨仁杰等.互相关分析在近红外光谱模型传递中的应用. 光谱学与光谱分析,2005,25(12):1975~1977
    [1] 蔡长年,汪润生.信息论.北京:人民邮电出版社,1962
    [2] 维纳.控制论(郝季仁译).北京:科学出版社,1962
    [3] 梁逸曾,俞汝勤.化学计量学.北京:高等教育出版社,2003.5
    [4] Danzer, K. Zur Anwendung. Informations theoretischer Grundlagen in der Analytik. Habilitation thesis. Technical University Chemnitz,1979
    [5] C.E.Shanno. A Mathematical Theory of Communication. Bell Syst.Tech.J,1948,27:379~423
    [6] Shannon,C.E.and Weaver,W. The Mathematical Theory of Communication. University of Illinois Press.Urbana,1949
    [7] Liteanu,C. and Rica,J. Utilization of the amount of information in evaluation of analytical methods..Anal.Chem, 1979,51,1986~1995
    [8] Karel Eckschlager and Klaus Danzer. Information Theory in Analytical Chemistry. New York::John Wiley & Sons,INC,1994
    [9] Karel Eckschlager and Vladimír ?těpánek. Information theory as applied to chemical analysis.New York: a wiley-interscience publication,1979
    [10]K.Eckschlager and V. ?těpánek. Analytical Measurement and Information(advances in the information theoretic approach to chemical analyses). New York: John Wiley & Sons,INC,1985
    [11]Kilg, R.; Smit, H.C. The quality of quantitation in chromatography Part I. Information theoretical considerations. Analytica Chimica Acta, 1996,333: 3~10
    [12]Geert van Marlen..Information theory applied to selection of peaks for retrieval of mass spectra. Analytical Chemistry, 1976,48(3):595~598
    [13] F. H. Heite, P. F. Dupuis, et al. Numerical taxonomy and information theory applied to feature selection from filed infrared spectra for automated interpretation. Analytica Chimica Acta,1978,103(4) :313~321
    [14] P. C. Thijssen and G. Kateman. Optimal designs with information theory in least-squares problems. Analytica Chimica Acta, 1984,157:99~115
    [15] B?rd Buttingsrud, Bj?rn K . Alsberg. A new maximum entropy-based method for deconvolution of spectra with heteroscedastic noise. Journal of Chemometrics, 2004,18(12): 537~547
    [16] Martens H. Multivariate Calibration (2nd edn). Wiley: Chichester, 1989, chap.7.
    [17] I.T. Joliffe. Principle Component Analysis. Springer, New York,1986
    [18] A.Savitzky and M.J.E. Golay.Smoothing and differentiation of data by simplified least squares procedures. Anal.Chem,1964,36:1627~1639
    [19] Willem Windig, Neal B. Gallagher, Jeremy M. Shaver, et al.A new approach for interactive self-modeling mixture analysis.Chemometrics and Intelligent Laboratory Systems, 2005,77:85~96
    [20] Demetriads-Shah, T., Steven M. and Clark, J. Remote Sens. Environ,1990.33:35
    [21] Rundquist, D., Han, L., Schalles J., and Peake, J. Photogramm. Eng. Remote Sens,1996, 62(2):195

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700