DNA测序信号去噪分析的一种新方法
详细信息 本馆镜像全文    |  推荐本文 | | 获取馆网全文
摘要
在DNA荧光测序中,噪声会影响分析的准确度和检出限。相比其他滤波方法,小波分析具有良好的时频域分辨特性。在小波去噪处理中,正确选择合适的小波基函数、去噪阈值和分解层数直接关系到信号去噪处理的质量。为了真实构建噪声模型并准确评价去噪算法的有效性,实验中通过实际系统中采集到的噪声信号叠加理想荧光信号构建DNA测序仿真信号,去噪分析的结果表明:选择sym7小波基函数、分解层数(lev=5)与使用固定格式软阈值,有效去除了DNA测序信号的噪声;处理后,信号的信噪比提高了5倍以上。将其用于处理实际的DNA电泳荧光信号,相比基于随机噪声模型的算法,去噪后的信号更加真实可靠。
Fluorescence signals in DNA sequencing are often contaminated by noise,which has negative influence on the accuracy and detection limit of analysis.Wavelet analysis has excellent time and frequency domain resolution for signal denoising compared to other conventional filtering methods.Before the signal denoising process,a key problem is how to choose a suitable wavelet base,decomposition level and denoising threshold,which have great influence on the quality of signal denoising.In order to construct the same noise model as that in experiment and evaluate the denoising algorithm precisely,a novel method is presented: the real noise signal acquired from the experimental system was added to an ideal signal to simulate a noisy DNA sequencing signal,thus the denoising efficiency could be evaluated accurately.The denoising results indicate that using sym 7 wavelet base,decomposition level at 5 and using fixed form soft threshold can effectively reduce the noise.After being processed,the SNR was improved more than 5 times.When the same algorithm was applied to the experimental DNA sequencing data,the results were more credible than those obtained through other algorithms based on the random noise model.
引文
[1]Barclay V J,Bonner R F.Analytical Chemistry,1997,69(1):78.
    [2]HONG Fei,WU Zhi-mei(洪飞,吴志美).Chinese Journal of Computers(计算机学报),2006,29(1):166.
    [3]XU Guo-yuan,ZHONG Guo-sheng,et al(徐国元,中国生,等).Chinese Journal of Geotechnical Engineering(岩土工程学报),2006,28(1):24.
    [4]CHEN Ying,LIN Yi(陈鹰,林怡).Acta Geodaetica et Cartographica Sinica(测绘学报),2006,35(1):19.
    [5]HAN Li-ping,YAO Chun-hui,et al(韩丽萍,药春晖,等).Journal of Test and Measurement Technology(测试技术学报),2006,20(1):46.
    [6]LEI Dong,HU Xiang-yun,et al(雷栋,胡祥云,等).Journal of Seismological Research(地震研究),2006,29(1):103.
    [7]YING Yi-bin,LIU Yan-de,FU Xia-ping(应义斌,刘燕德,傅霞萍).Spectroscopy and Spectral Analysis(光谱学与光谱分析),2006,26(1):63.
    [8]WU Ya-dong,SUN Shi-xin(吴亚东,孙世新).Acta Electronica Sinica(电子学报),2006,34(1):163.
    [9]Mallat S,Hwang W L.IEEE Transactions on Information Theory,1992,38(2):617.
    [10]Mallat S,Zhong S.IEEE Transactions on Pattern Analysis and Machine Intelligence,1992,14(7):1019.
    [11]Mallat S.IEEE Transactions on Pattern Analysis and Machine Intelligence,1989,11(7):674.
    [12]WANG Li-shi,YANG Xiao-yun,et al(王立世,杨晓云,等).Chemical Journal of Chinese Universities(高等学校化学学报),1999,20(3):383.
    [13]ZHONG Hong-bo,LI Guan-bin,et al(仲红波,李关宾,等).Chemical Journal of Chinese Universities(高等学校化学学报),2002,23(5):796.
    [14]QIN Xia,SHEN Lan-sun(秦侠,沈兰荪).Spectroscopy and Spectral Analysis(光谱学与光谱分析),2002,22(6):1009.
    [15]WANG Ying,MO Jin-yuan(王瑛,莫金垣).Spectroscopy and Spectral Analysis(光谱学与光谱分析),2005,25(1):124.

版权所有:© 2023 中国地质图书馆 中国地质调查局地学文献中心