A Noise Removal Algorithm for Time Series Microarray Data
详细信息    查看全文
  • 作者:Naresh Doni Jayavelu (22)
    Nadav Bar (22)
  • 关键词:Microarray time series data ; Noise ; Smoothing ; Fourier ; Network component analysis ; Principal component analysis ; clustering
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2013
  • 出版时间:2013
  • 年:2013
  • 卷:8154
  • 期:1
  • 页码:163-174
  • 全文大小:273KB
  • 参考文献:1. Bar-Joseph, Z., Gitter, A., Simon, I.: Studying and modelling dynamic biological processes using time-series gene expression data. Nat. Rev. Genet.聽13(8), 552鈥?64 (2012) CrossRef
    2. D鈥檋aeseleer, P.: How does gene expression clustering work. Nat. Biotech.聽23(12), 1499鈥?501 (2005), http://dx.doi.org/10.1038/nbt1205-1499 CrossRef
    3. Guo, Y.S., Cheng, J.Z., Jin, G.F., Gutkind, J.S., Hellmich, M.R., Townsend, C.M.: Gastrin stimulates cyclooxygenase-2 expression in intestinal epithelial cells through multiple signaling pathways: evidence for involvement of erk5 kinase transactivation of the epidermal growth factor. Journal of Biological Chemistry聽277(50), 48755鈥?8763 (2002) CrossRef
    4. Hardin, J., Wilson, J.: A note on oligonucleotide expression values not being normally distributed. Biostatistics聽10(3), 446鈥?50 (2009), http://biostatistics.oxfordjournals.org/content/10/3/446.abstract CrossRef
    5. Lewin, A., Bochkina, N., Richardson, S.: Fully bayesian mixture model for differential gene expression: simulations and model checks. Statistical Applications in Genetics and Molecular Biology聽6 (2007)
    6. Liao, J.C., Boscolo, R., Yang, Y.L., Tran, L.M., Sabatti, C., Roychowdhury, V.P.: Network component analysis: reconstruction of regulatory signals in biological systems. Proc. Natl. Acad. Sci. U S A聽100(26), 15522鈥?5527 (2003) CrossRef
    7. Posekany, A., Felsenstein, K., Sykacek, P.: Biological assessment of robust noise models in microarray data analysis. Bioinformatics (2011), http://bioinformatics.oxfordjournals.org/content/early/2011/01/19/bioinformatics.btr018.abstract
    8. Raychaudhuri, S., Stuart, J.M., Altman, R.B.: Principal components analysis to summarize microarray experiments: Application to sporulation time series. In: Pac. Symp. Biocomput., pp. 452鈥?63 (2000)
    9. Ringner, M.: What is principal component analysis. Nat. Biotech.聽26(3), 303鈥?04 (2008), http://dx.doi.org/10.1038/nbt0308-303 CrossRef
    10. Rousseeuw, P.J.: Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics聽20, 53鈥?5 (1987), http://www.sciencedirect.com/science/article/pii/0377042787901257 CrossRef
    11. Sloutsky, R., Jimenez, N., Swamidass, S.J., Naegle, K.M.: Accounting for noise when clustering biological data. Brief Bioinform. (October 2012), http://dx.doi.org/10.1093/bib/bbs057
    12. Subramaniam, D., Ramalingam, S., May, R., Dieckgraefe, B.K., Berg, D.E., Pothoulakis, C., Houchen, C.W., Wang, T.C., Anant, S.: Gastrin-mediated interleukin-8 and cyclooxygenase-2 gene expression: Differential transcriptional and posttranscriptional mechanisms. Gastroenterology聽134(4), 1070鈥?082 (2008) CrossRef
    13. Tang, V., Yan, H.: Noise reduction in microarray gene expression data based on spectral analysis. International Journal of Machine Learning and Cybernetics聽3, 51鈥?7 (2012), http://dx.doi.org/10.1007/s13042-011-0039-7 CrossRef
    14. Tu, Y., Stolovitzky, G., Klein, U.: Quantitative noise analysis for gene expression microarray experiments. Proceedings of the National Academy of Sciences聽99(22), 14031鈥?4036 (2002), http://www.pnas.org/content/99/22/14031.abstract CrossRef
    15. Warren Liao, T.: Clustering of time series data-a survey. Pattern Recogn.聽38(11), 1857鈥?874 (2005), http://dx.doi.org/10.1016/j.patcog.2005.01.025 CrossRef
  • 作者单位:Naresh Doni Jayavelu (22)
    Nadav Bar (22)

    22. Systems Biology Group, Department of Chemical Engineering, Norwegian University of Science and Technology (NTNU), Trondheim, NO-7491, Norway
  • ISSN:1611-3349
文摘
High-throughput technologies such as microarray data are a great resource for studying and understanding biological systems at a low cost. However noise present in the data makes it less reliable, and thus many computational methods and algorithms have been developed for removing the noise. We propose a novel noise removal algorithm based on Fourier transform functions. The algorithm optimizes the coefficients of the first and second order Fourier functions and selects the function which maximizes the Spearman correlation to the original data. To demonstrate the performance of this algorithm we compare the prediction accuracy of well known modelling tools, such as network component analysis (NCA), principal component analysis (PCA) and k-means clustering. We compared the performance of these tools on the original noisy data and the data treated with the algorithm. We performed the comparison analysis using three independent real biological data sets (each data set with two replicates). In all cases the proposed algorithm removes the noise in the data and substantially improves the predictions of modelling tools.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700