Putative null distributions corresponding to tests of differential expression in the Golden Spike dataset are intensity dependent
详细信息    查看全文
  • 作者:Daniel P Gaile (1) (2)
    Jeffrey C Miecznikowski (1) (2)
  • 刊名:BMC Genomics
  • 出版年:2007
  • 出版时间:December 2007
  • 年:2007
  • 卷:8
  • 期:1
  • 全文大小:429KB
  • 参考文献:1. Choe SE, Boutros M, Michelson AM, Church GM, Halfon MS: Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset. / Genome Biol 2005,6(2):R16. CrossRef
    2. Dabney AR, Storey JD: A reanalysis of a published Affymetrix GeneChip control dataset. / Genome Biol 2006.,7(3):
    3. Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. / Bioinformatics 2003, 19:185鈥?93. [Evaluation Studies] CrossRef
    4. Schadt EE, Li C, Ellis B, Wong WH: Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data. / J Cell Biochem Suppl 2001, (Suppl 37):120鈥?25.
    5. Irazarry RA, Cope LM, Wu Z: Feature-level exploration of a published Affymetrix GeneChip control dataset. / Genome Biol 2006.,7(8):
    6. Irazarry RA, Wu Z, Jaffee HA: Comparison of Affymetrix GeneChip expression measures. / Bioinformatics 2006,22(7):789鈥?94. CrossRef
    7. Koenker R: [http://www.r-project.org] / quantreg: Quantile Regression 2006. [R package version 3.85]
    8. Koenker RW, D'Orey V: [Algorithm AS 229] Computing Regression Quantiles. / Applied Statistics 1987, 36:383鈥?93. CrossRef
    9. Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JY, Zhang J: Bioconductor: Open software development for computational biology and bioinformatics. / Genome Biology 2004, 5:R80. CrossRef
    10. Irizarry R, Wu Z: / SpikeInSubset: Part of Affymetrix's Spike-In Experiment Data 2006. [R package version 1.2.1]
    11. R Development Core Team: [http://www.R-project.org] / R: A language and environment for statistical computing R Foundation for Statistical Computing, Vienna, Austria 2005. [ISBN 3鈥?00051鈥?7鈥?]
    12. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. / Biostatistics 2003,4(2):249鈥?64. CrossRef
    13. Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. / Nucleic Acids Res 2003,31(4):e15. CrossRef
    14. Affymetrix: / Microarray Suite User Guide version 5.0 2001. [Affymetrix, Santa Clara, CA]
    15. Irizarry RA, Gautier L, Bolstad BM: / affy: Methods for Affymetrix Oligonucleotide Arrays 2006. [R package version 1.12.2]
    16. Bolstad B: [http://bmbolstad.com] / affyPLM: Methods for fitting probe-level models 2006. [R package version 1.10.0]
    17. Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. / Bioinformatics 2003,19(2):185鈥?93. [Comparative Study] CrossRef
    18. Affymetrix: / Statistical Algorithms Description Document 2002. [Affymetrix, Santa Clara, CA]
    19. Hothorn T: On Exact Rank Tests in R. / R News 2001, 1:11鈥?2.
    20. Hothorn T, Hornik K: / exactRankTests: Exact Distributions for Rank and Permutation Tests 2006. [R package version 0.8鈥?2]
  • 作者单位:Daniel P Gaile (1) (2)
    Jeffrey C Miecznikowski (1) (2)

    1. Department of Biostatistics, University at Buffalo, Buffalo, New York, USA
    2. New York State Center of Excellence in Bioinformatics and Life Sciences, Buffalo, New York, USA
文摘
Background We provide a re-analysis of the Golden Spike dataset, a first generation "spike-in" control microarray dataset. The original analysis of the Golden Spike dataset was presented in a manuscript by Choe et al. and raised questions concerning the performance of several statistical methods for the control of the false discovery rate (across a set of tests for differential expression). These original findings are now in question as it has been reported that the p-values associated with the tests of differential expression for null probesets (i.e., probesets designed to be fold change 1 across the two arms of the experiment) are not uniformly distributed. Two recent publications have speculated as to the reasons the null distributions are non-uniform. A publication by Dabney and Storey concludes that the non-uniform distributions of null p-values are the direct consequence of an experimental design which requires technical replicates to approximate biological replicates. Irizarry et al. identify four characteristics of the feature level data (three related to experimental design and one artifact). Irizarry et al. argue that the four observed characteristics imply that the assumptions common to most pre-processing algorithms are not satisfied and hence the expression measure methodologies considered by Choe et al. are likely to be flawed. Results We replicate and extend the analyses of Dabney and Storey and present our results in the context of a two stage analysis. We provide evidence that the Stage I pre-processing algorithms considered in Dabney and Storey fail to provide expression values that are adequately centered or scaled. Furthermore, we demonstrate that the distributions of the p-values, test statistics, and probabilities associated with the relative locations and variabilities of the Stage II expression values vary with signal intensity. We provide diagnostic plots and a simple logistic regression based test statistic to detect these intensity related defects in the processed data. Conclusion We agree with Dabney and Storey that the null p-values considered in Choe et al. are indeed non-uniform. We also agree with the conclusion that, given current pre-processing technologies, the Golden Spike dataset should not serve as a reference dataset to evaluate false discovery rate controlling methodologies. However, we disagree with the assessment that the non-uniform p-values are merely the byproduct of testing for differential expression under the incorrect assumption that chip data are approximate to biological replicates. Whereas Dabney and Storey attribute the non-uniform p-values to violations of the Stage II model assumptions, we provide evidence that the non-uniformity can be attributed to the failure of the Stage I analyses to correct for systematic biases in the raw data matrix. Although we do not speculate as to the root cause of these systematic biases, the observations made in Irizarry et al. appear to be consistent with our findings. Whereas Irizarry et al. describe the effect of the experimental design on the feature level data, we consider the effect on the underlying multivariate distribution of putative null p-values. We demonstrate that the putative null distributions corresponding to the pre-processing algorithms considered in Choe et al. are all intensity dependent. This dependence serves to invalidate statistical inference based upon standard two sample test statistics. We identify a flaw in the characterization of the appropriate "null" probesets described in Choe et al. and we provide a corrected analysis which reduces (but does not eliminate) the intensity dependent effects.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700