Peak Finder Metaserver - a novel application for finding peaks in ChIP-seq data
详细信息    查看全文
  • 作者:Marcin Kruczyk (5) (6)
    Husen M Umer (5)
    Stefan Enroth (5)
    Jan Komorowski (5) (6)
  • 关键词:Transcription factor ; Peak finder ; ChIP ; seq ; Metaserver
  • 刊名:BMC Bioinformatics
  • 出版年:2013
  • 出版时间:December 2013
  • 年:2013
  • 卷:14
  • 期:1
  • 全文大小:214 KB
  • 参考文献:1. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. / Nat Methods 2008,5(7):621鈥?28. CrossRef
    2. Qin ZS, Yu J, Shen J, Maher CA, Hu M, Kalyana-Sundaram S, Yu J, Chinnaiyan AM: HPeak: an HMM-based algorithm for defining read-enriched regions in ChIP-Seq data. / BMC Bioinformatics 2010, 11:369. CrossRef
    3. Fejes AP, Robertson G, Bilenky M, Varhol R, Bainbridge M, Jones SJ: FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology. / Bioinformatics 2008,24(15):1729鈥?730. CrossRef
    4. Pepke S, Wold B, Mortazavi A: Computation for ChIP-seq and RNA-seq studies. / Nat Methods 2009, 6:S22-S32. CrossRef
    5. Bujnicki JM: Protein-structure prediction by recombination of fragments. / Chembiochem 2005, 7:19鈥?7. CrossRef
    6. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nussbaum C, Myers RM, Brown M, Li W, / et al.: Model-based analysis of ChIP-Seq (MACS). / Genome Biol 2008,9(9):R137. CrossRef
    7. Ji H, Jiang H, Ma W, Johnson DS, Myers RM, Wong WH: An integrated software system for analyzing ChIP-chip and ChIP-seq data. / Nat Biotechnol 2008,26(11):1293鈥?300. CrossRef
    8. Jothi R, Cuddapah S, Barski A, Cui K, Zhao K: Genome-wide identification of in vivo protein鈥揇NA binding sites from ChIP-Seq data. / Nucleic Acids Res 2008,36(16):5221鈥?231. CrossRef
    9. Wang X, Zhang X: Pinpointing transcription factor binding sites from ChIP-seq data with SeqSite. / BMC Syst Biol 2011,5(Suppl 2):S3. CrossRef
    10. Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. / Bioinformatics 2010,26(6):841鈥?42. CrossRef
    11. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, / et al.: The sequence alignment/map format and SAMtools. / Bioinformatics 2009,25(16):2078鈥?079. CrossRef
    12. Rye MB, S忙trom P, Drabl酶s F: A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs. / Nucleic Acids Res 2011,39(4):e25-e25. CrossRef
    13. Polman JAE, Welten JE, Bosch DS, de Jonge RT, Balog J, van der Maarel SM, de Kloet ER, Datson NA: A genome-wide signature of glucocorticoid receptor binding in neuronal PC12 cells. / BMC Neurosci 2012, 13:118. CrossRef
  • 作者单位:Marcin Kruczyk (5) (6)
    Husen M Umer (5)
    Stefan Enroth (5)
    Jan Komorowski (5) (6)

    5. Department of Immunology, Genetics and Pathology, SciLifeLab Uppsala, Rudbeck Laboratory, Uppsala University, SE, 751 85, Uppsala, Sweden
    6. Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Pawi艅skiego 5a Street, 02-106, Warszawa, Poland
  • ISSN:1471-2105
文摘
Background Finding peaks in ChIP-seq is an important process in biological inference. In some cases, such as positioning nucleosomes with specific histone modifications or finding transcription factor binding specificities, the precision of the detected peak plays a significant role. There are several applications for finding peaks (called peak finders) based on different algorithms (e.g. MACS, Erange and HPeak). Benchmark studies have shown that the existing peak finders identify different peaks for the same dataset and it is not known which one is the most accurate. We present the first meta-server called Peak Finder MetaServer (PFMS) that collects results from several peak finders and produces consensus peaks. Our application accepts three standard ChIP-seq data formats: BED, BAM, and SAM. Results Sensitivity and specificity of seven widely used peak finders were examined. For the experiments we used three previously studied Transcription Factors (TF) ChIP-seq datasets and identified three of the selected peak finders that returned results with high specificity and very good sensitivity compared to the remaining four. We also ran PFMS using the three selected peak finders on the same TF datasets and achieved higher specificity and sensitivity than the peak finders individually. Conclusions We show that combining outputs from up to seven peak finders yields better results than individual peak finders. In addition, three of the seven peak finders outperform the remaining four, and running PFMS with these three returns even more accurate results. Another added value of PFMS is a separate report of the peaks returned by each of the included peak finders.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700