FRESCo: finding regions of excess synonymous constraint in diverse viruses
详细信息    查看全文
  • 作者:Rachel S Sealfon (1) (2)
    Michael F Lin (3)
    Irwin Jungreis (1) (2)
    Maxim Y Wolf (1) (2)
    Manolis Kellis (1) (2)
    Pardis C Sabeti (2) (4)

    1. MIT
    ; Computer Science and Artificial Intelligence Laboratory ; Cambridge ; MA ; 02139 ; USA
    2. Broad Institute
    ; Cambridge ; MA ; 02142 ; USA
    3. DNANexus
    ; Mountain View ; CA ; 94040 ; USA
    4. Department of Organismic and Evolutionary Biology
    ; Harvard University ; Cambridge ; MA ; 02138 ; USA
  • 刊名:Genome Biology
  • 出版年:2015
  • 出版时间:December 2015
  • 年:2015
  • 卷:16
  • 期:1
  • 全文大小:2,692 KB
  • 参考文献:1. Eveleth, DD, Marsh, JL (1987) Overlapping transcription units in Drosophila: sequence and structure of the Cs gene. Mol Gen Genet 209: pp. 290-8 CrossRef
    2. Rogozin, IB, Spiridonov, AN, Sorokin, AV, Wolf, YI, Jordan, IK, Tatusov, RL (2002) Purifying and directional selection in overlapping prokaryotic genes. Trends Genet 18: pp. 228-32 CrossRef
    3. Lin, MF, Kheradpour, P, Washietl, S, Parker, BJ, Pedersen, JS, Kellis, M (2011) Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes. Genome Res 21: pp. 1916-28 CrossRef
    4. Stergachis, AB, Haugen, E, Shafer, A, Fu, W, Vernot, B, Reynolds, A (2013) Exonic transcription factor binding directs codon choice and affects protein evolution. Science 342: pp. 1367-72 CrossRef
    5. Plotkin, J, Kudia, G (2011) Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet 12: pp. 32-42 CrossRef
    6. Kudia, G, Murray, A, Tollervey, D, Plotkin, J (2009) Coding-sequence determinants of gene expression in Escherichia coli. Science 10: pp. 255-8
    7. Grundhoff, A, Sullivan, CS (2011) Virus-encoded microRNAs. Virology 411: pp. 325-43 CrossRef
    8. Siegrist, CA, Durand, B, Emery, P, David, E, Hearing, P, Mach, B (1993) RFX1 is identical to enhancer factor C and functions as a transactivator of the hepatitis B virus enhancer. Mol Cell Biol 13: pp. 6375-84
    9. Mizokami, M, Orito, E, Ohba, K, Ikeo, K, Lau, JY, Gojobori, T (1997) Constrained evolution with respect to gene overlap of hepatitis B virus. J Mol Evol 44: pp. S83-90 CrossRef
    10. Kim, DY, Firth, AE, Atasheva, S, Frolova, EI, Frolov, I (2011) Conservation of a packaging signal and the viral genome RNA packaging mechanism in alphavirus evolution. J Virol 85: pp. 8022-36 CrossRef
    11. Steward, M, Vipond, IB, Millar, NS, Emmerson, PT (1993) RNA editing in Newcastle disease virus. J Gen Virol 74: pp. 2539-47 CrossRef
    12. Xia, X (1996) Maximizing transcription efficiency causes codon usage bias. Genetics 144: pp. 1309-20
    13. Bulmer, M (1987) Coevolution of codon usage and transfer RNA abundance. Nature 325: pp. 728-30 CrossRef
    14. Kimchi-Sarfaty, C, Oh, JM, Kim, I-W, Sauna, ZE, Calcagno, AM, Ambudkar, SV (2007) A 鈥渟ilent鈥?polymorphism in the MDR1 gene changes substrate specificity. Science 315: pp. 525-8 CrossRef
    15. Simmonds, P, Smith, DB (1999) Structural constraints on RNA virus evolution. J Virol 73: pp. 5787-94
    16. Gog, JR, Afonso, EDS, Dalton, RM, Leclercq, I, Tiley, L, Elton, D (2007) Codon conservation in the influenza A virus genome defines RNA packaging signals. Nucleic Acids Res 35: pp. 1897-907 CrossRef
    17. Firth, AE, Atkins, JF (2009) A conserved predicted pseudoknot in the NS2A-encoding sequence of West Nile and Japanese encephalitis flaviviruses suggests NS1鈥?may derive from ribosomal frameshifting. Virol J 6: pp. 14 CrossRef
    18. Mayrose, I, Stern, A, Burdelova, E, Sabo, Y, Laham-Karam, N, Zamostiano, R (2013) Synonymous site conservation in the HIV-1 genome. BMC Evol Biol 13: pp. 164 CrossRef
    19. Firth, AE (2014) Mapping overlapping functional elements embedded within the protein-coding regions of RNA viruses. Nucleic Acids Res 42: pp. 12425-39 CrossRef
    20. Kosakovsky-Pond, S, Frost, S (2005) Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol 22: pp. 1208-22 CrossRef
    21. Kosakovsky-Pond, S, Muse, S (2005) Site-to-site variation of synonymous substitution rates. Mol Biol Evol 22: pp. 2375-85 CrossRef
    22. Kosakovsky-Pond, S, Scheffler, K, Gravenor, M, Poon, A, Frost, S (2010) Evolutionary fingerprinting of genes. Mol Biol Evol 27: pp. 520-36 CrossRef
    23. Sandelin, A, Alkema, W, Engstr枚m, P, Wasserman, WW, Lenhard, B (2004) JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res 32: pp. D91-4 CrossRef
    24. Pond, SLK, Frost, SDW, Muse, SV (2005) HyPhy: hypothesis testing using phylogenies. Bioinformatics 21: pp. 676-9 CrossRef
    25. Clyde, K, Barrera, J, Harris, E (2008) The capsid-coding region hairpin element (cHP) is a critical determinant of dengue virus and West Nile virus RNA synthesis. Virology 379: pp. 314-23 CrossRef
    26. Melian, EB, Hinzman, E, Nagasaki, T, Firth, AE, Wills, NM, Nouwens, AS (2010) NS1鈥?of flaviviruses in the Japanese encephalitis virus serogroup is a product of ribosomal frameshifting and plays a role in viral neuroinvasiveness. J Virol 84: pp. 1641-7 CrossRef
    27. Trevelyan, B, Smallman-Raynor, M, Cliff, AD (2005) The spatial structure of epidemic emergence: geographical aspects of poliomyelitis in north-eastern USA, July-October 1916. J R Stat Soc Ser A Stat Soc 168: pp. 701-22 CrossRef
    28. Goodfellow, I, Chaudhry, Y, Richardson, A, Meredith, J, Almond, JW, Barclay, W (2000) Identification of a cis-acting replication element within the poliovirus coding region. J Virol 74: pp. 4590-600 CrossRef
    29. Han, J-Q, Townsend, HL, Jha, BK, Paranjape, JM, Silverman, RH, Barton, DJ (2007) A phylogenetically conserved RNA structure in the poliovirus open reading frame inhibits the antiviral endoribonuclease RNase L. J Virol 81: pp. 5561-72 CrossRef
    30. Song, Y, Liu, Y, Ward, CB, Mueller, S, Futcher, B, Skiena, S (2012) Identification of two functionally redundant RNA elements in the coding sequence of poliovirus using computer-generated design. Proc Natl Acad Sci U S A 109: pp. 14301-7 CrossRef
    31. Burrill, CP, Westesson, O, Schulte, MB, Strings, VR, Segal, M, Andino, R (2013) Global RNA structure analysis of poliovirus identifies a conserved RNA structure involved in viral replication and infectivity. J Virol 87: pp. 11670-83 CrossRef
    32. Burns, CC, Shaw, J, Campagnoli, R, Jorba, J, Vincent, A, Quay, J (2006) Modulation of poliovirus replicative fitness in HeLa cells by deoptimization of synonymous codon usage in the capsid region. J Virol 80: pp. 3259-72 CrossRef
    33. Mueller, S, Papamichail, D, Coleman, JR, Skiena, S, Wimmer, E (2006) Reduction of the rate of poliovirus protein synthesis through large-scale codon deoptimization causes attenuation of viral virulence by lowering specific infectivity. J Virol 80: pp. 9687-96 CrossRef
    34. Simmonds, P, Welch, J (2006) Frequency and dynamics of recombination within different species of human enteroviruses. J Virol 80: pp. 483-93 CrossRef
    35. Mohan, KV, Atreya, CD (2001) Nucleotide sequence analysis of rotavirus gene 11 from two tissue culture-adapted ATCC strains, RRV and Wa. Virus Genes 23: pp. 321-9 CrossRef
    36. Li, W, Manktelow, E, Kirchbach, JC, Gog, JR, Desselberger, U, Lever, AM (2010) Genomic analysis of codon, sequence and structural conservation with selective biochemical-structure mapping reveals highly conserved and dynamic structures in rotavirus RNAs with potential cis-acting functions. Nucleic Acids Res 38: pp. 7718-35 CrossRef
    37. Belhouchet, M, Mohd Jaafar, F, Firth, AE, Grimes, JM, Mertens, PPC, Attoui, H (2011) Detection of a fourth orbivirus non-structural protein. PLoS One 6: pp. e25697 CrossRef
    38. Firth, AE (2008) Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene. Virol J 5: pp. 48 CrossRef
    39. Dijk, AA, Huismans, H (1988) In vitro transcription and translation of bluetongue virus mRNA. J Gen Virol 69: pp. 573-81 CrossRef
    40. Wade-Evans, AM, Mertens, PP, Belsham, GJ (1992) Sequence of genome segment 9 of bluetongue virus (serotype 1, South Africa) and expression analysis demonstrating that different forms of VP6 are derived from initiation of protein synthesis at two distinct sites. J Gen Virol 73: pp. 3023-6 CrossRef
    41. Chung, BY-W, Miller, WA, Atkins, JF, Firth, AE (2008) An overlapping essential gene in the Potyviridae. Proc Natl Acad Sci U S A 105: pp. 5897-902 CrossRef
    42. Haldeman-Cahill, R, Dar貌s, JA, Carrington, JC (1998) Secondary structures in the capsid protein coding sequence and 3鈥?nontranslated region involved in amplification of the tobacco etch virus genome. J Virol 72: pp. 4072-9
    43. Hofacker, IL, Stadler, PF, Stocsits, RR (2004) Conserved RNA secondary structures in viral genomes: a survey. Bioinformatics 20: pp. 1495-9 CrossRef
    44. Roossinck, MJ (2002) Evolutionary history of cucumber mosaic virus deduced by phylogenetic analyses. J Virol 76: pp. 3382-7 CrossRef
    45. Grubman, MJ, Baxt, B (2004) Foot-and-mouth disease. Clin Microbiol Rev 17: pp. 465-93 CrossRef
    46. Mason, PW, Bezborodova, SV, Henry, TM (2002) Identification and characterization of a cis-acting replication element (cre) adjacent to the internal ribosome entry site of foot-and-mouth disease virus. J Virol 76: pp. 9686-94 CrossRef
    47. Heath, L, Walt, E, Varsani, A, Martin, DP (2006) Recombination patterns in aphthoviruses mirror those found in other picornaviruses. J Virol 80: pp. 11827-32 CrossRef
    48. Mehedi, M, Falzarano, D, Seebach, J, Hu, X, Carpenter, M, Schnittler, H (2011) A new Ebola virus nonstructural glycoprotein expressed through RNA editing. J Virol 85: pp. 5406-14 CrossRef
    49. Coleman, JR, Papamichail, D, Skiena, S, Futcher, B, Wimmer, E, Mueller, S (2008) Virus attenuation by genome-scale changes in codon pair bias. Science 320: pp. 1784-7 CrossRef
    50. Edgar, RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: pp. 1792-7 CrossRef
    51. Stamatakis, A (2006) RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22: pp. 2688-90 CrossRef
    52. Gruber AR, Findei脽 S, Washietl S, Hofacker IL, Stadler PF. RNAz 2.0: improved noncoding RNA detection. Pac Symp Biocomput. 2010;69鈥?9.
    53. Darty, K, Denise, A, Ponty, Y (2009) VARNA: Interactive drawing and editing of the RNA secondary structure. Bioinformatics 25: pp. 1974-5 CrossRef
  • 刊物主题:Animal Genetics and Genomics; Human Genetics; Plant Genetics & Genomics; Microbial Genetics and Genomics; Fungus Genetics; Bioinformatics;
  • 出版者:BioMed Central
  • ISSN:1465-6906
文摘
Background The increasing availability of sequence data for many viruses provides power to detect regions under unusual evolutionary constraint at a high resolution. One approach leverages the synonymous substitution rate as a signature to pinpoint genic regions encoding overlapping or embedded functional elements. Protein-coding regions in viral genomes often contain overlapping RNA structural elements, reading frames, regulatory elements, microRNAs, and packaging signals. Synonymous substitutions in these regions would be selectively disfavored and thus these regions are characterized by excess synonymous constraint. Codon choice can also modulate transcriptional efficiency, translational accuracy, and protein folding. Results We developed a phylogenetic codon model-based framework, FRESCo, designed to find regions of excess synonymous constraint in short, deep alignments, such as individual viral genes across many sequenced isolates. We demonstrated the high specificity of our approach on simulated data and applied our framework to the protein-coding regions of approximately 30 distinct species of viruses with diverse genome architectures. Conclusions FRESCo recovers known multifunctional regions in well-characterized viruses such as hepatitis B virus, poliovirus, and West Nile virus, often at a single-codon resolution, and predicts many novel functional elements overlapping viral genes, including in Lassa and Ebola viruses. In a number of viruses, the synonymously constrained regions that we identified also display conserved, stable predicted RNA structures, including putative novel elements in multiple viral species.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700