Sequence search and analysis of gene products containing RNA recognition motifs in the human genome
详细信息    查看全文
  • 作者:Sony Malhotra (15)
    Ramanathan Sowdhamini (15)

    15. National Centre for Biological Sciences (TIFR)
    ; GKVK Campus ; Bellary Road ; Bangalore ; 560 065 ; India
  • 关键词:RNA recognition motif ; Homo sapiens ; Genome ; wide survey ; Domain architecture ; Splicing
  • 刊名:BMC Genomics
  • 出版年:2014
  • 出版时间:December 2014
  • 年:2014
  • 卷:15
  • 期:1
  • 全文大小:1,052 KB
  • 参考文献:1. Latchman, D (2007) Garland Science. Gene Regulation.
    2. Le Jeune, E, Ladurner, AG (2004) Analysing gene expression, edited by S. Lorkowski and P. Cullen. Protein Sci Publ Protein Soc 13: pp. 1950-1952 CrossRef
    3. Jackson, DA, Pombo, A, Iborra, F (2000) The balance sheet for transcription: an analysis of nuclear RNA metabolism in mammalian cells. FASEB J Off Publ Fed Am Soc Exp Biol 14: pp. 242-254
    4. Ambrose, CM, Duyao, MP, Barnes, G, Bates, GP, Lin, CS, Srinidhi, J, Baxendale, S, Hummerich, H, Lehrach, H, Altherr, M (1994) Structure and expression of the Huntington鈥檚 disease gene: evidence against simple inactivation due to an expanded CAG repeat. Somat Cell Mol Genet 20: pp. 27-38 CrossRef
    5. Aerts, S, Cools, J (2013) Cancer: Mutations close in on gene regulation. Nature 499: pp. 35-36 CrossRef
    6. Madhamshettiwar, PB, Maetschke, SR, Davis, MJ, Reverter, A, Ragan, MA (2012) Gene regulatory network inference: evaluation and application to ovarian cancer allows the prioritization of drug targets. Genome Med 4: pp. 41 CrossRef
    7. Cl茅ry, A, Blatter, M, Allain, FH-T (2008) RNA recognition motifs: boring? Not quite. Curr Opin Struct Biol 18: pp. 290-298 CrossRef
    8. Burd, CG, Dreyfuss, G (1994) Conserved structures and diversity of functions of RNA-binding proteins. Science 265: pp. 615-621 CrossRef
    9. FROM STRUCTURE TO FUNCTION OF RNA BINDING DOMAINS [http://www.ncbi.nlm.nih.gov/books/NBK63528/
    10. King, OD, Gitler, AD, Shorter, J (2012) The tip of the iceberg: RNA-binding proteins with prion-like domains in neurodegenerative disease. Brain Res 1462: pp. 61-80 CrossRef
    11. Birney, E, Kumar, S, Krainer, AR (1993) Analysis of the RNA-recognition motif and RS and RGG domains: conservation in metazoan pre-mRNA splicing factors. Nucleic Acids Res 21: pp. 5803-5816 CrossRef
    12. Gamberi, C, Johnstone, O, Lasko, P (2006) Drosophila RNA binding proteins. Int Rev Cytol 248: pp. 43-139 CrossRef
    13. Kerner, P, Degnan, SM, Marchand, L, Degnan, BM, Vervoort, M (2011) Evolution of RNA-binding proteins in animals: insights from genome-wide analysis in the sponge Amphimedon queenslandica. Mol Biol Evol 28: pp. 2289-2303 CrossRef
    14. Tamburino, AM, Ryder, SP, Walhout, AJM (2013) A compendium of Caenorhabditis elegans RNA binding proteins predicts extensive regulation at multiple levels. G3 Bethesda Md 3: pp. 297-304 CrossRef
    15. Lorkovi膰, ZJ, Barta, A (2002) Genome analysis: RNA recognition motif (RRM) and K homology (KH) domain RNA-binding proteins from the flowering plant Arabidopsis thaliana. Nucleic Acids Res 30: pp. 623-635 CrossRef
    16. McKee, AE, Minet, E, Stern, C, Riahi, S, Stiles, CD, Silver, PA (2005) A genome-wide in situ hybridization map of RNA-binding proteins reveals anatomically restricted expression in the developing mouse brain. BMC Dev Biol 5: pp. 14 CrossRef
    17. Bateman, A, Birney, E, Durbin, R, Eddy, SR, Howe, KL, Sonnhammer, EL (2000) The Pfam protein families database. Nucleic Acids Res 28: pp. 263-266 CrossRef
    18. Bateman, A, Coin, L, Durbin, R, Finn, RD, Hollich, V, Griffiths-Jones, S, Khanna, A, Marshall, M, Moxon, S, Sonnhammer, ELL, Studholme, DJ, Yeats, C, Eddy, SR (2004) The Pfam protein families database. Nucleic Acids Res 32: pp. D138-D141 CrossRef
    19. Finn, RD, Mistry, J, Tate, J, Coggill, P, Heger, A, Pollington, JE, Gavin, OL, Gunasekaran, P, Ceric, G, Forslund, K, Holm, L, Sonnhammer, ELL, Eddy, SR, Bateman, A (2009) The Pfam protein families database. Nucleic Acids Res 38: pp. D211-D222 CrossRef
    20. Punta, M, Coggill, PC, Eberhardt, RY, Mistry, J, Tate, J, Boursnell, C, Pang, N, Forslund, K, Ceric, G, Clements, J, Heger, A, Holm, L, Sonnhammer, ELL, Eddy, SR, Bateman, A, Finn, RD (2012) The Pfam protein families database. Nucleic Acids Res 40: pp. D290-D301 CrossRef
    21. Thompson, JD, Higgins, DG, Gibson, TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: pp. 4673-4680 CrossRef
    22. Larkin, MA, Blackshields, G, Brown, NP, Chenna, R, McGettigan, PA, McWilliam, H, Valentin, F, Wallace, IM, Wilm, A, Lopez, R, Thompson, JD, Gibson, TJ, Higgins, DG (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23: pp. 2947-2948 formatics/btm404" target="_blank" title="It opens in new window">CrossRef
    23. Guindon, S, Dufayard, J-F, Lefort, V, Anisimova, M, Hordijk, W, Gascuel, O (2010) New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol 59: pp. 307-321 CrossRef
    24. Edgar, RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: pp. 1792-1797 CrossRef
    25. Tamura, K, Stecher, G, Peterson, D, Filipski, A, Kumar, S (2013) MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Mol Biol Evol 30: pp. 2725-2729 CrossRef
    26. Berman, HM, Westbrook, J, Feng, Z, Gilliland, G, Bhat, TN, Weissig, H, Shindyalov, IN, Bourne, PE (2000) The protein data bank. Nucleic Acids Res 28: pp. 235-242 CrossRef
    27. ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids fordjournals.org/content/38/suppl_2/W529.short" class="a-plus-plus">http://nar.oxfordjournals.org/content/38/suppl_2/W529.short
    28. Marchler-Bauer, A, Panchenko, AR, Shoemaker, BA, Thiessen, PA, Geer, LY, Bryant, SH (2002) CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res 30: pp. 281-283 CrossRef
    29. Altschul, SF, Madden, TL, Sch盲ffer, AA, Zhang, J, Zhang, Z, Miller, W, Lipman, DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: pp. 3389-3402 CrossRef
    30. Eddy, SR (2011) Accelerated profile HMM searches. PLoS Comput Biol 7: pp. e1002195 CrossRef
    31. BLASTclust http://toolkit.tuebingen.mpg.de/blastclust#
    32. NCBI News: Spring 2004BLASTLab NCBI News: Spring 2004BLASTLab
    33. Ashburner, M, Ball, CA, Blake, JA, Botstein, D, Butler, H, Cherry, JM, Davis, AP, Dolinski, K, Dwight, SS, Eppig, JT, Harris, MA, Hill, DP, Issel-Tarver, L, Kasarskis, A, Lewis, S, Matese, JC, Richardson, JE, Ringwald, M, Rubin, GM, Sherlock, G (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25: pp. 25-29 CrossRef
    34. Shamoo, Y, Abdul-Manan, N, Williams, KR (1995) Multiple RNA binding domains (RBDs) just don鈥檛 add up. Nucleic Acids Res 23: pp. 725-728 CrossRef
    35. Huang, DW, Sherman, BT, Lempicki, RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4: pp. 44-57 CrossRef
    36. Huang, DW, Sherman, BT, Lempicki, RA (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37: pp. 1-13 CrossRef
    37. Kanehisa, M, Goto, S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28: pp. 27-30 CrossRef
    38. Castello, A, Fischer, B, Hentze, MW, Preiss, T (2013) RNA-binding proteins in Mendelian disease. Trends Genet TIG 29: pp. 318-327 CrossRef
    39. Ward, JJ, McGuffin, LJ, Bryson, K, Buxton, BF, Jones, DT (2004) The DISOPRED server for the prediction of protein disorder. Bioinformatics 20: pp. 2138-2139 formatics/bth195" target="_blank" title="It opens in new window">CrossRef
    40. Gray, DA, Woulfe, J (2013) Structural disorder and the loss of RNA homeostasis in aging and neurodegenerative disease. Front Genet 4: pp. 149 CrossRef
    41. Vanderweyde, T, Youmans, K, Liu-Yesucevitz, L, Wolozin, B (2013) Role of stress granules and RNA-binding proteins in neurodegeneration: a mini-review. Gerontology 59: pp. 524-533 CrossRef
    42. Wolozin, B (2012) Regulated protein aggregation: stress granules and neurodegeneration. Mol Neurodegener 7: pp. 56 CrossRef
    43. Daubner, GM, Cl茅ry, A, Allain, FH-T (2013) RRM鈥揜NA recognition: NMR or crystallography鈥nd new findings. Curr Opin Struct Biol 23: pp. 100-108 CrossRef
    44. Maris, C, Dominguez, C, Allain, FH-T (2005) The RNA recognition motif, a plastic RNA-binding platform to regulate post-transcriptional gene expression. FEBS J 272: pp. 2118-2131 CrossRef
    45. Castello, A, Fischer, B, Eichelbaum, K, Horos, R, Beckmann, BM, Strein, C, Davey, NE, Humphreys, DT, Preiss, T, Steinmetz, LM, Krijgsveld, J, Hentze, MW (2012) Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell 149: pp. 1393-1406 CrossRef
    46. Tompa, P, Csermely, P (2004) The role of structural disorder in the function of RNA and protein chaperones. FASEB J 18: pp. 1169-1175 CrossRef
    47. Korneta, I, Bujnicki, JM (2012) Intrinsic disorder in the human spliceosomal proteome. PLoS Comput Biol 8: pp. e1002641 CrossRef
    48. Lukong, KE, Chang, K, Khandjian, EW, Richard, S (2008) RNA-binding proteins in human genetic disease. Trends Genet TIG 24: pp. 416-425 CrossRef
    49. FigTree http://tree.bio.ed.ac.uk/software/figtree/
    50. Ren, J, Wen, L, Gao, X, Jin, C, Xue, Y, Yao, X (2009) DOG 1.0: illustrator of protein domain structures. Cell Res 19: pp. 271-273 CrossRef
    51. Hamosh, A, Scott, AF, Amberger, JS, Bocchini, CA, McKusick, VA (2005) Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 33: pp. D514-D517
  • 刊物主题:Life Sciences, general; Microarrays; Proteomics; Animal Genetics and Genomics; Microbial Genetics and Genomics; Plant Genetics & Genomics;
  • 出版者:BioMed Central
  • ISSN:1471-2164
文摘
Background Gene expression is tightly regulated at both transcriptional and post-transcriptional levels. RNA-binding proteins are involved in post-transcriptional gene regulation events. They are involved in a variety of functions such as splicing, alternative splicing, nuclear import and export of mRNA, RNA stability and translation. There are several well-characterized RNA-binding motifs present in a whole genome, such as RNA recognition motif (RRM), KH domain, zinc-fingers etc. In the present study, we have investigated human genome for the presence of RRM-containing gene products starting from RRM domains in the Pfam (Protein family database) repository. Results In Pfam, seven families are recorded to contain RRM-containing proteins. We studied these families for their taxonomic representation, sequence features (identity, length, phylogeny) and structural properties (mapping conservation on the structures). We then examined the presence of RRM-containing gene products in Homo sapiens genome and identified 928 RRM-containing gene products. These were studied for their predicted domain architectures, biological processes, involvement in pathways, disease relevance and disorder content. RRM domains were observed to occur multiple times in a single polypeptide. However, there are 56 other co-existing domains involved in different regulatory functions. Further, functional enrichment analysis revealed that RRM-containing gene products are mainly involved in biological functions such as mRNA splicing and its regulation. Conclusions Our sequence analysis identified RRM-containing gene products in the human genome and provides insights into their domain architectures and biological functions. Since mRNA splicing and gene regulation are important in the cellular machinery, this analysis provides an early overview of genes that carry out these functions.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700