Impact of residue accessible surface area on the prediction of protein secondary structures
详细信息    查看全文
  • 作者:Amir Momen-Roknabadi (1) (3)
    Mehdi Sadeghi (2) (3)
    Hamid Pezeshk (4)
    Sayed-Amir Marashi (1) (5) (6)
  • 刊名:BMC Bioinformatics
  • 出版年:2008
  • 出版时间:December 2008
  • 年:2008
  • 卷:9
  • 期:1
  • 全文大小:372KB
  • 参考文献:1. Kmiecik S, Gront D, Kolinski A: Towards the high-resolution protein structure prediction. Fast refinement of reduced models with all-atom force field. / BMC Struct Biol 2007, 7:43. CrossRef
    2. Xiang Z: Advances in homology protein structure modeling. / Curr Protein Pept Sci 2006, 7:217鈥?27. CrossRef
    3. Djurdjevic DP, Biggs MJ: Ab initio protein fold prediction using evolutionary algorithms: influence of design and control parameters on performance. / J Comput Chem 2006, 27:1177鈥?195. CrossRef
    4. Wu S, Skolnick J, Zhang Y: Ab initio modeling of small proteins by iterative TASSER simulations. / BMC Biol 2007, 5:17. CrossRef
    5. Jauch R, Yeo HC, Kolatkar PR, Clarke ND: Assessment of CASP7 structure predictions for template free targets. / Proteins 2007, 69:57鈥?7. CrossRef
    6. Rost B: Protein structure prediction in 1D, 2D, and 3D. / Encyclopedia of Computational Chemistry / (Edited by: von Rague-Schleyer P, Allinger NL, Clark TC, Gasteiger J, Kollman PA, Schaefer HF). Sussex, John Wiley & Sons 1998, 2242鈥?255.
    7. Chou PY, Fasman GD: Prediction of protein conformation. / Biochemistry 1974, 13:222鈥?45. CrossRef
    8. Chou PY, Fasman GD: Empirical predictions of protien conformations. / Annu Rev Biochem 1978, 47:251鈥?76. CrossRef
    9. Chen H, Gu F, Huang Z: Improved Chou-Fasman method for protein secondary structure prediction. / BMC Bioinformatics 2006, 7:S14. CrossRef
    10. Asai K, Hayamizu S, Handa K: Prediction of protein secondary structure by the hidden Markov model. / Comput Appl Biosci 1993, 9:141鈥?46.
    11. Martin J, Gibrat JF, Rodolphe F: Analysis of an optimal hidden Markov model for secondary structure prediction. / BMC Struct Biol 2006, 6:25. CrossRef
    12. Garnier J, Osguthorpe DJ, Robson B: Analysis of the Accuracy and Implications of Simple Methods for Predicting the Secondary Structure of Globular Proteins. / J Mol Biol 1978, 120:97鈥?20. CrossRef
    13. Garnier J, Gibrat JF, Robson B: GOR method for predicting protein secondary structure from amino acid sequence. / Methods Enzymol 1996, 266:540鈥?53. CrossRef
    14. Nishikawa K: Assessment of secondary-structure prediction of proteins -comparison of computerized Chou-Fasman methods with others. / Biochim Biophys Acta 1983, 748:285鈥?99. CrossRef
    15. Raghava GPS: Protein secondary structure prediction using nearest neighbor and neural network approach. / CASP 2000, 4:75鈥?8.
    16. Cuff JA, Barton GJ: Evaluation and improvement of multiple sequence methods for protein secondary structure prediction. / Proteins 1999, 34:508鈥?19. CrossRef
    17. Pollastri G, Przybylski DR B, Baldi P: Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. / Proteins 2002,47(2):228鈥?35. CrossRef
    18. Rost B Sander, C.: Prediction of protein secondary structure at better than 70 % Accuracy. / J Mol Biol 1993,232(2):584鈥?99. CrossRef
    19. Jones D: Protein secondary structure prediction based on position-specific scoring matrices. / J Mol Biol 1999, 292:195鈥?02. CrossRef
    20. Guo J, Chen H, Sun Z, Lin Y: A novel method for protein secondary structure prediction using dual-layer SVM and profiles. / Proteins 2004, 54:738鈥?43. CrossRef
    21. Hua S, Sun Z: A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. / J Mol Biol 2001, 308:397鈥?07. CrossRef
    22. Ward JJ, McGuffin LJ, Buxton BF, Jones DT: Secondary structure prediction with support vector machines. / Bioinformatics 2003, 19:1650鈥?655. CrossRef
    23. Karypis G: YASSPP: better kernels and coding schemes lead to improvements in protein secondary structure prediction. / Proteins 2006, 64:575鈥?86. CrossRef
    24. Ofer D, Yaoqi Z: Achieving 80% Ten-fold Cross-validated Accuracy for Secondary Structure Prediction by Large-scale Training. / Proteins 2007, 66:838鈥?45.
    25. Rost B: Review: protein secondary structure prediction continues to rise. / J Struct Biol 2001, 134:204鈥?18. CrossRef
    26. Rost B: Rising accuracy of protein secondary structure prediction. / Protein Structure Determination, Analysis and Modeling for Drug Discovery / (Edited by: Chasman D). New York , Dekker 2003, 207鈥?49. CrossRef
    27. Pollastri G, Martin AJM, Mooney C, Vullo A: Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information. / BMC Bioinformatics 2007, 8:201. CrossRef
    28. Costantini S, Colonna G, Facchiano AM: Amino acid propensities for secondary structures are influenced by the protein structural class. / Biochem Biophys Res Commun 2006, 342 :441鈥?51. CrossRef
    29. Costantini S Colonna, G, Facchiano, A.M: PreSSAPro: A software for the prediction of secondary structure by amino acid properties. / Comput Biol Chem 2007, 31:389鈥?92. CrossRef
    30. Marashi SA, Behrouzi R, Pezeshk H: Adaptation of proteins to different environments: A comparison of proteome structural properties in Bacillus subtilis and Escherichia coli. / J Theor Biol 2007, 244:127鈥?32. CrossRef
    31. Adamczak R, Porollo A, Meller J: Combining prediction of secondary structure and solvent accessibility in proteins. / Proteins 2005, 59:467鈥?75. CrossRef
    32. Macdonald JR, Johnson WC: Environmental features are important in determining protein secondary structure. / Protein Sci 2001, 10:1172鈥?177. CrossRef
    33. Zhu ZY, Blundell TL: The use of amino acid patterns of classified helices and strands in secondary structure prediction. / J Mol Biol 1996, 260:261鈥?76. CrossRef
    34. Zhong L, Johnson WC: Environment Affects Amino Acid Preference for Secondary Structure . / Proc Natl Acad Sci USA 1992,89(10):4462鈥?465. CrossRef
    35. Cohen BI, Presnell SR, Cohen FE: Origins of structural diversity within sequentially identical hexapeptides. / Protein Sci 1993, 2:2134鈥?145. CrossRef
    36. Han KF, Baker D: Global properties of the mapping between local amino acid sequence and local structure in proteins. / Proc Natl Acad Sci USA 1996, 93:5814鈥?818. CrossRef
    37. Kabsch W, Sander C: On the use of sequence homologies to predict protein structure: Identical pentapeptides can have completely different conformations. / Proc Natl Acad Sci USA 1984, 81:1075鈥?078. CrossRef
    38. Minor DL, Kim PS: Context-dependent secondary structure formation of a designed protein sequence. / Nature 1996, 380:730鈥?34. CrossRef
    39. Sudarsanam S: Structural diversity of sequentially identical subsequences of proteins: Identical octapeptides can have different conformations. / Proteins 1998, 30:228鈥?31. CrossRef
    40. Palliser CC, Parry DA: Quantitative comparison of the ability of hydropathy scales to recognize surface beta-strands in proteins. / Proteins 2001, 42:243鈥?55. CrossRef
    41. Kabsch W, Sander C: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. / Biopolymers 1983, 22:2577鈥?637. CrossRef
    42. Adamczak R, Porollo A, Meller J: Accurate prediction of solvent accessibility using neural networks-based regression. / Proteins 2004, 56:753鈥?67. CrossRef
    43. Wagner M, Adamczak R, Porollo A, Meller J: Linear regression models for solvent accessibility prediction in proteins. / J Comput Biol 2005, 12:355鈥?69. CrossRef
    44. Ahmad S, Gromiha MM, Sarai A: RVP-net: online prediction of real valued accessible surface area of proteins from single sequences. / Bioinformatics 2003, 19:1849鈥?851. CrossRef
    45. Hooft RWW, Sander C, Vriend G: Verification of Protein Structures: Side-Chain Planarity. / J Appl Cryst 1996, 29:714鈥?16. CrossRef
    46. Hobohm U, Scharf M, Schneider R, Sander C: Selection of a representative set of structures from the Brookhaven Protein Data Bank. / Protein Sci 1992, 1:409鈥?17. CrossRef
    47. Kloczkowski A, Ting KL, Jernigan RL, Garnier J: Combining the GOR V Algorithm With Evolutionary Information for Protein Secondary Structure Prediction FromAmino Acid Sequence. / Proteins 2002, 49:154鈥?66. CrossRef
    48. Brillouin L: Science and information theory. Academic Press 1956.
    49. Shannon CE: A mathematical theory of communication. / Bell Sys Tech J 1948, 27:379鈥?23.
    50. Shannon CE, Weaver W: The mathematical theory of communication. University of Illinois Press 1949.
    51. Fano R: Transmission of Information. John Wiley 1961.
    52. Forney GD: The Viterbi algorithm. / Proc IEEE 1973, 61:268鈥?78. CrossRef
  • 作者单位:Amir Momen-Roknabadi (1) (3)
    Mehdi Sadeghi (2) (3)
    Hamid Pezeshk (4)
    Sayed-Amir Marashi (1) (5) (6)

    1. Department of Biotechnology, College of Science, University of Tehran, Tehran, Iran
    3. Bioinformatics Group, School of Computer Science, Institute for Studies in Theoretical Physics and Mathematics (IPM), Niavaran Square, Tehran, Iran
    2. National Institute of Genetic Engineering and Biotechnology, Tehran-Karaj Highway, Tehran, Iran
    4. School of Mathematics, Statistics and Computer Sciences and Center of Excellence in Biomathematics, College of Science, University of Tehran, Tehran, Iran
    5. IMPRS-CBSC, Max Planck Institute for Molecular Genetics, Ihnestr. 63-73, D-14195, Berlin, Berlin, Germany
    6. DFG-Research Center Matheon, FB Mathematik und Informatik, Freie Universit盲t Berlin, Arnimallee 6, D-14195, Berlin, Germany
  • ISSN:1471-2105
文摘
Background The problem of accurate prediction of protein secondary structure continues to be one of the challenging problems in Bioinformatics. It has been previously suggested that amino acid relative solvent accessibility (RSA) might be an effective factor for increasing the accuracy of protein secondary structure prediction. Previous studies have either used a single constant threshold to classify residues into discrete classes (buries vs. exposed), or used the real-value predicted RSAs in their prediction method. Results We studied the effect of applying different RSA threshold types (namely, fixed thresholds vs. residue-dependent thresholds) on a variety of secondary structure prediction methods. With the consideration of DSSP-assigned RSA values we realized that improvement in the accuracy of prediction strictly depends on the selected threshold(s). Furthermore, we showed that choosing a single threshold for all amino acids is not the best possible parameter. We therefore used residue-dependent thresholds and most of residues showed improvement in prediction. Next, we tried to consider predicted RSA values, since in the real-world problem, protein sequence is the only available information. We first predicted the RSA classes by RVP-net program and then used these data in our method. Using this approach, improvement in prediction was also obtained. Conclusion The success of applying the RSA information on different secondary structure prediction methods suggest that prediction accuracy can be improved independent of prediction approaches. Thus, solvent accessibility can be considered as a rich source of information to help the improvement of these methods.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700