Prediction of Tyrosine Sulfation with mRMR Feature Selection and Analysis
详细信息    查看全文
  • 作者:Shen Niu ; Tao Huang ; Kaiyan Feng ; Yudong Cai ; Yixue Li
  • 刊名:Journal of Proteome Research
  • 出版年:2010
  • 出版时间:December 3, 2010
  • 年:2010
  • 卷:9
  • 期:12
  • 页码:6490-6497
  • 全文大小:256K
  • 年卷期:v.9,no.12(December 3, 2010)
  • ISSN:1535-3907
文摘
Protein tyrosine sulfation is a ubiquitous post-translational modification (PTM) of secreted and transmembrane proteins that pass through the Golgi apparatus. In this study, we developed a new method for protein tyrosine sulfation prediction based on a nearest neighbor algorithm with the maximum relevance minimum redundancy (mRMR) method followed by incremental feature selection (IFS). We incorporated features of sequence conservation, residual disorder, and amino acid factor, 229 features in total, to predict tyrosine sulfation sites. From these 229 features, 145 features were selected and deemed as the optimized features for the prediction. The prediction model achieved a prediction accuracy of 90.01% using the optimal 145-feature set. Feature analysis showed that conservation, disorder, and physicochemical/biochemical properties of amino acids all contributed to the sulfation process. Site-specific feature analysis showed that the features derived from its surrounding sites contributed profoundly to sulfation site determination in addition to features derived from the sulfation site itself. The detailed feature analysis in this paper might help understand more of the sulfation mechanism and guide the related experimental validation.

Keywords:

Sulfation; maximum relevance minimum redundancy; incremental feature selection; nearest neighbor algorithm

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700