Separation of sequences from host–pathogen interface using triplet nucleotide frequencies

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

Separation of sequences from host–pathogen interface using triplet nucleotide frequencies

详细信息	查看全文 \| 推荐本文 \|

作者：Jeppe Emmersen ; Stephen Rudd ; Hans-Werner Mewes ; Igor V. Tetko
关键词：Plant– ; fungi interactions ; EST data analysis ; Bioinformatics ; Codon usage ; Codon bias
刊名：Fungal Genetics and Biology
出版年：2007
期刊代码：116_10871845
类别：abs
出版时间：April 2007
卷：44
期：4
页码：231-241
文件大小：264 K

摘要

The identification of genes involved in host–pathogen interactions is important for the elucidation of mechanisms of disease resistance and host susceptibility. A traditional way to classify the origin of genes sampled from a pool of mixed cDNA is through sequence similarity to known genes from either the pathogen or host organism or other closely related species. This approach does not work when the identified sequence has no close homologues in the sequence databases. In our previous studies, we classified genes using their codon frequencies. This method, however, explicitly required the prediction of CDS regions and thus could not be applied to sequences composed from the non-coding regions of genes. In this study, we show that the use of sliding-window triplet frequencies extends the application of the algorithm to both coding and non-coding sequences and also increases the prediction accuracy of a Support Vector Machine classifier from 95.6 ± 0.3 to 96.5 ± 0.2. Thus the use of the triplet frequencies increased the prediction accuracy of the new method by more than 20%compared to our previous approach. A functional analysis of sequences detected gene families having significantly higher or lower probability to be correctly classified compared to the average accuracy of the method is described. The server to perform classification of EST sequences using triplet frequencies is available at http://mips.gsf.de/proj/est3.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700