Statistics of N-terminal alignment as a guide for refining prokaryotic gene annotation

详细信息	查看全文 \| 推荐本文 \|

作者：Naoki Sato ; ^{naokisat@bio.c.u-tokyo.ac.jp} ; Naoyuki Tajima
关键词：Cyanobacteria ; Genome annotation ; Genome clustering ; Initiation codon ; Synechocystis sp. PCC 6803
刊名：Genomics
出版年：2012
期刊代码：170_08887543
类别：bio
出版时间：March, 2012
卷：99
期：3
页码：138-143
文件大小：910 K

摘要

Identification of a correct N-terminus of a protein is an important step in genome annotation. However, we sometimes encounter incorrectly annotated N-termini in genomic databases. We analyzed statistics of surplus or missing N-terminal amino acid residues in tentatively translated coding sequence of cyanobacterial database entries, and found that, on average, about 8-9%of the aligned proteins have a putative incorrect N-terminus, although the percentage was dependent on the database entry. In an attempt to find more plausible N-termini for these proteins, we were able to estimate a better-aligning N-terminus in 90%of the cases. TTG was found as a putative initiation codon in most cases of recessed N-termini. This statistical approach, applicable to any group of prokaryotes, will help identify a plausible translation initiation site for each protein-coding gene in newly sequenced genomes, and also is a method of refining the N-terminus of proteins in already published genomes.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700