Two Blast-independent tools, CyPerl and CyExcel, for harvesting hundreds of novel cyclotides and analogues from plant genomes and protein databases
详细信息    查看全文
文摘
Main conclusion Two high-throughput tools harvest hundreds of novel cyclotides and analogues in plants. Cyclotides are gene-encoded backbone-cyclized polypeptides displaying a diverse range of bioactivities associated with plant defense. However, genome-scale or database-scale evaluations of cyclotides have been rare so far. Here, a novel time-efficient Perl program, CyPerl, was developed for searching cyclotides from predicted ORFs of 34 available plant genomes and existing plant protein sequences from Genbank databases. CyPerl-isolated sequences were further analyzed by removing repeats, evaluating their cysteine-distributed regions (CDRs) and comparing with CyBase-collected cyclotides in a user-friendly Excel (Microsoft Office) template, CyExcel. After genome-screening, 186 ORFs containing 145 unique cyclotide analogues were identified by CyPerl and CyExcel from 30 plant genomes tested from 10 plant families. Phaseolus vulgaris and Zea mays were the richest two species containing cyclotide analogues in the plants tested. After screening protein databases, 266 unique cyclotides and analogues were identified from seven plant families. By merging with 288 unique CyBase-listed cyclotides, 510 unique cyclotides and analogues were obtained from 13 plant families. In total, seven novel plant families containing cyclotide analogues and 202 novel cyclotide analogues were identified in this study. This study has established two Blast-independent tools for screening cyclotides from plant genomes and protein databases, and has also significantly widened the plant distribution and sequence diversity of cyclotides and their analogues.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700