Automatic annotation of eukaryotic genes, pseudogenes and promoters

Victor Solovyev, Peter Kosarev, Igor Seledsov and Denis Vorobyev

(2006)

Victor Solovyev, Peter Kosarev, Igor Seledsov and Denis Vorobyev (2006) Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biology, 7 (Supplement 1). pp. . ISSN 1465-6906

Our Full Text Deposits

Full text access: Open

Full Text - 196.09 KB

Links to Copies of this Item Held Elsewhere


Abstract

Background The ENCODE gene prediction workshop (EGASP) has been organized to evaluate how well state-of-the-art automatic gene finding methods are able to reproduce the manual and experimental gene annotation of the human genome. We have used Softberry gene finding software to predict genes, pseudogenes and promoters in 44 selected ENCODE sequences representing approximately 1% (30 Mb) of the human genome. Predictions of gene finding programs were evaluated in terms of their ability to reproduce the ENCODE-HAVANA annotation. Results The Fgenesh++ gene prediction pipeline can identify 91% of coding nucleotides with a specificity of 90%. Our automatic pseudogene finder (PSF program) found 90% of the manually annotated pseudogenes and some new ones. The Fprom promoter prediction program identifies 80% of TATA promoters sequences with one false positive prediction per 2,000 base-pairs (bp) and 50% of TATA-less promoters with one false positive prediction per 650 bp. It can be used to identify transcription start sites upstream of annotated coding parts of genes found by gene prediction software. Conclusion We review our software and underlying methods for identifying these three important structural and functional genome components and discuss the accuracy of predictions, recent advances and open problems in annotating genomic sequences. We have demonstrated that our methods can be effectively used for initial automatic annotation of the eukaryotic genome.

Information about this Version

This is a Published version
This version's date is: 07/08/2006
This item is peer reviewed

Link to this Version

https://repository.royalholloway.ac.uk/items/38431758-5110-3989-2044-39951b522bd0/1/

Item TypeJournal Article
TitleAutomatic annotation of eukaryotic genes, pseudogenes and promoters
AuthorsSolovyev, Victor
Kosarev, Peter
Seledsov, Igor
Vorobyev, Denis
DepartmentsFaculty of Science\Computer Science

Identifiers

doi10.1186/gb-2006-7-s1-s10

Deposited by () on 07-Jan-2011 in Royal Holloway Research Online.Last modified on 07-Jan-2011

Notes

© 2006 Solovyev et al.; licensee BioMed Central Ltd.

This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

References


Details