Main content area

ConsPred: a rule-based (re-)annotation framework for prokaryotic genomes

Weinmaier, Thomas, Platzer, Alexander, Frank, Jeroen, Hellinger, Hans-Jörg, Tischler, Patrick, Rattei, Thomas
Bioinformatics 2016 v.32 no.21 pp. 3327-3329
algorithms, bioinformatics, cloud computing, computer software, decision making, gene editing, genes, nucleotide sequences, prediction
Motivation: The rapidly growing number of available prokaryotic genome sequences requires fully automated and high-quality software solutions for their initial and re-annotation. Here we present ConsPred, a prokaryotic genome annotation framework that performs intrinsic gene predictions, homology searches, predictions of non-coding genes as well as CRISPR repeats and integrates all evidence into a consensus annotation. ConsPred achieves comprehensive, high-quality annotations based on rules and priorities, similar to decision-making in manual curation and avoids conflicting predictions. Parameters controlling the annotation process are configurable by the user. ConsPred has been used in the institutions of the authors for longer than 5 years and can easily be extended and adapted to specific needs. Summary: The ConsPred algorithm for producing a consensus from the varying scores of multiple gene prediction programs approaches manual curation in accuracy. Its rule-based approach for choosing final predictions avoids overriding previous manual curations. Availability and implementation: ConsPred is implemented in Java, Perl and Shell and is freely available under the Creative Commons license as a stand-alone in-house pipeline or as an Amazon Machine Image for cloud computing, see Contact: Supplementary information: Supplementary data are available at Bioinformatics online.