EuGène is an open gene finder for eukaryotic organisms. Compared to most existing gene finders, EuGène is characterized by its ability to simply integrate arbitrary sources of information in its prediction process.

As most existing gene finders, EuGène can exploit probabilistic models like Markov models for discriminating coding from non coding sequences or to discriminate effective splice sites from false splice sites (using various mathematical models). Beyond this EuGène is able to integrate information from several signal (splice site, translation start...) prediction software, similarity with existing sequences (EST, mRNA, 5'/3' EST from full length mRNA, proteins, genomic homologuous sequences) and output of existing gene finders... Based on all the available information, EuGène will output a prediction of maximal score i.e., maximally consistent with the information provided.

- N E W S -
TAIR9 Genome Release [June 19, 2009] contains
EuGene predictions (thanks to S. Aubourg, INRA-URGV)
See JCVI [news]
Even more new A. thaliana genes predicted by
EuGene validated by RT-PCR
See the [paper]
Hundreds of new A. thaliana genes predicted by
EuGene validated by RACEt
and included in TAIR 6 [publication]
Developed using GForge: mulcyber


- L A S T U P D A T E -
eugene v3.5g : Apr. 2009
eugene v3.5 : Sep. 2007
eugene v3.4 : Dec. 2006
eugene v3.3 : Sep. 2005
eugene v3.2 : Apr. 2005
eugene v3.1 : Feb. 2005
eugene v3.0 : Oct. 2004
EuGène Web : Aug. 21 2009


EuGène graphical output
Each source of information is integrated in EuGène by a small independant software component, called a "plugin". The plugin is responsible for the integration of the information but also for plotting the information on the graphical output of EuGène (if needed) and can also analyze the inconsistencies between the final prediction and the information provided.

There exists a large variety of plugins currently but if needed EuGène's users have the ability to extend EuGène. This can be done using two different approaches. One simple approach is to use the "Annotastruct" plugin. This plugin allows to inject information in EuGène using a GFF file. For the more serious user, it is possible to write a new plugin directly (in C++) and to load it dynamically into EuGène (without recompilation of eugene).
EuGène has been used extensively on the Arabidopsis genome where it has shown its excellent prediction quality. Recent updates of TAIR include hundreds of new genes predicted by EuGene which have been validated by RACE by TIGR [Reference]. It has been adapted to other plant and related organims. EuGène has been developped with funding from INRA and Génoplante.
The software is now OSI Certified Open Source Software under the terms of the Artistic License.