Data Release Policy

Our commitment

The Wellcome Sanger Institute is committed to data sharing wherever possible. This maximises the value and scientific impact of the data, and ensures transparency and equity in exploitation of the opportunities created.

For more information about the Institute's data policies, please refer to our policies page, which has sections on data sharing and use.

Types of genome

Annotated sequences represented in GeneDB are of two main types:

  • those already available from public sequence databases (e.g. NCBI or EMBL)
  • for some organisms, unfinished, assembled shotgun contigs, with semi-automatic analyses and annotation

Sequence datasets are available for the organisms represented in GeneDB.

Obtaining permission

Please note that permission of the principal investigator should be obtained before publishing solely bioinformatic analyses (e.g. metabolic reconstruction, synteny) of unpublished sequence/open reading frames/genes on a chromosome- or genome-wide scale.

Datasets (both sequence and analyses) are updated on a regular basis. As there has been a considerable investment in both acquiring the data, and undertaking these analyses, we request that you follow the accompanying guidelines when using data obtained directly from the GeneDB web site. Data or analyses obtained from "external" sources (e.g. keywords from SWISSPROT) should be credited as appropriate to the originating source, which in some cases, holds copyright.


Users of sequence data from the sequencing centres (see below) should not publish or otherwise disseminate the information without acknowledgement. An appropriate format is as follows:

These data were produced by the xxx Sequencing Group at the yyy Sequencing Centre and can be obtained from zzz.

Where "xxx" is the name of the relevant sequencing project, such as "Trypanosoma brucei" , "yyy" is the sequencing centre/submitter, and "zzz" is either an appropriate ftp directory, or a http URL, whence the data have been obtained. If an EMBL/Genbank accession number has been assigned to a sequence, this should be also be quoted.

For sequencing projects that are still in progress (e.g. Leishmania), please ensure that you use the latest available version of unfinished sequences.

Organism Sequencing Centres
Aspergillus fumigatus Sanger Institute
Leishmania Sanger Institute/EULEISH
Seattle Biomedical Research Institute
Trypanosoma brucei Sanger Institute/EULEISH


If you have any questions regarding the sequencing data, analyses, or their use in publications, please contact the appropriate organism curator or project manager, whose details are available on the pertinent home pages. Please note that we welcome input and comments from the community, whose contribution will be credited accordingly (see our privacy policy).


The data within GeneDB are provided in good faith, but no guarantee, express or implied, can be made as to their accuracy. We do not assume any legal liability or responsibility for any purpose for which the data are used.