Keyword Protein Sequence Group Entry

OMA Download – March 2014

The entire OMA database is available for download in several formats. It is also possible to download each group separately. This option is available in the group view. Please read our terms and conditions before integrating OMA data into your own research or database.

OMA Browser Archive: the download files form the previous releases of OMA are still available:
Jul 2013 |  Dec 2012 |  Mar 2012 |  May 2011 |  Nov 2010 |  May 2010 |  Oct 2009 |  Apr 2009

Orthology Relationships
The orthology relationships are available in two types: groups or pairs of orthologs. The information is given in terms of OMA identifiers (of the form HUMAN04376).
OMA groups:Text format downloadoma-groups.txt.gz
OrthoXML format downloadoma-groups.orthoXML.xml.gz
Pairwise orthologs:Text format downloadoma-pairs.txt.gz (all pairs)
OrthoXML format downloadoma-pairs.orthoXML.xml.gz
Pairs between two species: Genome Pair View
Sequences
All sequences with the corresponding OMA identifiers can be downloaded in fasta files. The proteins are all in one file, while the coding DNA is split into two files, one for the Eukaryotes and one for the Prokaryotes.
Protein sequences:Fasta format downloadoma-seqs.fa.gz
SeqXML format downloadoma-seqs.seqxml.tgz
cDNA Eukaryotes: downloadeukaryotes.cdna.fa.gz
cDNA Prokaryotes: downloadprokaryotes.cdna.fa.gz
Protein Annotations: downloadoma-protein-annotations.txt.gz
Identifier Mapping
Mappings of the OMA identifier to various other databases are available:
Mapping to UniProt: downloadoma-uniprot.txt.gz
Mapping to Ensembl: downloadoma-ensembl.txt.gz
Mapping to NCBI: downloadoma-ncbi.txt.gz
Mapping to GO: downloadoma-go.txt.gz
Mapping to Wormbase: downloadoma-wormbase.txt.gz
Mapping to JGI: downloadoma-jgi.txt.gz
Plant mapping: (ARATH and ORYSA) downloadoma-plants.txt.gz
Other files
OMA Groups/Sequences in COGs format: downloadAll.cog.tar.gz
Species information: (Taxon IDs, scientific names, genome sources) downloadoma-species.txt
Group descriptions: downloadgroup-descriptions.txt
Close OMA Groups: downloadoma-close-groups.txt
OMA ID History
Mappings of the OMA identifier of updated genomes from one release to another. We track only proteins with same amino acid sequences.
From Jul 2013 releasedownloadIDHistory-Jul2013-to-Mar2014.txt.gz
From Dec 2012 releasedownloadIDHistory-Dec2012-to-Mar2014.txt.gz
From Mar 2012 releasedownloadIDHistory-Mar2012-to-Mar2014.txt.gz
From May 2011 releasedownloadIDHistory-May2011-to-Mar2014.txt.gz