CEGMA gene predictions for Assemblathon 2 entries.

Dataset type: Genomic
Data released on June 24, 2013

Bradnam KR; Fass JN; Korf IK (2013): CEGMA gene predictions for Assemblathon 2 entries. GigaScience Database. http://dx.doi.org/10.5524/100061


Assemblathon 2 genome assemblies were assessed for their genic content. This was done by using published tool (CEGMA) that looks for the presence of nearly full-length genes within a single scaffold sequence. Such genes must match HMMs made from a set of 458 highlyconserved genes that are presumed to be conserved in all eukaryotes.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 23870653)

Related datasets:

doi:10.5524/100061 IsSupplementedBy doi:10.5524/100060
doi:10.5524/100061 IsSupplementedBy doi:10.5524/100062

Accessions (data generated as part of this study):

SRA: ERP002324
SRA: SRA026860
SRA: ERP002294

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
ERS218597499168Boa constrictor constrictor Boa constrictor constrictor
ERS22288013146Melopsittacus undulatusbudgerigarMelopsittacus undulatus Cell type:blood
Sex:male [PATO:0000384]
Common name:budgerigar
SRS140425106582Maylandia zebrazebra mbunaMaylandia zebra Sex:male [PATO:0000384]
Tissue:muscle and heart
Common name:zebra mbuna fish
Displaying 1-3 of 3 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
ERS222880, SRS140425, ERS218597Genome sequenceTAR79.41 MB2013-06-21
ReadmeTEXT0.75 KB2013-06-21
Displaying 1-2 of 2 File(s).
Date Action