De novo genome assembly and annotation data for the Murray cod (Maccullochella peelii), Australia's largest freshwater fish

Dataset type: Genomic, Transcriptomic
Data released on July 18, 2017

Austin CM; Lee YP; Harrisson KA; Tan MH; Croft LJ; Pavlova A; Sunnucks P; Gan HM (2017): De novo genome assembly and annotation data for the Murray cod (Maccullochella peelii), Australia's largest freshwater fish GigaScience Database. http://dx.doi.org/10.5524/100329

DOI10.5524/100329

One of the most iconic Australian fish is the Murray cod, Maccullochella peelii (Mitchell, 1838), a freshwater species that can grow to ~1.8 metres in length and live ≥ 48 years of age. The Murray cod is of conservation concern as a result of strong population contractions, but is also popular for recreational fishing and is of growing aquaculture interest. In this study, we report the whole genome sequence of the Murray cod to support ongoing population genetics, conservation and management-related research, as well as to understand better the evolutionary ecology and history of the species.
A draft Murray cod genome of 633 Mbp (N50=109,974bp; BUSCO and CEGMA completeness of 94.2% and 91.9%, respectively) with an estimated 148 Mbp of putative repetitive sequences was assembled from the combined sequencing data of two fish individuals with an identical maternal lineage. 47.2 Gb of Illumina HiSeq data and 804 Mb of Nanopore data were generated from the first individual while 23.2 Gb of Illumina MiSeq data were from the second individual. The inclusion of Nanopore reads for scaffolding followed by subsequent gap-closing using Illumina data led to a 29% reduction in the number of scaffolds and a 55% and 54% increase in the scaffold and contig N50, respectively. We also report the first transcriptome of Murray cod that was subsequently used to annotate the Murray cod genome leading to the identification of 26,539 protein-coding genes.
We present the whole genome of the Murray cod and anticipate this will be a catalyst for a range of genetic, genomic and phylogenetic studies of the Murray cod and more generally other fish species of Percichthydae family.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 28873963)

Accessions (data generated as part of this study):

BioProject: PRJNA290988
BioProject: PRJNA383091





Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
SAMN03938494135761 Murray codMaccullochella peelii Description:Sequencing of the Murray Cod genome
Analyte type:DNA
Alternative names:KMC200 or MCC0324
...
+
SAMN06759032135761 Murray codMaccullochella peelii Description:Sequencing of the Murray Cod transcrip...
Analyte type:RNA
Alternative names:BrokenCreek1
...
+
SAMN07329765135761 Murray codMaccullochella peelii Description:Sequencing of the Murray Cod genome
Analyte type:DNA
Isolation source:Fish Market in Melbourne, Victori...
...
+
Displaying 1-3 of 3 Sample(s).




File NameSample IDData TypeFile FormatSizeRelease Date 
SAMN03938494Genome sequenceFASTA175.51 MB2017-07-05
SAMN03938494Genome sequenceFASTA161.76 MB2017-07-05
SAMN03938494Genome sequenceFASTA167.54 MB2017-07-05
SAMN03938494AnnotationGFF507.95 MB2017-07-05
SAMN03938494Coding sequenceFASTA17.42 MB2017-07-05
SAMN03938494Protein sequenceFASTA8.12 MB2017-07-05
SAMN03938494AnnotationGFF18.19 MB2017-07-05
SAMN03938494TextTAR296.22 MB2017-07-05
SAMN03938494Textarchive1.89 MB2017-07-05
SAMN03938494OtherTSV7.14 MB2017-07-05
Displaying 1-10 of 12 File(s).
Funding body Awardee Award ID Comments
Australian Research Council Prof P Sunnucks LP11020001
Date Action
July 18, 2017 Dataset publish
July 20, 2017 File 4_cds.fa removed
July 20, 2017 File removed : 4_cds.fa
July 20, 2017 File 5_protein.fa removed
July 20, 2017 File removed : 5_protein.fa
October 2, 2017 Manuscript Link added : 10.1093/gigascience/gix063
November 9, 2022 Manuscript Link updated : 10.1093/gigascience/gix063