Supporting data for "A draft genome assembly of the Chinese sillago (Sillago sinica), the first reference genome for Sillaginidae fishes"

Dataset type: Genomic
Data released on August 16, 2018

Xu S; Xiao S; Zhu S; Zeng X; Luo J; Liu J; Gao T; Chen N (2018): Supporting data for "A draft genome assembly of the Chinese sillago (Sillago sinica), the first reference genome for Sillaginidae fishes" GigaScience Database. http://dx.doi.org/10.5524/100490

DOI10.5524/100490

Sillaginidae, also known as smelt-whitings, is a family of benthic coastal marine fishes in the Indo-West Pacific that have high ecological and economic importance. Many Sillaginidae species, including the Chinese sillago (Sillago sinica) are recently described in China, providing us with valuable materials to analyze genetic diversification of the family Sillaginidae. Herein, we constructed a reference genome for the Chinese sillago, with the aim to setup a platform for comparative analysis of all species in this family.
Using the single-molecule real-time DNA sequencing platform PacBio Sequel, we generated ~27.3 Gb genomic DNA sequences for the Chinese sillago. We reconstructed a genome assembly of 534 Mb using a strategy that taking advantage of complementary strengths of two genome assembly programs Canu and FALCON. The genome size was consistent with the estimated genome size based on Kmer analysis. The assembled genome consisted of 802 contigs with a contig N50 length of 2.6 Mb. We annotated 22,122 protein-coding genes in the Chinese sillago genomes using de novo method and with RNA-seq data and homologies to other teleosts. According to the phylogenetic analysis using protein-coding genes, Chinese sillago was closely related to Larimichthys Crocea and Dicentrarchus labrax, and Chinese sillago diverged from their ancestor around 69.5 - 82.6 million years ago.
Using long reads generated with PacBio sequencing technology, we have built a draft genome assembly for the Chinese sillago, which is the first reference genome for Sillaginidae species. This genome assembly sets a stage for comparative analysis of the diversification and adaptation of fishes in Sillaginidae.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 30202912)

Accessions (data generated as part of this study):

BioProject: PRJNA437933





Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
SAMN08813687907714  Sillago sinica Alternative names:apple snail
Description:Chinese sillago DNA extracted from the...
Strain:Chinese sillago
...
+
SAMN08903124907714  Sillago sinica Specific host:N/A
Description:Chinese sillago RNA extracted from a 5...
Strain:Chinese sillago
...
+
Displaying 1-2 of 2 Sample(s).




File NameSample IDData TypeFile FormatSizeRelease Date 
TextTEXT5.13 MB2018-08-14
ReadmeTEXT2.48 KB2018-08-02
Coding sequenceFASTA37.07 MB2018-08-02
Sequence assemblyFASTA515.93 MB2018-08-02
AnnotationGFF15.56 MB2018-08-02
AnnotationGFF42.75 KB2018-08-02
Protein sequenceFASTA13.07 MB2018-08-02
Repeat sequenceGFF164.46 MB2018-08-02
AnnotationGFF2.83 KB2018-08-02
AnnotationGFF42.97 KB2018-08-02
Displaying 1-10 of 11 File(s).
Funding body Awardee Award ID Comments
National Natural Science Foundation of China T Gao 41776171
National Natural Science Foundation of China T Gao 31572227
National Natural Science Foundation of China S Xiao 31602207
Date Action
August 16, 2018 Dataset publish
August 28, 2018 Manuscript Link added : 10.1093/gigascience/giy108
November 11, 2022 Manuscript Link updated : 10.1093/gigascience/giy108