Supporting data for the "Draft genome of the Chinese mitten crab, Eriocheir sinensis".
Dataset type: Genomic
Data released on January 18, 2016
The Chinese mitten crab, Eriocheir sinensis, is one of the most economically important representatives of the Crustacea in China, and an invasive species in much of Europe and North America. Its transition from swimming type to crawling during its early development process, anadromous migration for growth, and catadromous migration for breeding make it an attractive model for research. However, knowledge about the underlying molecular mechanisms to regulate these processes is still very limited. A total of 258.8-Gb raw reads for crab whole-genome sequencing were generated using the Illumina HiSeq2000 platform, and available from the SRA. Here we report the final genome assembly (1.12 Gb), about 67.5% of the estimated genome size (1.66 Gb), and composed of 17,553 scaffolds (>2 kb) with a N50 of 224 kb. We identified 14,436 genes using AUGUSTUS, of which 7,549 were shown to have significant supporting evidence using the Glean pipeline. This gene number is much higher than that of the Horseshoe crab (see GigaDB dataset 100091) and the annotation completeness evaluated by CEGMA is 66.9%.
Read the peer-reviewed publication(s):
Accessions (data generated as part of this study):
|Sample ID||Taxonomic ID||Common Name||Genbank Name||Scientific Name||Sample Attributes|
|Chinese mitten crab||95602||Chinese hairy crab||Chinese mitten crab||Eriocheir sinensis|| Tissue:muscle|
Geographic location (country and/or sea,region):Panjin, Liaoning Province, China.