Help Login Create account

Data released on March 07, 2014

Genomic data of the diploid cotton (Gossypium raimondii).

Cong, L; Gou, C; Kohel, R, J; Li, F; Li, Q; Liu, K; Lu, C; Percy, R, G; Shang, H; Shi, N; Song, C; Song, G; Wang, B; Wang, J; Wang, J; Wang, K; Wang, Z; Wei, H; Ye, W; Yin, Y; Yu, J, Z; Yu, S; Yuan, Y; Yue, Z; Zhang, X; Zheng, Z; Zhu, S; Zhu, Y; Zou, C (2014): Genomic data of the diploid cotton (Gossypium raimondii). GigaScience Database. RIS BibTeX Text

Cotton is one of the most economically important crop plants worldwide. Its fiber, commonly known as cotton lint, is the principal natural source for the textile industry.
We have sequenced and assembled a draft genome of G. raimondii, whose progenitor is the putative contributor of the D subgenome to the economically important fiber-producing cotton species Gossypium hirsutum and Gossypium barbadense.
We sequenced the 0.78 Gb genome to a depth of approximately 103 X with short reads from a series of libraries with various insert sizes ( 170 bp, 250 bp, 500 bp, 800 bp, 2 kb, 5 kb, 10 kb, 20 kb and 40 kb) on a HiSeq 2000 sequencer.
The assembled scaffolds of high quality sequences total 78.7 Gb, with the contig and scaffold N50 values of 44.9 kb and 2.3 Mb respectively. We identified 40,976 protein-coding genes with an mean length of 1104 bb.

Contact Submitter

Related manuscripts:

doi:10.1038/ng.2371 (PubMed: 22922876)

Accessions (data included in GigaDB):

BioProject: PRJNA82769


Samples: Table Settings


Common Name
Scienfic Name
Sample Attributes
Taxonomic ID
Genbank Name

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
Diploid Cotton CMD1029730Gossypium raimondii Cultivar:CMD10
Geographic location (country and/or sea,region):Ch...
Geographic location (latitude and longitude):not r...
Displaying 1-1 of 1 Sample(s).

Files: (FTP site) Table Settings


File Description
Sample ID
File Type
File Format
Release Date
Download Link
File Attributes

File NameSample IDFile TypeFile FormatSizeRelease Date 
Diploid Cotton CMD10Coding sequenceFASTA46.35 MB2014-03-07
Diploid Cotton CMD10Sequence assemblyFASTA739.33 MB2014-03-07
Diploid Cotton CMD10AnnotationGFF17.94 MB2014-03-07
Diploid Cotton CMD10Protein sequenceFASTA18.06 MB2014-03-07
ReadmeUNKNOWN0.97 KB2014-03-07
Displaying 1-5 of 5 File(s).

Other datasets you might like: