Help Login Create account

Data released on September 29, 2017

Supporting data for the near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum

Clavijo, B; Hall, R; Kingan, S; Puiu, D; Salzberg, S, L; Zimin, A, V (2017): Supporting data for the near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum GigaScience Database. http://dx.doi.org/10.5524/100356 RIS BibTeX Text

Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15,344,693,583 bases and has a weighted average (N50) contig size of 232,659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4,179,762,575 bp of T. aestivum that correspond to its D genome components.

Contact Submitter

Accessions (data included in GigaDB):

BioProject: PRJNA392179
GenBank: NMPL00000000

Keywords:

genome assembly wheat genome next-generation sequencing plant genomics 

Genomic

http://gigadb.org/images/data/cropped/100356.jpg

Funding:

  • Funding body - National Human Genome Research Institute
  • Award ID - R01HG006677
  • Awardee - SL Salzberg
  • Funding body - National Science Foundation
  • Award ID - IOS-1444893
  • Comment - Directorate for Biological Sciences
  • Awardee - AV Zimin
  • Funding body - National Science Foundation
  • Award ID - IOS-1238231
  • Comment - Directorate for Biological Sciences
  • Awardee - SL Salzberg
  • Funding body - National Science Foundation
  • Award ID - IOS-1238231
  • Comment - Directorate for Biological Sciences
  • Awardee - J Dvorak

Samples: Table Settings

Columns:

Common Name
Scienfic Name
Sample Attributes
Taxonomic ID
Genbank Name

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
Triticum aestivum WGS4565Canadian hard winter wheatbread wheatTriticum aestivum Description:Genomic DNA extracted from leaves of t...
Analyte type:DNA
Alternative accession-BioSample:SAMN07284949
...
+
Displaying 1-1 of 1 Sample(s).

Files: (FTP site) Table Settings

Columns:

File Description
Sample ID
File Type
File Format
Size
Release Date
Download Link
File Attributes

File NameSample IDFile TypeFile FormatSizeRelease Date 
ReadmeTEXT4.47 KB2017-09-26
Mixed archiveTAR1.15 MB2017-09-26
OtherTEXT2.94 MB2017-09-26
otherTAR125.55 MB2017-09-26
OtherTAR142.91 MB2017-09-26
Genome sequenceFASTA3.27 GB2017-09-26
Sequence assemblyFASTA3.31 GB2017-09-26
Sequence assemblyFASTA3.71 GB2017-09-26
Genome sequenceFASTA3.89 GB2017-09-26
Sequence assemblyFASTA3.92 GB2017-09-26
Displaying 1-10 of 13 File(s).

History:

+

Other datasets you might like: