Genome sequence of YH: the first diploid genome sequence of a Han Chinese individual.

Dataset type: Genomic
Data released on July 06, 2011

Genomic data from the YH (Homo sapiens) genome – first diploid genome sequence of a Han Chinese, a representative of the Asian population. The genomic DNA used in this study came from an anonymous male Han Chinese individual who has no known genetic diseases. The YH genome was assembled based on 3.3 billion reads using the Illumina Genome Analyzer. We achieved 117.7G nucleotides data and the genome was sequenced to 36-fold average coverage. By aligning the short reads with SOAP, 102.9G nucleotides were mapped onto the NCBI reference genome and 99.97% of the genome was covered. The raw sequences, alignments, consensus genome, variants and relevant tools are released for public use under a CC0 license.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 18987735)

Related datasets:

doi:10.5524/100015 IsSupplementedBy doi:10.5524/100013
doi:10.5524/100015 IsSupplementedBy doi:10.5524/100014
doi:10.5524/100015 IsPreviousVersionOf doi:10.5524/100038 (It is a more recent version of this dataset)

There is a new version of this dataset available at: DOI: 10.5524/100038

Additional information:

Genome browser:

Accessions (data generated as part of this study):

ENA: ERP000053
BioProject: PRJEA39173


Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
YH9606HumanhumanHomo sapiens
Displaying 1-1 of 1 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
YHGenome sequenceFASTQ7.71 MB2011-07-06
YHGenome sequenceFASTQ13.31 MB2011-07-06
YHGenome sequenceFASTQ20.4 MB2011-07-06
YHGenome sequenceFASTQ22.24 MB2011-07-06
YHGenome sequenceFASTQ28.43 MB2011-07-06
YHGenome sequenceFASTQ27.77 MB2011-07-06
YHGenome sequenceFASTQ79.88 MB2011-07-06
YHGenome sequenceFASTQ56.16 MB2011-07-06
YHGenome sequenceFASTQ21.75 MB2011-07-06
YHGenome sequenceFASTQ22.03 MB2011-07-06
Displaying 1-10 of 1244 File(s).
Date Action