Genomic data from Escherichia coli O104:H4 isolate TY-2482

Dataset type: Genomic
Data released on June 03, 2011

The May 2011 outbreak of an E. coli infection in Europe resulted in serious concerns about the potential appearance of a new deadly strain of bacteria, Escherichia coli O104:H4 TY-2482. In response to this situation, and immediately after the reports of deaths, the University Medical Centre Hamburg-Eppendorf and BGI-Shenzhen worked together to sequence the bacterium and assess its human health risk. The bacterium’s genome was first sequenced using Life Technologies; Ion Torrent sequencing platform. According to the results of the draft assembly, the estimated genome size of this new E. coli strain is about 5.2 Mb. Sequence analysis indicated this bacterium is an EHEC serotype O104 E. coli strain. Comparative analysis showed that this bacterium has 93% sequence similarity with the EAEC 55989 E. coli strain, which was isolated in the Central African Republic and known to cause serious diarrhea. This strain of E. coli, however, has also acquired specific sequences that appear to be similar to those involved in the pathogenicity of hemorrhagic colitis and hemolytic-uremic syndrome. The acquisition of these genes may have occurred through horizontal gene transfer. To maximize its utility to the research community and aid those fighting the epidemic, this genomic data was released into the public domain under a CC0 license. To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to genomic data from the 2011 E. coli outbreak. This work is published from: China.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 21793736)

Related datasets:

doi:10.5524/100001 IsCitedBy doi:10.5524/200125

Additional information:

Accessions (data generated as part of this study):

SRA: SRP006916
BioProject: PRJNA67657

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
TY-2482562E. coli Escherichia coli Isolate:TY-2482
Isolation source:stool sample from patient with he...
Displaying 1-1 of 1 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
TY-2482Genome sequenceFASTQ1.51 GB2012-02-28
TY-2482OtherEXCEL33.5 KB2012-02-28
TY-2482Sequence assemblyFASTA1.47 MB2012-02-28
TY-2482Sequence assemblyFASTA1.55 MB2012-02-28
TY-2482Sequence assemblyFASTA1.57 MB2012-02-28
TY-2482Sequence assemblyFASTA1.68 MB2012-02-28
TY-2482Sequence assemblyFASTA50 KB2012-02-28
TY-2482Sequence assemblyFASTA1.57 MB2012-02-28
ReadmeTEXT3.76 KB2019-01-28
ReadmeTEXT1.4 KB2012-02-28
Displaying 1-10 of 19 File(s).
Funding body Awardee Award ID Comments
National Natural Science Foundation of China M Li 31530068
Date Action
September 15, 2017 Relationship added : DOI 200029
September 15, 2017 Relationship removed : DOI 200029
October 2, 2017 Manuscript Link added : 10.1093/gigascience/gix082
October 2, 2017 Manuscript Link added : 10.1093/gigascience/gix078
March 12, 2018 Funder added : National Institutes of Health
July 30, 2018 Funder added : National Natural Science Foundation of China
August 21, 2018 Link added : PRJEB21098
August 21, 2018 Link removed : PRJEB21098
December 5, 2018 External Link removed : - no longer active, no alternative found
January 28, 2019 Additional file readme_100001.txt added
January 28, 2019 File readme_100001.txt updated
July 3, 2019 Relationship added : DOI 200096
July 3, 2019 Relationship removed : DOI 200096
August 1, 2019 Relationship added : DOI 200097
August 1, 2019 Relationship removed : DOI 200097
November 16, 2019 Relationship added : DOI 200099
November 16, 2019 Relationship removed : DOI 200099
December 14, 2019 Relationship added : DOI 100676
December 14, 2019 Relationship removed : DOI 100676
April 14, 2020 Relationship added : DOI 100721
April 14, 2020 Relationship removed : DOI 100721
August 18, 2020 Relationship added : DOI 100780
August 18, 2020 Relationship removed : DOI 100780
August 20, 2020 Additional file readme_100782.txt added
August 20, 2020 readme_100782.txt: additional file attribute added
August 24, 2020 Relationship added : DOI 200125
November 17, 2020 File 110601_I238_FCB067HABXX_L3_ESCqslRAADIAAPEI-2_1.fq.gz updated
November 23, 2020 Relationship added : DOI 200129
November 23, 2020 Relationship removed : DOI 200129
August 20, 2021 Additional file short_summary_output2.txt added
October 11, 2021 Additional file Drought_tolerance_horsegram_genes.csv added