Bacterial whole-genome read data from the Oxford Nanopore Technologies MinION™ nanopore sequencer.

Dataset type: Genomic
Data released on September 10, 2014

Quick J; Loman NJ (2014): Bacterial whole-genome read data from the Oxford Nanopore Technologies MinION™ nanopore sequencer. GigaScience Database.


The MinION is a new, portable single-molecule sequencer developed by Oxford Nanopore Technologies. It measures four inches in length and is powered from the USB 3.0 port of a laptop computer. By measuring the change in resistance produced when DNA strands translocate through and interact with a protein nanopore the device is able to deduce the underlying nucleotide sequence.
Here we present a read dataset from whole-genome shotgun sequencing of the model organism Escherichia coli K-12 substr. MG1655 generated on a MinION device with R7 chemistry during the early-access MinION Access Program (MAP).  
Three sequencing runs of the MinION were performed using R7 chemistry. The first run produced 43,656 forward reads (272Mb), 23,338 (125Mb) reverse reads, 20,087 two-direction (2D) reads (131Mb), of which 8% (10Mb) were full 2D. Full 2D reads means that the complementary strand was successfully slowed through the pore.
The R7 protocol has a modification to increase the relative number of full 2D reads (NONI). To exploit this, two new libraries were produced which included an overnight incubation stage (ONI-1 and ONI-2). Each library was run on an individual flow cell. This resulted in 6,534 & 8,260 forward reads, 2,171 & 2,945 reads and 1,740 & 2,394 2D reads (27%, 29%). In this case, 50% and 41.8% of reads were full 2D respectively. The mean fragment lengths for 2D reads from the three libraries were 6,543 (NONI) 6,907 (ONI-1) and 6,434 (ONI-2).
A fourth run was performed using a further modified chemistry (R7.3), which resulted in 39819 forward reads, 18889 complement reads, of which 11823 and 9563 were 2D and full 2D respectively.
Raw and assembled sequence data is provided to demonstrate the nature of data produced by the MinION™ platform and to encourage the development of customised methods for alignment and variant calling, de novo assembly and scaffolding. FAST5 files containing event data within the HDF5 container format are provided to assist with the development of improved base-calling methods.

Additional details

Read the peer-reviewed publication(s):

Accessions (data referenced by this study):

ENA: ERP007108

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
Ecoli_K12_MG1655511145E.coliE.coli Estimated genome size:4.6
Nucleic acid extraction:Invisorb Spin Cell Mini Ki...
Sample collection device or method:L-shaped spread...
Displaying 1-1 of 1 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
Ecoli_K12_MG1655Genome sequenceHDF546.37 GB2014-09-16
Ecoli_K12_MG1655Sequence assemblyFASTA187.01 MB2014-09-05
Ecoli_K12_MG1655Genome sequenceHDF557.17 GB2014-09-03
Ecoli_K12_MG1655Genome sequenceHDF521.09 GB2014-09-03
ISA-TabTEXT3.77 KB2014-09-30
ReadmeTEXT1.49 KB2014-09-03
Displaying 1-6 of 6 File(s).
Date Action
October 13, 2015 File Ecoli_R7_ONI.tgz updated
October 15, 2015 File Ecoli_R7_CombinedFasta.tgz updated
October 23, 2015 File Ecoli_R73.tgz updated
November 1, 2015 File Ecoli_R7_ONI.tgz updated
November 1, 2015 File Ecoli_R7_ONI.tgz updated
November 2, 2015 File Ecoli_R73.tgz updated
November 3, 2015 File Ecoli_R7_CombinedFasta.tgz updated