Supporting data for "An improved pig reference genome sequence to enable pig genetics and genomics research"

Dataset type: Genomic, Transcriptomic
Data released on April 15, 2020

The domestic pig (Sus scrofa) is important both as a food source and as a biomedical model given its similarity in size, anatomy, physiology, metabolism, pathology and pharmacology to humans. The draft reference genome (Sscrofa10.2) of a purebred Duroc female pig established using older clone-based sequencing methods was incomplete and unresolved redundancies, short range order and orientation errors and associated misassembled genes limited its utility. We present two annotated highly contiguous chromosome-level genome assemblies created with more recent long read technologies and a whole genome shotgun strategy, one for the same Duroc female (Sscrofa11.1) and one for an outbred, composite breed male (USMARCv1.0). Both assemblies are of substantially higher (>90-fold) continuity and accuracy than Sscrofa10.2. These highly contiguous assemblies plus annotation of a further 11 short read assemblies provide an unprecedented view of the genetic make-up of this important agricultural and biomedical model species. We propose that the improved Duroc assembly (Sscrofa11.1) become the reference genome for genomic research in pigs.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 32543654)

Genome browser:

http://www.ensembl.org/Sus_scrofa/Info/Index

http://www.ensembl.org/Sus_scrofa/Info/Strains

https://www.ncbi.nlm.nih.gov/genome/gdv/browser/genome/?id=GCF_000003025.6

Accessions (data generated as part of this study):

BioProject: PRJNA13421
BioProject: PRJNA392765

Accessions (data referenced by this study):

BioProject: PRJNA309108
BioProject: PRJNA186497
BioProject: PRJNA144099
BioProject: PRJEB19386
BioProject: PRJEB33353
BioProject: PRJNA351265
BioProject: PRJEB9115





Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
Bamei_pig_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
Berkshire_pig_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
Duroc_2-149823PigpigSus scrofa Description:Genomic DNA used for reference genome ...
Sex:female
Alternative accession-BioProject:PRJNA13421
...
+
Hampshire_pig_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
Jinhua_pig_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
Landrace_pig_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
Large_White_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
MARC14230049823PigpigSus scrofa Description:Genomic DNA used for reference genome ...
Sex:male
Alternative accession-BioProject:PRJNA392765
...
+
Meishan_pig_v19823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:female
Alternative accession-BioProject:PRJNA309108
...
+
minipig_v1.09823PigpigSus scrofa Description:Genomic DNA used for annotation of non...
Sex:male
Alternative accession-BioProject:PRJNA144099
...
+
Displaying 1-10 of 67 Sample(s).




File NameSample IDData TypeFile FormatSizeRelease Date 
Structural variationarchive12.22 MB2018-11-03
Structural variationarchive12.36 MB2018-11-03
Structural variationarchive9.56 MB2018-11-03
Structural variationarchive9.56 MB2018-11-03
ImagePNG74.07 KB2018-11-03
ImagePNG78.77 KB2018-11-03
ImagePNG316.39 KB2018-11-03
ImagePNG235.89 KB2018-11-03
ImagePNG63.74 KB2018-11-03
ImagePNG13.69 KB2018-11-03
Displaying 1-10 of 73 File(s).
Funding body Awardee Award ID Comments
Biotechnology and Biological Sciences Research Council Alan L Archibald and Mick Watson BBS/E/D/20211550 Institute Strategic Programme Grant
Biotechnology and Biological Sciences Research Council Alan L. Archibald and Mick Watson BBS/E/D/10002070 Institute Strategic Programme Grant
Biotechnology and Biological Sciences Research Council Nabeel Affara BB/F021372/1 Response Mode Grant
Biotechnology and Biological Sciences Research Council Alan L Archibald BB/M011461/1 Response Mode Grant
Biotechnology and Biological Sciences Research Council Paul Flicek BB/M011615/1 Response Mode Grant
Biotechnology and Biological Sciences Research Council Alan L Archibald and Mick Watson BB/M01844X/1 Response Mode Grant
EU Alan L Archibald KBBE222664 FP7 Programme Quantomics
Wellcome Trust Paul Flicek WT108749/Z/15/Z
USDA Derek M Bickhart and Benjamin D Rosen 8042-31000-001-00-D CRIS Project
USDA Derek M Bickhart 5090-31000-026-00-D CRIS Project
USDA Timothy P L Smith 3040-31000-100-00-D CRIS Project
Date Action
April 15, 2020 Dataset publish
April 24, 2020 Manuscript Link added : 10.1093/gigascience/giaa051
October 7, 2022 Manuscript Link updated : 10.1093/gigascience/giaa051