Supporting data for ‘Piggy: A Rapid, Large-Scale Pan-Genome Analysis Tool for Intergenic Regions in Bacteria’
Dataset type: Software, Genomic
Data released on February 22, 2018
Despite overwhelming evidence that variation in intergenic regions (IGRs) in bacteria can directly influence phenotypes, most current approaches for analysing pan-genomes focus exclusively on protein-coding sequences. To address this we present Piggy, a novel pipeline that emulates Roary except that it is based only on IGRs. We demonstrate the use of Piggy for pan-genome analyses of Staphylococcus aureus and Escherichia coli using large genome datasets. For S. aureus, we show that highly divergent (“switched”) IGRs are associated with differences in gene expression, and we establish a multi-locus reference database of IGR alleles (igMLST; implemented in BIGSdb). Piggy is available at https://github.com/harry-thorpe/piggy and registered with SciCrunch (RRID: SCR_015941).
Read the peer-reviewed publication(s):
Thorpe, H. A., Bayliss, S. C., Sheppard, S. K., & Feil, E. J. (2018). Piggy: a rapid, large-scale pan-genome analysis tool for intergenic regions in bacteria. GigaScience, 7(4). doi:10.1093/gigascience/giy015