Supporting data for "A workflow for simplified analysis of ATAC-cap-seq data in R"

Dataset type: Workflow, Software
Data released on June 19, 2018

Shrestha RK; Ding P; Jones JDG; MacLean D (2018): Supporting data for "A workflow for simplified analysis of ATAC-cap-seq data in R" GigaScience Database. http://dx.doi.org/10.5524/100476

DOI10.5524/100476

ATAC-cap-seq is a high-throughput sequencing method that combines targeted nucleic acid enrichment of precipitated DNA fragments with an upstream ATAC-seq step. There are increased analytical difficulties arising from working with a set of regions of interest that may be small in number and biologically dependent. Common statistical pipelines for RNAseq might be assumed to apply but can give misleading results on ATAC-cap-seq data. A tool is needed to allow a non-specialist user to quickly and easily summarise data and apply sensible and effective normalisation and analysis. We developed atacR to allow a user to easily analyse their ATAC enrichment experiment. It provides comprehensive summary functions and diagnostic plots for studying enriched tag abundance. Applying between-sample normalisation is made straightforward and functions for normalising based on user-defined control regions, whole library size and regions selected from the least variable regions in a dataset are provided. Three methods for detecting differential abundance of tags from enriched methods are provided, including Bootstrap t, Bayes Factor and a wrapped version of the standard exact test in the edgeR package. We compared the precision, recall and F-score of each detection method on resampled datasets at varying replicate, significance threshold and genes changed, we found that the Bayes factor method had greatest overall detection power, though edgeR was slightly stronger in simulations with lower numbers of genes changed. Our package allows a non-specialist user to easily and effectively apply methods appropriate to the analysis of ATAC-cap-seq in a reproducible manner. The package is implemented in pure R and is fully interoperable with common workflows in Bioconductor.

Additional details

Read the peer-reviewed publication(s):

Shrestha, R. K., Ding, P., Jones, J. D. G., & MacLean, D. (2018). A workflow for simplified analysis of ATAC-cap-seq data in R. GigaScience, 7(7). doi:10.1093/gigascience/giy080

Additional information:

https://github.com/TeamMacLean/atacr





File NameSample IDData TypeFile FormatSizeRelease Date 
HTMLHTML1.31 MB2018-06-14
R MarkdownRMD7.62 KB2018-06-14
HTMLHTML1003.32 KB2018-06-14
R MarkdownRMD3.85 KB2018-06-14
GitHub archivearchive17.2 MB2018-06-14
ReadmeTEXT2.92 KB2018-06-14
Tabular DataCSV126.37 KB2018-06-14
Displaying 1-7 of 7 File(s).
Funding body Awardee Award ID Comments
EU H2020 P Ding 656243 Marie Skłodowska-Curie Grant
Date Action
June 19, 2018 Dataset publish
July 4, 2018 Manuscript Link added : 10.1093/gigascience/giy080