Supporting data for "PM4NGS, a project management framework for Next-Generation Sequencing data analysis"

Dataset type: Software, Transcriptomic, Workflow, Bioinformatics
Data released on November 13, 2020

Vera-Alvarez R; Pongor L; Mariño-Ramírez L; Landsman D (2020): Supporting data for "PM4NGS, a project management framework for Next-Generation Sequencing data analysis" GigaScience Database. http://dx.doi.org/10.5524/100833

DOI10.5524/100833

FAIR (Findability, Accessibility, Interoperability, and Reusability) Next-Generation Sequencing (NGS) data analysis rely on complex computational biology workflows and pipelines to guarantee reproducibility, portability and scalability. Moreover, workflow languages, managers and container technologies have helped address the problem of data analysis pipeline execution across multiple platforms in scalable ways.
Here, we present a project management framework for NGS data analysis called PM4NGS. This framework is comprised of an automatic creation of a standard organizational structure of directories and files, bioinformatics tool management, using Docker or Bioconda, and data analysis pipelines in common workflow language (CWL) format. Pre-configured Jupyter notebooks with minimum Python code are included in PM4NGS to produce a project report and publication-ready figures. Three pipelines are presented in this manuscript for demonstration purposes including the analysis of RNA-Seq, ChIP-Seq and ChIP-exo datasets.
PM4NGS is an open-source framework that creates a standard organizational structure for NGS data analysis projects. PM4NGS is easy to install, configure and use by non-bioinformaticians on personal computers and laptops. It permits execution of the NGS data analysis on Windows 10 with the Windows Subsystem for Linux feature activated. The framework aims to reduce the gap between researcher in experimental laboratories producing NGS data and workflows for data analysis. PM4NGS docs

Additional details

Read the peer-reviewed publication(s):

(PubMed: 33410471)

Additional information:

https://pm4ngs.readthedocs.io/en/latest/

Github links:

https://github.com/ncbi/pm4ngs-rnaseq





File NameSample IDData TypeFile FormatSizeRelease Date 
GitHub archivezip12.69 MB2020-11-10
ReadmeTEXT2.6 KB2020-11-13
Displaying 1-2 of 2 File(s).
Funding body Awardee Award ID Comments
U.S. National Library of Medicine Intramural Research Program
Date Action
November 13, 2020 Dataset publish
December 14, 2020 Manuscript Link added : 10.1093/gigascience/giaa141
March 28, 2022 Manuscript Link updated : 10.1093/gigascience/giaa141