Help Login Create account

Data released on March 07, 2017

Supporting data for "Enhancing Knowledge Discovery from Cancer Genomics Data with Galaxy"

Albuquerque, M, A; Boutros, P, C; Grande, B, M; Grewal, J, K; Jessa, S; Krzywinski, M; Morin, R, D; Pararajalingam, P; Ritch, E, J; Shah, S, P (2017): Supporting data for "Enhancing Knowledge Discovery from Cancer Genomics Data with Galaxy" GigaScience Database. RIS BibTeX Text

The field of cancer genomics has demonstrated the power of massively parallel sequencing techniques to inform on the genes and specific alterations that drive tumor onset and progression. Although large comprehensive sequence data sets continue to be made increasingly available, data analysis remains an ongoing challenge, particularly for laboratories lacking dedicated resources and bioinformatics expertise.
To address this, we have produced a collection of Galaxy tools that represent many popular algorithms for detecting somatic genetic alterations from cancer genome and exome data. We developed new methods for parallelization of these tools within Galaxy to accelerate runtime and have demonstrated their usability and summarized their runtimes on multiple cloud service providers. Some tools represent extensions or refinement of existing toolkits to yield visualizations suited to cohort-wide cancer genomic analysis. For example, we present Oncocircos and Oncoprintplus, which generate data-rich summaries of exome-derived somatic mutation. Workflows that integrate these to achieve data integration and visualizations are demonstrated on a cohort of 96 diffuse large B-cell lymphomas and enabled the discovery of multiple candidate lymphoma-related genes. Our toolkit is available from our GitHub repository as Galaxy tool and dependency definitions and has been deployed using virtualization on multiple platforms including Docker.

Contact Submitter

Read the peer-reviewed publication(s):

Albuquerque, M. A., Grande, B. M., Ritch, E. J., Pararajalingam, P., Jessa, S., Krzywinski, M., … Morin, R. D. (2017). Enhancing knowledge discovery from cancer genomics data with Galaxy. GigaScience, 6(5), 1–13. doi:10.1093/gigascience/gix015

Additional information:


Lymphoma Driver Cancer Genome Pipeline Workflow Tool Cloud 


Files: (FTP site) Table Settings


File Description
Sample ID
Data Type
File Format
Release Date
Download Link
File Attributes

File NameSample IDData TypeFile FormatSizeRelease Date 
ReadmeTEXT2 KB2017-03-02
GitHub archivearchive52.81 MB2017-03-02
Displaying 1-2 of 2 File(s).



Other datasets you might like: