Help Login Create account

Data released on January 16, 2017

Supporting data for "drVM: a new tool for efficient genome assembly of known eukaryotic viruses from metagenomes"

Liao, Y; Lin, H (2017): Supporting data for "drVM: a new tool for efficient genome assembly of known eukaryotic viruses from metagenomes" GigaScience Database. RIS BibTeX Text

Virus discovery using high-throughput next-generation sequencing (NGS) has become more commonplace. However, although analysis of deep NGS data allows us to identity potential pathogens, the entire analytical procedure requires competency in the bioinformatics domain, which includes implementing proper software packages and preparing prerequisite databases. Simple and user-friendly bioinformatics pipelines are urgently required to obtain complete viral genome sequences from metagenomic data.
Here we present a pipeline, drVM (detect and reconstruct known viral genomes from metagenomes), for rapid viral read identification, genus-level read partition, read normalization, de novo assembly, sequence annotation and coverage profiling. The first two procedures and sequence annotation rely on known viral genomes as a reference database. We also present the validation results of the analysis of over 300 previously published sequencing runs, to provide complete viral genome assemblies for a variety of virus types including DNA viruses, RNA viruses and retroviruses.
drVM is available for free download here and is also assembled as a Docker container, an Amazon machine image and a virtual machine to facilitate seamless deployment. GigaDB is hosting the version of the software as reviewed in the associated article, please refer to the sourceforge site for the most recent updates to the software.

Contact Submitter

Read the peer-reviewed publication(s):

Lin, H.-H., & Liao, Y.-C. (2017). drVM: a new tool for efficient genome assembly of known eukaryotic viruses from metagenomes. GigaScience, 6(2), 1–10. doi:10.1093/gigascience/gix003

Additional information:


Next-generation sequencing (NGS) bioinformatics metagenomics detection reconstruction virus 

Software, Genomic


  • Funding body - National Health Research Institute Taiwan
  • Award ID - PH-105-PP-05
  • Funding body - Ministry of Science and Technology Taiwan
  • Award ID - MOST-104-2320-B-400-021-MY2

Files: (FTP site) Table Settings


File Description
Sample ID
Data Type
File Format
Release Date
Download Link
File Attributes

File NameSample IDData TypeFile FormatSizeRelease Date 
Mixed archivearchive223.67 MB2017-01-12
Mixed archivearchive1.19 GB2017-01-12
Mixed archivearchive36.44 MB2017-01-12
Mixed archiveTAR6.65 MB2017-01-12
Mixed archivearchive2.55 GB2017-01-12
ArticlePDF2.7 MB2017-01-12
TextTEXT1.86 KB2017-01-12
Mixed archiveTAR1.55 MB2017-01-12
Mixed archivearchive1.36 GB2017-01-12
Mixed archiveTAR419.99 MB2017-01-12
Displaying 1-10 of 13 File(s).



Other datasets you might like: