Supporting data for "Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples"

Dataset type: Genomic
Data released on August 04, 2017

Arulandhu AJ; Barreto Crespo MT; Birck M; Burns M; Costessi A; Duijsings D; Gaspar FB; Hagelaar R; Haynes E; Hochegger R; Holst-Jensen A; Prins TW; Rechenmann F; Scholtens IMJ; Staats M; Voorhuijzen MM; Klingl A; Lundberg L; Natale C; Niekamp H; Barbante A; Perri E; Rosec J; Seyfarth R; Sovova T; Moorleghem CV; Ruth Sv; Peelen T; Kok E (2017): Supporting data for "Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples" GigaScience Database.


DNA metabarcoding provides great potential for species identification in complex samples such as food supplements and traditional medicines. Such a method would aid CITES (the Convention on International Trade in Endangered Species of Wild Fauna and Flora) enforcement officers to combat wildlife crime by preventing illegal trade of endangered plant and animal species. The objective of this research was to develop a multi-locus DNA metabarcoding method for forensic wildlife species identification and to evaluate the applicability and reproducibility of this approach across different laboratories. A DNA metabarcoding method was developed that makes use of 12 DNA barcode markers that have demonstrated universal applicability across a wide range of plant and animal taxa, and that facilitate the identification of species in samples containing degraded DNA. The DNA metabarcoding method was developed based on Illumina MiSeq amplicon sequencing of well-defined experimental mixtures, for which a bioinformatics pipeline with user-friendly web interface was developed. The performance of the DNA metabarcoding method was assessed in an international validation trial by 16 laboratories, in which the method was found to be highly reproducible and sensitive enough to identify species present in a mixture at 1% dry weight content. The advanced multi-locus DNA metabarcoding method assessed in this study provides reliable and detailed data on the composition of complex food products, including information on the presence of CITES-listed species. The method can provide improved resolution for species identification, while verifying species with multiple DNA barcodes contributes to an enhanced quality assurance.

Additional details

Read the peer-reviewed publication(s):

Arulandhu, A. J., Staats, M., Hagelaar, R., Voorhuijzen, M. M., Prins, T. W., Scholtens, I., … Kok, E. (2017). Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples. GigaScience, 6(10), 1–18. doi:10.1093/gigascience/gix080

Additional information:

Accessions (data included in GigaDB):

BioProject: PRJEB18620

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
EM_21427524  mixed sample Description:Experimental mixture containing 90% Bos taurus and 10% Lactuca sativa
Alternative accession-SRA Sample:ERS1545973
Alternative accession-BioSample:SAMEA80893918
EM_31427524  mixed sample Description:Experimental mixture containing 1% Bos taurus and 99% Lactuca sativa
Alternative accession-SRA Sample:ERS1545974
Alternative accession-BioSample:SAMEA80894668
EM_41427524  mixed sample Description:Experimental mixture containing 10% Bos taurus and 90% Lactuca sativa
Alternative accession-SRA Sample:ERS1545975
Alternative accession-BioSample:SAMEA80895418
EM_51427524  mixed sample Description:Experimental mixture containing 99% Bos taurus and 1% Aloe variegata
Alternative accession-SRA Sample:ERS1545976
Alternative accession-BioSample:SAMEA80896168
EM_61427524  mixed sample Description:Experimental mixture containing 95% Bos taurus, 1% Parapenaeopsis sp., 1% Anguilla anguilla, 1% Crocodylus niloticus, 1% Gallus gallus domesticus and 1% Pieris brassicae
Alternative accession-SRA Sample:ERS1545977
Alternative accession-BioSample:SAMEA80896918
EM_71427524  mixed sample Description:Experimental mixture containing 85% Bos taurus, 3% Parapenaeopsis sp., 3% Anguilla anguilla, 3% Crocodylus niloticus, 3% Gallus gallus domesticus and 3% Pieris brassicae
Alternative accession-SRA Sample:ERS1545978
Alternative accession-BioSample:SAMEA80897668
EM_81427524  mixed sample Description:Experimental mixture containing 1% Echinocactus sp., 1% Euphorbia sp., 1% Aloe variegata, 1% Dendrobium sp., 1% Cycas revoluta and 95% Lactuca Sativa
Alternative accession-SRA Sample:ERS1545979
Alternative accession-BioSample:SAMEA80898418
EM_91427524  mixed sample Description:Experimental mixture containing 3% Echinocactus sp., 3% Euphorbia sp., 3% Aloe variegata, 3% Dendrobium sp., 3% Cycas revoluta and 85% Lactuca Sativa
Alternative accession-SRA Sample:ERS1545980
Alternative accession-BioSample:SAMEA80899168
EM_101427524  mixed sample Description:Experimental mixture containing 10% Bos taurus, 10% Parapenaeopsis sp., 10% Anguilla anguilla, 10% Crocodylus niloticus, 10% Gallus gallus domesticus, 10% Pieris brassicae, 10% Echinocactus sp., 10% Euphorbia sp., 10% Aloe variegata, 10% Dendrobium sp., 10% Cycas revoluta and 10% Lactuca Sativa
Alternative accession-SRA Sample:ERS1545981
Alternative accession-BioSample:SAMEA80899918
EM_11427524  mixed sample Description:Experimental mixture containing 99% Bos taurus and 1% Lactuca sativa
Alternative accession-SRA Sample:ERS1545972
Alternative accession-BioSample:SAMEA80893168
Displaying 1-10 of 177 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
GitHub archivearchive13.58 MB2017-07-28
S5_Lab_5Tabular dataTSV7.83 KB2017-07-28
S5_Lab_5Tabular dataTSV4.97 KB2017-07-28
S5_Lab_5Tabular dataTSV8.55 KB2017-07-28
S5_Lab_5Tabular dataTSV8.11 KB2017-07-28
S5_Lab_5Tabular dataTSV0.81 KB2017-07-28
S5_Lab_5Tabular dataTSV7.63 KB2017-07-28
S5_Lab_5Tabular dataTSV14.48 KB2017-07-28
S5_Lab_5Tabular dataTSV15.1 KB2017-07-28
S5_Lab_5Tabular dataTSV14.63 KB2017-07-28
Displaying 1-10 of 14 File(s).
Funding body Awardee Award ID Comments
European Commission FP7-KBBE-2013-7-613908-Decathlon Seventh Framework Programme (FP7)

Date Action
August 4, 2017 Dataset publish
October 2, 2017 Manuscript Link added : 10.1093/gigascience/gix080