Supporting data for "A Chromosomal-Level Genome Assembly for the insect vector for Chagas disease, Triatoma rubrofasciata"

Dataset type: Genomic, Transcriptomic
Data released on July 01, 2019

Liu Q; Guo YH; Zhang Y; Hu W; Li YY; Zhu D; Zhou ZB; Wu JT; Chen NS; Zhou XN (2019): Supporting data for "A Chromosomal-Level Genome Assembly for the insect vector for Chagas disease, Triatoma rubrofasciata" GigaScience Database. http://dx.doi.org/10.5524/100614

DOI10.5524/100614

Triatoma rubrofasciata is a widespread pathogen vector for Chagas disease, an illness that affects approximately seven million people worldwide. Despite of its importance to human health, its evolutionary origin has not been conclusively determined. A reference genome for T. rubrofasciata is not yet available.
We have sequenced the genome of a female T. rubrofasciata individual using a single molecular DNA sequencing technology (i.e., PacBio Sequel platform) and have successfully reconstructed a whole-genome (680 Mb) assembly that covers 90% of the nuclear genome (757 Mb). Through Hi-C analysis, we have reconstructed full-length chromosomes of this female individual that has 13 unique chromosomes (2n = 24 = 22 + X1 + X2) with a contig N50 of 2.72Mb and a scaffold N50 of 50.7 Mb. This genome has achieved a high base-level accuracy of 99.99%. This platinum-grade genome assembly has 12,691 annotated protein-coding genes. More than 95.1% BUSCO genes were single-copy completed, indicating a high level of completeness of the genome.
The platinum-grade genome assembly and its annotation provide valuable information for future in-depth comparative genomics studies including sexual determination analysis in T. rubrofasciata and the pathogenesis of Chagas disease.

Additional details

Read the peer-reviewed publication(s):

(PubMed: 31425588)

Genome browser:

https://trace.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA516044&go=go

Accessions (data generated as part of this study):

BioProject: PRJNA516044
BioSample : SRR8466737
BioSample : SRR8466736
BioSample : SRR8466756
BioSample : SRR8468315
BioSample : SRR8468316





Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
NIPD-004162384 large kissing bugTriatoma rubrofasciata Description:DNA extracted from adult female Triato...
Common name:large kissing bug
Alternative accession-BioSample:SAMN10781786
...
+
Displaying 1-1 of 1 Sample(s).




File NameSample IDData TypeFile FormatSizeRelease Date 
ReadmeTEXT4.96 KB2020-05-17
OtherTEXT0.82 KB2020-05-17
OtherTEXT0.72 KB2020-05-17
Genome sequenceFASTA196.5 MB2020-05-17
Coding sequenceFASTA5.82 MB2020-05-17
AnnotationGFF1.23 MB2020-05-17
AnnotationGFF1.32 KB2020-05-17
Protein sequenceFASTA3.8 MB2020-05-17
AnnotationGFF86.81 MB2020-05-17
AnnotationGFF6.46 KB2020-05-17
Displaying 1-10 of 31 File(s).
Funding body Awardee Award ID Comments
Ministry of Science and Technology of China X-N Zhou 2016YFC1202000
Chinese Academy of Sciences N-S Chen
Date Action
July 1, 2019 Dataset publish
July 4, 2019 Manuscript Link added : 10.1093/gigascience/giz089
May 17, 2020 Author provided new chromosome level assembly along with matching set of analysis results files. All original files are kept, but moved to the sub-folder contig_assembly. Here is the list of file name and location changes: annotation_short_summary_Triatoma_rubrofasciata.txt -> contig_assembly/Triatoma_con_annotation_BUSCO.txt assembly_short_summary_Triatoma_rubrofasciata.txt -> contig_assembly/Triatoma_con_assembly_BUSCO.txt SingleCopy.pep.msa -> contig_assembly/Triatoma_con_SingleCopy.pep.msa SingleCopy.phylip -> contig_assembly/Triatoma_con_SingleCopy.phylip Triatoma_assembly_fasta.zip -> contig_assembly/Triatoma_con_assembly.fasta.gz Triatoma_cds.fa -> contig_assembly/Triatoma_con_cds.fa.gz Triatoma_filtered_snp.vcf.gz -> contig_assembly/Triatoma_con_snp.vcf.gz Triatoma_genome.gff -> contig_assembly/Triatoma_con_genome.gff.gz Triatoma_miRNA.gff -> contig_assembly/Triatoma_con_miRNA.gff.gz Triatoma_pep.fa -> contig_assembly/Triatoma_con_pep.fa.gz Triatoma_repeats annotation.gff -> contig_assembly/Triatoma_con_repeats_annotation.gff Triatoma_rRNA.gff -> contig_assembly/Triatoma_con_rRNA.gff.gz Triatoma_snRNA.gff -> contig_assembly/Triatoma_con_snRNA.gff.gz Triatoma_tree.raw.reroot.nwk -> contig_assembly/Triatoma_con_tree.evolution.nwk Triatoma_tRNA.gff -> contig_assembly/Triatoma_con_tRNA.gff.gz
May 28, 2020 The new files provided by the authors for the chromosome level assembly are; chromosome_assembly/Triatoma_chr_annotation_BUSCO.txt chromosome_assembly/Triatoma_chr_assembly_BUSCO.txt chromosome_assembly/Triatoma_chr_SingleCopy.pep.msa chromosome_assembly/Triatoma_chr_SingleCopy.phylip chromosome_assembly/Triatoma_chr_assembly.fasta.gz chromosome_assembly/Triatoma_chr_cds.fa.gz chromosome_assembly/Triatoma_chr_snp.vcf.gz chromosome_assembly/Triatoma_chr_genome.gff.gz chromosome_assembly/Triatoma_chr_miRNA.gff.gz chromosome_assembly/Triatoma_chr_pep.fa.gz chromosome_assembly/Triatoma_chr_repeat_annotation.gff.gz chromosome_assembly/Triatoma_chr_rRNA.gff.gz chromosome_assembly/Triatoma_chr_snRNA.gff.gz chromosome_assembly/Triatoma_chr_tree.evolution.nwk chromosome_assembly/Triatoma_chr_tRNA.gff.gz
May 28, 2020 File readme_100614.txt updated
October 14, 2022 Manuscript Link updated : 10.1093/gigascience/giz089