Supporting data for "Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome"

Dataset type: Genomic
Data released on November 13, 2017

Fu X; Li J; Tian Y; Quan W; Zhang S; Liu Q; Liang F; Zhu X; Zhang L; Wang D; JH (2017): Supporting data for "Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome" GigaScience Database.


Fireflies are a family of insects within the beetle order Coleoptera, or winged beetles, which are one of the most well known and loved insect species because of their bioluminescence. However, the firefly is in danger of extinction because of the massive destruction of its living environment. In order to improve the understanding of fireflies and protect them effectively, we sequenced the whole genome of the terrestrial firefly Pyrocoelia pectoralis.
Here, we developed a highly reliable genome resource for the terrestrial firefly Pyrocoelia pectoralis (E. Oliv., 1883) (Coleoptera: Lampyridae) using single molecule real time (SMRT) Sequencing on the PacBio Sequel platform. In total, 57.8Gb long reads were generated and assembled into a final size of 760.4Mb genome which is close to the estimated genome size and covered 98.7% complete and 0.7% partial insect BUSCOs. The k-mer analysis showed this genome is highly heterozygous. However, our long-read assembly demonstrates continuousness with a contig N50 length of 3.04Mb and the longest contig length of 13.69Mb. Furthermore, 135,589 SSRs and 341Mb of repeat sequences were detected. A total of 23,092 genes were predicted in which 88.44% genes were annotated with one or more related functions.
We assembled a high quality firefly genome, which will not only provide insights into the conservation and biodiversity of fireflies, but also provide a wealth of information to study the mechanisms of their sexual communication, bio-luminescence and evolution.

Additional details

Read the peer-reviewed publication(s):

Fu, X., Li, J., Tian, Y., Quan, W., Zhang, S., Liu, Q., … Hu, J. (2017). Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome. GigaScience, 6(12). doi:10.1093/gigascience/gix112

Accessions (data generated as part of this study):

BioProject: PRJNA394639


Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
PP01417401Pyrocoelia pectoralis Description:Genomic DNA extracted from a female ad...
Alternative names:firefly
Geographic location (latitude and longitude):30.5 ...
Displaying 1-1 of 1 Sample(s).

File NameSample IDData TypeFile FormatSizeRelease Date 
scriptPython5.41 KB2017-11-09
scriptPython3 KB2017-11-09
scriptPython1.67 KB2017-11-09
scriptPython5.66 KB2017-11-09
ReadmeTEXT3.82 KB2017-11-09
scriptUNKNOWN11.99 KB2017-11-09
scriptUNKNOWN2.85 KB2017-11-09
scriptUNKNOWN13.43 KB2017-11-09
scriptPython11.21 KB2017-11-09
Sequence assemblyFASTA3.99 MB2017-11-09
Displaying 1-10 of 28 File(s).
Funding body Awardee Award ID Comments
National Science Foundation of China X Fu 31672349
National Science Foundation of China X Fu 31372252

Date Action
November 13, 2017 Dataset publish
January 9, 2018 Manuscript Link added : 10.1093/gigascience/gix112
August 23, 2018 External Link updated :