Help Login Create account

Data released on November 13, 2017

Supporting data for "Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome"

Fu, X; Li, J; Tian, Y; Quan, W; Zhang, S; Liu, Q; Liang, F; Zhu, X; Zhang, L; Wang, D; , J, H (2017): Supporting data for "Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome" GigaScience Database. RIS BibTeX Text

Fireflies are a family of insects within the beetle order Coleoptera, or winged beetles, which are one of the most well known and loved insect species because of their bioluminescence. However, the firefly is in danger of extinction because of the massive destruction of its living environment. In order to improve the understanding of fireflies and protect them effectively, we sequenced the whole genome of the terrestrial firefly Pyrocoelia pectoralis.
Here, we developed a highly reliable genome resource for the terrestrial firefly Pyrocoelia pectoralis (E. Oliv., 1883) (Coleoptera: Lampyridae) using single molecule real time (SMRT) Sequencing on the PacBio Sequel platform. In total, 57.8Gb long reads were generated and assembled into a final size of 760.4Mb genome which is close to the estimated genome size and covered 98.7% complete and 0.7% partial insect BUSCOs. The k-mer analysis showed this genome is highly heterozygous. However, our long-read assembly demonstrates continuousness with a contig N50 length of 3.04Mb and the longest contig length of 13.69Mb. Furthermore, 135,589 SSRs and 341Mb of repeat sequences were detected. A total of 23,092 genes were predicted in which 88.44% genes were annotated with one or more related functions.
We assembled a high quality firefly genome, which will not only provide insights into the conservation and biodiversity of fireflies, but also provide a wealth of information to study the mechanisms of their sexual communication, bio-luminescence and evolution.

Contact Submitter

Read the peer-reviewed publication(s):

Fu, X., Li, J., Tian, Y., Quan, W., Zhang, S., Liu, Q., … Hu, J. (2017). Long-read sequence assembly of the firefly Pyrocoelia pectoralis genome. GigaScience, 6(12), 1–7. doi:10.1093/gigascience/gix112

Additional information:

Accessions (data included in GigaDB):

BioProject: PRJNA394639



firefly pyrocoelia pectoralis genome long reads assembly 



  • Funding body - National Science Foundation of China
  • Location - China
  • Award ID - 31672349
  • Awardee - X Fu
  • Funding body - National Science Foundation of China
  • Location - China
  • Award ID - 31372252
  • Awardee - X Fu

Samples: Table Settings


Common Name
Scienfic Name
Sample Attributes
Taxonomic ID
Genbank Name

Sample IDTaxonomic IDCommon NameGenbank NameScientific NameSample Attributes
PP01417401Pyrocoelia pectoralis Description:Genomic DNA extracted from a female ad...
Alternative names:firefly
Geographic location (latitude and longitude):30.5 ...
Displaying 1-1 of 1 Sample(s).

Files: (FTP site) Table Settings


File Description
Sample ID
Data Type
File Format
Release Date
Download Link
File Attributes

File NameSample IDData TypeFile FormatSizeRelease Date 
annotationFASTA920.18 KB2017-11-09
annotationFASTA3.66 MB2017-11-09
Coding SequenceFASTA32.35 MB2017-11-09
Sequence assemblyFASTA3.99 MB2017-11-09
Sequence assemblyFASTA16.41 KB2017-11-09
Repeat sequenceFASTA67.08 MB2017-11-09
Sequence assemblyFASTA194.57 KB2017-11-09
protein sequenceFASTA12.16 MB2017-11-09
annotationFASTA0.75 KB2017-11-09
PP01Sequence assemblyFASTA218.19 MB2017-11-09
Displaying 1-10 of 28 File(s).



Other datasets you might like: