Data released on November 13, 2017
Fireflies are a family of insects within the beetle order Coleoptera, or winged beetles, which are one of the most well known and loved insect species because of their bioluminescence. However, the firefly is in danger of extinction because of the massive destruction of its living environment. In order to improve the understanding of fireflies and protect them effectively, we sequenced the whole genome of the terrestrial firefly Pyrocoelia pectoralis.
Here, we developed a highly reliable genome resource for the terrestrial firefly Pyrocoelia pectoralis (E. Oliv., 1883) (Coleoptera: Lampyridae) using single molecule real time (SMRT) Sequencing on the PacBio Sequel platform. In total, 57.8Gb long reads were generated and assembled into a final size of 760.4Mb genome which is close to the estimated genome size and covered 98.7% complete and 0.7% partial insect BUSCOs. The k-mer analysis showed this genome is highly heterozygous. However, our long-read assembly demonstrates continuousness with a contig N50 length of 3.04Mb and the longest contig length of 13.69Mb. Furthermore, 135,589 SSRs and 341Mb of repeat sequences were detected. A total of 23,092 genes were predicted in which 88.44% genes were annotated with one or more related functions.
We assembled a high quality firefly genome, which will not only provide insights into the conservation and biodiversity of fireflies, but also provide a wealth of information to study the mechanisms of their sexual communication, bio-luminescence and evolution.