Supporting data for "Efficient and accurate detection of splice junctions from RNA-Seq with Portcullis"

Dataset type: Software
Data released on October 16, 2018

Mapleson DL; Venturini L; Kaithakottil GG; Swarbreck D (2018): Supporting data for "Efficient and accurate detection of splice junctions from RNA-Seq with Portcullis" GigaScience Database. http://dx.doi.org/10.5524/100519

DOI10.5524/100519

Next generation sequencing (NGS) technologies enable rapid and cheap genome-wide transcriptome analysis, providing vital information about gene structure, transcript expression and alternative splicing. Key to this is the the accurate identification of exon-exon junctions from RNA sequenced (RNA-Seq) reads. A number of RNA-Seq aligners capable of splitting reads across these splice junctions (SJs) have been developed, however, it has been shown that while they correctly identify most genuine SJs available in a given sample, they also often produce large numbers of incorrect SJs.
Herein we describe the extent of this problem using popular RNA-Seq mapping tools, and present a new method, called Portcullis, to rapidly filter false SJs junctions derived from spliced alignments. We show that Portcullis distinguishes between genuine and false positive junctions to a high-degree of accuracy across different species, samples, expression levels, error profiles and read lengths. Portcullis is portable, efficient and to our knowledge is currently the only SJ prediction tool that reliably scales for use with large RNA-Seq datasets and large, highly-fragmented genomes, whilst delivering accurate SJs





File NameSample IDData TypeFile FormatSizeRelease Date 
GitHub archivearchive42.97 MB2018-10-12
ReadmeTEXT1.72 KB2018-10-12
Displaying 1-2 of 2 File(s).
Funding body Awardee Award ID Comments
Biotechnology and Biological Sciences Research Council(BBSRC) Federica Di Palma BB/CSP1720/1 Core Strategic Programme Grant
Biotechnology and Biological Sciences Research Council(BBSRC) Ksenia Krasileva BB/J003743/1 LOLA Award
Biotechnology and Biological Sciences Research Council(BBSRC) Neil Hall BB/CCG1720/1 Capability in Genomics and Single Cell
Date Action
October 16, 2018 Dataset publish
November 3, 2018 Manuscript Link added : 10.1093/gigascience/giy131
March 10, 2022 External Link added : https://doi.org/10.6084/m9.figshare.7205135.v1
March 10, 2022 External Link added : https://doi.org/10.6084/m9.figshare.6051875.v1
March 10, 2022 External Link added : https://doi.org/10.6084/m9.figshare.6051878.v1
November 11, 2022 Manuscript Link updated : 10.1093/gigascience/giy131