In:
Genome Biology, Springer Science and Business Media LLC, Vol. 22, No. 1 ( 2021-12)
Abstract:
Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technologies can reduce the accuracy of intron identification. Here we apply alignment metrics and machine-learning-derived sequence information to filter spurious splice junctions from long-read alignments and use the remaining junctions to guide realignment in a two-pass approach. This method, available in the software package 2passtools ( https://github.com/bartongroup/2passtools ), improves the accuracy of spliced alignment and transcriptome assembly for species both with and without existing high-quality annotations.
Type of Medium:
Online Resource
ISSN:
1474-760X
DOI:
10.1186/s13059-021-02296-0
Language:
English
Publisher:
Springer Science and Business Media LLC
Publication Date:
2021
detail.hit.zdb_id:
2040529-7