A porting of the Trinity RNA assembly pipeline, https://trinityrnaseq.github.io, that uses Nextflow to handle the underlying sub-tasks. This enables additional capabilities to better use HPC resources, such as packing of tasks to fill up nodes and use of node-local disks to improve I/O. By design, the pipeline separates the workflow logic (main file) and the cluster-specific configuration (config files), improving portability.
Based on a pipeline by Sydney Informatics Hub: ...
Workflow for Spliced RNAseq data Steps:
- FastQC (Read Quality Control)
- fastp (Read Trimming)
- STAR (Read mapping)
- featurecounts (transcript read counts)
- kallisto (transcript [pseudo]counts)