Teams: Not specified
Organizations: Not specifiedhttps://orcid.org/0000-0002-1399-293X
Dad, husband and PhD. Scientist, technologist and engineer. Bibliophile. Philomath. Passionate about science, medicine, research, computing and all things geeky!
Teams: MAB - ATGC
Organizations: Centre National de la Recherche Scientifique (CNRS)https://orcid.org/0000-0003-1590-8313
Workflow for Illumina Quality Control and Filtering
Multiple paired datasets will be merged into single paired dataset.
- FastQC on raw data files
- fastp for read quality trimming
- BBduk for phiX and (optional) rRNA filtering
- Kraken2 for taxonomic classification of reads (optional)
- BBmap for (contamination) filtering using given references (optional)
- FastQC on filtered (merged) data
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default ...
polya_liftover - sc/snRNAseq Snakemake Workflow
A [Snakemake][sm] workflow for using PolyA_DB and UCSC Liftover with Cellranger.
Some genes are not accurately annotated in the reference genome. Here, we use information provide by the [PolyA_DB v3.2][polya] to update the coordinates, then the [USCS Liftover][liftover] tool to update to a more recent genome. Next, we use [Cellranger][cr] to create the reference and count matrix. Finally, by taking advantage of the integrated [Conda][conda] and ...
Flashlite-Trinity contains two workflows that run Trinity on the University of Queensland's HPC, Flashlite. Trinity performs de novo transcriptome assembly of RNA-seq data by combining three independent software modules Inchworm, Chrysalis and Butterfly to process RNA-seq reads. The algorithm can detect isoforms, handle paired-end reads, multiple insert sizes and strandedness. Users can run Flashlite-Trinity on single samples, or smaller samples requiring <500Gb ...
Description: Trinity @ NCI-Gadi contains a staged Trinity workflow that can be run on the National Computational Infrastructure’s (NCI) Gadi supercomputer. Trinity performs de novo transcriptome assembly of RNA-seq data by combining three independent software modules Inchworm, Chrysalis and Butterfly to process RNA-seq reads. The algorithm can detect isoforms, handle paired-end reads, multiple insert sizes and strandedness. ...
ORSON combine state-of-the-art tools for annotation processes within a Nextflow pipeline: sequence similarity search (PLAST, BLAST or Diamond), functional annotation retrieval (BeeDeeM) and functional prediction (InterProScan). When required, BUSCO completness evaluation and eggNOG Orthogroup annotation can be activated. While ORSON results can be analyzed through the command-line, it also offers the possibility to be compatible with BlastViewer or Blast2GO graphical tools.
A porting of the Trinity RNA assembly pipeline, https://trinityrnaseq.github.io, that uses Nextflow to handle the underlying sub-tasks. This enables additional capabilities to better use HPC resources, such as packing of tasks to fill up nodes and use of node-local disks to improve I/O. By design, the pipeline separates the workflow logic (main file) and the cluster-specific configuration (config files), improving portability.
Based on a pipeline by Sydney Informatics Hub: ...