Workflows
What is a Workflow?Filters
Workflow (hybrid) metagenomic assembly and binning
- Workflow Illumina Quality: https://workflowhub.eu/workflows/336?version=1
- FastQC (control)
- fastp (quality trimming)
- kraken2 (taxonomy)
- bbmap contamination filter
- Workflow Longread Quality:
- NanoPlot (control)
- filtlong (quality trimming)
- kraken2 (taxonomy)
- minimap2 contamination filter
- Kraken2 taxonomic classification of FASTQ reads
- SPAdes/Flye (Assembly)
- Pilon/Medaka/PyPolCA (Assembly polishing)
- QUAST (Assembly ...
Type: Common Workflow Language
Creators: Bart Nijsse, Jasper Koehorst, Changlin Ke
Submitter: Bart Nijsse
Workflow for microbial (meta-)genome annotation
Input is a (meta)genome sequence in fasta format.
-
bakta
-
KoFamScan (optional)
-
InterProScan (optional)
-
eggNOG mapper (optional)
-
To RDF conversion with SAPP (optional, default on) --> SAPP conversion Workflow in WorkflowHub
Workflow for converting (genome) annotation tool output into a GBOL RDF file (TTL/HDT) using SAPP
Current formats / tools:
- EMBL format
- InterProScan (JSON/TSV)
- eggNOG-mapper (TSV)
- KoFamScan (TSV)
git: https://gitlab.com/m-unlock/cwl
SAPP (Semantic Annotation Platform with Provenance):
https://gitlab.com/sapp
https://academic.oup.com/bioinformatics/article/34/8/1401/4653704
Workflow for LongRead Quality Control and Filtering
- NanoPlot (read quality control) before and after filtering
- Filtlong (read trimming)
- Kraken2 taxonomic read classification before and after filtering
- Minimap2 read filtering based on given references
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default
All tool CWL files and other workflows can be found here: https://gitlab.com/m-unlock/cwl/workflows
**How to setup and use an UNLOCK ...
Type: Common Workflow Language
Creators: Bart Nijsse, Jasper Koehorst, Germán Royval
Submitter: Bart Nijsse
Workflow for Illumina Quality Control and Filtering
Multiple paired datasets will be merged into single paired dataset.
Summary:
- FastQC on raw data files
- fastp for read quality trimming
- BBduk for phiX and (optional) rRNA filtering
- Kraken2 for taxonomic classification of reads (optional)
- BBmap for (contamination) filtering using given references (optional)
- FastQC on filtered (merged) data
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default ...
- Deprecated -
See our updated hybrid assembly workflow: https://workflowhub.eu/workflows/367
And other workflows: https://workflowhub.eu/projects/16#workflows
Workflow for sequencing with ONT Nanopore data, from basecalled reads to (meta)assembly and binning
- Workflow Nanopore Quality
- Kraken2 taxonomic classification of FASTQ reads
- Flye (de-novo assembly)
- Medaka (assembly polishing)
- metaQUAST (assembly quality reports)
When Illumina reads are provided:
- Workflow ...
Type: Common Workflow Language
Creators: Bart Nijsse, Jasper Koehorst, Germán Royval
Submitter: Jasper Koehorst
Workflow for Metagenomics from bins to metabolic models (GEMs)
Summary
- Prodigal gene prediction
- CarveMe genome scale metabolic model reconstruction
- MEMOTE for metabolic model testing
- SMETANA Species METabolic interaction ANAlysis
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default
All tool CWL files and other workflows can be found here: Tools: https://gitlab.com/m-unlock/cwl Workflows: https://gitlab.com/m-unlock/cwl/workflows
**How ...
Workflow for Metagenomics binning from assembly
Minimal inputs are: Identifier, assembly (fasta) and a associated sorted BAM file
Summary
- MetaBAT2 (binning)
- MaxBin2 (binning)
- SemiBin (binning)
- DAS Tool (bin merging)
- EukRep (eukaryotic classification)
- CheckM (bin completeness and contamination)
- BUSCO (bin completeness)
- GTDB-Tk (bin taxonomic classification)
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default
**All tool CWL ...
Workflow Kallisto RNAseq
(pseudoalignment on transcripts)
- Workflow Illumina Quality: https://workflowhub.eu/workflows/336?version=1
- kallisto
All tool CWL files and other workflows can be found here:
Tools: https://git.wur.nl/unlock/cwl/-/tree/master/cwl
Workflows: https://git.wur.nl/unlock/cwl/-/tree/master/cwl/workflows
How to setup and use an UNLOCK workflow:
https://m-unlock.gitlab.io/docs/setup/setup.html
- deprecated -
Workflow for sequencing with ONT Nanopore, from basecalling to assembly.
- Guppy (basecalling of raw reads)
- MinIONQC (quality check)
- FASTQ merging from multi into one file
- Kraken2 (taxonomic classification)
- Krona (classification visualization)
- Flye (de novo assembly)
- Medaka (assembly polishing)
- QUAST (assembly quality reports)
All tool CWL files and other workflows can be found here: Tools: https://git.wur.nl/unlock/cwl/-/tree/master/cwl Workflows: ...
Type: Common Workflow Language
Creators: Bart Nijsse, Jasper Koehorst, Germán Royval
Submitter: Jasper Koehorst