Workflows
What is a Workflow?Filters
Workflow to take DataOne data packages (raw datasets + metadata written in Ecological Metadata Standard) as input and create a DwC occurence.csv file almost ready to put in a Dawrin core Archive using eml-annotations at the attribute level
A porting of the Trinity RNA assembly pipeline, https://trinityrnaseq.github.io, that uses Nextflow to handle the underlying sub-tasks. This enables additional capabilities to better use HPC resources, such as packing of tasks to fill up nodes and use of node-local disks to improve I/O. By design, the pipeline separates the workflow logic (main file) and the cluster-specific configuration (config files), improving portability.
Based on a pipeline by Sydney Informatics Hub: ...
Workflow for tracking objects in Cell Profiler: https://training.galaxyproject.org/training-material/topics/imaging/tutorials/object-tracking-using-cell-profiler/tutorial.html
VIRify
VIRify is a recently developed pipeline for the detection, annotation, and taxonomic classification of viral contigs in metagenomic and metatranscriptomic assemblies. The pipeline is part of the repertoire of analysis services offered by MGnify. VIRify’s taxonomic classification relies on the detection of taxon-specific profile hidden Markov models (HMMs), built upon a set of 22,014 orthologous protein domains and referred to as ViPhOGs. VIRify was implemented in CWL. What do I need? The ...
Type: Nextflow
Creators: Martin Beracochea, Martin Hölzer, Alexandre Almeida, Guillermo Rangel-Pineros and Ekaterina Sakharova
Submitter: Laura Rodriguez-Navas
VIRify
VIRify is a recently developed pipeline for the detection, annotation, and taxonomic classification of viral contigs in metagenomic and metatranscriptomic assemblies. The pipeline is part of the repertoire of analysis services offered by MGnify. VIRify’s taxonomic classification relies on the detection of taxon-specific profile hidden Markov models (HMMs), built upon a set of 22,014 orthologous protein domains and referred to as ViPhOGs. VIRify was implemented in CWL. What do I need? The ...
Type: Common Workflow Language
Creators: Martin Beracochea, Martin Hölzer, Alexandre Almeida, Guillermo Rangel-Pineros and Ekaterina Sakharova
Submitter: Laura Rodriguez-Navas
Rare disease researchers workflow is that they submit their raw data (fastq), run the mapping and variant calling RD-Connect pipeline and obtain unannotated gvcf files to further submit to the RD-Connect GPAP or analyse on their own.
This demonstrator focuses on the variant calling pipeline. The raw genomic data is processed using the RD-Connect pipeline (Laurie et al., 2016) running on the standards (GA4GH) compliant, interoperable container ...
Type: Common Workflow Language
Creators: José Mª Fernández, Laura Rodriguez-Navas
Submitter: Laura Rodriguez-Navas
Non-functional workflow to get a global view of possibilities for plant virus classification.
Galaxy version of pre-processing of reads from COVID-19 samples. QC + human read cleaning Based on https://github.com/Finn-Lab/Metagen-FastQC/blob/master/metagen-fastqc.sh
CWL version of the md_list.cwl workflow for HPC.
Workflow to build different indices for different tools from a genome and transcriptome.
This workflow expects an (annotated) genome in GBOL ttl format.
Steps:
- SAPP: rdf2gtf (genome fasta)
- SAPP: rdf2fasta (transcripts fasta)
- STAR index (Optional for Eukaryotic origin)
- bowtie2 index
- kallisto index