Workflows

What is a Workflow?

44 Workflows visible to you, out of a total of 44

Combined workflows for large genome assembly

Galaxy Australia, Australian BioCommons

Combined workflow for large genome assembly

The tutorial document for this workflow is here: https://doi.org/10.5281/zenodo.5655813

What it does: A workflow for genome assembly, containing subworkflows:

Data QC
Kmer counting
Trim and filter reads
Assembly with Flye
Assembly polishing
Assess genome quality

Inputs:

long reads and short reads in fastq format
reference genome for Quast

Outputs:

Data information - QC, kmers
Filtered, trimmed reads
Genome assembly, assembly graph, ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.230.1

Download

Created: 8th Nov 2021 at 06:08, Last updated: 11th Nov 2021 at 03:37

Assess genome quality

Galaxy Australia, Australian BioCommons

Assess genome quality; can run alone or as part of a combined workflow for large genome assembly.

What it does: Assesses the quality of the genome assembly: generate some statistics and determine if expected genes are present; align contigs to a reference genome.
Inputs: polished assembly; reference_genome.fasta (e.g. of a closely-related species, if available).
Outputs: Busco table of genes found; Quast HTML report, and link to Icarus contigs browser, showing contigs aligned to a reference ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.229.1

Download

Created: 8th Nov 2021 at 06:03, Last updated: 9th Nov 2021 at 01:12

Racon polish with long reads, x4

Galaxy Australia, Australian BioCommons

Assembly polishing subworkflow: Racon polishing with long reads

Inputs: long reads and assembly contigs

Workflow steps:

minimap2 : long reads are mapped to assembly => overlaps.paf.
overaps, long reads, assembly => Racon => polished assembly 1
using polished assembly 1 as input; repeat minimap2 + racon => polished assembly 2
using polished assembly 2 as input, repeat minimap2 + racon => polished assembly 3
using polished assembly 3 as input, repeat minimap2 + racon => ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.227.1

Download

Created: 8th Nov 2021 at 05:45, Last updated: 9th Nov 2021 at 01:12

Assembly with Flye

Galaxy Australia, Australian BioCommons

Assembly with Flye; can run alone or as part of a combined workflow for large genome assembly.

What it does: Assembles long reads with the tool Flye
Inputs: long reads (may be raw, or filtered, and/or corrected); fastq.gz format
Outputs: Flye assembly fasta; Fasta stats on assembly.fasta; Assembly graph image from Bandage; Bar chart of contig sizes; Quast reports of genome assembly
Tools used: Flye, Fasta statistics, Bandage, Bar chart, Quast
Input parameters: None required, but recommend ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.225.1

Download

Created: 8th Nov 2021 at 05:07, Last updated: 9th Nov 2021 at 01:11

Trim and filter reads - fastp

Galaxy Australia, Australian BioCommons

Trim and filter reads; can run alone or as part of a combined workflow for large genome assembly.

What it does: Trims and filters raw sequence reads according to specified settings.
Inputs: Long reads (format fastq); Short reads R1 and R2 (format fastq)
Outputs: Trimmed and filtered reads: fastp_filtered_long_reads.fastq.gz (But note: no trimming or filtering is on by default), fastp_filtered_R1.fastq.gz, fastp_filtered_R2.fastq.gz
Reports: fastp report on long reads, html; fastp report ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.224.1

Download

Created: 8th Nov 2021 at 04:56, Last updated: 9th Nov 2021 at 01:11

kmer counting - meryl

Galaxy Australia, Australian BioCommons

Kmer counting step, can run alone or as part of a combined workflow for large genome assembly.

What it does: Estimates genome size and heterozygosity based on counts of kmers
Inputs: One set of short reads: e.g. R1.fq.gz
Outputs: GenomeScope graphs
Tools used: Meryl, GenomeScope
Input parameters: None required
Workflow steps: The tool meryl counts kmers in the input reads (k=21), then converts this into a histogram. GenomeScope: runs a model on the histogram; reports estimates. k-mer ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.223.1

Download

Created: 8th Nov 2021 at 04:47, Last updated: 9th Nov 2021 at 01:10

Data QC

Galaxy Australia, Australian BioCommons

Data QC step, can run alone or as part of a combined workflow for large genome assembly.

What it does: Reports statistics from sequencing reads.
Inputs: long reads (fastq.gz format), short reads (R1 and R2) (fastq.gz format).
Outputs: For long reads: a nanoplot report (the HTML report summarizes all the information). For short reads: a MultiQC report.
Tools used: Nanoplot, FastQC, MultiQC.
Input parameters: None required.
Workflow steps: Long reads are analysed by Nanoplot; Short reads ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.222.1

Download

Created: 8th Nov 2021 at 04:34, Last updated: 9th Nov 2021 at 01:09

Racon polish with Illumina reads, x2

Galaxy Australia, Australian BioCommons

Assembly polishing subworkflow: Racon polishing with short reads

Inputs: short reads and assembly (usually pre-polished with other tools first, e.g. Racon + long reads; Medaka)

Workflow steps:

minimap2: short reads (R1 only) are mapped to the assembly => overlaps.paf. Minimap2 setting is for short reads.
overlaps + short reads + assembly => Racon => polished assembly 1
using polished assembly 1 as input; repeat minimap2 + racon => polished assembly 2
Racon short-read polished ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.228.1

Download

Created: 8th Nov 2021 at 05:50, Last updated: 9th Nov 2021 at 01:09

Flashlite-Supernova

Australian BioCommons, Sydney Informatics Hub

Work-in-progress

The Flashlite-Supernova pipeline runs Supernova to generate phased whole-genome de novo assemblies from a Chromium prepared library on University of Queensland's HPC, Flashlite.

Infrastructure_deployment_metadata: FlashLite (QRISCloud)

Type: Shell Script

Creators: None

Submitter: Tracy Chew

DOI: 10.48546/workflowhub.workflow.151.1

Download

Created: 19th Aug 2021 at 00:21, Last updated: 9th Sep 2021 at 02:31

Flashlite-Trinity

Australian BioCommons, Sydney Informatics Hub

(Show All)

Stable

Flashlite-Trinity contains two workflows that run Trinity on the University of Queensland's HPC, Flashlite. Trinity performs de novo transcriptome assembly of RNA-seq data by combining three independent software modules Inchworm, Chrysalis and Butterfly to process RNA-seq reads. The algorithm can detect isoforms, handle paired-end reads, multiple insert sizes and strandedness. Users can run Flashlite-Trinity on single samples, or smaller samples requiring <500Gb ...

Type: Shell Script

Creators: Tracy Chew, Rosemarie Sadsad, Georgina Samaha, Cali Willet

Submitter: Tracy Chew

DOI: 10.48546/workflowhub.workflow.149.1

Download

Created: 19th Aug 2021 at 00:17, Last updated: 7th Sep 2021 at 07:27

Workflows

Filters ×

Filters