Workflows

What is a Workflow?

27 Workflows visible to you, out of a total of 27

ERGA-BGE Genome Report ANNOT analyses

ERGA Annotation

Stable

The workflow requires the user to provide:

ENSEMBL link address of the annotation GFF3 file
ENSEMBL link address of the assembly FASTA file
NCBI taxonomy ID
BUSCO lineage
OMArk database

Thw workflow will produce statistics of the annotation based on AGAT, BUSCO and OMArk.

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1096.1

Created: 9th Aug 2024 at 15:14, Last updated: 24th Feb 2025 at 15:24

ERGA-BGE Genome Report ASM analyses (one-asm WGS Illumina PE + HiC)

ERGA Assembly

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assembly, Illumina WGS reads + HiC reads

The workflow requires the following:

Species Taxonomy ID number
NCBI Genome assembly accession code
BUSCO Lineage
WGS accurate reads accession code
NCBI HiC reads accession code

The workflow will get the data and process it to generate genome profiling (genomescope, smudgeplot -optional-), assembly stats (gfastats), merqury stats (QV, completeness), BUSCO, snailplot, contamination blobplot, and ...

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1103.2

Created: 19th Aug 2024 at 10:38, Last updated: 5th Dec 2024 at 16:48

GALOP - Genome Assembly using Long reads Pipeline

Bioinformatics Laboratory for Genomics and Biodiversity (LBGB), ERGA Assembly

(Show All)

Work-in-progress

GALOP - Genome Assembly using Long reads Pipeline

This repository contains an exact copy of the standard Genoscope long reads assembly pipeline.

At the moment, this is not intended for users to download as it uses grid submission commands that will only work at Genoscope. As time goes on, we intend to make this pipeline available to a broader audience. However, genome assembly and polishing commands are accessible in the lib/assembly.py and lib/polishing.py files.

galop.py -h 
Mandatory
...

Type: Python

Creators: Benjamin Istace, Jean-Marc Aury, Caroline Belser

Submitter: Benjamin Istace

DOI: 10.48546/workflowhub.workflow.1200.2

Created: 12th Nov 2024 at 07:37, Last updated: 14th Nov 2024 at 06:55

Swedish Earth Biogenome Project Genome Assembly Workflow

NBIS, ERGA Assembly

Work-in-progress

Swedish Earth Biogenome Project - Genome Assembly Workflow

The primary genome assembly workflow for the Earth Biogenome Project at NBIS.

Workflow overview

General aim:

flowchart LR 
hifi[/ HiFi reads /] --> data_inspection 
ont[/ ONT reads /] --> data_inspection 
hic[/ Hi-C reads /] --> data_inspection 
data_inspection[[ Data inspection ]] --> preprocessing 
preprocessing[[ Preprocessing ]] --> assemble 
assemble[[ Assemble ]] --> validation 
validation[[ Assembly
...

Type: Nextflow

Creators: Mahesh Binzer-Panchal, Martin Pippel

Submitter: Mahesh Binzer-Panchal

Created: 23rd Aug 2024 at 14:16

HiC scaffolding pipeline

ERGA Assembly, Biodiversity Genomics Europe (general)

Stable

HiC scaffolding pipeline

Snakemake pipeline for scaffolding of a genome using HiC reads using yahs.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda --cores N

where N is number of cores to use. There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring the cluster.json file run:

./run_cluster ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.796.3

Created: 16th Mar 2024 at 09:01

Purge retained haplotypes using Purge-Dups

ERGA Assembly, Biodiversity Genomics Europe (general)

Purge dups

This snakemake pipeline is designed to be run using as input a contig-level genome and pacbio reads. This pipeline has been tested with snakemake v7.32.4. Raw long-read sequencing files and the input contig genome assembly must be given in the config.yaml file. To execute the workflow run:

snakemake --use-conda --cores N

Or configure the cluster.json and run using the ./run_cluster command

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.506.2

Created: 16th Jun 2023 at 14:56, Last updated: 16th Mar 2024 at 07:49

HiC contact map generation

ERGA Assembly, Biodiversity Genomics Europe (general)

Stable

HiC contact map generation

Snakemake pipeline for the generation of .pretext and .mcool files for visualisation of HiC contact maps with the softwares PretextView and HiGlass, respectively.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda

There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.795.2

Created: 14th Mar 2024 at 09:50, Last updated: 14th Mar 2024 at 09:52

ERGA ONT+Illumina Assembly+QC NextDenovo+HyPo v2403 (WF2)

ERGA Assembly

Work-in-progress

The workflow takes raw ONT reads and trimmed Illumina WGS paired reads collections, the ONT raw stats table (calculated from WF1) and the estimated genome size (calculated from WF1) to run NextDenovo and subsequently polish the assembly with HyPo. It produces collapsed assemblies (unpolished and polished) and runs all the QC analyses (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 11th Mar 2024 at 14:45

ERGA ONT+Illumina Assembly+QC Flye+HyPo v2403 (WF2)

ERGA Assembly

Stable

The workflow takes raw ONT reads and trimmed Illumina WGS paired reads collections, and the estimated genome size and Max depth (both calculated from WF1) to run Flye and subsequently polish the assembly with HyPo. It produces collapsed assemblies (unpolished and polished) and runs all the QC analyses (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 11th Mar 2024 at 12:41

CLAWS (CNAG's long-read assembly workflow in Snakemake)

ERGA Assembly

Stable

CLAWS (CNAG's Long-read Assembly Workflow in Snakemake)

Snakemake Pipeline used for de novo genome assembly @CNAG. It has been developed for Snakemake v6.0.5.

It accepts Oxford Nanopore Technologies (ONT) reads, PacBio HFi reads, illumina paired-end data, illumina 10X data and Hi-C reads. It does the preprocessing of the reads, assembly, polishing, purge_dups, scaffodling and different evaluation steps. By default it will preprocess the reads, run Flye + Hypo + purge_dups + yahs and evaluate ...

Type: Snakemake

Creators: Jessica Gomez-Garrido, Fernando Cruz (CNAG), Francisco Camara (CNAG), Tyler Alioto (CNAG)

Submitter: Jessica Gomez-Garrido

DOI: 10.48546/workflowhub.workflow.567.2

Created: 12th Sep 2023 at 14:23, Last updated: 2nd Feb 2024 at 12:24

Workflows

Filters ×

GALOP - Genome Assembly using Long reads Pipeline

Swedish Earth Biogenome Project - Genome Assembly Workflow

Workflow overview

HiC scaffolding pipeline

Prerequisites

Purge dups

HiC contact map generation

Prerequisites

CLAWS (CNAG's Long-read Assembly Workflow in Snakemake)

Filters