79 items tagged with 'Genomics'.

Teams: Sydney Informatics Hub

Organizations: The University of Sydney

https://orcid.org/0000-0003-0662-9101

Expertise: Bioinformatics, Genomics, Genetics

Senior Bioinformatics Engineer - Sydney Informatics Hub | Australian BioCommons

Sora Yonezawa

Teams: bonohulab

Organizations: Hiroshima University

https://orcid.org/0009-0004-1874-3117

Expertise: Bioinformatics

Tools: CWL, Genomics, Python, R, Transcriptomics, Jupyter notebook

Hiroshima University, Graduate School of Integrated Sciences for life, Laboratory of Genome Informatics, Ph.D student GitHub: https://github.com/yonesora56

Mahesh Binzer-Panchal

Teams: NBIS, ERGA Assembly

Organizations: NBIS – National Bioinformatics Infrastructure Sweden

https://orcid.org/0000-0003-1675-0677

Expertise: Bioinformatics, Genomics, Scientific workflow developement, Workflows

Tools: Nextflow, nf-core

I'm a bioinformatician for the National Bioinformatics Infrastrure Sweden. I specialise in de novo genome assembly and workflow development with Nextflow. I'm also a Nextflow ambassador and nf-core maintainer.

Laura Najera Cortazar

Teams: Biodiversity Genomics Europe (general), iBOL Europe Metabarcoding

Organizations: BIOPOLIS Association (BIOPOLIS-CIBIO)

https://orcid.org/0000-0001-8650-7248

Expertise: Molecular Biology, phylogenetics, evolution

Tools: Databases, Genomics, Genetic analysis

Zsolt Balázs

Teams: KrauthammerLab

Organizations: University of Zurich

https://orcid.org/0000-0003-3537-7441

Expertise: Bioinformatics

Tools: Genomics, Molecular Biology, Single Cell analysis

Clea Siguret

Teams: Not specified

Organizations: Not specified

https://orcid.org/0009-0005-6140-0379

Expertise: Bioinformatics, Computer Science, Data Management, Genomics, Python, R, Scientific workflow developement, Workflows, phylogenomics

Tools: Galaxy, Genomics, Git, Python, R, Workflows

Bram Danneels

Teams: EBP-Nor

Organizations: University of Bergen

https://orcid.org/0000-0001-7446-8325

Expertise: Genomics, Metagenomics, NGS, Python, evolution

Tools: Genomics, Python, Snakemake, Transcriptomics

Naser Elmi

Teams: Cimorgh IT solutions

Organizations: cimorgh IT

Expertise: Bioinformatics, Genomics, Metagenomics, Microbiology, NGS, Python, R, bash, WDL

Tools: Mathematical Modelling, R, WDL

Priyanka Surana

Teams: Not specified

Organizations: Not specified

https://orcid.org/0000-0002-7167-0875

Expertise: Bioinformatics, Genomics, Scientific workflow developement

Tools: Nextflow, nf-core, Python, R

Dongyang Wang

Teams: Not specified

Organizations: Not specified

https://orcid.org/0000-0001-6440-6980

Expertise: Bioinformatics, Genomics, Machine Learning

Tools: Python, R, Machine Learning

I am a Ph.D. student in Gong lab. I am interested in cancer genomics, including the mining of genetic risk determinants in cancer, functional prediction of genetic variants, tumor-associated molecular epidemiology, large-scale data integration, analysis, and mining, as well as the construction of bioinformatical data platforms.

Saskia Hiltemann

Teams: Galaxy Training Network

Organizations: Erasmus University Medical Centre

https://orcid.org/0000-0003-3803-468X

Expertise: Genomics, amplicon analysis, Microbiology

Tools: Galaxy

Post-doc at ErasmusMC, Galaxy Training Network (GTN) Lead

Juan Caballero

Teams: MGnify

Organizations: EMBL-EBI

https://orcid.org/0000-0002-6160-3644

Expertise: Bioinformatics, Genomics, Metagenomics, Data Management

Tools: CWL, Jupyter notebook, Nextflow, Molecular Biology, Workflows, Microbiology, Transcriptomics, Perl, Python, R

Stephen Moss

Teams: Not specified

Organizations: Not specified

https://orcid.org/0000-0002-1399-293X

Expertise: Bioinformatics, Computer Science, Data Management, Genetics, Genomics, Machine Learning, Metagenomics, NGS, Scientific workflow developement, Software Engineering

Tools: Databases, Galaxy, Genomics, Jupyter notebook, Machine Learning, Nextflow, nf-core, PCR, Perl, Python, R, rtPCR, Snakemake, Transcriptomics, Virology, Web, Web services, Workflows

Dad, husband and PhD. Scientist, technologist and engineer. Bibliophile. Philomath. Passionate about science, medicine, research, computing and all things geeky!

[email protected] Rivals

Teams: MAB - ATGC

Organizations: Centre National de la Recherche Scientifique (CNRS)

https://orcid.org/0000-0003-3791-3973

Expertise: Bioinformatics, Genomics, algorithm, Machine Learning, Metagenomics, NGS, Computer Science

Tools: Transcriptomics, Genomics, Python, C/C++, Web services, Workflows

bonohulab

Toward data-driven genome breeding (digital breeding), we are developing data analysis infrastructure technology essential for genome editing, focusing on gene function analysis using bioinformatics called BioDX.

Space: Hiroshima workflow community

Public web page: https://bonohu.hiroshima-u.ac.jp/index_en.html

Organisms: Not specified

Illumina Protocol - Testing

COPO

No description specified

Creator: Felix Shaw

Submitter: Felix Shaw

Download

Created: 9th Oct 2024 at 08:20

The BGE guide to using WorkflowHub

Biodiversity Genomics Europe (general)

This is a project specific guide for the Bioiversity Genomics Europe (BGE project use of WorkflowHub.

Creator: Stian Soiland-Reyes

Submitter: Stian Soiland-Reyes

DOI: 10.48546/workflowhub.sop.15.1

External Link

Created: 30th Apr 2024 at 14:38

EpiCurator: an immunoinformatic workflow to predict and prioritize SARS-CoV-2 epitopes

yPublish - Bioinfo tools

Abstract (Expand)

The ongoing coronavirus 2019 (COVID-19) pandemic, triggered by the emerging SARS-CoV-2 virus, represents a global public health challenge. Therefore, the development of effective vaccines is an urgent …

Authors: Cristina S. Ferreira, Yasmmin C. Martins, Rangel Celso Souza, Ana Tereza R. Vasconcelos

Date Published: 2021

Publication Type: Journal

DOI: 10.7717/peerj.12548

Citation: PeerJ 9:e12548

Created: 23rd Oct 2023 at 15:04, Last updated: 23rd Oct 2023 at 15:06

ONTeater

ELIXIR Biodiversity Community

Stable

ONTeater is a eukaryotic genome assembly pipeline intended to produce highly-contiguous genomes with a single input of Oxford NanoPore Tech (ONT) longread data, although PacBio is accepted as well. Information can be found here. Originally developed to support genome assembly efforts by OIKOS genomics, predominantly of nonmodel vertebrate species.

Type: Nextflow

Creators: None

Submitter: Keiler Collier

Created: 18th Jun 2025 at 14:12, Last updated: 21st Jun 2025 at 08:15

Click-qPCR

Ultra-simple tool for interactive qPCR data analysis developed by R and Shiny.

Read this document in Japanese (日本語版のユーザーガイドはこちら)

Overview

Click-qPCR is a user-friendly Shiny web application designed for the straightforward analysis of real-time quantitative PCR (qPCR) data.

This tool is readily accessible via a web browser at https://kubo-azu.shinyapps.io/Click-qPCR/, requiring no local installation for end-users. ...

Type: Unrecognized workflow type

Creator: Azusa Kubota

Submitter: Azusa Kubota

Created: 4th Jun 2025 at 06:10

Workflow Constructed From History 'IWTomics Workflow'

Galaxy Training Network

Interval-Wise Testing for omics data

Associated Tutorial

This workflows is part of the tutorial Interval-Wise Testing for omics data, available in the GTN

Thanks to...

Tutorial Author(s): Marzia A Cremona, Fabio Cumbo ...

Type: Galaxy

Creators: None

Submitter: GTN Bot

Created: 2nd Jun 2025 at 14:15

BVSim: A Benchmarking Variation Simulator Mimicking Human Variation Spectrum

Structural Variation Analysis

Stable

BVSim: A Benchmarking Variation Simulator Mimicking Human Variation Spectrum

Getting Started
Installation
General Functions and Parameters
Shared Parameters
Output Naming Conventions
[Write the Relative ...

Type: Unrecognized workflow type

Creators: Yongyi Luo, Zhen Zhang, Jiandong Shi, Jingyu Hao, Sheng Lian, Taobo Hu, Toyotaka Ishibashi, Depeng Wang, Shu Wang, Weichuan Yu, Xiaodan Fan

Submitter: Zhen Zhang

DOI: 10.48546/workflowhub.workflow.1361.1

Created: 10th May 2025 at 14:56, Last updated: 10th May 2025 at 15:23

gSpreadComp

Kasmanas

gSpreadComp: Streamlining Microbial Community Analysis for Resistance, Virulence, and Plasmid-Mediated Spread

Overview

gSpreadComp is a UNIX-based, modular bioinformatics toolkit designed to streamline comparative genomics for analyzing microbial communities. It integrates genome annotation, gene spread calculation, plasmid-mediated horizontal gene transfer (HGT) detection and resistance-virulence ranking within the analysed microbial community to help researchers identify potential ...

Type: Shell Script

Creator: Jonas Kasmanas

Submitter: Jonas Kasmanas

DOI: 10.48546/workflowhub.workflow.1340.3

Created: 15th Apr 2025 at 11:29

sanger-tol/curationpretext

Tree of Life Genome Assembly, Tree of Life Genome Analysis

Work-in-progress

sanger-tol/curationpretext

[![Cite with ...

Type: Nextflow

Creators: Damon-Lee Pointon, Mahesh Panchel, Yumi Sims, Will Eagles, Matthieu Muffato, Solenne Correard, Josie Paris

Submitter: Damon-Lee Pointon

Created: 12th Mar 2025 at 10:23

sanger-tol/curationpretext

Tree of Life Genome Assembly, Tree of Life Genome Analysis

Work-in-progress

[![Cite ...

Type: Nextflow

Creators: Damon-Lee Pointon, Mahesh Panchel

Submitter: Damon-Lee Pointon

Created: 12th Mar 2025 at 10:19

nf-core/phaseimpute

nf-core

Phasing and imputation pipeline

Type: Nextflow

Creators: Louis Le Nezet, Anabella Trigila

Submitter: WorkflowHub Bot

Created: 10th Dec 2024 at 04:04

skim2mito

NHM Clark group

Stable

skim2mito

skim2mito is a snakemake pipeline for the batch assembly, annotation, and phylogenetic analysis of mitochondrial genomes from low coverage genome skims. The pipeline was designed to work with sequence data from museum collections. However, it should also work with genome skims from recently collected samples.

Setup
Example data
Input
Output
Filtering contaminants
[Assembly and ...

Type: Snakemake

Creators: None

Submitter: Oliver White

Created: 12th Mar 2024 at 15:03, Last updated: 7th Oct 2024 at 13:24

covid-sequence-analysis-workflow

SARS-CoV-2 Data Hubs

Stable

covid-sequence-analysis-workflow

This is the official repository of the SARS-CoV-2 variant surveillance pipeline developed by Danish Technical University (DTU), Eotvos Lorand University (ELTE), EMBL-EBI, Erasmus Medical Center (EMC) under the Versatile Emerging infectious disease Observatory (VEO) project. The project consists of 20 European partners. It is funded by the European Commission.

The ...

Type: Nextflow

Creator: David Yuan

Submitter: David Yuan

DOI: 10.48546/workflowhub.workflow.664.1

Created: 14th Nov 2023 at 09:42

nf-core/pairgenomealign

nf-core

[![Cite ...

Type: Nextflow

Creators: charles-plessy , charles-plessy

Submitter: WorkflowHub Bot

Created: 28th Aug 2024 at 04:03, Last updated: 6th Feb 2025 at 04:04

ERGA-BGE Genome Report ASM analyses (one-asm HiFi + HiC)

ERGA Assembly

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assembly, HiFi WGS reads + HiC reads

The workflow requires the following:

Species Taxonomy ID number
NCBI Genome assembly accession code
BUSCO Lineage
WGS accurate reads accession code
NCBI HiC reads accession code

The workflow will get the data and process it to generate genome profiling (genomescope, smudgeplot -optional-), assembly stats (gfastats), merqury stats (QV, completeness), BUSCO, snailplot, contamination blobplot, and HiC ...

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1104.1

Created: 20th Aug 2024 at 14:19, Last updated: 5th Dec 2024 at 16:47

ERGA-BGE Genome Report ASM analyses (one-asm WGS Illumina PE + HiC)

ERGA Assembly

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assembly, Illumina WGS reads + HiC reads

The workflow requires the following:

Species Taxonomy ID number
NCBI Genome assembly accession code
BUSCO Lineage
WGS accurate reads accession code
NCBI HiC reads accession code

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1103.2

Created: 19th Aug 2024 at 10:38, Last updated: 5th Dec 2024 at 16:48

ERGA-BGE Genome Report ANNOT analyses

ERGA Annotation

Stable

The workflow requires the user to provide:

ENSEMBL link address of the annotation GFF3 file
ENSEMBL link address of the assembly FASTA file
NCBI taxonomy ID
BUSCO lineage
OMArk database

Thw workflow will produce statistics of the annotation based on AGAT, BUSCO and OMArk.

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1096.1

Created: 9th Aug 2024 at 15:14, Last updated: 24th Feb 2025 at 15:24

cfDNA UniFlow: A unified preprocessing pipeline for cell-free DNA data from liquid biopsies

KircherLab

Stable

cfDNA UniFlow is a unified, standardized, and ready-to-use workflow for processing whole genome sequencing (WGS) cfDNA samples from liquid biopsies. It includes essential steps for pre-processing raw cfDNA samples, quality control and reporting. Additionally, several optional utility functions like GC bias correction and estimation of copy number state are included. Finally, we provide specialized methods for extracting coverage derived signals and visualizations comparing cases and controls. ...

Type: Snakemake

Creator: Sebastian Röner

Submitter: Sebastian Röner

DOI: 10.48546/workflowhub.workflow.1091.2

Created: 7th Aug 2024 at 12:56, Last updated: 11th Nov 2024 at 08:21

Deepconsensus for Sequel2/2e subreads

WGGC

deepconsensus 1.2 snakemake pipeline

This snakemake-based workflow takes in a subreads.bam and results in a deepconsensus.fastq

no methylation calls !

The metadata id of the subreads file needs to be: "m[numeric][numeric][numeric].subreads.bam"

Chunking (how many subjobs) and ccs min quality filter can be adjusted in the config.yaml

the checkpoint model for deepconsensus1.2 should be accessible like this: gsutil cp -r gs://brain-genomics-public/research/deepconsensus/models/v1.2/model_checkpoint/* ...

Type: Snakemake

Creators: None

Submitter: dan rick

Created: 12th Jul 2024 at 09:59

bacterial_genome_annotation/main

Intergalactic Workflow Commission (IWC)

Tests Passing

Annotation of an assembled bacterial genomes to detect genes, potential plasmids, integrons and Insertion sequence (IS) elements.

Type: Galaxy

Creators: ABRomics , Pierre Marin, Clea Siguret, abromics-consortium

Submitter: WorkflowHub Bot

Created: 20th Jun 2024 at 03:01

quality-and-contamination-control/main

Intergalactic Workflow Commission (IWC)

Tests Not available

Short paired-end read analysis to provide quality analysis, read cleaning and taxonomy assignation

Type: Galaxy

Creators: ABRomics , Pierre Marin, Clea Siguret, abromics-consortium

Submitter: WorkflowHub Bot

Created: 20th Jun 2024 at 03:02, Last updated: 2nd Oct 2024 at 10:54

amr_gene_detection/main

Intergalactic Workflow Commission (IWC)

Tests Not available

Antimicrobial resistance gene detection from assembled bacterial genomes

Type: Galaxy

Creators: ABRomics , Pierre Marin, Clea Siguret, abromics-consortium

Submitter: WorkflowHub Bot

Created: 19th Jun 2024 at 03:01

bacterial-genome-assembly/main

Intergalactic Workflow Commission (IWC)

Tests Not available

Assembly of bacterial paired-end short read data with generation of quality metrics and reports

Type: Galaxy

Creators: Abromics , Pierre Marin, Clea Siguret, abromics-consortium

Submitter: WorkflowHub Bot

Created: 18th Jun 2024 at 03:01, Last updated: 2nd Oct 2024 at 11:00

MOLGENIS/VIP: Variant Interpretation Pipeline

MOLGENIS

Stable

Variant Interpretation Pipeline (VIP) that annotates, filters and reports prioritized causal variants in humans, see https://github.com/molgenis/vip for more information.

Type: Unrecognized workflow type

Creators: None

Submitter: Dennis Hendriksen

Download

Created: 21st Jun 2021 at 09:33, Last updated: 12th Jun 2024 at 10:50

nf-core/sarek

nf-core

An open-source analysis pipeline to detect germline or somatic variants from whole genome or targeted sequencing

Type: Nextflow

Creators: Maxime Garcia, Szilveszter Juhos

Submitter: WorkflowHub Bot

Created: 4th Jun 2024 at 11:33

nf-core/circdna

nf-core

Pipeline for the identification of circular DNAs

Type: Nextflow

Creator: Daniel Schreyer

Submitter: WorkflowHub Bot

Created: 4th Jun 2024 at 11:32

nf-core/bactmap

nf-core

A mapping-based pipeline for creating a phylogeny from bacterial whole genome sequences

Type: Nextflow

Creator: Anthony Underwood

Submitter: WorkflowHub Bot

Created: 4th Jun 2024 at 11:32

GSC (Genotype Sparse Compression)

Genome Data Compression Team

Stable

GSC (Genotype Sparse Compression)

Genotype Sparse Compression (GSC) is an advanced tool for lossless compression of VCF files, designed to efficiently store and manage VCF files in a compressed format. It accepts VCF/BCF files as input and utilizes advanced compression techniques to significantly reduce storage requirements while ensuring fast query capabilities. In our study, we successfully compressed the VCF files from the 1000 Genomes Project (1000Gpip3), consisting of 2504 samples and 80 ...

Type: Docker

Creator: Xiaolong Luo

Submitter: Xiaolong Luo

DOI: 10.48546/workflowhub.workflow.887.1

Created: 18th May 2024 at 14:18

GSC (Genotype Sparse Compression)

Genome Data Compression Team

Stable

GSC (Genotype Sparse Compression)

Type: Common Workflow Language

Creators: None

Submitter: Xiaolong Luo

Created: 17th May 2024 at 17:51

Parabricks-Genomics-nf

Sydney Informatics Hub

Parabricks-Genomics-nf is a GPU-enabled pipeline for alignment and germline short variant calling for short read sequencing data. The pipeline utilises NVIDIA's Clara Parabricks toolkit to dramatically speed up the execution of best practice bioinformatics tools. Currently, this pipeline is configured specifically for NCI's Gadi HPC.

NVIDIA's Clara Parabricks can deliver a significant ...

Type: Nextflow

Creator: Georgina Samaha

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.836.1

Created: 25th Apr 2024 at 00:19

sanger-tol/treeval v1.1.0 - Ancient Aurora

Tree of Life Genome Assembly

Stable

...

Type: Nextflow

Creators: Damon-Lee Pointon, William Eagles, Ying Sims

Submitter: Damon-Lee Pointon

Created: 9th Apr 2024 at 10:22

HiC contact map generation

ERGA Assembly, Biodiversity Genomics Europe (general)

Stable

HiC contact map generation

Snakemake pipeline for the generation of .pretext and .mcool files for visualisation of HiC contact maps with the softwares PretextView and HiGlass, respectively.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda

There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.795.2

Created: 14th Mar 2024 at 09:50, Last updated: 14th Mar 2024 at 09:52

Somatic-ShortV-nf

Sydney Informatics Hub, Australian BioCommons

(Show All)

Work-in-progress

This is a Nextflow implementaion of the GATK Somatic Short Variant Calling workflow. This workflow can be used to discover somatic short variants (SNVs and indels) from tumour and matched normal BAM files following GATK's Best Practices Workflow. The workflowis currently optimised to run efficiently and at scale on the National Compute Infrastructure, Gadi.

Type: Nextflow

Creators: Nandan Deshpande, Tracy Chew, Cali Willet, Georgina Samaha

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.691.1

Created: 20th Dec 2023 at 01:12, Last updated: 20th Dec 2023 at 01:16

Inclusion Body Myositis Active Subnetwork Identification Workflow

EJPRD WP13 case-studies workflows

Workflow for Creating a large disease network from various datasets and databases for IBM, and applying the active subnetwork identification method MOGAMUN.

Type: Common Workflow Language

Creators: Daphne Wijnbergen, Mridul Johari

Submitter: Daphne Wijnbergen

DOI: 10.48546/workflowhub.workflow.681.7

Created: 27th Nov 2023 at 12:52, Last updated: 1st Feb 2024 at 11:26

ANNOTATO - ERGA Genome Annotation Workflow in Nextflow

ERGA Annotation, Bioinformatics Laboratory for Genomics and Biodiversity (LBGB)

Stable

ANNOTATO - Annotation workflow To Annotate Them Oll

ANNOTATO - Annotation workflow To Annotate Them Oll
Overview of the workflow
Input data
Pipeline steps
Output data
Prerequisites
Installation
Running ANNOTATO
Before running the pipeline (IMPORTANT) ...

Type: Nextflow

Creator: Phuong Doan

Submitters: Tom Brown, Phuong Doan

DOI: 10.48546/workflowhub.workflow.654.2

Created: 9th Nov 2023 at 09:43, Last updated: 24th Nov 2023 at 15:24

ERGA Protein-coding gene annotation workflow

ERGA Annotation

ERGA Protein-coding gene annotation workflow.

Adapted from the work of Sagane Joye:

https://github.com/sdind/genome_annotation_workflow

Prerequisites

The following programs are required to run the workflow and the listed version were tested. It should be noted that older versions of snakemake are not compatible with newer versions of singularity as is noted here: https://github.com/nextflow-io/nextflow/issues/1659.

conda v 23.7.3 ...

Type: Snakemake

Creator: Sagane Joye-Dind

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.569.1

Created: 12th Sep 2023 at 20:29, Last updated: 13th Sep 2023 at 14:40

CLAWS (CNAG's long-read assembly workflow in Snakemake)

ERGA Assembly

Stable

CLAWS (CNAG's Long-read Assembly Workflow in Snakemake)

Snakemake Pipeline used for de novo genome assembly @CNAG. It has been developed for Snakemake v6.0.5.

It accepts Oxford Nanopore Technologies (ONT) reads, PacBio HFi reads, illumina paired-end data, illumina 10X data and Hi-C reads. It does the preprocessing of the reads, assembly, polishing, purge_dups, scaffodling and different evaluation steps. By default it will preprocess the reads, run Flye + Hypo + purge_dups + yahs and evaluate ...

Type: Snakemake

Creators: Jessica Gomez-Garrido, Fernando Cruz (CNAG), Francisco Camara (CNAG), Tyler Alioto (CNAG)

Submitter: Jessica Gomez-Garrido

DOI: 10.48546/workflowhub.workflow.567.2

Created: 12th Sep 2023 at 14:23, Last updated: 2nd Feb 2024 at 12:24

ARA (Automated Record Analysis)

ARA-dev

Stable

ARA (Automated Record Analysis) : An automatic pipeline for exploration of SRA datasets with sequences as a query

Requirements

Docker
Please checkout the Docker installation guide.

Mamba package manager
Please checkout the mamba or micromamba official installation guide.
We prefer mamba over conda since it is faster and uses ...

Type: Perl

Creators: Anand Maurya, Maciej Szymanski, Wojciech Karlowski

Submitter: Anand Maurya

DOI: 10.48546/workflowhub.workflow.546.1

Created: 31st Jul 2023 at 13:44, Last updated: 31st Jul 2023 at 13:49

prepareChIPs:

Black Ochre Data Labs

Work-in-progress

prepareChIPs

This is a simple snakemake workflow template for preparing single-end ChIP-Seq data. The steps implemented are:

Download raw fastq files from SRA
Trim and Filter raw fastq files using AdapterRemoval
Align to the supplied genome using bowtie2
Deduplicate Alignments using Picard MarkDuplicates
Call Macs2 Peaks using macs2

A pdf of the rulegraph is available here

Full details for each step are given below. Any additional ...

Type: Snakemake

Creator: Stevie Pederson

Submitter: Stevie Pederson

DOI: 10.48546/workflowhub.workflow.528.1

Created: 9th Jul 2023 at 09:54

CWL-based (single-sample) workflow for germline variant calling

Biodata Analysis Group

Stable

A CWL-based pipeline for calling small germline variants, namely SNPs and small INDELs, by processing data from Whole-genome Sequencing (WGS) or Targeted Sequencing (e.g., Whole-exome sequencing; WES) experiments.

On the respective GitHub folder are available:

The CWL wrappers and subworkflows for the workflow
A pre-configured YAML template, based on validation analysis of publicly available HTS data

Briefly, the workflow performs the following steps:

Quality control of Illumina reads ...

Type: Common Workflow Language

Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos

Submitter: Konstantinos Kyritsis

DOI: 10.48546/workflowhub.workflow.527.1

Created: 5th Jul 2023 at 10:48

CWL-based (multi-sample) workflow for germline variant calling

Biodata Analysis Group

Stable

On the respective GitHub folder are available:

The CWL wrappers and subworkflows for the workflow
A pre-configured YAML template, based on validation analysis of publicly available HTS data

Briefly, the workflow performs the following steps:

Quality control of Illumina reads ...

Type: Common Workflow Language

Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos

Submitter: Konstantinos Kyritsis

DOI: 10.48546/workflowhub.workflow.526.1

Created: 5th Jul 2023 at 10:44

Purge retained haplotypes using Purge-Dups

ERGA Assembly, Biodiversity Genomics Europe (general)

Purge dups

This snakemake pipeline is designed to be run using as input a contig-level genome and pacbio reads. This pipeline has been tested with snakemake v7.32.4. Raw long-read sequencing files and the input contig genome assembly must be given in the config.yaml file. To execute the workflow run:

snakemake --use-conda --cores N

Or configure the cluster.json and run using the ./run_cluster command

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.506.2

Created: 16th Jun 2023 at 14:56, Last updated: 16th Mar 2024 at 07:49

Mobilome Annotation Pipeline

MGnify

Stable

Mobilome Annotation Pipeline (former MoMofy)

Bacteria can acquire genetic material through horizontal gene transfer, allowing them to rapidly adapt to changing environmental conditions. These mobile genetic elements can be classified into three main categories: plasmids, phages, and integrative elements. Plasmids are mostly extrachromosmal; phages can be found extrachromosmal or as temperate phages (prophages); whereas integrons are stable inserted in the chromosome. Autonomous elements are ...

Type: Nextflow

Creators: Alejandra Escobar, Martin Beracochea

Submitters: Martin Beracochea, Alejandra Escobar

Created: 6th Apr 2023 at 10:40, Last updated: 17th Jun 2025 at 15:24

IGVreport-nf

Sydney Informatics Hub, Australian BioCommons

Work-in-progress

IGVreport-nf

Description
Diagram
User guide
Workflow summaries
Metadata
Component tools
Required (minimum) inputs/parameters
Additional notes
Help/FAQ/Troubleshooting
Acknowledgements/citations/credits

Description

Quickly generate [IGV .html ...

Type: Nextflow

Creators: Georgina Samaha, Tracy Chew

Submitter: Georgina Samaha

Created: 21st Mar 2023 at 05:17

GermlineStructuralV-nf

Sydney Informatics Hub, Australian BioCommons

(Show All)

GermlineStructuralV-nf is a pipeline for identifying structural variant events in human Illumina short read whole genome sequence data. GermlineStructuralV-nf identifies structural variant and copy number events from BAM files using Manta, Smoove, and TIDDIT. Variants are then merged using SURVIVOR, ...

Type: Nextflow

Creators: Georgina Samaha, Marina Kennerson, Tracy Chew, Sarah Beecroft

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.431.1

Created: 31st Jan 2023 at 23:40, Last updated: 18th Dec 2023 at 05:36

IndexReferenceFasta-nf

Sydney Informatics Hub, Australian BioCommons

Stable

IndexReferenceFasta-nf

===========

Description
Diagram
User guide
Benchmarking
Workflow summaries
Metadata
Component tools
Required (minimum) inputs/parameters
Additional notes
Help/FAQ/Troubleshooting
Acknowledgements/citations/credits ...

Type: Nextflow

Creator: Georgina Samaha

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.393.1

Created: 12th Oct 2022 at 03:34

Metagenomic GEMs from Assembly

UNLOCK

Work-in-progress

Workflow for Metagenomics from bins to metabolic models (GEMs)

Summary

Prodigal gene prediction
CarveMe genome scale metabolic model reconstruction
MEMOTE for metabolic model testing
SMETANA Species METabolic interaction ANAlysis

Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default

All tool CWL files and other workflows can be found here: Tools: https://gitlab.com/m-unlock/cwl Workflows: https://gitlab.com/m-unlock/cwl/workflows

**How ...

Type: Common Workflow Language

Creators: Bart Nijsse, Jasper Koehorst

Submitter: Bart Nijsse

Created: 7th Jul 2022 at 09:23, Last updated: 2nd Nov 2022 at 15:41

LongRead Quality Control and Filtering

UNLOCK

(Show All)

Work-in-progress

Workflow for LongRead Quality Control and Filtering

NanoPlot (read quality control) before and after filtering
Filtlong (read trimming)
Kraken2 taxonomic read classification before and after filtering
Minimap2 read filtering based on given references

Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default

All tool CWL files and other workflows can be found here: https://gitlab.com/m-unlock/cwl/workflows

**How to setup and use an UNLOCK ...

Type: Common Workflow Language

Creators: Bart Nijsse, Jasper Koehorst, Germán Royval

Submitter: Bart Nijsse

Created: 21st Apr 2022 at 17:19, Last updated: 7th Apr 2023 at 15:07

Nanopore Assembly Workflow - Deprecated -

UNLOCK

(Show All)

- Deprecated -

See our updated hybrid assembly workflow: https://workflowhub.eu/workflows/367

And other workflows: https://workflowhub.eu/projects/16#workflows

Workflow for sequencing with ONT Nanopore data, from basecalled reads to (meta)assembly and binning

Workflow Nanopore Quality
Kraken2 taxonomic classification of FASTQ reads
Flye (de-novo assembly)
Medaka (assembly polishing)
metaQUAST (assembly quality reports)

When Illumina reads are provided:

Workflow ...

Type: Common Workflow Language

Creators: Bart Nijsse, Jasper Koehorst, Germán Royval

Submitter: Jasper Koehorst

Created: 6th Jan 2022 at 07:38, Last updated: 2nd Feb 2023 at 15:16

Workflow for Illumina Quality Control and Filtering

UNLOCK

(Show All)

Stable

Workflow for Illumina Quality Control and Filtering

Multiple paired datasets will be merged into single paired dataset.

Summary:

FastQC on raw data files
fastp for read quality trimming
BBduk for phiX and (optional) rRNA filtering
Kraken2 for taxonomic classification of reads (optional)
BBmap for (contamination) filtering using given references (optional)
FastQC on filtered (merged) data

Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default ...

Type: Common Workflow Language

Creators: Bart Nijsse, Jasper Koehorst, Changlin Ke

Submitter: Bart Nijsse

Created: 21st Apr 2022 at 14:00, Last updated: 7th Apr 2023 at 15:02

V-pipe (main multi-virus version)

V-Pipe

Stable

...

Type: Snakemake

Creators: Ivan Topolsky, Kim Philipp Jablonski

Submitter: Ivan Topolsky

DOI: 10.48546/workflowhub.workflow.301.5

Created: 30th Mar 2022 at 17:50, Last updated: 10th Jun 2024 at 19:38

Bootstrapping-for-BQSR @ NCI-Gadi

Sydney Informatics Hub

Work-in-progress

Bootstrapping-for-BQSR @ NCI-Gadi is a pipeline for bootstrapping a variant resource to enable GATK base quality score recalibration (BQSR) for non-model organisms that lack a publicly available variant resource. This implementation is optimised for the National Compute Infrastucture's Gadi HPC. Multiple rounds of bootstrapping can be performed. Users can use Fastq-to-bam @ NCI-Gadi and Germline-ShortV @ NCI-Gadi to ...

Type: Shell Script

Creators: Tracy Chew, Rosemarie Sadsad, Cali Willet

Submitter: Tracy Chew

DOI: 10.48546/workflowhub.workflow.153.1

Download

Created: 18th Aug 2021 at 00:26, Last updated: 7th Sep 2021 at 07:24

GATK4 Fastq to joint-called cohort VCF with Cromwell on local cluster (no job scheduler)

Australian BioCommons, Pawsey Supercomputing Research Centre

Work-in-progress

Local Cromwell implementation of GATK4 germline variant calling pipeline

See the GATK website for more information on this toolset

Assumptions

Using hg38 human reference genome build
Running 'locally' i.e. not using HPC/SLURM scheduling, or containers. This repo was specifically tested on Pawsey Nimbus 16 CPU, 64GB RAM virtual machine, primarily running in the /data volume storage partition.
Starting from short-read Illumina paired-end fastq ...

Type: Workflow Description Language

Creators: None

Submitter: Sarah Beecroft

Download

Created: 17th Aug 2021 at 05:47

Fastq-to-bam @ NCI-Gadi

Australian BioCommons, Sydney Informatics Hub

(Show All)

Stable

Fastq-to-BAM @ NCI-Gadi is a genome alignment workflow that takes raw FASTQ files, aligns them to a reference genome and outputs analysis ready BAM files. This workflow is designed for the National Computational Infrastructure's (NCI) Gadi supercompter, leveraging multiple nodes on NCI Gadi to run all stages of the workflow in parallel, either massively parallel using the scatter-gather approach or parallel by sample. It consists of a number of stages and follows the BROAD Institute's best practice ...

Type: Shell Script

Creators: Cali Willet, Tracy Chew, Georgina Samaha, Rosemarie Sadsad, Andrey Bliznyuk, Ben Menadue, Rika Kobayashi, Matthew Downton, Yue Sun

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.146.1

Download

Created: 17th Aug 2021 at 05:45, Last updated: 31st Sep 2022 at 00:23

GATK4 Fastq to joint-called cohort VCF with Cromwell on SLURM

Australian BioCommons, Pawsey Supercomputing Research Centre

Work-in-progress

SLURM HPC Cromwell implementation of GATK4 germline variant calling pipeline

See the GATK website for more information on this toolset

Assumptions

Using hg38 human reference genome build
Running using HPC/SLURM scheduling. This repo was specifically tested on Pawsey Zeus machine, primarily running in the /scratch partition.
Starting from short-read Illumina paired-end fastq files as input

Dependencies

The following versions have been ...

Type: Workflow Description Language

Creators: None

Submitter: Sarah Beecroft

Download

Created: 17th Aug 2021 at 05:42, Last updated: 17th Aug 2021 at 05:56

Germline-ShortV @ NCI-Gadi

Australian BioCommons, Sydney Informatics Hub

(Show All)

Work-in-progress

Germline-ShortV @ NCI-Gadi is an implementation of the BROAD Institute's best practice workflow for germline short variant discovery. This implementation is optimised for the National Compute Infrastucture's Gadi HPC, utilising scatter-gather parallelism to enable use of multiple nodes with high CPU or memory efficiency. This workflow requires sample BAM files, which can be generated using the Fastq-to-bam @ NCI-Gadi pipeline. Germline-ShortV can be applied ...

Type: Shell Script

Creators: Rosemarie Sadsad, Georgina Samaha, Tracy Chew, Cali Willet

Submitter: Tracy Chew

DOI: 10.48546/workflowhub.workflow.143.1

Download

Created: 17th Aug 2021 at 05:35, Last updated: 9th Sep 2021 at 02:34

ORSON: workflow for prOteome and tRanScriptome functiOnal aNnotation

SeBiMER

(Show All)

Stable

ORSON combine state-of-the-art tools for annotation processes within a Nextflow pipeline: sequence similarity search (PLAST, BLAST or Diamond), functional annotation retrieval (BeeDeeM) and functional prediction (InterProScan). When required, BUSCO completness evaluation and eggNOG Orthogroup annotation can be activated. While ORSON results can be analyzed through the command-line, it also offers the possibility to be compatible with BlastViewer or Blast2GO graphical tools.

Type: Nextflow

Creators: Cyril Noel, Alexandre Cormier, Patrick Durand, Laura Leroi, Pierre Cuzin

Submitter: Patrick Durand

DOI: 10.48546/workflowhub.workflow.136.1

Download

Created: 8th Jul 2021 at 15:18, Last updated: 8th Jul 2021 at 15:38

Genome Assembly Workflows for ERGA-BGE genomes

Pipelines used by the genomes assembly teams part of the Biodiversity Genomics Europe project

https://biodiversitygenomics.eu/

Maintainers: Tom Brown

Number of items: 3

Tags: Assembly, Genomics, Biodiversity

Created: 4th Sep 2024 at 09:54, Last updated: 14th Nov 2024 at 08:35

Genome Evaluation for ERGA-BGE Reports

Collection of Galaxy workflows for generating results used for creating ERGA-BGE Reports

For a given genome, two workflows should be run: the assembly evaluation (ASM analyses), and the annotation evaluation (ANNOT analyses)

Depending on the kind of data used for the genome assembly, you should choose HiFi or ONT (Illumina) workflows for ASM analyses

Maintainers: Diego De Panis

Number of items: 3

Tags: Genomics, QC, Genome assembly

Created: 20th Aug 2024 at 14:44, Last updated: 26th Aug 2024 at 13:03

ERGA Assembly Galaxy ONT+Illumina & HiC Pipelines (Flye-HyPo + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be Oxford Nanopore raw reads plus Illumina WGS reads and Illumina 3-dimensional Chromatin Confirmation Capture (HiC) reads

Executing all workflows will output one scaffolded collapsed assembly and the complete QC analyses

Please run the workflows in order: WF0 (there are two, one for ONT, and another one for Illumina that can be used independently for the WGS and HiC reads), WF1, WF2, WF3, WF4

Maintainers: Diego De Panis

Number of items: 6

Tags: Assembly, Bioinformatics, Galaxy, Genomics, Genome assembly, ONT, illumina, Hi-C

Created: 8th Jan 2024 at 09:54, Last updated: 11th Mar 2024 at 12:42

ERGA Assembly Galaxy ONT+Illumina & HiC Pipelines (NextDenovo-HyPo + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be Oxford Nanopore raw reads plus Illumina WGS reads and Illumina 3-dimensional Chromatin Confirmation Capture (HiC) reads

Executing all workflows will output one scaffolded collapsed assembly and the complete QC analyses

Please run the workflows in order: WF0 (there are two, one for ONT, and another one for Illumina that can be used independently for the WGS and HiC reads), WF1, WF2, WF3, WF4

Maintainers: Diego De Panis

Number of items: 6

Tags: Assembly, Bioinformatics, Galaxy, Genomics, Genome assembly, ONT, illumina, Hi-C

Created: 8th Jan 2024 at 09:51, Last updated: 11th Mar 2024 at 14:45

Workflows used by BGE project

This is a general collection of workflows used by or developed by members of the BGE project.

Maintainers: Stian Soiland-Reyes

Number of items: 0

Tags: Genomics, Biodiversity

Created: 23rd Oct 2023 at 12:09, Last updated: 23rd Oct 2023 at 12:14

ERGA Assembly Galaxy HiFi & HiC Pipelines (Hifiasm-HiC + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be PacBio HiFi reads and Illumina 3-dimensional Chromatin Confirmation Capture (HiC) reads

Executing all workflows will output two scaffolded haplotype assemblies and the complete QC analyses

Please run the workflows in order: WF0 (there are two, one for HiFi and one for Illumina HiC), WF1, WF2, WF3, WF4

Maintainers: Tom Brown, Diego De Panis

Number of items: 6

Tags: Assembly, Bioinformatics, Galaxy, Genomics, Genome assembly, HiFi, Hi-C

Created: 16th Jun 2023 at 15:07, Last updated: 20th Nov 2023 at 16:20

Vertebrate Genomes Pipelines (VGP) workflows

The Vertebrate Genomes Pipelines in Galaxy are intended to allow a user to generate high-quality near error-free assemblies of species from a user's own data or from the GenomeArk database.

Maintainers: Stian Soiland-Reyes

Number of items: 3

Tags: vgp, vertebrates, Genomics, Biodiversity

Created: 27th Jan 2023 at 11:51, Last updated: 27th Jan 2023 at 11:58

Click-qPCR

Overview

Associated Tutorial

Thanks to...

BVSim: A Benchmarking Variation Simulator Mimicking Human Variation Spectrum

Table of Contents

gSpreadComp: Streamlining Microbial Community Analysis for Resistance, Virulence, and Plasmid-Mediated Spread

Overview

sanger-tol/curationpretext

skim2mito

Contents

covid-sequence-analysis-workflow

deepconsensus 1.2 snakemake pipeline

GSC (Genotype Sparse Compression)

GSC (Genotype Sparse Compression)

HiC contact map generation

Prerequisites

ANNOTATO - Annotation workflow To Annotate Them Oll

ERGA Protein-coding gene annotation workflow.

Prerequisites

CLAWS (CNAG's Long-read Assembly Workflow in Snakemake)

ARA (Automated Record Analysis) : An automatic pipeline for exploration of SRA datasets with sequences as a query

Requirements

prepareChIPs

Purge dups

Mobilome Annotation Pipeline (former MoMofy)

IGVreport-nf

Description

IndexReferenceFasta-nf

Workflow for Metagenomics from bins to metabolic models (GEMs)

Workflow for LongRead Quality Control and Filtering

- Deprecated -

See our updated hybrid assembly workflow: https://workflowhub.eu/workflows/367

And other workflows: https://workflowhub.eu/projects/16#workflows

Workflow for Illumina Quality Control and Filtering

Local Cromwell implementation of GATK4 germline variant calling pipeline

Assumptions

SLURM HPC Cromwell implementation of GATK4 germline variant calling pipeline

Assumptions

Dependencies