Viral Amplicon Analysis Pipeline for ONT Data
Version 1

Workflow Type: Galaxy

Viral Amplicon Analysis Pipeline for ONT Data

Description

Galaxy workflow for processing viral amplicon datasets, such as MPXV (Mpox), sequenced using the Oxford Nanopore Technologies (ONT) platform. The workflow uses the latest Fieldbioinformatics Artic Minion pipeline that uses the clair3 variant caller.

The ARTIC minion is a pipeline for working with viral nanopore sequencing data, generated from tiling amplicon schemes. It is designed to help run the artic bioinformatics protocols; for example the MPXV protocol. The ARTIC minion pipeline performs genome mapping, variant calling and consensus assembly.

Inputs

  • ONT Reads: Oxford Nanopore Technologies (ONT) sequenced reads from ARTIC assay with fastqsanger encoding. [fastqsanger, fastqsanger.gz]
  • Claire3 Model: Basecalling model to be used for processing ONT data. The basecalling model must be uploaded to your Galaxy History. Available models can be downloaded from the Rerio repository: https://github.com/nanoporetech/rerio. Please note: By using these models, you agree to the Oxford Nanopore Technologies, Ltd. Public License agreement [tar.gz]
  • Normalise Reads: Sample at most this number of reads per amplicon and strand. This parameter sets the value for the ARTIC minion input Coverage normalisation depth (Default value 400). [integer]
  • Oxford Nanopore Technologies, Ltd. Public License agreement: Users must agree to the Oxford Nanopore Technologies, Ltd. Public License agreement in order to run this workflow. [boolean]
  • Maximum Read Length: Optional input specifying the maximum permitted read length. Reads longer than this value will be discarded during filtering (Default value 2600). [integer]
  • Minimum Read Length: Optional input specifying the minimum permitted read length. Reads shorter than this value will be discarded during filtering (Default value 200). [integer]
  • Amplicon scheme name: Name of the amplicon scheme on labs.primalscheme.com to fetch primer BED and reference (Default value artic-inrb-mpox).
  • Amplicon scheme version: Version of the amplicon scheme on labs.primalscheme.com in format vX.X.X (Default value v1.0.0).
  • Amplicon scheme amplicon length: Length of the amplicons in the scheme. Only required if the scheme named has multiple lengths (Default value 2500).

NOTE: Alternatively, the primer bed file (primer.bed) and reference genome fasta file (reference.fasta) can be provided from the user's History if previously imported into Galaxy from labs.primalscheme.com.

Outputs

  • Collated QC Results: MultiQC combined QC reports output by Fastplong. [html,tabular]
  • Collated BAM Files: Collection of the bam files output by Artic minion. [bam]
  • Collated Alignment Reports: Collection of the alignment reports output by Artic minion. [tabular]
  • Collated VCF Outputs: Collection of vcf files output by Artic minion. [vcf]
  • Combined FASTA Outputs: Single fasta file of all consensus sequences output by Artic minion. [fasta]
  • Plot Depth: Plot of the mapped read depth calculated by Mosdepth. [pdf]

Workflow Steps

  1. Filter ONT reads by length using Fastplong.
  2. Run the ARTIC minion pipeline tool to perform the following processing steps.
    • Align reads to reference genome using minimap2
    • Trim primers
    • Call variants (mutations)
    • Create consensus genome sequence
  3. Calculate the mapped read depth using Mosdepth and convert bedgraph output to tabular using Convert bedgraph to tsv.
  4. Rename consensus sequence headers produced by Artic minion from extended path/file names to the sample barcode (e.g. >barcode09/KJ642613.1_masked/ARTIC/clair3 KJ642613.1_masked -> >barcode09) using rename header and clean header.
  5. Combine Fastplong QC reports using MultiQC.
  6. Plot mapped read depth using Plotting tool.
  7. Create a collection of the Artic minion output bam files using Build list.
  8. Create a collection of the Artic minion output alignment reports using Build list.
  9. Create a collection of the Artic minion output vcf files using Build list.
  10. Merge all Artic minion output consensus sequences into a single fasta file using Concatenate datasets.

Inputs

ID Name Description Type
Amplicon scheme amplicon length Amplicon scheme amplicon length Length of the amplicons in the scheme. Only required if the scheme named has multiple lengths
  • int
Amplicon scheme name Amplicon scheme name Name of the amplicon scheme on labs.primalscheme.com to fetch primer BED and reference for
  • string
Amplicon scheme version Amplicon scheme version Version of the amplicon scheme on labs.primalscheme.com in format vX.X.X
  • string
Clair3 Model Clair3 Model Model must be tar.gz
  • File
I agree to the terms of the Oxford Nanopore Technologies, Ltd. Public License agreement I agree to the terms of the Oxford Nanopore Technologies, Ltd. Public License agreement To use Rerio models (e.g. for ONT R10 chemistry) you must agree to the terms of the Oxford Nanopore Technologies, Ltd. Public License agreement, which can be found in the nanoporetech/rerio Github repository.
  • boolean
Maximum read length Maximum read length Reads longer than this value will be discarded.
  • int?
Minimum read length Minimum read length Reads shorter than this value will be discarded.
  • int?
Normalise Reads Normalise Reads Sample at most this number of reads per amplicon and strand.
  • int
ONT Reads ONT Reads ONT reads from ARTIC assay with fastqsanger encoding
  • File[]

Steps

ID Name Description
9 Fastplong toolshed.g2.bx.psu.edu/repos/iuc/fastplong/fastplong/0.3.0+galaxy1
10 Artic minion toolshed.g2.bx.psu.edu/repos/iuc/artic_minion/artic_minion/1.7.3+galaxy0
11 Collated - QC Results toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.27+galaxy3
12 Mosdepth: calc depth (bedgraph) toolshed.g2.bx.psu.edu/repos/iuc/mosdepth/mosdepth/0.3.8+galaxy0
13 rename header Expected header: barcode09/KJ642613.1_masked/ARTIC/clair3 KJ642613.1_masked renamed to: barcode09 toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3
14 Collated - BAM Files __BUILD_LIST__
15 Collate Alignment Reports __BUILD_LIST__
16 Collated - VCF Outputs __BUILD_LIST__
17 Convert bedgraph to tsv Converts bedgraph to per base tsv for plotting toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_makewindowsbed/2.31.1+galaxy0
18 clean header Rename from: >{barcode}.fastq.gz to >{barcode} toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3
19 Plot Depth toolshed.g2.bx.psu.edu/repos/devteam/xy_plot/XY_Plot_1/1.0.2
20 Combine fasta outputs Combine the output of renamed artic consensus. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cat/9.5+galaxy0

Outputs

ID Name Description Type
alignment_report alignment_report n/a
  • File
html_report html_report n/a
  • File
stats_table stats_table n/a
  • File
Collated - BAM Files Collated - BAM Files n/a
  • File
Collated - VCF Outputs Collated - VCF Outputs n/a
  • File
Depth Plots Depth Plots n/a
  • File
All Consensus Sequences All Consensus Sequences n/a
  • File

Version History

Version 1 (earliest) Created 18th Jun 2026 at 01:10 by Tristan Reynolds

Initial commit


Open master 9341a2e
help Creators and Submitter
Creators
Submitter
Activity

Views: 0   Downloads: 0   Runs: 0

Created: 18th Jun 2026 at 01:10

Annotated Properties
Topic annotations
Scientific disciplines
Immunology and Microbiology
help Attributions

None

Total size: 855 KB
Powered by
(v.1.17.3)
Copyright © 2008 - 2026 The University of Manchester and HITS gGmbH