Generic variation analysis on WGS PE data
Version 1

Workflow Type: Galaxy

Generic variant calling

A generic workflow for identification of variants in a haploid genome such as genomes of bacteria or viruses. It can be readily used on MonkeyPox. The workflow accepts two inputs:

  • A genbank file with the reference genomes
  • A collection of paired fastqsanger files

The workflow outputs a collection of VCF files for each sample (each fastq pair). These VCF files serve as input to the Reporting workflow.

Workflow can be accessed directly on

The general idea of the workflow is:


ID Name Description Type
GenBank genome GenBank genome n/a
  • File
Paired Collection Paired Collection Illumina reads with fastqsanger encoding
  • File[]


ID Name Description
2 fastp
3 SnpEff build:
4 Map with BWA-MEM
5 Samtools view
6 Samtools stats
7 MarkDuplicates
8 MultiQC
9 Realign reads
10 Insert indel qualities
11 Call variants
12 Lofreq filter
13 SnpEff eff:


ID Name Description Type
fastp_pe fastp_pe n/a
  • File
fastp_html_report fastp_html_report n/a
  • File
SnpEff4.3 database for SnpEff4.3 database for n/a
  • File
Fasta sequences for Fasta sequences for n/a
  • File
Map with BWA-MEM on input dataset(s) (mapped reads in BAM format) Map with BWA-MEM on input dataset(s) (mapped reads in BAM format) n/a
  • File
filtered_mapped_reads filtered_mapped_reads n/a
  • File
mapped_reads_stats mapped_reads_stats n/a
  • File
markduplicates_reads markduplicates_reads n/a
  • File
markduplicates_stats markduplicates_stats n/a
  • File
preprocessing_and_mapping_reports preprocessing_and_mapping_reports n/a
  • File
realigned_deduplicated_filtered_mapped_reads realigned_deduplicated_filtered_mapped_reads n/a
  • File
realigned_deduplicated_filtered_mapped_reads_with_indel_quals realigned_deduplicated_filtered_mapped_reads_with_indel_quals n/a
  • File
called_variants called_variants n/a
  • File
soft_filtered_variants soft_filtered_variants n/a
  • File
_anonymous_output_1 _anonymous_output_1 n/a
  • File
SnpEff eff: on input dataset(s) - stats SnpEff eff: on input dataset(s) - stats n/a
  • File

Version History

Version 1 (earliest) Created 1st Jun 2022 at 13:46 by Anton Nekrutenko

Initial commit

Open master 9738be7
help Creators and Submitter
Not specified
Additional credit

Wolfgang Maier

Discussion Channel

Views: 3053   Downloads: 294

Created: 1st Jun 2022 at 13:46

Last updated: 1st Jun 2022 at 16:51

help Tags
help Attributions


Total size: 37.4 KB
Powered by
Copyright © 2008 - 2024 The University of Manchester and HITS gGmbH