MGnify - amplicon analysis pipeline
Version 1

Workflow Type: Common Workflow Language
Stable

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline with multiple analysis pipelines that are tailored according to the input data, and that are formally described using the Common Workflow Language, enabling greater provenance, reusability, and reproducibility. MGnify's new analysis pipelines offer additional approaches for taxonomic assertions based on ribosomal internal transcribed spacer regions (ITS1/2) and expanded protein functional annotations. Biochemical pathways and systems predictions have also been added for assembled contigs. MGnify's growing focus on the assembly of metagenomic data has also seen the number of datasets it has assembled and analysed increase six-fold. The non-redundant protein database constructed from the proteins encoded by these assemblies now exceeds 1 billion sequences. Meanwhile, a newly developed contig viewer provides fine-grained visualisation of the assembled contigs and their enriched annotations.

Documentation: https://docs.mgnify.org/en/latest/analysis.html#amplicon-analysis-pipeline

Click and drag the diagram to pan, double click or use the controls to zoom.

Inputs

ID Name Description Type
single_reads n/a n/a
  • File?
forward_reads n/a n/a
  • File?
reverse_reads n/a n/a
  • File?
qc_min_length n/a n/a
  • int
stats_file_name n/a n/a
  • string
ssu_db n/a n/a
  • File
lsu_db n/a n/a
  • File
ssu_tax n/a n/a
  • string
lsu_tax n/a n/a
  • string
ssu_otus n/a n/a
  • string
lsu_otus n/a n/a
  • string
rfam_models n/a n/a
  • string[]
rfam_model_clans n/a n/a
  • string
ssu_label n/a n/a
  • string
lsu_label n/a n/a
  • string
5s_pattern n/a n/a
  • string
5.8s_pattern n/a n/a
  • string
unite_db n/a n/a
  • File
unite_tax n/a n/a
  • string
unite_otu_file n/a n/a
  • string
unite_label n/a n/a
  • string
itsonedb n/a n/a
  • File
itsonedb_tax n/a n/a
  • string
itsonedb_otu_file n/a n/a
  • string
itsonedb_label n/a n/a
  • string

Steps

ID Name Description
before-qc n/a n/a
after-qc n/a n/a
touch_file_flag n/a n/a

Outputs

ID Name Description Type
qc-statistics n/a n/a
  • Directory
qc_summary n/a n/a
  • File
qc-status n/a n/a
  • File
hashsum_paired n/a n/a
  • File[]?
hashsum_single n/a n/a
  • File?
fastp_filtering_json_report n/a n/a
  • File?
gz_files n/a n/a
  • File[]
sequence-categorisation_folder n/a n/a
  • Directory?
taxonomy-summary_folder n/a n/a
  • Directory?
rna-count n/a n/a
  • File?
ITS-length n/a n/a
  • File?
suppressed_upload n/a n/a
  • Directory?
completed_flag_file n/a n/a
  • File?
no_tax_flag_file n/a n/a
  • File?

Version History

v5.0.7 (earliest) Created 7th Jun 2022 at 09:28 by Martin Beracochea

Fix collect_scripts.py


Frozen v5.0.7 981aafc
help Creators and Submitter
Creators
Not specified
Additional credit

Alex L Mitchell, Alexandre Almeida, Martin Beracochea, Miguel Boland, Josephine Burgin, Guy Cochrane, Michael R Crusoe, Varsha Kale, Simon C Potter, Lorna J Richardson, Ekaterina Sakharova, Maxim Scheremetjew, Anton Korobeynikov, Alex Shlemov, Olga Kunyavskaya, Alla Lapidus, Robert D Finn

Submitter
Activity

Views: 1191

Created: 7th Jun 2022 at 09:28

help Attributions

None

Total size: 367 MB
Powered by
(v.1.14.1)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH