MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline with multiple analysis pipelines that are tailored according to the input data, and that are formally described using the Common Workflow Language, enabling greater provenance, reusability, and reproducibility. MGnify's new analysis pipelines offer additional approaches for taxonomic assertions based on ribosomal internal transcribed spacer regions (ITS1/2) and expanded protein functional annotations. Biochemical pathways and systems predictions have also been added for assembled contigs. MGnify's growing focus on the assembly of metagenomic data has also seen the number of datasets it has assembled and analysed increase six-fold. The non-redundant protein database constructed from the proteins encoded by these assemblies now exceeds 1 billion sequences. Meanwhile, a newly developed contig viewer provides fine-grained visualisation of the assembled contigs and their enriched annotations.
Documentation: https://docs.mgnify.org/en/latest/analysis.html#assembly-analysis-pipeline
Click and drag the diagram to pan, double click or use the controls to zoom.
Inputs
ID | Name | Description | Type |
---|---|---|---|
contigs | n/a | n/a |
|
contig_min_length | n/a | n/a |
|
ssu_db | n/a | n/a |
|
lsu_db | n/a | n/a |
|
ssu_tax | n/a | n/a |
|
lsu_tax | n/a | n/a |
|
ssu_otus | n/a | n/a |
|
lsu_otus | n/a | n/a |
|
rfam_models | n/a | n/a |
|
rfam_model_clans | n/a | n/a |
|
other_ncrna_models | n/a | n/a |
|
ssu_label | n/a | n/a |
|
lsu_label | n/a | n/a |
|
5s_pattern | n/a | n/a |
|
5.8s_pattern | n/a | n/a |
|
CGC_config | n/a | n/a |
|
CGC_postfixes | n/a | n/a |
|
cgc_chunk_size | n/a | n/a |
|
protein_chunk_size_eggnog | n/a | n/a |
|
protein_chunk_size_hmm | n/a | n/a |
|
protein_chunk_size_IPS | n/a | n/a |
|
func_ann_names_ips | n/a | n/a |
|
func_ann_names_hmmer | n/a | n/a |
|
HMM_gathering_bit_score | n/a | n/a |
|
HMM_omit_alignment | n/a | n/a |
|
HMM_name_database | n/a | n/a |
|
hmmsearch_header | n/a | n/a |
|
EggNOG_db | n/a | n/a |
|
EggNOG_diamond_db | n/a | n/a |
|
EggNOG_data_dir | n/a | n/a |
|
InterProScan_databases | n/a | n/a |
|
InterProScan_applications | n/a | n/a |
|
InterProScan_outputFormat | n/a | n/a |
|
ips_header | n/a | n/a |
|
ko_file | n/a | n/a |
|
Uniref90_db_txt | n/a | n/a |
|
diamond_maxTargetSeqs | n/a | n/a |
|
diamond_databaseFile | n/a | n/a |
|
diamond_header | n/a | n/a |
|
go_config | n/a | n/a |
|
graphs | n/a | n/a |
|
pathways_names | n/a | n/a |
|
pathways_classes | n/a | n/a |
|
gp_flatfiles_path | n/a | n/a |
|
clusters_glossary | n/a | n/a |
|
Steps
ID | Name | Description |
---|---|---|
before-qc | n/a | n/a |
after-qc | n/a | n/a |
touch_file_flag | n/a | n/a |
touch_no_cds_flag | n/a | n/a |
Outputs
ID | Name | Description | Type |
---|---|---|---|
qc-status | n/a | n/a |
|
qc_summary | n/a | n/a |
|
hashsum_input | n/a | n/a |
|
qc-statistics_folder | n/a | n/a |
|
compressed_files | n/a | n/a |
|
index_fasta_file | n/a | n/a |
|
bgzip_index | n/a | n/a |
|
bgzip_fasta_file | n/a | n/a |
|
chunking_nucleotides | n/a | n/a |
|
chunking_proteins | n/a | n/a |
|
functional_annotation_folder | n/a | n/a |
|
stats | n/a | n/a |
|
pathways_systems_folder | n/a | n/a |
|
pathways_systems_folder_antismash | n/a | n/a |
|
pathways_systems_folder_antismash_summary | n/a | n/a |
|
sequence-categorisation_folder | n/a | n/a |
|
taxonomy-summary_folder | n/a | n/a |
|
rna-count | n/a | n/a |
|
completed_flag_file | n/a | n/a |
|
no_cds_flag_file | n/a | n/a |
|
no_tax_flag_file | n/a | n/a |
|
Version History
v5.0.7 (latest) Created 7th Jun 2022 at 09:03 by Martin Beracochea
Fix collect_scripts.py
Frozen
v5.0.7
981aafc
master @ 981aafc (earliest) Created 7th Jun 2022 at 08:41 by Martin Beracochea
Fix collect_scripts.py
Frozen
master
981aafc
Creators
Not specifiedAdditional credit
Alex L Mitchell, Alexandre Almeida, Martin Beracochea, Miguel Boland, Josephine Burgin, Guy Cochrane, Michael R Crusoe, Varsha Kale, Simon C Potter, Lorna J Richardson, Ekaterina Sakharova, Maxim Scheremetjew, Anton Korobeynikov, Alex Shlemov, Olga Kunyavskaya, Alla Lapidus, Robert D Finn
Submitter
Views: 3728 Downloads: 466
Created: 7th Jun 2022 at 08:41
Last updated: 7th Jun 2022 at 09:04
None