Taxonomy Assignment with QIIME2
This workflow performs taxonomic assignment for an input OTU/ASV (Operating Taxonomic Unit/Amplicon Sequence Variant) Feature Table and set of amplicon Representative Sequences using QIIME2. The workflow creates and trains a QIIME2 classifier for an input reference taxonomic database (e.g. HOMD, SILVA, PR2, UNITE) and uses the trained classifier to assign taxonomy to OTUs/ASVs in the input Feature Table.
Inputs
- Feature Table [
biom2]: OTU table with ASVs in rows and samples in columns containing ASV counts per sample (can be the output from previous QIIME2 or DADA2 processing) - Representative Sequences [
fasta]: File containing sequences for all ASVs detected in the samples (can be the output from previous QIIME2 or DADA2 processing) - Reference sequences [
fasta]: File containing sequences for the chosen reference taxonomy database - Reference taxonomy [
tabular]: File containing taxonomy assigned to sequences in the reference taxonomy database - Forward Primer Sequence [
text]: Forward primer sequence (5' -> 3'). Required for qiime2 feature-classifier extract-reads (input parameter:f_primer). Default value is the primer for the 16S rRNA V3-V4 region:CCTAYGGGRBGCASCAG. - Reverse Primer Sequence [
text]: Reverse primer sequence (5' -> 3'). Required for qiime2 feature-classifier extract-reads (input parameter:r_primer). Default value is the primer for the 16S rRNA V3-V4 region:GGACTACNNGGGTATCTAAT.
Outputs
- OTU/ASV table [
csv]: Modified version of input Feature Table with additional column with the assigned taxonomy and assignment confidence for each ASV as additional columns. - Feature Table Summary [
html]: QIIME2 visualisation of the input OTU/ASV Feature Tablebiom2dataset - Representative Sequences Summary [
html]: QIIME2 visualisation of the input Representative Sequencesfastadataset - Taxonomy Bar Plot [
qzv]: Interactive bar plot for OTU/ASV table with assigned taxonomy (this dataset cannot be viewed in Galaxy; to view this dataset, export it from Galaxy and use a dedicated QIIME2 viewer, e.g. https://view.qiime2.org/).
Processing Steps
- Convert all input datasets to QIIME2 artefacts using qiime2 tools import.
- Summarise the Feature Table and Representative Sequences datasets using qiime2 feature-table summarize and qiime2 feature-table tabulate-seqs, respectively, and extract
htmlvisualisations that can be viewed in Galaxy using QIIME vizualisation extractor. - Extract reference sequences (for the input reference taxonomy database) for training the feature classifier using qiime2 feature-classifier extract-reads.
- Train the QIIME2 Naive Bayes feature classifier on the reference database (extracted sequences and taxonomy) using qiime2 feature-classifier fit-classifier-naive-bayes.
- Run the classifier on the amplicon Representative Sequences to assign taxonomy to the sample ASVs using qiime2 feature-classifier classify-sklearn.
- Create a summary visualisation of the taxonomy assigned to the sample dataset using qiime2 taxa barplot.
- Export the assigned taxonomy table from a QIIME2 artefact to
tsvusing qiime2 tools export. - Convert the output
tsvtable totabularformat using Convert tabular to CSV and Convert CSV to tabular (requires intermediate conversion tocsv). - Convert the
biom2Feature Table totabularusing Convert between BIOM table formats. - Combine the ASV taxonomy assignment and Feature Table
tabulardatasets using Remove beginning and Join. - Convert the combined OTU/ASV table from
tabulartocsvformat using Convert tabular to CSV.
Sample Data
Sample input datasets for this workflow are available on Zenodo: https://doi.org/10.5281/zenodo.18263330
Contribution
Workflow created by Tristan Reynolds (provided Galaxy and bioinformatics expertise) and Amy Loughman (provided scientific requirements and expertise).
Inputs
| ID | Name | Description | Type |
|---|---|---|---|
| Feature Table (biom2) | #main/Feature Table (biom2) | Input feature table in biom2 format (e.g. as produced by "qiime2 dada2 denoise-paired"). |
|
| Forward Primer Sequence | #main/Forward Primer Sequence | Forward primer sequence (5' -> 3'). Required for "qiime2 feature-classifier extract-reads" (input parameter: f_primer). **Default value is the primer for the 16S rRNA V3-V4 region. |
|
| Reference Sequences (fasta) | #main/Reference Sequences (fasta) | Input reference database reference sequences in fasta format. |
|
| Reference Taxonomy (tabular) | #main/Reference Taxonomy (tabular) | Input reference database reference taxonomy in tabular format. |
|
| Representative Sequences (fasta) | #main/Representative Sequences (fasta) | Input representative sequences in fasta format (e.g. as produced by "qiime2 dada2 denoise-paired"). |
|
| Reverse Primer Sequence | #main/Reverse Primer Sequence | Reverse primer sequence (5' -> 3'). Required for "qiime2 feature-classifier extract-reads" (input parameter: r_primer). **Default value is the primer for the 16S rRNA V3-V4 region. |
|
Steps
| ID | Name | Description |
|---|---|---|
| 6 | qiime2 tools import | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2_core__tools__import/qiime2_core__tools__import/2023.5.0+dist.h193f7cc9.3 |
| 7 | Convert | toolshed.g2.bx.psu.edu/repos/iuc/biom_convert/biom_convert/2.1.15+galaxy1 |
| 8 | qiime2 tools import | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2_core__tools__import/qiime2_core__tools__import/2023.5.0+dist.h193f7cc9.3 |
| 9 | qiime2 tools import | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2_core__tools__import/qiime2_core__tools__import/2023.5.0+dist.h193f7cc9.3 |
| 10 | qiime2 tools import | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2_core__tools__import/qiime2_core__tools__import/2023.5.0+dist.h193f7cc9.3 |
| 11 | qiime2 feature-table summarize | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2__feature_table__summarize/qiime2__feature_table__summarize/2023.5.0+q2galaxy.2023.5.0.2 |
| 12 | Remove beginning | Remove beginning1 |
| 13 | qiime2 feature-table tabulate-seqs | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2__feature_table__tabulate_seqs/qiime2__feature_table__tabulate_seqs/2023.5.0+q2galaxy.2023.5.0.2 |
| 14 | qiime2 feature-classifier extract-reads | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2__feature_classifier__extract_reads/qiime2__feature_classifier__extract_reads/2023.5.0+q2galaxy.2023.5.0.2 |
| 15 | Extract Feature Table Summary Visual | toolshed.g2.bx.psu.edu/repos/iuc/qiime_extract_viz/qiime_extract_viz/0.1.0+galaxy0 |
| 16 | Extract Representative Sequences Summary Visual | toolshed.g2.bx.psu.edu/repos/iuc/qiime_extract_viz/qiime_extract_viz/0.1.0+galaxy0 |
| 17 | qiime2 feature-classifier fit-classifier-naive-bayes | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2__feature_classifier__fit_classifier_naive_bayes/qiime2__feature_classifier__fit_classifier_naive_bayes/2023.5.0+q2galaxy.2023.5.0.2 |
| 18 | qiime2 feature-classifier classify-sklearn | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2__feature_classifier__classify_sklearn/qiime2__feature_classifier__classify_sklearn/2023.5.0+q2galaxy.2023.5.0.2 |
| 19 | qiime2 tools export | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2_core__tools__export/qiime2_core__tools__export/2023.5.0+dist.h193f7cc9.2 |
| 20 | qiime2 taxa barplot | toolshed.g2.bx.psu.edu/repos/q2d2/qiime2__taxa__barplot/qiime2__taxa__barplot/2023.5.0+q2galaxy.2023.5.0.2 |
| 21 | Convert tabular to CSV | tabular_to_csv |
| 22 | Convert CSV to tabular | csv_to_tabular |
| 23 | Join | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_easyjoin_tool/9.5+galaxy2 |
| 24 | Convert tabular to CSV | tabular_to_csv |
Outputs
| ID | Name | Description | Type |
|---|---|---|---|
| Combined OTU Table | #main/Combined OTU Table | n/a |
|
| Feature Table Summary | #main/Feature Table Summary | n/a |
|
| Representative Sequences Summary | #main/Representative Sequences Summary | n/a |
|
| Visualise Taxa Bar Plot | #main/Visualise Taxa Bar Plot | n/a |
|
Version History
Version 1 (earliest) Created 2nd Mar 2026 at 00:01 by Tristan Reynolds
Initial commit
Frozen
Version-1
d3f1594
Creators and SubmitterCreators
Submitter
Views: 491 Downloads: 72 Runs: 1
Created: 2nd Mar 2026 at 00:01
AttributionsNone
Run on Galaxy
https://orcid.org/0000-0002-6606-5953