WGS2Protein v3.0.0
Version 1

Workflow Type: Galaxy
Stable

This workflow extracts protein-coding sequences from whole genome sequencing (WGS) data obtained from the European Nucleotide Archive (ENA). It automates the preprocessing, annotation, and selection of relevant protein sequences using tools such as Prokka, FASTA-to-Tabular, and pattern-based selection. The resulting dataset supports downstream analyses including comparative genomics, phylogenetics, and functional annotation.

Steps

ID Name Description
2 FastQC toolshed.g2.bx.psu.edu/repos/devteam/fastqc/fastqc/0.74+galaxy1
3 Trimmomatic toolshed.g2.bx.psu.edu/repos/pjbriggs/trimmomatic/trimmomatic/0.39+galaxy2
4 FastQC toolshed.g2.bx.psu.edu/repos/devteam/fastqc/fastqc/0.74+galaxy1
5 FastQC toolshed.g2.bx.psu.edu/repos/devteam/fastqc/fastqc/0.74+galaxy1
6 Shovill toolshed.g2.bx.psu.edu/repos/iuc/shovill/shovill/1.1.0+galaxy2
7 FastQC toolshed.g2.bx.psu.edu/repos/devteam/fastqc/fastqc/0.74+galaxy1
8 Prokka toolshed.g2.bx.psu.edu/repos/crs4/prokka/prokka/1.14.6+galaxy1
9 FASTA-to-Tabular toolshed.g2.bx.psu.edu/repos/devteam/fasta_to_tabular/fasta2tab/1.1.1
10 Select Grep1

Version History

Version 1 (earliest) Created 30th Jun 2025 at 10:26 by Crist John Pastor

Initial commit


Open master 8a0242c
help Creators and Submitter
Creator
  • Crist John M. Pastor
Submitter
Activity

Views: 13   Downloads: 4   Runs: 1

Created: 30th Jun 2025 at 10:26

help Tags

This item has not yet been tagged.

help Attributions

None

Total size: 1.8 MB
Powered by
(v.1.17.0-main)
Copyright © 2008 - 2025 The University of Manchester and HITS gGmbH