Research Object Crate for WGS2Protein v3.0.0

Original URL: https://workflowhub.eu/workflows/1767/ro_crate?version=1

This workflow extracts protein-coding sequences from whole genome sequencing (WGS) data obtained from the European Nucleotide Archive (ENA). It automates the preprocessing, annotation, and selection of relevant protein sequences using tools such as Prokka, FASTA-to-Tabular, and pattern-based selection. The resulting dataset supports downstream analyses including comparative genomics, phylogenetics, and functional annotation.

Author
License
CC-BY-4.0

Contents

Main Workflow: WGS2Protein v3.0.0
Size: 28963 bytes
Main Workflow Diagram: WGS2Protein%20v.3.0.0.png
Size: 1857150 bytes