Research Object Crate for preparing genomic data for phylogeny recostruction (GTN)

Original URL: https://workflowhub.eu/workflows/358/ro_crate?version=1

This workflow begins from a set of genome assemblies of different samples, strains, species. The genome is first annotated with Funnanotate. Predicted proteins are furtner annotated with Busco. Next, 'ProteinOrtho' finds orthologs across the samples and makes orthogroups. Orthogroups where all samples are represented are extracted. Orthologs in each orthogroup are aligned with ClustalW. Test dataset: https://zenodo.org/record/6610704#.Ypn3FzlBw5k

Author
Miguel Roncoroni
License
CC-BY-4.0

Contents