IWC - Intergalactic Workflow Commission
Space: This Team is not associated with a Space
SEEK ID: https://workflowhub.eu/projects/33
Public web page: https://github.com/galaxyproject/iwc
Organisms: No Organisms specified
WorkflowHub PALs: No PALs for this Team
Team created: 12th Mar 2021
Related items
This workflow uses the decoupler tool in Galaxy to generate pseudobulk counts from an annotated AnnData file obtained from scRNA-seq analysis. Following the pseudobulk step, differential expression genes (DEG) are calculated using the edgeR tool. The workflow also includes data sanitation steps to ensure smooth operation of edgeR and minimizing potential issues. Additionally, a Volcano plot tool is used to visualize the results after the DEG analysis.
Type: Galaxy
Creators: Diana Chiang Jurado, Pavankumar Videm, Pablo Moreno
Submitter: WorkflowHub Bot
This workflow can only work on an experimental setup with exactly 2 conditions. It takes two collections of count tables as input and performs differential expression analysis. Additionally it filters for DE genes based on adjusted p-value and log2 fold changes thresholds. It also generates informative plots.
This workflow is used for GO and KEGG enrichment analysis using GOseq tools.
Workflow for variant analysis against a reference genome in GenBank format
Find and annotate variants in ampliconic SARS-CoV-2 Illumina sequencing data and classify samples with pangolin and nextclade
This workflow takes a VCF dataset of variants produced by any of the *-variant-calling workflows in https://github.com/galaxyproject/iwc/tree/main/workflows/sars-cov-2-variant-calling and generates tabular lists of variants by Samples and by Variant, and an overview plot of variants and their allele-frequencies.
COVID-19: variation analysis on ARTIC PE data
The workflow for Illumina-sequenced ampliconic data builds on the RNASeq workflow for paired-end data using the same steps for mapping and variant calling, but adds extra logic for trimming amplicon primer sequences off reads with the ivar package. In addition, this workflow uses ivar also to identify amplicons affected by primer-binding site mutations and, if possible, excludes reads derived from such ...
Assembly of bacterial paired-end short read data with generation of quality metrics and reports
Type: Galaxy
Creators: Abromics , Pierre Marin, Clea Siguret, abromics-consortium
Submitter: WorkflowHub Bot
Short paired-end read analysis to provide quality analysis, read cleaning and taxonomy assignation
Type: Galaxy
Creators: ABRomics , Pierre Marin, Clea Siguret, abromics-consortium
Submitter: WorkflowHub Bot
Purge contigs marked as duplicates by purge_dups (could be haplotypic duplication or overlap duplication). This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5)
Scaffolding with Bionano
Scaffolding using Bionano optical map data
Inputs
- Bionano data [cmap]
- Estimated genome size [txt]
- Phased assembly generated by Hifiasm [gfa1]
Outputs
- Scaffolds
- Non-scaffolded contigs
- QC: Assembly statistics
- QC: Nx plot
- QC: Size plot
Purge Duplicate Contigs
Purge contigs marked as duplicates by purge_dups in a single haplotype(could be haplotypic duplication or overlap duplication) This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5)
Inputs
- Genomescope model parameters [txt] (Generated by the k-mer profiling workflow)
- Hifi long reads - trimmed [fastq] (Generated by Cutadapt in the contigging workflow)
- Assembly to purge (e.g. hap1) ...
Contiging Solo:
Generate assembly based on PacBio Hifi Reads.
Inputs
- Hifi long reads [fastq]
- K-mer database [meryldb]
- Genome profile summary generated by Genomescope [txt]
- Homozygous Read Coverage. Optional, use if you think the estimation from Genomescope is inacurate.
- Genomescope Model Parameters generated by Genomescope [tabular]
- Database for busco lineage (recommended: latest)
- Busco lineage (recommended: vertebrata)
- Name of first assembly
- Name of second ...
Assembly with Hifi reads and Trio Data
Generate phased assembly based on PacBio Hifi Reads using parental Illumina data for phasing
Inputs
- Hifi long reads [fastq]
- Concatenated Illumina reads : Paternal [fastq]
- Concatenated Illumina reads : Maternal [fastq]
- K-mer database [meryldb]
- Paternal hapmer database [meryldb]
- Maternal hapmer database [meryldb]
- Genome profile summary generated by Genomescope [txt]
- Genome model parameters generated by Genomescope [tabular]
...
Create Meryl Database used for the estimation of assembly parameters and quality control with Merqury. Part of the VGP pipeline.
VGP Workflow #1
This workflow produces a Meryl database and Genomescope outputs that will be used to determine parameters for following workflows, and assess the quality of genome assemblies. Specifically, it provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality.
Inputs
- A collection of Hifi long reads in FASTQ format
- k-mer length
- Ploidy
Outputs
- Meryl Database of kmer counts
...
This workflow
- Reconstruct phylogeny (insert fragments in a reference)
- Alpha rarefaction analysis
- Taxonomic analysis
Type: Galaxy
Creators: Debjyoti Ghosh, Helmholtz-Zentrum für Umweltforschung - UFZ
Submitter: WorkflowHub Bot