SEEK ID: https://workflowhub.eu/people/112
Location: Not specified
ORCID: Not specified
Joined: 12th Mar 2021
Expertise: Not specified
Tools: Not specified
Related items
Biodiversity Genomics Europe, funded by Horizon Europe call HORIZON-CL6-2021-BIODIV-01-01, aims at aligning the resources and research agendas of both DNA barcoding and reference genome generation, thus opening the door for a true quantum leap in biodiversity genomics research in Europe.
Despite ground-breaking developments in both DNA barcoding and full genome sequencing, there remains a critical need to develop and strengthen functioning communities of practice ...
Teams: Vertebrate Genomes Pipelines in Galaxy
Web page: https://biodiversitygenomics.eu/
The Vertebrate Genomes Pipelines in Galaxy are intended to allow a user to generate high-quality near error-free assemblies of species from a user's own data or from the GenomeArk database
Space: Biodiversity Genomics Europe (BGE)
Public web page: https://galaxyproject.org/projects/vgp/workflows/
Organisms: Not specified
IWC - Intergalactic Workflow Commission
Space: This Team is not associated with a Space
Public web page: https://github.com/galaxyproject/iwc
Organisms: Not specified
A workflow for the analysis of pox virus genomes sequenced as half-genomes (for ITR resolution) in a tiled-amplicon approach
Virtual screening of the SARS-CoV-2 main protease with rDock and pose scoring
This workflow take as input a collection of paired fastq. Remove adapters with cutadapt, map pairs with bowtie2 allowing dovetail. Keep MAPQ30 and concordant pairs. BAM to BED. MACS2 with "ATAC" parameters.
This workflow take as input a collection of paired fastq. It uses HiCUP to go from fastq to validPair file. The pairs are filtered for MAPQ and sorted by cooler to generate a tabix dataset. Cooler is used to generate a balanced cool file to the desired resolution.
This workflow take as input a collection of paired fastq. It will remove bad quality and adapters with cutadapt. Map with Bowtie2 end-to-end. Will remove reads on MT and unconcordant pairs and pairs with mapping quality below 30 and PCR duplicates. Will compute the pile-up on 5' +- 100bp. Will call peaks and count the number of reads falling in the 1kb region centered on the summit. Will plot the number of reads for each fragment length.
This workflow takes as input a list of single-read fastqs. Adapters and bad quality bases are removed with cutadapt. Reads are mapped with STAR with ENCODE parameters and genes are counted simultaneously. The counts are reprocess to be similar to HTSeq-count output. FPKM are computed with cufflinks. Coverage (per million mapped reads) are computed with bedtools on uniquely mapped reads.
This workflow takes as input a list of paired-end fastqs. Adapters and bad quality bases are removed with cutadapt. Reads are mapped with STAR with ENCODE parameters and genes are counted simultaneously. The counts are reprocess to be similar to HTSeq-count output. FPKM are computed with cufflinks. Coverage (per million mapped reads) are computed with bedtools on uniquely mapped reads (with R2 orientation inverted).
ChIP-seq paired-end Workflow
Inputs dataset
- The workflow needs a single input which is a list of dataset pairs of fastqsanger.
Inputs values
- adapters sequences: this depends on the library preparation. If you don't know, use FastQC to determine if it is Truseq or Nextera.
- reference_genome: this field will be adapted to the genomes available for bowtie2.
- effective_genome_size: this is used by MACS2 and may be entered manually (indications are provided for heavily used genomes).
...
ChIP-seq single-read Workflow
Inputs dataset
- The workflow needs a single input which is a list of fastqsanger files.
Inputs values
- adapters sequence_forward: this depends on the library preparation. If you don't know, use FastQC to determine if it is Truseq or Nextera.
- reference_genome: this field will be adapted to the genomes available for bowtie2.
- effective_genome_size: this is used by MACS2 and may be entered manually (indications are provided for heavily used genomes).
...
COVID-19: consensus construction
This workflow aims at generating reliable consensus sequences from variant calls according to transparent criteria that capture at least some of the complexity of variant calling.
It takes a collection of VCFs (with DP and DP4 INFO fields) and a collection of the corresponding aligned reads (for the purpose of calculating genome-wide coverage) such as produced by any of the variant calling workflows in ...
COVID-19: variation analysis reporting
This workflow takes VCF datasets of variants produced by any of the "*-variant-calling" workflows in https://github.com/galaxyproject/iwc/tree/main/workflows/sars-cov-2-variant-calling and generates tabular reports of variants by samples and by variant, along with an overview plot of variants and their allele-frequencies across all samples.
ChIP-seq paired-end Workflow
Inputs dataset
- The workflow needs a single input which is a list of dataset pairs of fastqsanger.
Inputs values
- adapters sequences: this depends on the library preparation. If you don't know, use FastQC to determine if it is Truseq or Nextera.
- reference_genome: this field will be adapted to the genomes available for bowtie2.
- effective_genome_size: this is used by MACS2 and may be entered manually (indications are provided for heavily used genomes).
...
COVID-19: variation analysis on ARTIC PE data
The workflow for Illumina-sequenced ampliconic data builds on the RNASeq workflow for paired-end data using the same steps for mapping and variant calling, but adds extra logic for trimming amplicon primer sequences off reads with the ivar package. In addition, this workflow uses ivar also to identify amplicons affected by primer-binding site mutations and, if possible, excludes reads derived from such ...
VGP Workflow #1
This workflow produces a Meryl database and Genomescope outputs that will be used to determine parameters for following workflows, and assess the quality of genome assemblies. Specifically, it provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality.
Inputs
- Collection of Hifi long reads in FASTQ format
Outputs
- Meryl Database of kmer counts
- GenomeScope
- Linear plot
...
VGP Workflow #1
This workflow collects the metrics on the properties of the genome under consideration by analyzing the k-mer frequencies. It provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality. It uses reads from two parental genomes to partition long reads from the offspring into haplotype-specific k-mer databases.
Inputs
- Collection of Hifi long reads in FASTQ format
- Paternal short-read ...
Generic variation analysis on WGS PE data
This workflows performs paired end read mapping with bwa-mem followed by sensitive variant calling across a wide range of AFs with lofreq and variant annotation with snpEff. The reference genome can be provided as a GenBank file.
Parallel Accession Download
Downloads fastq files for sequencing run accessions provided in a text file using fasterq-dump. Creates one job per listed run accession, and is therefore much faster and more robust to errors when many accessions need to be downloaded.
GROMACS dcTMD free energy calculation
Perform an ensemble of targeted MD simulations of a user-specified size using the GROMACS PULL code and calculate dcTMD free energy and friction profiles for the resulting dissocation pathway. Note that pathway separation is not performed by the workflow; the user is responsible for checking the ensemble themselves.
The input protein (PDB) and ligand (SDF) files provided are parameterized by the 'Protein-ligand complex parameterization' subworkflow.
Note ...
Protein-ligand complex parameterization
Parameterizes an input protein (PDB) and ligand (SDF) file prior to molecular dynamics simulation with GROMACS.
This is a simple workflow intended for use as a subworkflow in more complex MD workflows. It is used as a subworkflow by the GROMACS MMGBSA and dcTMD workflows.
GROMACS MMGBSA free energy calculation
Perform an ensemble of MD simulations of a user-specified size using GROMACS, and calculate MMGBSA free energies using AmberTools. An ensemble average is calculated and returned to the user as the final input.
The input protein (PDB) and ligand (SDF) files provided are parameterized by the 'Protein-ligand complex parameterization' subworkflow.