Workflows
What is a Workflow?Filters
This workflow is composed with the XCMS tool R package (Smith, C.A. 2006) able to extract and the metaMS R package (Wehrens, R 2014) for the field of untargeted metabolomics.
ANNOTATO - Annotation workflow To Annotate Them Oll
Name: Increment Contact Person: support-compss@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs
Description
Increment is an application that takes three different values and increases them a number of given times.
The purpose of this application is to show parallelism between the different increments.
Execution instructions
Usage:
runcompss --lang=python src/increment.py N initValue1 initValue2 initValue3
where:
- N: Number of times to increase ...
Type: COMPSs
Creators: Javier Conejero, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Raül Sirvent
MMGBSA simulation and calculation
Contiging Solo w/HiC:
Generate phased assembly based on PacBio Hifi Reads using HiC data from the same individual for phasing.
Inputs
- Hifi long reads [fastq]
- HiC forward reads (if multiple input files, concatenated in same order as reverse reads) [fastq]
- HiC reverse reads (if multiple input files, concatenated in same order as forward reads) [fastq]
- K-mer database [meryldb]
- Genome profile summary generated by Genomescope [txt]
- Name of first assembly
- Name of second ...
Scaffolding using HiC data with YAHS.
Purge contigs marked as duplicates by purge_dups (could be haplotypic duplication or overlap duplication). This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5)
VGP Workflow #1
This workflow produces a Meryl database and Genomescope outputs that will be used to determine parameters for following workflows, and assess the quality of genome assemblies. Specifically, it provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality.
Inputs
- A collection of Hifi long reads in FASTQ format
- k-mer length
- Ploidy
Outputs
- Meryl Database of kmer counts
...
Create Meryl Database used for the estimation of assembly parameters and quality control with Merqury. Part of the VGP pipeline.
Assembly with Hifi reads and Trio Data
Generate phased assembly based on PacBio Hifi Reads using parental Illumina data for phasing
Inputs
- Hifi long reads [fastq]
- Concatenated Illumina reads : Paternal [fastq]
- Concatenated Illumina reads : Maternal [fastq]
- K-mer database [meryldb]
- Paternal hapmer database [meryldb]
- Maternal hapmer database [meryldb]
- Genome profile summary generated by Genomescope [txt]
- Bloom Filter
- Name of first haplotype
- Name of second haplotype ...