Workflows
What is a Workflow?Filters
Automated inference of stable isotope incorporation rates in proteins for functional metaproteomics
Summary
HPPIDiscovery is a scientific workflow to augment, predict and perform an insilico curation of host-pathogen Protein-Protein Interactions (PPIs) using graph theory to build new candidate ppis and machine learning to predict and evaluate them by combining multiple PPI detection methods of proteins according to three categories: structural, based on primary aminoacid sequence and functional annotations.
HPPIDiscovery contains three main steps: (i) acquirement of pathogen and host proteins ...
This Galaxy workflow takes a list of tumor/normal sample pair variants in VCF format and
- annotates them using the ENSEMBL Variant Effect Predictor and custom annotation data
- turns the annotated VCF into a MAF file for import into cBioPortal
- generates human-readable variant- and gene-centric reports
The input VCF is expected to encode somatic status, somatic p-value and germline p-value of each variant in varscan somatic format, i.e., via SS, SPV and GPV INFO keys, respectively.
The workflow takes a trimmed HiFi reads collection, Forward/Reverse HiC reads, and the max coverage depth (calculated from WF1) to run Hifiasm in HiC phasing mode. It produces both Pri/Alt and Hap1/Hap2 assemblies, and runs all the QC analysis (gfastats, BUSCO, and Merqury). The default Hifiasm purge level is Light (l1).
The ultimate-level complexity workflow is one among a collection of workflows designed to address tasks up to CTF estimation. In addition to the functionalities provided by layer 0 and 1 workflows, this workflow aims to enhance the quality of both acquisition images and processing.
Quality control protocols
…
Combination of methods
- CTF consensus
- New methods to compare ctf estimations
- CTF xmipp criteria (richer parameters i.e. ice detection)
Advantages:
- Control of ...
PAIRED-END workflow. Align reads on fasta reference/assembly using bwa mem, get a consensus, variants, mutation explanations.
IMPORTANT:
- For "bcftools call" consensus step, the --ploidy file is in "Données partagées" (Shared Data) and must be imported in your history to use the worflow by providing this file (tells bcftools to consider haploid variant calling).
- SELECT THE MOST ADAPTED VADR MODEL for annotation (see vadr parameters).
SINGLE-END workflow. Align reads on fasta reference/assembly using bwa mem, get a consensus, variants, mutation explanations.
IMPORTANT:
- For "bcftools call" consensus step, the --ploidy file is in "Données partagées" (Shared Data) and must be imported in your history to use the worflow by providing this file (tells bcftools to consider haploid variant calling).
- SELECT the mot ADAPTED VADR MODEL for annotation (see vadr parameters).
We assume the identifiers of the input list are like: sample_name_replicateID. The identifiers of the output list will be: sample_name
This repository contains the python code to reproduce the experiments in Dłotko, Gurnari "Euler Characteristic Curves and Profiles: a stable shape invariant for big data problems"
RepeatMasking Workflow
This workflow uses RepeatModeler and RepeatMasker for genome analysis.
-
RepeatModeler is a software package for identifying and modeling de novo families of transposable elements (TEs). At the heart of RepeatModeler are three de novo repeat search programs (RECON, RepeatScout and LtrHarvest/Ltr_retriever) which use complementary computational methods to identify repeat element boundaries and family relationships from sequence data.
-
RepeatMasker is a program that analyzes ...