Workflows
What is a Workflow?Filters
PISAD - Phsaed Intraspecies Sample Anomalies Detection tool
Summary
We developed PISAD, a tool designed to detect anomalies in cohort samples without requiring reference information. It is primarily divided into two stages. Stage 1: We select low-error data from the cohort and conduct reference-free SNP calling to construct a variant sketch. Stage 2: By comparing the k-mer counts of other cohort data to the variant sketch, we infer the relationships between the sample and other samples to ...
REFLOW is a workflow manager tool designed to streamline and automate tasks related to renewable energy potential analyses. It is built with Luigi and provides an automated, robust framework for data acquisition, processing, land/sea eligibility analysis, technology placements, simulations and visualizations. It is build with transparency and reproducibility in mind.
FAIR Statistics Aggregator for DOIs
Table of Contents
Introduction
This repository hosts a prototype tool designed to analyze and aggregate FAIR (Findable, Accessible, Interoperable, and Reusable) statistics for a list ...
RDM_system_connector
WARNING
This is a proof of concept, it has not been decided whether it will be developed into a fully functional tool. Feedback is therefore essential, especially as it is unclear whether this type of tool is useful at all, and if so, which parts, as the concept consists of many different parts. (source code readme:
- installation guide and short description
- [sphinx code ...
The workflow starts with selecting KLF4 as the search term. Gene sets with set labels containing KLF4 were queried from Enrichr[1]. Identified matching terms from the ENCODE TF ChIP-seq 2015[2] library were assembled into a collection of gene sets. A GMT was extracted from the Enrichr results for ENCODE_TF_ChIP-seq_2015. Identified matching terms from the ChEA 2022[4] library were assembled into a collection of gene sets. A GMT was extracted from the Enrichr results for ChEA_2022. Identified ...
The workflow starts with selecting Autophagy as the search term. Gene sets with set labels containing Autophagy were queried from Enrichr[1]. Identified matching terms from the MGI Mammalian Phenotype Level 4 2019[2] library were assembled into a collection of gene sets. A GMT was extracted from the Enrichr results for MGI_Mammalian_Phenotype_Level_4_2019. All the identified gene sets were combined using the union set operation. Reversers and mimickers from over 1 million signatures were identified ...
The workflow starts with selecting chr10:g.3823823G>A as the search term. The closest gene to the variant was found using MyVariant.info[1]. RNA-seq-like LINCS L1000 Signatures[3] which mimick or reverse the the expression of KLF6 were visualized. Median expression of KLF6 was obtained from the GTEx Portal[8] using the portal's API. To visualize the scored tissues, a vertical bar plot was created Fig..
- Lelong, S. et al. BioThings SDK: a toolkit for building high-performance data APIs in ...
Description
The Settlement Delineation and Analysis (SDA) workflows generates a settlement network from geospatial settlement data. It can process geotiff and shapefile inputs and was originally designed to operate on the World Settlement Footprint dataset. Through multiple workflow stages, a settlement network is constructed, contracted (i.e. clustered) and ultimately analysed with centrality measures. The output shapefile stores the ...
Workflow to download and prepare TCGA data.
The workflow divides the process of generating Gene Regulatory networks from TCGA cancer data in three steps:
- Downloading the raw data from GDC and saving the rds/tables needed later
- Preparing the data. This step includes filtering the data, normalizing it...
- Analysis of gene regulatory networks
PVGA is a powerful virus-focused assembler that does both assembly and polishing. For virus genomes, small changes will lead to significant differences in terms of viral function and pathogenicity. Thus, for virus-focused assemblers, high-accuracy results are crucial. Our approach heavily depends on the input reads as evidence to produce the reported genome. It first adopts a reference genome to start with. We then align all the reads against the reference genome to get an alignment graph. After ...