Workflow Type: Galaxy

Assembly with Hifi reads and Trio Data

Generate phased assembly based on PacBio Hifi Reads using parental Illumina data for phasing

Inputs

  1. Hifi long reads [fastq]
  2. Concatenated Illumina reads : Paternal [fastq]
  3. Concatenated Illumina reads : Maternal [fastq]
  4. K-mer database [meryldb]
  5. Paternal hapmer database [meryldb]
  6. Maternal hapmer database [meryldb]
  7. Genome profile summary generated by Genomescope [txt]
  8. Genome model parameters generated by Genomescope [tabular]
  9. Homozygous read coverage (Estimated from the Genomescope model if not provided)
  10. Lineage of the species being assembled
  11. Bloom Filter
  12. Name of first haplotype
  13. Name of second haplotype

Outputs

  1. Haplotype 1 assembly
  2. Haplotype 2 assembly
  3. QC: BUSCO report for both assemblies
  4. Merqury report for both assemblies
  5. Assembly statistics for both assemblies
  6. Nx Plot for both assemblies
  7. Size plot for both assemblies

Inputs

ID Name Description Type
Bits for bloom filter Bits for bloom filter Defaults to 37 if not specified. For genomes much larger than human, applying -f38 or even -f39 is preferred to save memory on k-mer counting.
  • int?
Genomescope Model Parameters Genomescope Model Parameters GenomeScope model parameters generated by K-mer profiling workflow
  • File
Genomescope Summary Genomescope Summary n/a
  • File
Hapmer Database : Maternal Hapmer Database : Maternal n/a
  • File
Hapmer Database : Paternal Hapmer Database : Paternal n/a
  • File
Homozygous Read Coverage Homozygous Read Coverage If empty, read coverage will be estimated from the Genomescope parameters.
  • int?
Lineage Lineage Taxonomic lineage for the organism being assembled for Busco analysis
  • string
Maternal Illumina reads (hap2) Maternal Illumina reads (hap2) n/a
  • File[]
Meryl Database : Child Meryl Database : Child n/a
  • File
Name for Haplotype 1 Name for Haplotype 1 n/a
  • string?
Name for Haplotype 2 Name for Haplotype 2 n/a
  • string?
Pacbio Reads Collection : child Pacbio Reads Collection : child n/a
  • File[]
Paternal Illumina reads (hap1) Paternal Illumina reads (hap1) n/a
  • File[]
SAK input file (Optional) SAK input file (Optional) n/a
  • File?
Utilize homology information to correct trio-phasing errors Utilize homology information to correct trio-phasing errors --trio-dual option
  • boolean?

Steps

ID Name Description
15 Cutadapt toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.8+galaxy0
16 Compute toolshed.g2.bx.psu.edu/repos/devteam/column_maker/Add_a_column1/2.0
17 Search in textfiles toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.3+galaxy0
18 MultiQC toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.11+galaxy1
19 Cut Cut1
20 Replace Text toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_line/9.3+galaxy0
21 Parse parameter value param_value_from_file
22 Convert Convert characters1
23 Homozygous read coverage for Hifiasm toolshed.g2.bx.psu.edu/repos/iuc/pick_value/pick_value/0.2.0
24 Cut Cut1
25 Hifiasm toolshed.g2.bx.psu.edu/repos/bgruening/hifiasm/hifiasm/0.19.8+galaxy1
26 Estimated genome size param_value_from_file
27 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
28 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
29 Raw Unitig Image toolshed.g2.bx.psu.edu/repos/iuc/bandage/bandage_image/2022.09+galaxy4
30 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
31 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
32 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
33 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
34 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
35 gfastats toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.6+galaxy0
36 Busco toolshed.g2.bx.psu.edu/repos/iuc/busco/busco/5.5.0+galaxy0
37 Busco toolshed.g2.bx.psu.edu/repos/iuc/busco/busco/5.5.0+galaxy0
38 Merqury toolshed.g2.bx.psu.edu/repos/iuc/merqury/merqury/1.3+galaxy4
39 Data prep Hap1 n/a
40 Data Prep Hap2 n/a
41 Text reformatting toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.3+galaxy0
42 Text reformatting toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.3+galaxy0
43 Plots n/a
44 Join toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_easyjoin_tool/9.3+galaxy0

Outputs

ID Name Description Type
json_stats json_stats n/a
  • File
multiqc html report multiqc html report n/a
  • File
cutadapt multiqc stats cutadapt multiqc stats n/a
  • File
Estimated Genome size Estimated Genome size n/a
  • File
usable hap1 gfa usable hap1 gfa n/a
  • File
usable hap2 gfa usable hap2 gfa n/a
  • File
Hifiasm Trio hap1 Hifiasm Trio hap1 n/a
  • File
Hifiasm Trio hap2 Hifiasm Trio hap2 n/a
  • File
Busco Gff Hap1 Busco Gff Hap1 n/a
  • File
Busco Summary Hap1 Busco Summary Hap1 n/a
  • File
Busco Summary Image Hap1 Busco Summary Image Hap1 n/a
  • File
Busco Gff Hap2 Busco Gff Hap2 n/a
  • File
Busco Summary Image Hap2 Busco Summary Image Hap2 n/a
  • File
Busco Summary Hap2 Busco Summary Hap2 n/a
  • File
Merqury Images Merqury Images n/a
  • File
Merqury Trio Histogram Merqury Trio Histogram n/a
  • File
Size Plot Size Plot n/a
  • File
Nx Plot Nx Plot n/a
  • File
Assembly statistics for Hap1 and Hap2 Assembly statistics for Hap1 and Hap2 n/a
  • File

Version History

v0.1.4 (latest) Created 3rd May 2024 at 03:02 by WorkflowHub Bot

Updated to v0.1.4


Frozen v0.1.4 b639087

v0.1.3 (earliest) Created 1st May 2024 at 03:04 by WorkflowHub Bot

Updated to v0.1.3


Frozen v0.1.3 a2195cc
help Creators and Submitter
Creators
Not specified
Additional credit

Galaxy, VGP

Submitter
Activity

Views: 157

Created: 1st May 2024 at 03:04

Last updated: 3rd May 2024 at 03:02

help Tags
help Attributions

None

Total size: 161 KB
Powered by
(v.1.14.1)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH