Workflow Type: Galaxy
Stable

Takes fastqs and reference data, to produce a single cell counts matrix into and save in annData format - adding a column called sample with the sample name.

Inputs

ID Name Description Type
Barcode Whitelist file Barcode Whitelist file Obtain from kit supplier e.g. 10X
  • File
Expected Cells Expected Cells Approximately how many cells are expected in this sample?
  • int?
Paired Fastqs for one sample Paired Fastqs for one sample Typically the Forward reads are labelled _R1, barcodes. R2 is Reverse, mRNA sequences. List should contain seq for one biological sample only.
  • File[]
Reference Genome fasta Reference Genome fasta n/a
  • File
Reference annotation Reference annotation n/a
  • File
Sample Sample Short, unique sample name
  • string

Steps

ID Name Description
6 Unzip collection __UNZIP_COLLECTION__
7 Forward/R1/barcode Fastq Concat toolshed.g2.bx.psu.edu/repos/nml/collapse_collections/collapse_dataset/5.1.0
8 Reverse/R2/mRNA Fastq concat toolshed.g2.bx.psu.edu/repos/nml/collapse_collections/collapse_dataset/5.1.0
9 RNA STARSolo toolshed.g2.bx.psu.edu/repos/iuc/rna_starsolo/rna_starsolo/2.7.10b+galaxy2
10 Scanpy Read10x toolshed.g2.bx.psu.edu/repos/ebi-gxa/scanpy_read_10x/scanpy_read_10x/1.8.1+1+galaxy0
11 Inspect AnnData toolshed.g2.bx.psu.edu/repos/iuc/anndata_inspect/anndata_inspect/0.7.5+galaxy1
12 Create text file toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_text_file_with_recurring_lines/1.1.0
13 Text transformation This step replaces the first line with a header. Because the annotated data matrix also had a header, this works. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_sed_tool/1.1.1
14 Loaded Anndata with Sample name AnnData object with supplied sample id. toolshed.g2.bx.psu.edu/repos/iuc/anndata_manipulate/anndata_manipulate/0.7.5+galaxy1

Outputs

ID Name Description Type
RNA STARSolo on input dataset(s): Alignments RNA STARSolo on input dataset(s): Alignments n/a
  • File
Feature Statistic Summaries RNA STARSolo on input dataset(s): Barcode/Feature Statistic Summaries n/a
  • File
RNA STARSolo on input dataset(s): log RNA STARSolo on input dataset(s): log n/a
  • File
RNA STARSolo on input dataset(s): Genes filtered RNA STARSolo on input dataset(s): Genes filtered n/a
  • File
RNA STARSolo on input dataset(s): Barcodes filtered RNA STARSolo on input dataset(s): Barcodes filtered n/a
  • File
RNA STARSolo on input dataset(s): Matrix Gene Counts filtered RNA STARSolo on input dataset(s): Matrix Gene Counts filtered n/a
  • File
AnnData Loaded AnnData Loaded n/a
  • File

Version History

main @ fe052c0 (latest) Created 12th Dec 2023 at 03:08 by Sarah Williams

updated star solo index size to limit RAM useage since galaxy migration


Frozen main fe052c0

main @ a95a4ee (earliest) Created 22nd Jun 2023 at 07:47 by Sarah Williams

counts matrix workflow doc, affiliations


Frozen main a95a4ee
help Creators and Submitter
Creators
  • Sarah Williams
  • Mike Thang
  • Valentine Murigneaux
Submitter
Citation
Williams, S., Thang, M., & Murigneaux, V. (2023). {scRNAseq on galaxy workflows}.
License
Other (Not Open)
Activity

Views: 1615

Created: 22nd Jun 2023 at 07:47

Last updated: 12th Dec 2023 at 03:08

help Tags
help Attributions

None

Total size: 2.95 MB
Powered by
(v.1.14.1)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH