BioimageAIpub - publish Bioimaging data to HuggingFace for AI processing
main @ 706945e

Workflow Type: Jupyter
Work-in-progress

BioimageAIpub

A Python library to publish Bioimaging datasets to HuggingFace in AI-ready fashion.

Installation

git clone https://github.com/German-BioImaging/bioimageaipub/bioimageaipub.git
cd bioimageaipub
pip install -r requirements.txt

Usage

BioimageAIpub is supplied as a Python library. See demo/demo.ipynb for a Python notebook demonstrating the usage.

Citation

Acknowledgments

This project was supported by and is attributable to the German Cancer Research Center (DKFZ) the Helmholtz Metadata Collaboration (HMC), an incubator-platform of the Helmholtz Association within the framework of the Information and Data Science strategic initiative.

Inputs

ID Name Description Type
list_path n/a n/a
  • File
endpoint_url n/a n/a
  • string?
data_dir n/a n/a
  • string?
file_type n/a n/a
  • string?
train_ratio n/a n/a
  • float?
study_id n/a n/a
  • string?
hf_num_fields n/a n/a
  • int?
omexcavator_path n/a n/a
  • string?
mixed_data_type_columns n/a n/a
  • string?
destination_dataset n/a n/a
  • string

Steps

ID Name Description
step1_download_and_convert n/a n/a
step2_split_and_annotate n/a n/a
step3_upload_to_hf n/a n/a

Outputs

ID Name Description Type
final_converted_dir n/a n/a
  • Directory

Version History

main @ 706945e (earliest) Created 12th Dec 2025 at 10:43 by Stefan Dvoretskii

a more granular cwl workflow


Frozen main 706945e
help Creators and Submitter
Creators
Additional credit

German Cancer Research Center (DKFZ), HMC Hub Health

Submitter
Activity

Views: 579   Downloads: 118

Created: 12th Dec 2025 at 10:43

Last updated: 7th Jan 2026 at 10:40

Annotated Properties
Topic annotations
Operation annotations
Scientific disciplines
Computer Science, Biochemistry, Genetics and Molecular Biology
help Attributions

None

Total size: 1.09 MB
Powered by
(v.1.17.3)
Copyright © 2008 - 2026 The University of Manchester and HITS gGmbH