COMPSs GPU Cache Matrix Multiplication

**Name:** Matmul GPU **Contact Person**: cristian.tatu@bsc.es **Access Level**: public **License Agreement**: Apache2 **Platform**: COMPSs **Machine**: Minotauro-MN4 Matmul running on the GPU leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). C = A @ B Where A: shape (320, 56_900_000) block_size (10, 11_380_000) B: shape (56_900_000, 10) block_size (11_380_000, 10) C: shape (320, 10) block_size (10, 10) Total dataset size 291 GB. Version dislib-0.9 (https://github.com/bsc-wdc/dislib/tree/release-0.9)

Publisher
[<#ROCrate::Organization https://ror.org/05sd8tv96 @properties={"@id"=>"https://ror.org/05sd8tv96", "@type"=>"Organization", "name"=>"Barcelona Supercomputing Center"}>]
License
Apache-2.0

Contents