This experiment has been executed using:
- Model: Decision Tree regressor (test.yaml)
- n_samples: 8 (number of ALYA simulations to be run)
- Nodes used in the MN5 supercomputer: 4
- ALYA_PROCS = 56
Adapted version of a workflow used in CAELESTIS for mechanical testing simulation which was developed by Riccardo Cecco (Workflows and Distributed Computing Group, BSC) with the guidance of Gerard Guillamet (Dual Technologies Research Group, BSC) and Aravind Sasikumar (AMADE Research UDG). The workflow ...
Type: COMPSs
Creators: Riccardo Cecco, Gerard Guillamet, Aravind Sasikumar, Fernando Vázquez-Novoa, Jorge Ejarque, Rosa M Badia
Submitter: Raül Sirvent
This experiment has been executed using:
- Model: SVR (test_SVR.yaml)
- n_samples: 8 (number of ALYA simulations to be run)
- Nodes used in the MN5 supercomputer: 2
- ALYA_PROCS = 28
Adapted version of a workflow used in CAELESTIS for mechanical testing simulation which was developed by Riccardo Cecco (Workflows and Distributed Computing Group, BSC) with the guidance of Gerard Guillamet (Dual Technologies Research Group, BSC) and Aravind Sasikumar (AMADE Research UDG). The workflow is distributed ...
Type: COMPSs
Creators: Riccardo Cecco, Gerard Guillamet, Aravind Sasikumar, Fernando Vázquez-Novoa, Jorge Ejarque, Rosa M Badia
Submitter: Raül Sirvent
Name: PhysioNet CascadeCSVM Kfold Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum5
Kfold to evaluate CascadeCSVM accuracy on PhysioNet dataset (https://b2drop.bsc.es/index.php/s/8Q8MefXX2rrzaWs). This application used dislib-0.9.0
Name: PhysioNet kNN Kfold Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum5
Kfold to evaluate kNN accuracy on PhysioNet dataset (https://b2drop.bsc.es/index.php/s/8Q8MefXX2rrzaWs). This application used dislib-0.9.0
Name: PhysioNet RF Kfold Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum5
Kfold to evaluate RandomForest accuracy on PhysioNet dataset (https://b2drop.bsc.es/index.php/s/8Q8MefXX2rrzaWs). This application used dislib-0.9.0
Name: GridSearchCV Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum5
GridSearch of kNN algorithm for the iris.csv dataset (https://gist.githubusercontent.com/netj/8836201/raw/6f9306ad21398ea43cba4f7d537619d0e07d5ae3/iris.csv). This application used dislib-0.9.0
Name: GridSearchCV Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum5
GridSearch of kNN algorithm for the iris.csv dataset (https://gist.githubusercontent.com/netj/8836201/raw/6f9306ad21398ea43cba4f7d537619d0e07d5ae3/iris.csv). This application used dislib-0.9.0
Name: Dislib Distributed Training - Cache ON Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4
PyTorch distributed training of CNN on GPU and leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). Dataset: Imagenet Version dislib-0.9 Version PyTorch 1.7.1+cu101
Average task execution time: 36 seconds
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
Name: K-Means GPU Cache ON Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4
K-Means running on the GPU leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). Parameters used: K=40 and 32 blocks of size (1_000_000, 1200). It creates a block for each GPU. Total dataset shape is (32_000_000, 1200). Version dislib-0.9
Average task execution time: 16 seconds
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
Name: KMeans Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum5
KMEans for clustering the housing.csv dataset (https://github.com/sonarsushant/California-House-Price-Prediction/blob/master/housing.csv). This application used dislib-0.9.0
Name: Dislib Distributed Training - Cache OFF Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4
PyTorch distributed training of CNN on GPU. Launched using 32 GPUs (16 nodes). Dataset: Imagenet Version dislib-0.9 Version PyTorch 1.7.1+cu101
Average task execution time: 84 seconds
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
Name: K-Means GPU Cache OFF Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4
K-Means running on GPUs. Launched using 32 GPUs (16 nodes). Parameters used: K=40 and 32 blocks of size (1_000_000, 1200). It creates a block for each GPU. Total dataset shape is (32_000_000, 1200). Version dislib-0.9
Average task execution time: 194 seconds
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
Name: Matmul GPU Case 1 Cache-ON Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4
Matmul running on the GPU leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). Performs C = A @ B Where A: shape (320, 56_900_000) block_size (10, 11_380_000) B: shape (56_900_000, 10) block_size (11_380_000, 10) C: shape (320, 10) block_size ...
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
Name: Matmul GPU Case 1 Cache-OFF Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs 3.3 Machine: Minotauro-MN4
Matmul running on the GPU without Cache. Launched using 32 GPUs (16 nodes). Performs C = A @ B Where A: shape (320, 56_900_000) block_size (10, 11_380_000) B: shape (56_900_000, 10) block_size (11_380_000, 10) C: shape (320, 10) block_size (10, 10) Total dataset size 291 ...
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
Name: Random Forest Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum4 This is an example of Random Forest algorithm from dislib. To show the usage, the code generates a synthetical input matrix. The results are printed by screen. This application used dislib-0.9.0
Name: Lanczos SVD Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum4
Lanczos SVD for computing singular values needed to reach an epsilon of 1e-3 on a matrix of (150000, 150). The input matrix is generated synthetically. This application used dislib-0.9.0
Type: COMPSs
Creators: Fernando Vázquez-Novoa, Workflows and Distributed Computing
Submitter: Fernando Vázquez-Novoa
Name: TruncatedSVD (Randomized SVD) Contact Person: [email protected] Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: MareNostrum4
TruncatedSVD (Randomized SVD) for computing just 456 singular values out of a (3.6M x 1200) size matrix. The input matrix represents a CFD transient simulation of aire moving past a cylinder. This application used dislib-0.9.0
Type: COMPSs
Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)
Submitter: Cristian Tatu
A demonstration workflow for Reduced Order Modeling (ROM) within the eFlows4HPC project, implemented using Kratos Multiphysics, EZyRB, COMPSs, and dislib.
Type: COMPSs
Creators: Jose Raul Bravo Martinez, Sebastian Ares de Parga Regalado, Riccardo Rossi Bernecoli, Jorge Ejarque
Submitter: Raül Sirvent