22 October 2020
Empa
Europe/Zurich timezone

Reproducible High-Throughput HPC Workloads with AiiDA

22 Oct 2020, 10:15
30m
Akademie I (Empa)

Akademie I

Empa

Überlandstrasse 129 8600 Dübendorf

Speaker

Giovanni Pizzi (EPFL)

Description

The ever-growing availability of computing power and the sustained development of advanced computational methods present new challenges driven by the sheer amount of calculations and data to manage. In this talk I will present AiiDA (aiida.net), a robust open-source high-throughput infrastructure addressing the challenges arising from the needs of automated workflow management and data provenance recording. I will discuss the developments and capabilities required to reach sustained performance, with AiiDA supporting throughputs of tens of thousands processes/hour, each of them ranging from local processes to massive HPC jobs. I will also discuss how AiiDA automatically preserves and stores the full data provenance in a relational database making it queryable and traversable, thus enabling high-performance data analytics and reproducibility of the research.

Presentation materials