Skip to content

Latest commit

 

History

History
24 lines (17 loc) · 2.58 KB

README.md

File metadata and controls

24 lines (17 loc) · 2.58 KB

Optimising workflow lifecycle management: development, HPC-ready containers deployment and reproducibility

SC24 tutorial, Atlanta, November 18th, 2024

Abstract

With Exaflop systems already here, the application communities are eager to leverage these large and complex systems. The complexity is further increased by the applications' need to combine different aspects beyond traditional HPC solvers and simulators, with artificial intelligence (AI) and data analytics (DA). The eFlows4HPC project proposed a software stack and the HPC Workflows-as-a-Service (HPCWaaS) methodology to provide tools to simplify the development, deployment, execution, and reuse of workflows. These results are leveraged in the DT-GEO and CAELESTIS projects. These tools also aim to support the reproducibility, portability and ease of use of complex workflows.

The tutorial will focus on a set of tools and methodologies for managing the whole application workflow lifecycle. In particular, the tutorial will cover aspects of developing computational workflows with PyCOMPSs and new extensions to better integrate with AI and DA with examples from DT-GEO and CAELESTIS projects. The tutorial will also describe how to automatically record workflow provenance with PyCOMPSs to share FAIR workflows in public repositories, enabling their reproducibility. Finally, we will explain how to generate specific containers that leverage HPC systems features and use them in the workflow deployment phase. The tutorial will include hands-on sessions on different aspects.

Agenda

Time Session Presenter
08:30 – 08:35 Overview of tutorial agenda Rosa M Badia
8:35 - 9:05 Part 1.1: Hybrid HPC+AI+DA workflow development with PyCOMPSs Rosa M Badia
9:05 - 9:35 Part 1.2: Workflows’ reproducibility through provenance Raül Sirvent
9:35 - 9:50 Part 1.3: HPC ready container images Motivation for architecture specific containers Rosa M Badia
9:50 - 10:00 Hands-on preparation All presenters
10:00 - 10:30 Break (30 min)
10:30 - 11:00 Part 2.1: Hands-on session: Sample workflows with PyCOMPSs and execution with containers Rosa M Badia
11:00 - 11:45 Part 2.2: Hands-on session: How to automatically record workflow provenance and use it to share experiments in WorkflowHub Raül Sirvent
11:45 - 12:00 Tutorial conclusions All presenters