Big data processing on supercomputers
-
4 April 2025
10:00 AM – 12:00 PM - online
This course is held in the Czech language.
Annotation: This course will focus on systems for managing data processing workflows and their interfacing with supercomputers. We will introduce the basics of using supercomputers and different approaches to using them. The LEXIS platform will be introduced as an interface for running complex jobs with automatic data transfer without terminal tools. Participants will also learn about different approaches to transferring and storing data on supercomputing infrastructures in this demonstration. Attendees will experience working with workflows within LEXIS and see a demonstration of the Apache Airflow tool for more advanced tasks.
Benefits for participants
- Interface for easy access to data processing on the supercomputer
- Working with the graphical interface of the LEXIS platform
- Introducing the Apache Airflow GUI
- Issues of data transfer and access to analysis results on supercomputers
Level: beginner, intermediate, advanced
Language: Czech
Prerequisites: Interest in big data processing on a supercomputer
Ing. Martin Golasowski, Ph.D.
Martin Golasowski, Ph.D., is a senior researcher in the Advanced Data Analysis and Simulation Laboratory at the IT4Innovations National Supercomputing Center in the Czech Republic. His research interests are distributed data management, transfer, and analysis, using complex computational pipelines on HPC and in the cloud. He is responsible for the technical development of the LEXIS platform, which provides data management and workflow orchestration as a service. He is involved in several international research projects, including EXA4MIND and OpenWebSearch.eu. He has contributed to H2020 or EuroHPC research projects such as IO-SEA, LEXIS or ANTAREX and has contributed more than 20 publications.
Share event