HPC-Cloud-Big Data Convergent Architectures and Research Data Management The LEXIS Approach
Stephan Hachinger(1), Jan Martinovic(2), Olivier Terzo(3), Cédric Koch-Hofer(4), Martin Golasowski(2), Mohamad Hayek(1), Marc Levrier(4), Alberto Scionti(3), Donato Magarielli(5), Thierry Goubier(6), Antonio Parodi(7), Sean Murphy(8), Florin-Ionut Apopei(9), Carmine D’Amico(3), Simone Ciccia(3), Danijel Schorlemmer(10)
(1) Leibniz Supercomputing Centre (LRZ), Bavarian Academy of Sciences and Humanities, Garching, Germany
(2) IT4Innovations, VŠB – Technical University of Ostrava, Ostrava-Poruba, Czech Republic
(3) Advanced Computing and Applications, LINKS Foundation, Torino, Italy
(4) ATOS, Paris, France
(5) AvioAero, Torino, Italy
(6) CEA, LIST, Paris, France
(7) CIMA Research Foundation, Savona, Italy
(8) Cyclops Labs, Zurich, Switzerland
(9) TESEO, Torino, Italy
(10) GFZ German Research Centre for Geosciences, Potsdam, Germany
The LEXIS project (Large-scale EXecution for Industry & Society, H2020 GA825532) provides a platform for optimized execution of Cloud-HPC workflows, reducing computation time and increasing energy efficiency. The system will rely on advanced, distributed orchestration solutions (Bull Ystia Orchestrator, based on TOSCA and Alien4Cloud technologies), the High-End Application Execution Middleware HEAppE, and new hardware capabilities for maximizing efficiency in data processing, analysis and transfer (e.g. Burst Buffers with GPU- and FPGA-based data reprocessing).
LEXIS handles computation tasks and data from three Pilots, based on representative and demanding HPC/Cloud-Computing use cases in Industry (SMEs) and Science: i) Simulations of complex turbomachinery and gearbox systems in Aeronautics, ii) Tsunami simulations and earthquake loss assessments which are time-constrained to enable immediate warnings and to support well-informed decisions, and iii) Weather and Climate simulations where massive amounts of in-situ data are assimilated to improve forecasts. A user-friendly LEXIS web portal, as a unique entry point, will provide access to data as well as workflow-handling and remote visualization functionality.
As part of its back-end, LEXIS builds an elaborate system for the handling of input, intermediate and result data. At its core, a Distributed Data Infrastructure (DDI) ensures the availability of LEXIS data at all participating HPC sites, which will be federated with a common LEXIS AAI (with unified security model, user database and authorization policy). The DDI leverages best of breed data-management solutions from EUDAT, such as B2SAFE (based on iRODS) and B2HANDLE. REST APIs on top of it will ensure a smooth interaction with LEXIS workflows and the orchestration layer. Last, but not least, the DDI will provide functionalities for Research Data Management following the FAIR principles (“Findable, Interoperable, Accessible, Reusable”), e.g. DOI acquisition, which helps to publish and disseminate open data products.
NOTE: Submitted, publication in 2020 if accepted.