Generic and ML Workloads in an HPC Datacenter: Node Energy, Job Failures, and Node-Job Analysis

Bibliographic Details
Title: Generic and ML Workloads in an HPC Datacenter: Node Energy, Job Failures, and Node-Job Analysis
Authors: Chu, Xiaoyu, Hofstatter, Daniel, Ilager, Shashikant, Talluri, Sacheendra, Kampert, Duncan, Podareanu, Damian, Duplyakin, Dmitry, Brandic, Ivona, Iosup, Alexandru
Source: 2024 IEEE 30th International Conference on Parallel and Distributed Systems (ICPADS) ICPADS Parallel and Distributed Systems (ICPADS), 2024 IEEE 30th International Conference on. :710-719 Oct, 2024
Relation: 2024 IEEE 30th International Conference on Parallel and Distributed Systems (ICPADS)
Database: IEEE Xplore Digital Library