Preview |
PDF, English (Masterarbeit Constantin Nicolai)
Download (985kB) | Terms of use |
Abstract
The fast broader adoption of ML applications has caused a surge in their global energy usage, necessitating a comprehensive understanding of the tradeoffs between execution speed and energy consumption. While previous work was focused on time-only or inference-only studies, we provide a more complete picture by covering a wider space of parameters. We contribute: (1) time and energy profiling across inference and training of DNN operations, (2) operations-level and full DNN performance predictions trained on our profiling results and (3) graphical evaluation and validation of profiling and predictor results. Profiling results of Nvidia A30 performance across several core clocks reveal an energy optimum of 900 MHz, aligning with the manufacturer base clock of 930 MHz. This work provides the tools which enable the correct choice of target GPU and clock speed for existing and future models.
Document type: | Master's thesis |
---|---|
Supervisor: | Fröning, Prof. Dr. Holger |
Place of Publication: | Heidelberg |
Date of thesis defense: | 2025 |
Date Deposited: | 09 Sep 2025 09:37 |
Date: | 2025 |
Faculties / Institutes: | Service facilities > Institut f. Technische Informatik (ZITI) Fakultät für Ingenieurwissenschaften > Dekanat der Fakultät für Ingenieurwissenschaften |
DDC-classification: | 004 Data processing Computer science |