Loading…

Evaluating and Comparing the Potentials in Primary Response for GPU and CPU Data Centers

The rapid growth of Large Language Models (LLMs) and Artificial Intelligence (AI) has transformed traditional CPU-centric Data Centers (DaCe) into more power-demanding GPU DaCes. Previous work has explored methods to reduce energy costs and carbon emissions in GPU DaCes. However, there remains a gap...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhou, Yihong, Paredes, Angel, Essayeh, Chaimaa, Morstyn, Thomas
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The rapid growth of Large Language Models (LLMs) and Artificial Intelligence (AI) has transformed traditional CPU-centric Data Centers (DaCe) into more power-demanding GPU DaCes. Previous work has explored methods to reduce energy costs and carbon emissions in GPU DaCes. However, there remains a gap in understanding the potential of GPU DaCes for providing primary response, a crucial ancillary service for stabilizing the power system. Drawing on real-world job traces from a GPU-intensive DaCe operated by SenseTime and a CPU-intensive DaCe at Oak Ridge National Laboratory, we developed a mixed-integer linear programming model to assess the DaCe flexibility potentials considering individual jobs' characteristics. We show that the GPU DaCe possesses a larger flexibility for delivering primary responses compared to the CPU DaCe. Furthermore, the GPU DaCe exhibits lower variability in flexibility across different times of the day and over a 7-month evaluation horizon, making them more dependable and stable sources for offering primary response.
ISSN:1944-9933
DOI:10.1109/PESGM51994.2024.10689061