Loading…

Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling

In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kube...

Full description

Saved in:
Bibliographic Details
Published in:Electronics (Basel) 2024-07, Vol.13 (13), p.2651
Main Authors: Dakić, Vedran, Kovač, Mario, Slovinac, Jurica
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93
container_end_page
container_issue 13
container_start_page 2651
container_title Electronics (Basel)
container_volume 13
creator Dakić, Vedran
Kovač, Mario
Slovinac, Jurica
description In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.
doi_str_mv 10.3390/electronics13132651
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3079025578</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3079025578</sourcerecordid><originalsourceid>FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</originalsourceid><addsrcrecordid>eNptUU1PwkAQbYwmEuQXeNnEq9X9YBf2iAXFiJFEEo_NsB1osd3F3RbDb_HPWoIHDs5lXmbevMnMi6JrRu-E0PQeSzS1d7YwgQkmuJLsLOpwOtCx5pqfn-DLqBfChrahmRgK2ol-JjtX7gq7JtNincdz9CvnK7AGSeKqbVMfWmOogSRoa_SBfBd1Tl6aJXqLNYZbcjozslDuQ9FWwWZkvLdQFYZ8OP9ZOsjIvASDVStEHiBgRpwlr2DywiKZIXh7WPZucsyasoVX0cUKyoC9v9yNFo-TRTKNZ29Pz8loFhtOeR1zRfvSwBIVaKP6VHFusK9NBpJlkg0HCpnsr4SC5ZCpgQGmNDNMUslRoxbd6OYou_Xuq8FQpxvX-PaQkIr2cZRLORi2LHFkGe9C8LhKt76owO9TRtODD-k_PohfxtKAaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3079025578</pqid></control><display><type>article</type><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</creator><creatorcontrib>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</creatorcontrib><description>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</description><identifier>ISSN: 2079-9292</identifier><identifier>EISSN: 2079-9292</identifier><identifier>DOI: 10.3390/electronics13132651</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Automation ; Computer centers ; Containerization ; Containers ; Design ; Efficiency ; Hardware ; High performance computing ; Machine learning ; Modularity ; Network latency ; Operating systems ; Placement ; Real time ; RISC ; Scheduling ; Software ; Task complexity ; Workload ; Workloads</subject><ispartof>Electronics (Basel), 2024-07, Vol.13 (13), p.2651</ispartof><rights>2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</cites><orcidid>0000-0001-8638-6044</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3079025578/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3079025578?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,25732,27903,27904,36991,44569,74873</link.rule.ids></links><search><creatorcontrib>Dakić, Vedran</creatorcontrib><creatorcontrib>Kovač, Mario</creatorcontrib><creatorcontrib>Slovinac, Jurica</creatorcontrib><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><title>Electronics (Basel)</title><description>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</description><subject>Automation</subject><subject>Computer centers</subject><subject>Containerization</subject><subject>Containers</subject><subject>Design</subject><subject>Efficiency</subject><subject>Hardware</subject><subject>High performance computing</subject><subject>Machine learning</subject><subject>Modularity</subject><subject>Network latency</subject><subject>Operating systems</subject><subject>Placement</subject><subject>Real time</subject><subject>RISC</subject><subject>Scheduling</subject><subject>Software</subject><subject>Task complexity</subject><subject>Workload</subject><subject>Workloads</subject><issn>2079-9292</issn><issn>2079-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNptUU1PwkAQbYwmEuQXeNnEq9X9YBf2iAXFiJFEEo_NsB1osd3F3RbDb_HPWoIHDs5lXmbevMnMi6JrRu-E0PQeSzS1d7YwgQkmuJLsLOpwOtCx5pqfn-DLqBfChrahmRgK2ol-JjtX7gq7JtNincdz9CvnK7AGSeKqbVMfWmOogSRoa_SBfBd1Tl6aJXqLNYZbcjozslDuQ9FWwWZkvLdQFYZ8OP9ZOsjIvASDVStEHiBgRpwlr2DywiKZIXh7WPZucsyasoVX0cUKyoC9v9yNFo-TRTKNZ29Pz8loFhtOeR1zRfvSwBIVaKP6VHFusK9NBpJlkg0HCpnsr4SC5ZCpgQGmNDNMUslRoxbd6OYou_Xuq8FQpxvX-PaQkIr2cZRLORi2LHFkGe9C8LhKt76owO9TRtODD-k_PohfxtKAaw</recordid><startdate>20240705</startdate><enddate>20240705</enddate><creator>Dakić, Vedran</creator><creator>Kovač, Mario</creator><creator>Slovinac, Jurica</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L7M</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><orcidid>https://orcid.org/0000-0001-8638-6044</orcidid></search><sort><creationdate>20240705</creationdate><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><author>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automation</topic><topic>Computer centers</topic><topic>Containerization</topic><topic>Containers</topic><topic>Design</topic><topic>Efficiency</topic><topic>Hardware</topic><topic>High performance computing</topic><topic>Machine learning</topic><topic>Modularity</topic><topic>Network latency</topic><topic>Operating systems</topic><topic>Placement</topic><topic>Real time</topic><topic>RISC</topic><topic>Scheduling</topic><topic>Software</topic><topic>Task complexity</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dakić, Vedran</creatorcontrib><creatorcontrib>Kovač, Mario</creatorcontrib><creatorcontrib>Slovinac, Jurica</creatorcontrib><collection>CrossRef</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Electronics (Basel)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dakić, Vedran</au><au>Kovač, Mario</au><au>Slovinac, Jurica</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</atitle><jtitle>Electronics (Basel)</jtitle><date>2024-07-05</date><risdate>2024</risdate><volume>13</volume><issue>13</issue><spage>2651</spage><pages>2651-</pages><issn>2079-9292</issn><eissn>2079-9292</eissn><abstract>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/electronics13132651</doi><orcidid>https://orcid.org/0000-0001-8638-6044</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2079-9292
ispartof Electronics (Basel), 2024-07, Vol.13 (13), p.2651
issn 2079-9292
2079-9292
language eng
recordid cdi_proquest_journals_3079025578
source Publicly Available Content Database (Proquest) (PQ_SDU_P3)
subjects Automation
Computer centers
Containerization
Containers
Design
Efficiency
Hardware
High performance computing
Machine learning
Modularity
Network latency
Operating systems
Placement
Real time
RISC
Scheduling
Software
Task complexity
Workload
Workloads
title Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T13%3A58%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evolving%20High-Performance%20Computing%20Data%20Centers%20with%20Kubernetes,%20Performance%20Analysis,%20and%20Dynamic%20Workload%20Placement%20Based%20on%20Machine%20Learning%20Scheduling&rft.jtitle=Electronics%20(Basel)&rft.au=Daki%C4%87,%20Vedran&rft.date=2024-07-05&rft.volume=13&rft.issue=13&rft.spage=2651&rft.pages=2651-&rft.issn=2079-9292&rft.eissn=2079-9292&rft_id=info:doi/10.3390/electronics13132651&rft_dat=%3Cproquest_cross%3E3079025578%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3079025578&rft_id=info:pmid/&rfr_iscdi=true