Loading…
Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling
In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kube...
Saved in:
Published in: | Electronics (Basel) 2024-07, Vol.13 (13), p.2651 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93 |
container_end_page | |
container_issue | 13 |
container_start_page | 2651 |
container_title | Electronics (Basel) |
container_volume | 13 |
creator | Dakić, Vedran Kovač, Mario Slovinac, Jurica |
description | In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency. |
doi_str_mv | 10.3390/electronics13132651 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3079025578</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3079025578</sourcerecordid><originalsourceid>FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</originalsourceid><addsrcrecordid>eNptUU1PwkAQbYwmEuQXeNnEq9X9YBf2iAXFiJFEEo_NsB1osd3F3RbDb_HPWoIHDs5lXmbevMnMi6JrRu-E0PQeSzS1d7YwgQkmuJLsLOpwOtCx5pqfn-DLqBfChrahmRgK2ol-JjtX7gq7JtNincdz9CvnK7AGSeKqbVMfWmOogSRoa_SBfBd1Tl6aJXqLNYZbcjozslDuQ9FWwWZkvLdQFYZ8OP9ZOsjIvASDVStEHiBgRpwlr2DywiKZIXh7WPZucsyasoVX0cUKyoC9v9yNFo-TRTKNZ29Pz8loFhtOeR1zRfvSwBIVaKP6VHFusK9NBpJlkg0HCpnsr4SC5ZCpgQGmNDNMUslRoxbd6OYou_Xuq8FQpxvX-PaQkIr2cZRLORi2LHFkGe9C8LhKt76owO9TRtODD-k_PohfxtKAaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3079025578</pqid></control><display><type>article</type><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</creator><creatorcontrib>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</creatorcontrib><description>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</description><identifier>ISSN: 2079-9292</identifier><identifier>EISSN: 2079-9292</identifier><identifier>DOI: 10.3390/electronics13132651</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Automation ; Computer centers ; Containerization ; Containers ; Design ; Efficiency ; Hardware ; High performance computing ; Machine learning ; Modularity ; Network latency ; Operating systems ; Placement ; Real time ; RISC ; Scheduling ; Software ; Task complexity ; Workload ; Workloads</subject><ispartof>Electronics (Basel), 2024-07, Vol.13 (13), p.2651</ispartof><rights>2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</cites><orcidid>0000-0001-8638-6044</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3079025578/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3079025578?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,25732,27903,27904,36991,44569,74873</link.rule.ids></links><search><creatorcontrib>Dakić, Vedran</creatorcontrib><creatorcontrib>Kovač, Mario</creatorcontrib><creatorcontrib>Slovinac, Jurica</creatorcontrib><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><title>Electronics (Basel)</title><description>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</description><subject>Automation</subject><subject>Computer centers</subject><subject>Containerization</subject><subject>Containers</subject><subject>Design</subject><subject>Efficiency</subject><subject>Hardware</subject><subject>High performance computing</subject><subject>Machine learning</subject><subject>Modularity</subject><subject>Network latency</subject><subject>Operating systems</subject><subject>Placement</subject><subject>Real time</subject><subject>RISC</subject><subject>Scheduling</subject><subject>Software</subject><subject>Task complexity</subject><subject>Workload</subject><subject>Workloads</subject><issn>2079-9292</issn><issn>2079-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNptUU1PwkAQbYwmEuQXeNnEq9X9YBf2iAXFiJFEEo_NsB1osd3F3RbDb_HPWoIHDs5lXmbevMnMi6JrRu-E0PQeSzS1d7YwgQkmuJLsLOpwOtCx5pqfn-DLqBfChrahmRgK2ol-JjtX7gq7JtNincdz9CvnK7AGSeKqbVMfWmOogSRoa_SBfBd1Tl6aJXqLNYZbcjozslDuQ9FWwWZkvLdQFYZ8OP9ZOsjIvASDVStEHiBgRpwlr2DywiKZIXh7WPZucsyasoVX0cUKyoC9v9yNFo-TRTKNZ29Pz8loFhtOeR1zRfvSwBIVaKP6VHFusK9NBpJlkg0HCpnsr4SC5ZCpgQGmNDNMUslRoxbd6OYou_Xuq8FQpxvX-PaQkIr2cZRLORi2LHFkGe9C8LhKt76owO9TRtODD-k_PohfxtKAaw</recordid><startdate>20240705</startdate><enddate>20240705</enddate><creator>Dakić, Vedran</creator><creator>Kovač, Mario</creator><creator>Slovinac, Jurica</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L7M</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><orcidid>https://orcid.org/0000-0001-8638-6044</orcidid></search><sort><creationdate>20240705</creationdate><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><author>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automation</topic><topic>Computer centers</topic><topic>Containerization</topic><topic>Containers</topic><topic>Design</topic><topic>Efficiency</topic><topic>Hardware</topic><topic>High performance computing</topic><topic>Machine learning</topic><topic>Modularity</topic><topic>Network latency</topic><topic>Operating systems</topic><topic>Placement</topic><topic>Real time</topic><topic>RISC</topic><topic>Scheduling</topic><topic>Software</topic><topic>Task complexity</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dakić, Vedran</creatorcontrib><creatorcontrib>Kovač, Mario</creatorcontrib><creatorcontrib>Slovinac, Jurica</creatorcontrib><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Electronics (Basel)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dakić, Vedran</au><au>Kovač, Mario</au><au>Slovinac, Jurica</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</atitle><jtitle>Electronics (Basel)</jtitle><date>2024-07-05</date><risdate>2024</risdate><volume>13</volume><issue>13</issue><spage>2651</spage><pages>2651-</pages><issn>2079-9292</issn><eissn>2079-9292</eissn><abstract>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/electronics13132651</doi><orcidid>https://orcid.org/0000-0001-8638-6044</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2079-9292 |
ispartof | Electronics (Basel), 2024-07, Vol.13 (13), p.2651 |
issn | 2079-9292 2079-9292 |
language | eng |
recordid | cdi_proquest_journals_3079025578 |
source | Publicly Available Content Database (Proquest) (PQ_SDU_P3) |
subjects | Automation Computer centers Containerization Containers Design Efficiency Hardware High performance computing Machine learning Modularity Network latency Operating systems Placement Real time RISC Scheduling Software Task complexity Workload Workloads |
title | Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T13%3A58%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evolving%20High-Performance%20Computing%20Data%20Centers%20with%20Kubernetes,%20Performance%20Analysis,%20and%20Dynamic%20Workload%20Placement%20Based%20on%20Machine%20Learning%20Scheduling&rft.jtitle=Electronics%20(Basel)&rft.au=Daki%C4%87,%20Vedran&rft.date=2024-07-05&rft.volume=13&rft.issue=13&rft.spage=2651&rft.pages=2651-&rft.issn=2079-9292&rft.eissn=2079-9292&rft_id=info:doi/10.3390/electronics13132651&rft_dat=%3Cproquest_cross%3E3079025578%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3079025578&rft_id=info:pmid/&rfr_iscdi=true |