Loading…

Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling

In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kube...

Full description

Saved in:

Bibliographic Details
Published in:	Electronics (Basel) 2024-07, Vol.13 (13), p.2651
Main Authors:	Dakić, Vedran, Kovač, Mario, Slovinac, Jurica
Format:	Article
Language:	English
Subjects:	Automation Computer centers Containerization Containers Design Efficiency Hardware High performance computing Machine learning Modularity Network latency Operating systems Placement Real time RISC Scheduling Software Task complexity Workload Workloads
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93
container_end_page
container_issue	13
container_start_page	2651
container_title	Electronics (Basel)
container_volume	13
creator	Dakić, Vedran Kovač, Mario Slovinac, Jurica
description	In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.
doi_str_mv	10.3390/electronics13132651
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3079025578</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3079025578</sourcerecordid><originalsourceid>FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</originalsourceid><addsrcrecordid>eNptUU1PwkAQbYwmEuQXeNnEq9X9YBf2iAXFiJFEEo_NsB1osd3F3RbDb_HPWoIHDs5lXmbevMnMi6JrRu-E0PQeSzS1d7YwgQkmuJLsLOpwOtCx5pqfn-DLqBfChrahmRgK2ol-JjtX7gq7JtNincdz9CvnK7AGSeKqbVMfWmOogSRoa_SBfBd1Tl6aJXqLNYZbcjozslDuQ9FWwWZkvLdQFYZ8OP9ZOsjIvASDVStEHiBgRpwlr2DywiKZIXh7WPZucsyasoVX0cUKyoC9v9yNFo-TRTKNZ29Pz8loFhtOeR1zRfvSwBIVaKP6VHFusK9NBpJlkg0HCpnsr4SC5ZCpgQGmNDNMUslRoxbd6OYou_Xuq8FQpxvX-PaQkIr2cZRLORi2LHFkGe9C8LhKt76owO9TRtODD-k_PohfxtKAaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3079025578</pqid></control><display><type>article</type><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</creator><creatorcontrib>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</creatorcontrib><description>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</description><identifier>ISSN: 2079-9292</identifier><identifier>EISSN: 2079-9292</identifier><identifier>DOI: 10.3390/electronics13132651</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Automation ; Computer centers ; Containerization ; Containers ; Design ; Efficiency ; Hardware ; High performance computing ; Machine learning ; Modularity ; Network latency ; Operating systems ; Placement ; Real time ; RISC ; Scheduling ; Software ; Task complexity ; Workload ; Workloads</subject><ispartof>Electronics (Basel), 2024-07, Vol.13 (13), p.2651</ispartof><rights>2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</cites><orcidid>0000-0001-8638-6044</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3079025578/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3079025578?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,25732,27903,27904,36991,44569,74873</link.rule.ids></links><search><creatorcontrib>Dakić, Vedran</creatorcontrib><creatorcontrib>Kovač, Mario</creatorcontrib><creatorcontrib>Slovinac, Jurica</creatorcontrib><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><title>Electronics (Basel)</title><description>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</description><subject>Automation</subject><subject>Computer centers</subject><subject>Containerization</subject><subject>Containers</subject><subject>Design</subject><subject>Efficiency</subject><subject>Hardware</subject><subject>High performance computing</subject><subject>Machine learning</subject><subject>Modularity</subject><subject>Network latency</subject><subject>Operating systems</subject><subject>Placement</subject><subject>Real time</subject><subject>RISC</subject><subject>Scheduling</subject><subject>Software</subject><subject>Task complexity</subject><subject>Workload</subject><subject>Workloads</subject><issn>2079-9292</issn><issn>2079-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNptUU1PwkAQbYwmEuQXeNnEq9X9YBf2iAXFiJFEEo_NsB1osd3F3RbDb_HPWoIHDs5lXmbevMnMi6JrRu-E0PQeSzS1d7YwgQkmuJLsLOpwOtCx5pqfn-DLqBfChrahmRgK2ol-JjtX7gq7JtNincdz9CvnK7AGSeKqbVMfWmOogSRoa_SBfBd1Tl6aJXqLNYZbcjozslDuQ9FWwWZkvLdQFYZ8OP9ZOsjIvASDVStEHiBgRpwlr2DywiKZIXh7WPZucsyasoVX0cUKyoC9v9yNFo-TRTKNZ29Pz8loFhtOeR1zRfvSwBIVaKP6VHFusK9NBpJlkg0HCpnsr4SC5ZCpgQGmNDNMUslRoxbd6OYou_Xuq8FQpxvX-PaQkIr2cZRLORi2LHFkGe9C8LhKt76owO9TRtODD-k_PohfxtKAaw</recordid><startdate>20240705</startdate><enddate>20240705</enddate><creator>Dakić, Vedran</creator><creator>Kovač, Mario</creator><creator>Slovinac, Jurica</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L7M</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><orcidid>https://orcid.org/0000-0001-8638-6044</orcidid></search><sort><creationdate>20240705</creationdate><title>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</title><author>Dakić, Vedran ; Kovač, Mario ; Slovinac, Jurica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automation</topic><topic>Computer centers</topic><topic>Containerization</topic><topic>Containers</topic><topic>Design</topic><topic>Efficiency</topic><topic>Hardware</topic><topic>High performance computing</topic><topic>Machine learning</topic><topic>Modularity</topic><topic>Network latency</topic><topic>Operating systems</topic><topic>Placement</topic><topic>Real time</topic><topic>RISC</topic><topic>Scheduling</topic><topic>Software</topic><topic>Task complexity</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dakić, Vedran</creatorcontrib><creatorcontrib>Kovač, Mario</creatorcontrib><creatorcontrib>Slovinac, Jurica</creatorcontrib><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Electronics (Basel)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dakić, Vedran</au><au>Kovač, Mario</au><au>Slovinac, Jurica</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling</atitle><jtitle>Electronics (Basel)</jtitle><date>2024-07-05</date><risdate>2024</risdate><volume>13</volume><issue>13</issue><spage>2651</spage><pages>2651-</pages><issn>2079-9292</issn><eissn>2079-9292</eissn><abstract>In the past twenty years, the IT industry has moved away from using physical servers for workload management to workloads consolidated via virtualization and, in the next iteration, further consolidated into containers. Later, container workloads based on Docker and Podman were orchestrated via Kubernetes or OpenShift. On the other hand, high-performance computing (HPC) environments have been lagging in this process, as much work is still needed to figure out how to apply containerization platforms for HPC. Containers have many advantages, as they tend to have less overhead while providing flexibility, modularity, and maintenance benefits. This makes them well-suited for tasks requiring a lot of computing power that are latency- or bandwidth-sensitive. But they are complex to manage, and many daily operations are based on command-line procedures that take years to master. This paper proposes a different architecture based on seamless hardware integration and a user-friendly UI (User Interface). It also offers dynamic workload placement based on real-time performance analysis and prediction and Machine Learning-based scheduling. This solves a prevalent issue in Kubernetes: the suboptimal placement of workloads without needing individual workload schedulers, as they are challenging to write and require much time to debug and test properly. It also enables us to focus on one of the key HPC issues—energy efficiency. Furthermore, the application we developed that implements this architecture helps with the Kubernetes installation process, which is fully automated, no matter which hardware platform we use—x86, ARM, and soon, RISC-V. The results we achieved using this architecture and application are very promising in two areas—the speed of workload scheduling and workload placement on a correct node. This also enables us to focus on one of the key HPC issues—energy efficiency.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/electronics13132651</doi><orcidid>https://orcid.org/0000-0001-8638-6044</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2079-9292
ispartof	Electronics (Basel), 2024-07, Vol.13 (13), p.2651
issn	2079-9292 2079-9292
language	eng
recordid	cdi_proquest_journals_3079025578
source	Publicly Available Content Database (Proquest) (PQ_SDU_P3)
subjects	Automation Computer centers Containerization Containers Design Efficiency Hardware High performance computing Machine learning Modularity Network latency Operating systems Placement Real time RISC Scheduling Software Task complexity Workload Workloads
title	Evolving High-Performance Computing Data Centers with Kubernetes, Performance Analysis, and Dynamic Workload Placement Based on Machine Learning Scheduling
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T13%3A58%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evolving%20High-Performance%20Computing%20Data%20Centers%20with%20Kubernetes,%20Performance%20Analysis,%20and%20Dynamic%20Workload%20Placement%20Based%20on%20Machine%20Learning%20Scheduling&rft.jtitle=Electronics%20(Basel)&rft.au=Daki%C4%87,%20Vedran&rft.date=2024-07-05&rft.volume=13&rft.issue=13&rft.spage=2651&rft.pages=2651-&rft.issn=2079-9292&rft.eissn=2079-9292&rft_id=info:doi/10.3390/electronics13132651&rft_dat=%3Cproquest_cross%3E3079025578%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c202t-26045cabe6a9c640622ce49cda51d51876e154f36ab8167ca1691c15052e9e93%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3079025578&rft_id=info:pmid/&rfr_iscdi=true