Loading…

Optimizing Single-Source Graph Execution on NUMA Machines

Graphs are data structures capable of representing problems from different domains, such as logistics and social networks. However, these massive graphs stored in high-performance computing (HPC) servers start processing from distinct source vertices (i.e., single-source: such as a different user or...

Full description

Saved in:
Bibliographic Details
Main Authors: Rocha, Hiago Mayk G. de A., Querol, Vicenc Beltran, Lorenzon, Arthur F., Beck, Antonio Carlos S.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 6
container_issue
container_start_page 1
container_title
container_volume
creator Rocha, Hiago Mayk G. de A.
Querol, Vicenc Beltran
Lorenzon, Arthur F.
Beck, Antonio Carlos S.
description Graphs are data structures capable of representing problems from different domains, such as logistics and social networks. However, these massive graphs stored in high-performance computing (HPC) servers start processing from distinct source vertices (i.e., single-source: such as a different user or message in a social network). Therefore, the amount of vertices and the structure of the sub-graphs to be processed will also change depending on the source, highly influencing the graph algorithm behavior and performance. In this paper, we propose GraphNroll, a framework to extract the full potential of these single-source graph algorithms by exploiting the fact that the majority of the HPC servers are on Non-Uniform Memory Access (NUMA) architectures and, therefore, highly dependent of thread and data mapping. GraphNroll adjusts the thread and data mapping on NUMA machines by using only the embeddings of the source vertices (i.e., vector representations that encode the graph topology) to find the best configuration in terms of thread/data mapping. For that, GraphNroll learns over such embeddings to build a machine learning model capable of predicting the mapping solutions for a new algorithm execution without any profiling. Our results show that GraphNroll is, on average, 8% (up to 18%) faster than the Linux OS Default and Random solutions on three NUMA machines while reducing energy consumption.
doi_str_mv 10.1109/SBESC60926.2023.10324068
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10324068</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10324068</ieee_id><sourcerecordid>10324068</sourcerecordid><originalsourceid>FETCH-LOGICAL-i204t-361d1bbe73c56f08f3870c6c54e14b19600331ff4ce2259610eb0a2a6a2658883</originalsourceid><addsrcrecordid>eNo1j1FLwzAUhaMgOGb_gQ_5A603uUmaPM5S52BzD3XPI423LrJ1pe1A_fUWVDicA9-BA4cxLiATAtxD9VhWhQEnTSZBYiYApQJjr1jicmdRAzolwF6zmZyaNLdO3bJkGD4AYOJaOzVjbtuN8RS_Y_vOq8mOlFbnSx-IL3vfHXj5SeEyxnPLJ73sNgu-8eEQWxru2E3jjwMlfzlnu6fytXhO19vlqlis0yhBjSka8SbqmnIM2jRgG7Q5BBO0IqFq4QwAomgaFUhK7YwAqsFLb7w02lqLc3b_uxuJaN_18eT7r_3_W_wBeMBHrA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Optimizing Single-Source Graph Execution on NUMA Machines</title><source>IEEE Xplore All Conference Series</source><creator>Rocha, Hiago Mayk G. de A. ; Querol, Vicenc Beltran ; Lorenzon, Arthur F. ; Beck, Antonio Carlos S.</creator><creatorcontrib>Rocha, Hiago Mayk G. de A. ; Querol, Vicenc Beltran ; Lorenzon, Arthur F. ; Beck, Antonio Carlos S.</creatorcontrib><description>Graphs are data structures capable of representing problems from different domains, such as logistics and social networks. However, these massive graphs stored in high-performance computing (HPC) servers start processing from distinct source vertices (i.e., single-source: such as a different user or message in a social network). Therefore, the amount of vertices and the structure of the sub-graphs to be processed will also change depending on the source, highly influencing the graph algorithm behavior and performance. In this paper, we propose GraphNroll, a framework to extract the full potential of these single-source graph algorithms by exploiting the fact that the majority of the HPC servers are on Non-Uniform Memory Access (NUMA) architectures and, therefore, highly dependent of thread and data mapping. GraphNroll adjusts the thread and data mapping on NUMA machines by using only the embeddings of the source vertices (i.e., vector representations that encode the graph topology) to find the best configuration in terms of thread/data mapping. For that, GraphNroll learns over such embeddings to build a machine learning model capable of predicting the mapping solutions for a new algorithm execution without any profiling. Our results show that GraphNroll is, on average, 8% (up to 18%) faster than the Linux OS Default and Random solutions on three NUMA machines while reducing energy consumption.</description><identifier>EISSN: 2324-7894</identifier><identifier>EISBN: 9798350394108</identifier><identifier>DOI: 10.1109/SBESC60926.2023.10324068</identifier><language>eng</language><publisher>IEEE</publisher><subject>Graph's High-Level Features ; NUMA Machines ; Single-Source Algorithms ; Thread and Data Mapping</subject><ispartof>2023 XIII Brazilian Symposium on Computing Systems Engineering (SBESC), 2023, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10324068$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10324068$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rocha, Hiago Mayk G. de A.</creatorcontrib><creatorcontrib>Querol, Vicenc Beltran</creatorcontrib><creatorcontrib>Lorenzon, Arthur F.</creatorcontrib><creatorcontrib>Beck, Antonio Carlos S.</creatorcontrib><title>Optimizing Single-Source Graph Execution on NUMA Machines</title><title>2023 XIII Brazilian Symposium on Computing Systems Engineering (SBESC)</title><addtitle>SBESC</addtitle><description>Graphs are data structures capable of representing problems from different domains, such as logistics and social networks. However, these massive graphs stored in high-performance computing (HPC) servers start processing from distinct source vertices (i.e., single-source: such as a different user or message in a social network). Therefore, the amount of vertices and the structure of the sub-graphs to be processed will also change depending on the source, highly influencing the graph algorithm behavior and performance. In this paper, we propose GraphNroll, a framework to extract the full potential of these single-source graph algorithms by exploiting the fact that the majority of the HPC servers are on Non-Uniform Memory Access (NUMA) architectures and, therefore, highly dependent of thread and data mapping. GraphNroll adjusts the thread and data mapping on NUMA machines by using only the embeddings of the source vertices (i.e., vector representations that encode the graph topology) to find the best configuration in terms of thread/data mapping. For that, GraphNroll learns over such embeddings to build a machine learning model capable of predicting the mapping solutions for a new algorithm execution without any profiling. Our results show that GraphNroll is, on average, 8% (up to 18%) faster than the Linux OS Default and Random solutions on three NUMA machines while reducing energy consumption.</description><subject>Graph's High-Level Features</subject><subject>NUMA Machines</subject><subject>Single-Source Algorithms</subject><subject>Thread and Data Mapping</subject><issn>2324-7894</issn><isbn>9798350394108</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNo1j1FLwzAUhaMgOGb_gQ_5A603uUmaPM5S52BzD3XPI423LrJ1pe1A_fUWVDicA9-BA4cxLiATAtxD9VhWhQEnTSZBYiYApQJjr1jicmdRAzolwF6zmZyaNLdO3bJkGD4AYOJaOzVjbtuN8RS_Y_vOq8mOlFbnSx-IL3vfHXj5SeEyxnPLJ73sNgu-8eEQWxru2E3jjwMlfzlnu6fytXhO19vlqlis0yhBjSka8SbqmnIM2jRgG7Q5BBO0IqFq4QwAomgaFUhK7YwAqsFLb7w02lqLc3b_uxuJaN_18eT7r_3_W_wBeMBHrA</recordid><startdate>20231121</startdate><enddate>20231121</enddate><creator>Rocha, Hiago Mayk G. de A.</creator><creator>Querol, Vicenc Beltran</creator><creator>Lorenzon, Arthur F.</creator><creator>Beck, Antonio Carlos S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>20231121</creationdate><title>Optimizing Single-Source Graph Execution on NUMA Machines</title><author>Rocha, Hiago Mayk G. de A. ; Querol, Vicenc Beltran ; Lorenzon, Arthur F. ; Beck, Antonio Carlos S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i204t-361d1bbe73c56f08f3870c6c54e14b19600331ff4ce2259610eb0a2a6a2658883</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Graph's High-Level Features</topic><topic>NUMA Machines</topic><topic>Single-Source Algorithms</topic><topic>Thread and Data Mapping</topic><toplevel>online_resources</toplevel><creatorcontrib>Rocha, Hiago Mayk G. de A.</creatorcontrib><creatorcontrib>Querol, Vicenc Beltran</creatorcontrib><creatorcontrib>Lorenzon, Arthur F.</creatorcontrib><creatorcontrib>Beck, Antonio Carlos S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rocha, Hiago Mayk G. de A.</au><au>Querol, Vicenc Beltran</au><au>Lorenzon, Arthur F.</au><au>Beck, Antonio Carlos S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Optimizing Single-Source Graph Execution on NUMA Machines</atitle><btitle>2023 XIII Brazilian Symposium on Computing Systems Engineering (SBESC)</btitle><stitle>SBESC</stitle><date>2023-11-21</date><risdate>2023</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><eissn>2324-7894</eissn><eisbn>9798350394108</eisbn><abstract>Graphs are data structures capable of representing problems from different domains, such as logistics and social networks. However, these massive graphs stored in high-performance computing (HPC) servers start processing from distinct source vertices (i.e., single-source: such as a different user or message in a social network). Therefore, the amount of vertices and the structure of the sub-graphs to be processed will also change depending on the source, highly influencing the graph algorithm behavior and performance. In this paper, we propose GraphNroll, a framework to extract the full potential of these single-source graph algorithms by exploiting the fact that the majority of the HPC servers are on Non-Uniform Memory Access (NUMA) architectures and, therefore, highly dependent of thread and data mapping. GraphNroll adjusts the thread and data mapping on NUMA machines by using only the embeddings of the source vertices (i.e., vector representations that encode the graph topology) to find the best configuration in terms of thread/data mapping. For that, GraphNroll learns over such embeddings to build a machine learning model capable of predicting the mapping solutions for a new algorithm execution without any profiling. Our results show that GraphNroll is, on average, 8% (up to 18%) faster than the Linux OS Default and Random solutions on three NUMA machines while reducing energy consumption.</abstract><pub>IEEE</pub><doi>10.1109/SBESC60926.2023.10324068</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2324-7894
ispartof 2023 XIII Brazilian Symposium on Computing Systems Engineering (SBESC), 2023, p.1-6
issn 2324-7894
language eng
recordid cdi_ieee_primary_10324068
source IEEE Xplore All Conference Series
subjects Graph's High-Level Features
NUMA Machines
Single-Source Algorithms
Thread and Data Mapping
title Optimizing Single-Source Graph Execution on NUMA Machines
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T16%3A34%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Optimizing%20Single-Source%20Graph%20Execution%20on%20NUMA%20Machines&rft.btitle=2023%20XIII%20Brazilian%20Symposium%20on%20Computing%20Systems%20Engineering%20(SBESC)&rft.au=Rocha,%20Hiago%20Mayk%20G.%20de%20A.&rft.date=2023-11-21&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.eissn=2324-7894&rft_id=info:doi/10.1109/SBESC60926.2023.10324068&rft.eisbn=9798350394108&rft_dat=%3Cieee_CHZPO%3E10324068%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i204t-361d1bbe73c56f08f3870c6c54e14b19600331ff4ce2259610eb0a2a6a2658883%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10324068&rfr_iscdi=true