Loading…
Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures
Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 1351 |
container_issue | |
container_start_page | 1350 |
container_title | |
container_volume | |
creator | Gerofi, Balazs Shimada, Akio Hori, Atsushi Ishikawa, Yutaka |
description | Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel. |
doi_str_mv | 10.1109/SC.Companion.2012.181 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6495964</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6495964</ieee_id><sourcerecordid>6495964</sourcerecordid><originalsourceid>FETCH-ieee_primary_64959643</originalsourceid><addsrcrecordid>eNp9js1KxDAUhSMi-NcnEOG-gDXJdNLGXSlKN4OLmf1w7dypkUlSbjJI394Krl2dA9934AjxqGSplLTP267sop8wuBhKLZUuVaMuRGHrRtbGriu7NvZS3KrK1CujVaOvRZHSl5Ry2Rtd2Rvh2o-UGYf8Arv4jXyA94kYswsjbOeUyUObklvKAXq3EB4-3YAn2JCPPMMGA47kKWQ4RoaeMnEcKVA8J2h_5UxDPjOle3F1xFOi4i_vxMPb667rnxwR7Sd2Hnnem-W0NdXqf_oDbmxN_g</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Gerofi, Balazs ; Shimada, Akio ; Hori, Atsushi ; Ishikawa, Yutaka</creator><creatorcontrib>Gerofi, Balazs ; Shimada, Akio ; Hori, Atsushi ; Ishikawa, Yutaka</creatorcontrib><description>Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.</description><identifier>ISBN: 1467362182</identifier><identifier>ISBN: 9781467362184</identifier><identifier>EISBN: 9780769549569</identifier><identifier>EISBN: 076954956X</identifier><identifier>DOI: 10.1109/SC.Companion.2012.181</identifier><language>eng</language><publisher>IEEE</publisher><ispartof>2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 2012, p.1350-1351</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6495964$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2051,27904,54898</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6495964$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Gerofi, Balazs</creatorcontrib><creatorcontrib>Shimada, Akio</creatorcontrib><creatorcontrib>Hori, Atsushi</creatorcontrib><creatorcontrib>Ishikawa, Yutaka</creatorcontrib><title>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</title><title>2012 SC Companion: High Performance Computing, Networking Storage and Analysis</title><addtitle>SCC</addtitle><description>Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.</description><isbn>1467362182</isbn><isbn>9781467362184</isbn><isbn>9780769549569</isbn><isbn>076954956X</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNp9js1KxDAUhSMi-NcnEOG-gDXJdNLGXSlKN4OLmf1w7dypkUlSbjJI394Krl2dA9934AjxqGSplLTP267sop8wuBhKLZUuVaMuRGHrRtbGriu7NvZS3KrK1CujVaOvRZHSl5Ry2Rtd2Rvh2o-UGYf8Arv4jXyA94kYswsjbOeUyUObklvKAXq3EB4-3YAn2JCPPMMGA47kKWQ4RoaeMnEcKVA8J2h_5UxDPjOle3F1xFOi4i_vxMPb667rnxwR7Sd2Hnnem-W0NdXqf_oDbmxN_g</recordid><startdate>201211</startdate><enddate>201211</enddate><creator>Gerofi, Balazs</creator><creator>Shimada, Akio</creator><creator>Hori, Atsushi</creator><creator>Ishikawa, Yutaka</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201211</creationdate><title>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</title><author>Gerofi, Balazs ; Shimada, Akio ; Hori, Atsushi ; Ishikawa, Yutaka</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_64959643</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Gerofi, Balazs</creatorcontrib><creatorcontrib>Shimada, Akio</creatorcontrib><creatorcontrib>Hori, Atsushi</creatorcontrib><creatorcontrib>Ishikawa, Yutaka</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library Online</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gerofi, Balazs</au><au>Shimada, Akio</au><au>Hori, Atsushi</au><au>Ishikawa, Yutaka</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</atitle><btitle>2012 SC Companion: High Performance Computing, Networking Storage and Analysis</btitle><stitle>SCC</stitle><date>2012-11</date><risdate>2012</risdate><spage>1350</spage><epage>1351</epage><pages>1350-1351</pages><isbn>1467362182</isbn><isbn>9781467362184</isbn><eisbn>9780769549569</eisbn><eisbn>076954956X</eisbn><abstract>Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.</abstract><pub>IEEE</pub><doi>10.1109/SC.Companion.2012.181</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 1467362182 |
ispartof | 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 2012, p.1350-1351 |
issn | |
language | eng |
recordid | cdi_ieee_primary_6495964 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
title | Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T08%3A57%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Abstract:%20Toward%20Operating%20System%20Assisted%20Hierarchical%20Memory%20Management%20for%20Heterogeneous%20Architectures&rft.btitle=2012%20SC%20Companion:%20High%20Performance%20Computing,%20Networking%20Storage%20and%20Analysis&rft.au=Gerofi,%20Balazs&rft.date=2012-11&rft.spage=1350&rft.epage=1351&rft.pages=1350-1351&rft.isbn=1467362182&rft.isbn_list=9781467362184&rft_id=info:doi/10.1109/SC.Companion.2012.181&rft.eisbn=9780769549569&rft.eisbn_list=076954956X&rft_dat=%3Cieee_6IE%3E6495964%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-ieee_primary_64959643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6495964&rfr_iscdi=true |