Loading…

Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures

Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires...

Full description

Saved in:
Bibliographic Details
Main Authors: Gerofi, Balazs, Shimada, Akio, Hori, Atsushi, Ishikawa, Yutaka
Format: Conference Proceeding
Language:English
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 1351
container_issue
container_start_page 1350
container_title
container_volume
creator Gerofi, Balazs
Shimada, Akio
Hori, Atsushi
Ishikawa, Yutaka
description Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.
doi_str_mv 10.1109/SC.Companion.2012.181
format conference_proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6495964</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6495964</ieee_id><sourcerecordid>6495964</sourcerecordid><originalsourceid>FETCH-ieee_primary_64959643</originalsourceid><addsrcrecordid>eNp9js1KxDAUhSMi-NcnEOG-gDXJdNLGXSlKN4OLmf1w7dypkUlSbjJI394Krl2dA9934AjxqGSplLTP267sop8wuBhKLZUuVaMuRGHrRtbGriu7NvZS3KrK1CujVaOvRZHSl5Ry2Rtd2Rvh2o-UGYf8Arv4jXyA94kYswsjbOeUyUObklvKAXq3EB4-3YAn2JCPPMMGA47kKWQ4RoaeMnEcKVA8J2h_5UxDPjOle3F1xFOi4i_vxMPb667rnxwR7Sd2Hnnem-W0NdXqf_oDbmxN_g</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Gerofi, Balazs ; Shimada, Akio ; Hori, Atsushi ; Ishikawa, Yutaka</creator><creatorcontrib>Gerofi, Balazs ; Shimada, Akio ; Hori, Atsushi ; Ishikawa, Yutaka</creatorcontrib><description>Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.</description><identifier>ISBN: 1467362182</identifier><identifier>ISBN: 9781467362184</identifier><identifier>EISBN: 9780769549569</identifier><identifier>EISBN: 076954956X</identifier><identifier>DOI: 10.1109/SC.Companion.2012.181</identifier><language>eng</language><publisher>IEEE</publisher><ispartof>2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 2012, p.1350-1351</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6495964$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2051,27904,54898</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6495964$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Gerofi, Balazs</creatorcontrib><creatorcontrib>Shimada, Akio</creatorcontrib><creatorcontrib>Hori, Atsushi</creatorcontrib><creatorcontrib>Ishikawa, Yutaka</creatorcontrib><title>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</title><title>2012 SC Companion: High Performance Computing, Networking Storage and Analysis</title><addtitle>SCC</addtitle><description>Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.</description><isbn>1467362182</isbn><isbn>9781467362184</isbn><isbn>9780769549569</isbn><isbn>076954956X</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNp9js1KxDAUhSMi-NcnEOG-gDXJdNLGXSlKN4OLmf1w7dypkUlSbjJI394Krl2dA9934AjxqGSplLTP267sop8wuBhKLZUuVaMuRGHrRtbGriu7NvZS3KrK1CujVaOvRZHSl5Ry2Rtd2Rvh2o-UGYf8Arv4jXyA94kYswsjbOeUyUObklvKAXq3EB4-3YAn2JCPPMMGA47kKWQ4RoaeMnEcKVA8J2h_5UxDPjOle3F1xFOi4i_vxMPb667rnxwR7Sd2Hnnem-W0NdXqf_oDbmxN_g</recordid><startdate>201211</startdate><enddate>201211</enddate><creator>Gerofi, Balazs</creator><creator>Shimada, Akio</creator><creator>Hori, Atsushi</creator><creator>Ishikawa, Yutaka</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201211</creationdate><title>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</title><author>Gerofi, Balazs ; Shimada, Akio ; Hori, Atsushi ; Ishikawa, Yutaka</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_64959643</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Gerofi, Balazs</creatorcontrib><creatorcontrib>Shimada, Akio</creatorcontrib><creatorcontrib>Hori, Atsushi</creatorcontrib><creatorcontrib>Ishikawa, Yutaka</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library Online</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gerofi, Balazs</au><au>Shimada, Akio</au><au>Hori, Atsushi</au><au>Ishikawa, Yutaka</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures</atitle><btitle>2012 SC Companion: High Performance Computing, Networking Storage and Analysis</btitle><stitle>SCC</stitle><date>2012-11</date><risdate>2012</risdate><spage>1350</spage><epage>1351</epage><pages>1350-1351</pages><isbn>1467362182</isbn><isbn>9781467362184</isbn><eisbn>9780769549569</eisbn><eisbn>076954956X</eisbn><abstract>Heterogeneous architectures, where a multicore processor is accompanied with a large number of simpler, but more power-efficient CPU cores optimized for parallel workloads, are receiving a lot of attention these days. Currently, these co-processors come with a limited on-board memory, which requires partitioning computational problems manually into pieces that can fit into the device's RAM as well as overlapping computation and communication. In this poster we propose application transparent, operating system (OS) assisted hierarchical memory management system, where the OS orchestrates data movement between the host and the device and updates the process virtual memory address space accordingly. We identify the main scalability issues of frequent address space changes, such as the increasing price of TLB invalidations with the growing number of CPU cores and propose partially separated page tables to overcome the problem. With partially separated page tables each core maintains its own set of mappings of the computation area, enabling the OS to perform address space updates in a scalable manner, and involve a particular CPU core in TLB invalidation only if it is absolutely necessary. Furthermore, we propose dedicated data movement cores in order to efficiently overlap computation and communication. We provide experimental results on stencil computation, a common HPC kernel.</abstract><pub>IEEE</pub><doi>10.1109/SC.Companion.2012.181</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 1467362182
ispartof 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 2012, p.1350-1351
issn
language eng
recordid cdi_ieee_primary_6495964
source IEEE Electronic Library (IEL) Conference Proceedings
title Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T08%3A57%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Abstract:%20Toward%20Operating%20System%20Assisted%20Hierarchical%20Memory%20Management%20for%20Heterogeneous%20Architectures&rft.btitle=2012%20SC%20Companion:%20High%20Performance%20Computing,%20Networking%20Storage%20and%20Analysis&rft.au=Gerofi,%20Balazs&rft.date=2012-11&rft.spage=1350&rft.epage=1351&rft.pages=1350-1351&rft.isbn=1467362182&rft.isbn_list=9781467362184&rft_id=info:doi/10.1109/SC.Companion.2012.181&rft.eisbn=9780769549569&rft.eisbn_list=076954956X&rft_dat=%3Cieee_6IE%3E6495964%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-ieee_primary_64959643%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6495964&rfr_iscdi=true