Loading…

Hierarchical algorithms on hierarchical architectures

A traditional goal of algorithmic optimality, squeezing out flops, has been superseded by evolution in architecture. Flops no longer serve as a reasonable proxy for all aspects of complexity. Instead, algorithms must now squeeze memory, data transfers, and synchronizations, while extra flops on loca...

Full description

Saved in:
Bibliographic Details
Published in:Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences physical, and engineering sciences, 2020-03, Vol.378 (2166), p.1-16
Main Authors: Keyes, D. E., Ltaief, H., Turkiyyah, G.
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 16
container_issue 2166
container_start_page 1
container_title Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences
container_volume 378
creator Keyes, D. E.
Ltaief, H.
Turkiyyah, G.
description A traditional goal of algorithmic optimality, squeezing out flops, has been superseded by evolution in architecture. Flops no longer serve as a reasonable proxy for all aspects of complexity. Instead, algorithms must now squeeze memory, data transfers, and synchronizations, while extra flops on locally cached data represent only small costs in time and energy. Hierarchically low-rank matrices realize a rarely achieved combination of optimal storage complexity and high-computational intensity for a wide class of formally dense linear operators that arise in applications for which exascale computers are being constructed. They may be regarded as algebraic generalizations of the fast multipole method. Methods based on these hierarchical data structures and their simpler cousins, tile low-rank matrices, are well proportioned for early exascale computer architectures, which are provisioned for high processing power relative to memory capacity and memory bandwidth. They are ushering in a renaissance of computational linear algebra. A challenge is that emerging hardware architecture possesses hierarchies of its own that do not generally align with those of the algorithm. We describe modules of a software toolkit, hierarchical computations on manycore architectures, that illustrate these features and are intended as building blocks of applications, such as matrix-free higher-order methods in optimization and large-scale spatial statistics. Some modules of this open-source project have been adopted in the software libraries of major vendors. This article is part of a discussion meeting issue ‘Numerical algorithms for high-performance computational science’.
format article
fullrecord <record><control><sourceid>jstor</sourceid><recordid>TN_cdi_jstor_primary_26917445</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>26917445</jstor_id><sourcerecordid>26917445</sourcerecordid><originalsourceid>FETCH-jstor_primary_269174453</originalsourceid><addsrcrecordid>eNpjYuA0NDE31DWyNDNiAbKNzUx0TQ2MIzgYuIqLswwMDA3NTI04GUw9MlOLEouSMzKTE3MUEnPS84sySzJyixXy8xQyUKRAjJLU5JLSotRiHgbWtMSc4lReKM3NIOvmGuLsoZtVXJJfFF9QlJmbWFQZb2RmaWhuYmJqTEgeADwDMsw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Hierarchical algorithms on hierarchical architectures</title><source>JSTOR Archival Journals</source><source>Royal Society Publishing Jisc Collections Royal Society Journals Read &amp; Publish Transitional Agreement 2025 (reading list)</source><creator>Keyes, D. E. ; Ltaief, H. ; Turkiyyah, G.</creator><creatorcontrib>Keyes, D. E. ; Ltaief, H. ; Turkiyyah, G.</creatorcontrib><description>A traditional goal of algorithmic optimality, squeezing out flops, has been superseded by evolution in architecture. Flops no longer serve as a reasonable proxy for all aspects of complexity. Instead, algorithms must now squeeze memory, data transfers, and synchronizations, while extra flops on locally cached data represent only small costs in time and energy. Hierarchically low-rank matrices realize a rarely achieved combination of optimal storage complexity and high-computational intensity for a wide class of formally dense linear operators that arise in applications for which exascale computers are being constructed. They may be regarded as algebraic generalizations of the fast multipole method. Methods based on these hierarchical data structures and their simpler cousins, tile low-rank matrices, are well proportioned for early exascale computer architectures, which are provisioned for high processing power relative to memory capacity and memory bandwidth. They are ushering in a renaissance of computational linear algebra. A challenge is that emerging hardware architecture possesses hierarchies of its own that do not generally align with those of the algorithm. We describe modules of a software toolkit, hierarchical computations on manycore architectures, that illustrate these features and are intended as building blocks of applications, such as matrix-free higher-order methods in optimization and large-scale spatial statistics. Some modules of this open-source project have been adopted in the software libraries of major vendors. This article is part of a discussion meeting issue ‘Numerical algorithms for high-performance computational science’.</description><identifier>ISSN: 1364-503X</identifier><identifier>EISSN: 1471-2962</identifier><language>eng</language><publisher>Royal Society</publisher><ispartof>Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences, 2020-03, Vol.378 (2166), p.1-16</ispartof><rights>2020 The Authors</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/26917445$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/26917445$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,780,784,58238,58471</link.rule.ids></links><search><creatorcontrib>Keyes, D. E.</creatorcontrib><creatorcontrib>Ltaief, H.</creatorcontrib><creatorcontrib>Turkiyyah, G.</creatorcontrib><title>Hierarchical algorithms on hierarchical architectures</title><title>Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences</title><description>A traditional goal of algorithmic optimality, squeezing out flops, has been superseded by evolution in architecture. Flops no longer serve as a reasonable proxy for all aspects of complexity. Instead, algorithms must now squeeze memory, data transfers, and synchronizations, while extra flops on locally cached data represent only small costs in time and energy. Hierarchically low-rank matrices realize a rarely achieved combination of optimal storage complexity and high-computational intensity for a wide class of formally dense linear operators that arise in applications for which exascale computers are being constructed. They may be regarded as algebraic generalizations of the fast multipole method. Methods based on these hierarchical data structures and their simpler cousins, tile low-rank matrices, are well proportioned for early exascale computer architectures, which are provisioned for high processing power relative to memory capacity and memory bandwidth. They are ushering in a renaissance of computational linear algebra. A challenge is that emerging hardware architecture possesses hierarchies of its own that do not generally align with those of the algorithm. We describe modules of a software toolkit, hierarchical computations on manycore architectures, that illustrate these features and are intended as building blocks of applications, such as matrix-free higher-order methods in optimization and large-scale spatial statistics. Some modules of this open-source project have been adopted in the software libraries of major vendors. This article is part of a discussion meeting issue ‘Numerical algorithms for high-performance computational science’.</description><issn>1364-503X</issn><issn>1471-2962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid/><recordid>eNpjYuA0NDE31DWyNDNiAbKNzUx0TQ2MIzgYuIqLswwMDA3NTI04GUw9MlOLEouSMzKTE3MUEnPS84sySzJyixXy8xQyUKRAjJLU5JLSotRiHgbWtMSc4lReKM3NIOvmGuLsoZtVXJJfFF9QlJmbWFQZb2RmaWhuYmJqTEgeADwDMsw</recordid><startdate>20200306</startdate><enddate>20200306</enddate><creator>Keyes, D. E.</creator><creator>Ltaief, H.</creator><creator>Turkiyyah, G.</creator><general>Royal Society</general><scope/></search><sort><creationdate>20200306</creationdate><title>Hierarchical algorithms on hierarchical architectures</title><author>Keyes, D. E. ; Ltaief, H. ; Turkiyyah, G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-jstor_primary_269174453</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Keyes, D. E.</creatorcontrib><creatorcontrib>Ltaief, H.</creatorcontrib><creatorcontrib>Turkiyyah, G.</creatorcontrib><jtitle>Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Keyes, D. E.</au><au>Ltaief, H.</au><au>Turkiyyah, G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hierarchical algorithms on hierarchical architectures</atitle><jtitle>Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences</jtitle><date>2020-03-06</date><risdate>2020</risdate><volume>378</volume><issue>2166</issue><spage>1</spage><epage>16</epage><pages>1-16</pages><issn>1364-503X</issn><eissn>1471-2962</eissn><abstract>A traditional goal of algorithmic optimality, squeezing out flops, has been superseded by evolution in architecture. Flops no longer serve as a reasonable proxy for all aspects of complexity. Instead, algorithms must now squeeze memory, data transfers, and synchronizations, while extra flops on locally cached data represent only small costs in time and energy. Hierarchically low-rank matrices realize a rarely achieved combination of optimal storage complexity and high-computational intensity for a wide class of formally dense linear operators that arise in applications for which exascale computers are being constructed. They may be regarded as algebraic generalizations of the fast multipole method. Methods based on these hierarchical data structures and their simpler cousins, tile low-rank matrices, are well proportioned for early exascale computer architectures, which are provisioned for high processing power relative to memory capacity and memory bandwidth. They are ushering in a renaissance of computational linear algebra. A challenge is that emerging hardware architecture possesses hierarchies of its own that do not generally align with those of the algorithm. We describe modules of a software toolkit, hierarchical computations on manycore architectures, that illustrate these features and are intended as building blocks of applications, such as matrix-free higher-order methods in optimization and large-scale spatial statistics. Some modules of this open-source project have been adopted in the software libraries of major vendors. This article is part of a discussion meeting issue ‘Numerical algorithms for high-performance computational science’.</abstract><pub>Royal Society</pub></addata></record>
fulltext fulltext
identifier ISSN: 1364-503X
ispartof Philosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences, 2020-03, Vol.378 (2166), p.1-16
issn 1364-503X
1471-2962
language eng
recordid cdi_jstor_primary_26917445
source JSTOR Archival Journals; Royal Society Publishing Jisc Collections Royal Society Journals Read & Publish Transitional Agreement 2025 (reading list)
title Hierarchical algorithms on hierarchical architectures
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T21%3A08%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hierarchical%20algorithms%20on%20hierarchical%20architectures&rft.jtitle=Philosophical%20transactions%20of%20the%20Royal%20Society%20of%20London.%20Series%20A:%20Mathematical,%20physical,%20and%20engineering%20sciences&rft.au=Keyes,%20D.%20E.&rft.date=2020-03-06&rft.volume=378&rft.issue=2166&rft.spage=1&rft.epage=16&rft.pages=1-16&rft.issn=1364-503X&rft.eissn=1471-2962&rft_id=info:doi/&rft_dat=%3Cjstor%3E26917445%3C/jstor%3E%3Cgrp_id%3Ecdi_FETCH-jstor_primary_269174453%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_jstor_id=26917445&rfr_iscdi=true