Loading…
The BLIS Framework: Experiments in Portability
BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework...
Saved in:
Published in: | ACM transactions on mathematical software 2016-06, Vol.42 (2), p.1-19 |
---|---|
Main Authors: | , , , , , , , , , , , |
Format: | Article |
Language: | English |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3 |
container_end_page | 19 |
container_issue | 2 |
container_start_page | 1 |
container_title | ACM transactions on mathematical software |
container_volume | 42 |
creator | Van Zee, Field G. Smith, Tyler M. Marker, Bryan Low, Tze Meng Geijn, Robert A. Van De Igual, Francisco D. Smelyanskiy, Mikhail Zhang, Xianyi Kistler, Michael Austel, Vernon Gunnels, John A. Killough, Lee |
description | BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain. |
doi_str_mv | 10.1145/2755561 |
format | article |
fullrecord | <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1145_2755561</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_2755561</sourcerecordid><originalsourceid>FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3</originalsourceid><addsrcrecordid>eNotz01LAzEQgOFQFFyrCP6C3jzFzkwyyeaoxdrCQg-t5yWbzVA_SiURxH8vYk_v7YVHqRuEe0TLc_LM7HCiGmT22lPgM9UAhFYbBrhQl7W-AQChx0bd7vZ59titt7NliYf8fSzvV-pc4kfN16dO1cvyabdY6W7zvF48dDph67-0A8CMYgY3Bh-SDZGJ3CABIrlA4xjMYG1ro7hhBHRGUhShAAnYORIzVXf_31SOtZYs_Wd5PcTy0yP0f5T-RDG_FNc3zg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>The BLIS Framework: Experiments in Portability</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Van Zee, Field G. ; Smith, Tyler M. ; Marker, Bryan ; Low, Tze Meng ; Geijn, Robert A. Van De ; Igual, Francisco D. ; Smelyanskiy, Mikhail ; Zhang, Xianyi ; Kistler, Michael ; Austel, Vernon ; Gunnels, John A. ; Killough, Lee</creator><creatorcontrib>Van Zee, Field G. ; Smith, Tyler M. ; Marker, Bryan ; Low, Tze Meng ; Geijn, Robert A. Van De ; Igual, Francisco D. ; Smelyanskiy, Mikhail ; Zhang, Xianyi ; Kistler, Michael ; Austel, Vernon ; Gunnels, John A. ; Killough, Lee</creatorcontrib><description>BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.</description><identifier>ISSN: 0098-3500</identifier><identifier>EISSN: 1557-7295</identifier><identifier>DOI: 10.1145/2755561</identifier><language>eng</language><ispartof>ACM transactions on mathematical software, 2016-06, Vol.42 (2), p.1-19</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Van Zee, Field G.</creatorcontrib><creatorcontrib>Smith, Tyler M.</creatorcontrib><creatorcontrib>Marker, Bryan</creatorcontrib><creatorcontrib>Low, Tze Meng</creatorcontrib><creatorcontrib>Geijn, Robert A. Van De</creatorcontrib><creatorcontrib>Igual, Francisco D.</creatorcontrib><creatorcontrib>Smelyanskiy, Mikhail</creatorcontrib><creatorcontrib>Zhang, Xianyi</creatorcontrib><creatorcontrib>Kistler, Michael</creatorcontrib><creatorcontrib>Austel, Vernon</creatorcontrib><creatorcontrib>Gunnels, John A.</creatorcontrib><creatorcontrib>Killough, Lee</creatorcontrib><title>The BLIS Framework: Experiments in Portability</title><title>ACM transactions on mathematical software</title><description>BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.</description><issn>0098-3500</issn><issn>1557-7295</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNotz01LAzEQgOFQFFyrCP6C3jzFzkwyyeaoxdrCQg-t5yWbzVA_SiURxH8vYk_v7YVHqRuEe0TLc_LM7HCiGmT22lPgM9UAhFYbBrhQl7W-AQChx0bd7vZ59titt7NliYf8fSzvV-pc4kfN16dO1cvyabdY6W7zvF48dDph67-0A8CMYgY3Bh-SDZGJ3CABIrlA4xjMYG1ro7hhBHRGUhShAAnYORIzVXf_31SOtZYs_Wd5PcTy0yP0f5T-RDG_FNc3zg</recordid><startdate>20160601</startdate><enddate>20160601</enddate><creator>Van Zee, Field G.</creator><creator>Smith, Tyler M.</creator><creator>Marker, Bryan</creator><creator>Low, Tze Meng</creator><creator>Geijn, Robert A. Van De</creator><creator>Igual, Francisco D.</creator><creator>Smelyanskiy, Mikhail</creator><creator>Zhang, Xianyi</creator><creator>Kistler, Michael</creator><creator>Austel, Vernon</creator><creator>Gunnels, John A.</creator><creator>Killough, Lee</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20160601</creationdate><title>The BLIS Framework</title><author>Van Zee, Field G. ; Smith, Tyler M. ; Marker, Bryan ; Low, Tze Meng ; Geijn, Robert A. Van De ; Igual, Francisco D. ; Smelyanskiy, Mikhail ; Zhang, Xianyi ; Kistler, Michael ; Austel, Vernon ; Gunnels, John A. ; Killough, Lee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Van Zee, Field G.</creatorcontrib><creatorcontrib>Smith, Tyler M.</creatorcontrib><creatorcontrib>Marker, Bryan</creatorcontrib><creatorcontrib>Low, Tze Meng</creatorcontrib><creatorcontrib>Geijn, Robert A. Van De</creatorcontrib><creatorcontrib>Igual, Francisco D.</creatorcontrib><creatorcontrib>Smelyanskiy, Mikhail</creatorcontrib><creatorcontrib>Zhang, Xianyi</creatorcontrib><creatorcontrib>Kistler, Michael</creatorcontrib><creatorcontrib>Austel, Vernon</creatorcontrib><creatorcontrib>Gunnels, John A.</creatorcontrib><creatorcontrib>Killough, Lee</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on mathematical software</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Van Zee, Field G.</au><au>Smith, Tyler M.</au><au>Marker, Bryan</au><au>Low, Tze Meng</au><au>Geijn, Robert A. Van De</au><au>Igual, Francisco D.</au><au>Smelyanskiy, Mikhail</au><au>Zhang, Xianyi</au><au>Kistler, Michael</au><au>Austel, Vernon</au><au>Gunnels, John A.</au><au>Killough, Lee</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The BLIS Framework: Experiments in Portability</atitle><jtitle>ACM transactions on mathematical software</jtitle><date>2016-06-01</date><risdate>2016</risdate><volume>42</volume><issue>2</issue><spage>1</spage><epage>19</epage><pages>1-19</pages><issn>0098-3500</issn><eissn>1557-7295</eissn><abstract>BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.</abstract><doi>10.1145/2755561</doi><tpages>19</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0098-3500 |
ispartof | ACM transactions on mathematical software, 2016-06, Vol.42 (2), p.1-19 |
issn | 0098-3500 1557-7295 |
language | eng |
recordid | cdi_crossref_primary_10_1145_2755561 |
source | Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list) |
title | The BLIS Framework: Experiments in Portability |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T11%3A08%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20BLIS%20Framework:%20Experiments%20in%20Portability&rft.jtitle=ACM%20transactions%20on%20mathematical%20software&rft.au=Van%20Zee,%20Field%20G.&rft.date=2016-06-01&rft.volume=42&rft.issue=2&rft.spage=1&rft.epage=19&rft.pages=1-19&rft.issn=0098-3500&rft.eissn=1557-7295&rft_id=info:doi/10.1145/2755561&rft_dat=%3Ccrossref%3E10_1145_2755561%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |