Loading…

The BLIS Framework: Experiments in Portability

BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework...

Full description

Saved in:
Bibliographic Details
Published in:ACM transactions on mathematical software 2016-06, Vol.42 (2), p.1-19
Main Authors: Van Zee, Field G., Smith, Tyler M., Marker, Bryan, Low, Tze Meng, Geijn, Robert A. Van De, Igual, Francisco D., Smelyanskiy, Mikhail, Zhang, Xianyi, Kistler, Michael, Austel, Vernon, Gunnels, John A., Killough, Lee
Format: Article
Language:English
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3
container_end_page 19
container_issue 2
container_start_page 1
container_title ACM transactions on mathematical software
container_volume 42
creator Van Zee, Field G.
Smith, Tyler M.
Marker, Bryan
Low, Tze Meng
Geijn, Robert A. Van De
Igual, Francisco D.
Smelyanskiy, Mikhail
Zhang, Xianyi
Kistler, Michael
Austel, Vernon
Gunnels, John A.
Killough, Lee
description BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.
doi_str_mv 10.1145/2755561
format article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1145_2755561</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_2755561</sourcerecordid><originalsourceid>FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3</originalsourceid><addsrcrecordid>eNotz01LAzEQgOFQFFyrCP6C3jzFzkwyyeaoxdrCQg-t5yWbzVA_SiURxH8vYk_v7YVHqRuEe0TLc_LM7HCiGmT22lPgM9UAhFYbBrhQl7W-AQChx0bd7vZ59titt7NliYf8fSzvV-pc4kfN16dO1cvyabdY6W7zvF48dDph67-0A8CMYgY3Bh-SDZGJ3CABIrlA4xjMYG1ro7hhBHRGUhShAAnYORIzVXf_31SOtZYs_Wd5PcTy0yP0f5T-RDG_FNc3zg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>The BLIS Framework: Experiments in Portability</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Van Zee, Field G. ; Smith, Tyler M. ; Marker, Bryan ; Low, Tze Meng ; Geijn, Robert A. Van De ; Igual, Francisco D. ; Smelyanskiy, Mikhail ; Zhang, Xianyi ; Kistler, Michael ; Austel, Vernon ; Gunnels, John A. ; Killough, Lee</creator><creatorcontrib>Van Zee, Field G. ; Smith, Tyler M. ; Marker, Bryan ; Low, Tze Meng ; Geijn, Robert A. Van De ; Igual, Francisco D. ; Smelyanskiy, Mikhail ; Zhang, Xianyi ; Kistler, Michael ; Austel, Vernon ; Gunnels, John A. ; Killough, Lee</creatorcontrib><description>BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.</description><identifier>ISSN: 0098-3500</identifier><identifier>EISSN: 1557-7295</identifier><identifier>DOI: 10.1145/2755561</identifier><language>eng</language><ispartof>ACM transactions on mathematical software, 2016-06, Vol.42 (2), p.1-19</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Van Zee, Field G.</creatorcontrib><creatorcontrib>Smith, Tyler M.</creatorcontrib><creatorcontrib>Marker, Bryan</creatorcontrib><creatorcontrib>Low, Tze Meng</creatorcontrib><creatorcontrib>Geijn, Robert A. Van De</creatorcontrib><creatorcontrib>Igual, Francisco D.</creatorcontrib><creatorcontrib>Smelyanskiy, Mikhail</creatorcontrib><creatorcontrib>Zhang, Xianyi</creatorcontrib><creatorcontrib>Kistler, Michael</creatorcontrib><creatorcontrib>Austel, Vernon</creatorcontrib><creatorcontrib>Gunnels, John A.</creatorcontrib><creatorcontrib>Killough, Lee</creatorcontrib><title>The BLIS Framework: Experiments in Portability</title><title>ACM transactions on mathematical software</title><description>BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.</description><issn>0098-3500</issn><issn>1557-7295</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNotz01LAzEQgOFQFFyrCP6C3jzFzkwyyeaoxdrCQg-t5yWbzVA_SiURxH8vYk_v7YVHqRuEe0TLc_LM7HCiGmT22lPgM9UAhFYbBrhQl7W-AQChx0bd7vZ59titt7NliYf8fSzvV-pc4kfN16dO1cvyabdY6W7zvF48dDph67-0A8CMYgY3Bh-SDZGJ3CABIrlA4xjMYG1ro7hhBHRGUhShAAnYORIzVXf_31SOtZYs_Wd5PcTy0yP0f5T-RDG_FNc3zg</recordid><startdate>20160601</startdate><enddate>20160601</enddate><creator>Van Zee, Field G.</creator><creator>Smith, Tyler M.</creator><creator>Marker, Bryan</creator><creator>Low, Tze Meng</creator><creator>Geijn, Robert A. Van De</creator><creator>Igual, Francisco D.</creator><creator>Smelyanskiy, Mikhail</creator><creator>Zhang, Xianyi</creator><creator>Kistler, Michael</creator><creator>Austel, Vernon</creator><creator>Gunnels, John A.</creator><creator>Killough, Lee</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20160601</creationdate><title>The BLIS Framework</title><author>Van Zee, Field G. ; Smith, Tyler M. ; Marker, Bryan ; Low, Tze Meng ; Geijn, Robert A. Van De ; Igual, Francisco D. ; Smelyanskiy, Mikhail ; Zhang, Xianyi ; Kistler, Michael ; Austel, Vernon ; Gunnels, John A. ; Killough, Lee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Van Zee, Field G.</creatorcontrib><creatorcontrib>Smith, Tyler M.</creatorcontrib><creatorcontrib>Marker, Bryan</creatorcontrib><creatorcontrib>Low, Tze Meng</creatorcontrib><creatorcontrib>Geijn, Robert A. Van De</creatorcontrib><creatorcontrib>Igual, Francisco D.</creatorcontrib><creatorcontrib>Smelyanskiy, Mikhail</creatorcontrib><creatorcontrib>Zhang, Xianyi</creatorcontrib><creatorcontrib>Kistler, Michael</creatorcontrib><creatorcontrib>Austel, Vernon</creatorcontrib><creatorcontrib>Gunnels, John A.</creatorcontrib><creatorcontrib>Killough, Lee</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on mathematical software</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Van Zee, Field G.</au><au>Smith, Tyler M.</au><au>Marker, Bryan</au><au>Low, Tze Meng</au><au>Geijn, Robert A. Van De</au><au>Igual, Francisco D.</au><au>Smelyanskiy, Mikhail</au><au>Zhang, Xianyi</au><au>Kistler, Michael</au><au>Austel, Vernon</au><au>Gunnels, John A.</au><au>Killough, Lee</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The BLIS Framework: Experiments in Portability</atitle><jtitle>ACM transactions on mathematical software</jtitle><date>2016-06-01</date><risdate>2016</risdate><volume>42</volume><issue>2</issue><spage>1</spage><epage>19</epage><pages>1-19</pages><issn>0098-3500</issn><eissn>1557-7295</eissn><abstract>BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra libraries. We demonstrate how BLIS acts as a productivity multiplier by using it to implement the level-3 BLAS on a variety of current architectures. The systems for which we demonstrate the framework include state-of-the-art general-purpose, low-power, and many-core architectures. We show, with very little effort, how the BLIS framework yields sequential and parallel implementations that are competitive with the performance of ATLAS, OpenBLAS (an effort to maintain and extend the GotoBLAS), and commercial vendor implementations such as AMD’s ACML, IBM’s ESSL, and Intel’s MKL libraries. Although most of this article focuses on single-core implementation, we also provide compelling results that suggest the framework’s leverage extends to the multithreaded domain.</abstract><doi>10.1145/2755561</doi><tpages>19</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0098-3500
ispartof ACM transactions on mathematical software, 2016-06, Vol.42 (2), p.1-19
issn 0098-3500
1557-7295
language eng
recordid cdi_crossref_primary_10_1145_2755561
source Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)
title The BLIS Framework: Experiments in Portability
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T11%3A08%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20BLIS%20Framework:%20Experiments%20in%20Portability&rft.jtitle=ACM%20transactions%20on%20mathematical%20software&rft.au=Van%20Zee,%20Field%20G.&rft.date=2016-06-01&rft.volume=42&rft.issue=2&rft.spage=1&rft.epage=19&rft.pages=1-19&rft.issn=0098-3500&rft.eissn=1557-7295&rft_id=info:doi/10.1145/2755561&rft_dat=%3Ccrossref%3E10_1145_2755561%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c187t-6001e1f3b6d979c49a5226bf90a2692dd93b4484af6bd0163fcaff290c05662f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true