Loading…

List Ranking and List Scan on the CRAY C-90

List ranking and list scan are two primitive operations used in many parallel algorithms that use list, trees, and graph data structures. But vectorizing and parallelizing list ranking is a challenge because it is highly communication intensive and dynamic. In addition, the serial algorithm is very...

Full description

Saved in:
Bibliographic Details
Main Authors: Reid-Miller, Margaret, Blelloch, Guy E
Format: Report
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Reid-Miller, Margaret
Blelloch, Guy E
description List ranking and list scan are two primitive operations used in many parallel algorithms that use list, trees, and graph data structures. But vectorizing and parallelizing list ranking is a challenge because it is highly communication intensive and dynamic. In addition, the serial algorithm is very simple and has very small constants. In order to compete a parallel algorithm must also be simple and have small constants. A parallel algorithm due to Wyllie is such an algorithm, but it is not work efficient-its performance degrades for longer and longer linked lists. In contrast, work efficient PRAM algorithms developed to date have very large constants. We introduce a new fully vectorized and parallelized algorithm that both is work efficient and has small constants. However, it does not achieve O(logn) running time. But we contend that work efficiency and small constants is more important, given that vector and multiprocessor machines are used for problems that are much larger than the number of processors and, therefore, the O(log n) time is never achieved in practice. In particular, to the best of our knowledge our implementation of list ranking and list scan on the CRAY C-90 is the fastest implementation to date. In addition, it is the first implementation of which we are aware that outperforms fast workstations. List ranking, List scan, Prescan, Prefix-sum, Bulk synchronous processor (BSP) model, Parallel algorithm, Vector algorithm, Linked list, Load balancing, Randomized algorithm, Vector multiprocessor, CRAY Y-MPC-90
format report
fullrecord <record><control><sourceid>dtic_1RU</sourceid><recordid>TN_cdi_dtic_stinet_ADA276411</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>ADA276411</sourcerecordid><originalsourceid>FETCH-dtic_stinet_ADA2764113</originalsourceid><addsrcrecordid>eNrjZND2ySwuUQhKzMvOzEtXSMxLUQALBCcn5ink5ymUZKQqOAc5Rio461oa8DCwpiXmFKfyQmluBhk31xBnD92Ukszk-OKSzLzUknhHF0cjczMTQ0NjAtIA6u8kGA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>report</recordtype></control><display><type>report</type><title>List Ranking and List Scan on the CRAY C-90</title><source>DTIC Technical Reports</source><creator>Reid-Miller, Margaret ; Blelloch, Guy E</creator><creatorcontrib>Reid-Miller, Margaret ; Blelloch, Guy E ; CARNEGIE-MELLON UNIV PITTSBURGH PA DEPT OF COMPUTER SCIENCE</creatorcontrib><description>List ranking and list scan are two primitive operations used in many parallel algorithms that use list, trees, and graph data structures. But vectorizing and parallelizing list ranking is a challenge because it is highly communication intensive and dynamic. In addition, the serial algorithm is very simple and has very small constants. In order to compete a parallel algorithm must also be simple and have small constants. A parallel algorithm due to Wyllie is such an algorithm, but it is not work efficient-its performance degrades for longer and longer linked lists. In contrast, work efficient PRAM algorithms developed to date have very large constants. We introduce a new fully vectorized and parallelized algorithm that both is work efficient and has small constants. However, it does not achieve O(logn) running time. But we contend that work efficiency and small constants is more important, given that vector and multiprocessor machines are used for problems that are much larger than the number of processors and, therefore, the O(log n) time is never achieved in practice. In particular, to the best of our knowledge our implementation of list ranking and list scan on the CRAY C-90 is the fastest implementation to date. In addition, it is the first implementation of which we are aware that outperforms fast workstations. List ranking, List scan, Prescan, Prefix-sum, Bulk synchronous processor (BSP) model, Parallel algorithm, Vector algorithm, Linked list, Load balancing, Randomized algorithm, Vector multiprocessor, CRAY Y-MPC-90</description><language>eng</language><subject>ACCESS ; ALGORITHMS ; Computer Hardware ; CONSTANTS ; CONTRAST ; DATA PROCESSING ; DYNAMICS ; EFFICIENCY ; EQUATIONS ; GRAIN SIZE ; GRAPHS ; LOOPS ; MACHINES ; MULTIPROCESSORS ; OPERATION ; OPTIMIZATION ; PARALLEL PROCESSING ; RANKING ; SCANNERS ; SUPERCONDUCTORS ; TEST AND EVALUATION ; TIME ; TREES ; VECTOR ANALYSIS</subject><creationdate>1994</creationdate><rights>Approved for public release; distribution is unlimited.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,776,881,27544,27545</link.rule.ids><linktorsrc>$$Uhttps://apps.dtic.mil/sti/citations/ADA276411$$EView_record_in_DTIC$$FView_record_in_$$GDTIC$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Reid-Miller, Margaret</creatorcontrib><creatorcontrib>Blelloch, Guy E</creatorcontrib><creatorcontrib>CARNEGIE-MELLON UNIV PITTSBURGH PA DEPT OF COMPUTER SCIENCE</creatorcontrib><title>List Ranking and List Scan on the CRAY C-90</title><description>List ranking and list scan are two primitive operations used in many parallel algorithms that use list, trees, and graph data structures. But vectorizing and parallelizing list ranking is a challenge because it is highly communication intensive and dynamic. In addition, the serial algorithm is very simple and has very small constants. In order to compete a parallel algorithm must also be simple and have small constants. A parallel algorithm due to Wyllie is such an algorithm, but it is not work efficient-its performance degrades for longer and longer linked lists. In contrast, work efficient PRAM algorithms developed to date have very large constants. We introduce a new fully vectorized and parallelized algorithm that both is work efficient and has small constants. However, it does not achieve O(logn) running time. But we contend that work efficiency and small constants is more important, given that vector and multiprocessor machines are used for problems that are much larger than the number of processors and, therefore, the O(log n) time is never achieved in practice. In particular, to the best of our knowledge our implementation of list ranking and list scan on the CRAY C-90 is the fastest implementation to date. In addition, it is the first implementation of which we are aware that outperforms fast workstations. List ranking, List scan, Prescan, Prefix-sum, Bulk synchronous processor (BSP) model, Parallel algorithm, Vector algorithm, Linked list, Load balancing, Randomized algorithm, Vector multiprocessor, CRAY Y-MPC-90</description><subject>ACCESS</subject><subject>ALGORITHMS</subject><subject>Computer Hardware</subject><subject>CONSTANTS</subject><subject>CONTRAST</subject><subject>DATA PROCESSING</subject><subject>DYNAMICS</subject><subject>EFFICIENCY</subject><subject>EQUATIONS</subject><subject>GRAIN SIZE</subject><subject>GRAPHS</subject><subject>LOOPS</subject><subject>MACHINES</subject><subject>MULTIPROCESSORS</subject><subject>OPERATION</subject><subject>OPTIMIZATION</subject><subject>PARALLEL PROCESSING</subject><subject>RANKING</subject><subject>SCANNERS</subject><subject>SUPERCONDUCTORS</subject><subject>TEST AND EVALUATION</subject><subject>TIME</subject><subject>TREES</subject><subject>VECTOR ANALYSIS</subject><fulltext>true</fulltext><rsrctype>report</rsrctype><creationdate>1994</creationdate><recordtype>report</recordtype><sourceid>1RU</sourceid><recordid>eNrjZND2ySwuUQhKzMvOzEtXSMxLUQALBCcn5ink5ymUZKQqOAc5Rio461oa8DCwpiXmFKfyQmluBhk31xBnD92Ukszk-OKSzLzUknhHF0cjczMTQ0NjAtIA6u8kGA</recordid><startdate>19940216</startdate><enddate>19940216</enddate><creator>Reid-Miller, Margaret</creator><creator>Blelloch, Guy E</creator><scope>1RU</scope><scope>BHM</scope></search><sort><creationdate>19940216</creationdate><title>List Ranking and List Scan on the CRAY C-90</title><author>Reid-Miller, Margaret ; Blelloch, Guy E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-dtic_stinet_ADA2764113</frbrgroupid><rsrctype>reports</rsrctype><prefilter>reports</prefilter><language>eng</language><creationdate>1994</creationdate><topic>ACCESS</topic><topic>ALGORITHMS</topic><topic>Computer Hardware</topic><topic>CONSTANTS</topic><topic>CONTRAST</topic><topic>DATA PROCESSING</topic><topic>DYNAMICS</topic><topic>EFFICIENCY</topic><topic>EQUATIONS</topic><topic>GRAIN SIZE</topic><topic>GRAPHS</topic><topic>LOOPS</topic><topic>MACHINES</topic><topic>MULTIPROCESSORS</topic><topic>OPERATION</topic><topic>OPTIMIZATION</topic><topic>PARALLEL PROCESSING</topic><topic>RANKING</topic><topic>SCANNERS</topic><topic>SUPERCONDUCTORS</topic><topic>TEST AND EVALUATION</topic><topic>TIME</topic><topic>TREES</topic><topic>VECTOR ANALYSIS</topic><toplevel>online_resources</toplevel><creatorcontrib>Reid-Miller, Margaret</creatorcontrib><creatorcontrib>Blelloch, Guy E</creatorcontrib><creatorcontrib>CARNEGIE-MELLON UNIV PITTSBURGH PA DEPT OF COMPUTER SCIENCE</creatorcontrib><collection>DTIC Technical Reports</collection><collection>DTIC STINET</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Reid-Miller, Margaret</au><au>Blelloch, Guy E</au><aucorp>CARNEGIE-MELLON UNIV PITTSBURGH PA DEPT OF COMPUTER SCIENCE</aucorp><format>book</format><genre>unknown</genre><ristype>RPRT</ristype><btitle>List Ranking and List Scan on the CRAY C-90</btitle><date>1994-02-16</date><risdate>1994</risdate><abstract>List ranking and list scan are two primitive operations used in many parallel algorithms that use list, trees, and graph data structures. But vectorizing and parallelizing list ranking is a challenge because it is highly communication intensive and dynamic. In addition, the serial algorithm is very simple and has very small constants. In order to compete a parallel algorithm must also be simple and have small constants. A parallel algorithm due to Wyllie is such an algorithm, but it is not work efficient-its performance degrades for longer and longer linked lists. In contrast, work efficient PRAM algorithms developed to date have very large constants. We introduce a new fully vectorized and parallelized algorithm that both is work efficient and has small constants. However, it does not achieve O(logn) running time. But we contend that work efficiency and small constants is more important, given that vector and multiprocessor machines are used for problems that are much larger than the number of processors and, therefore, the O(log n) time is never achieved in practice. In particular, to the best of our knowledge our implementation of list ranking and list scan on the CRAY C-90 is the fastest implementation to date. In addition, it is the first implementation of which we are aware that outperforms fast workstations. List ranking, List scan, Prescan, Prefix-sum, Bulk synchronous processor (BSP) model, Parallel algorithm, Vector algorithm, Linked list, Load balancing, Randomized algorithm, Vector multiprocessor, CRAY Y-MPC-90</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_dtic_stinet_ADA276411
source DTIC Technical Reports
subjects ACCESS
ALGORITHMS
Computer Hardware
CONSTANTS
CONTRAST
DATA PROCESSING
DYNAMICS
EFFICIENCY
EQUATIONS
GRAIN SIZE
GRAPHS
LOOPS
MACHINES
MULTIPROCESSORS
OPERATION
OPTIMIZATION
PARALLEL PROCESSING
RANKING
SCANNERS
SUPERCONDUCTORS
TEST AND EVALUATION
TIME
TREES
VECTOR ANALYSIS
title List Ranking and List Scan on the CRAY C-90
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T02%3A56%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-dtic_1RU&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.btitle=List%20Ranking%20and%20List%20Scan%20on%20the%20CRAY%20C-90&rft.au=Reid-Miller,%20Margaret&rft.aucorp=CARNEGIE-MELLON%20UNIV%20PITTSBURGH%20PA%20DEPT%20OF%20COMPUTER%20SCIENCE&rft.date=1994-02-16&rft_id=info:doi/&rft_dat=%3Cdtic_1RU%3EADA276411%3C/dtic_1RU%3E%3Cgrp_id%3Ecdi_FETCH-dtic_stinet_ADA2764113%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true