Loading…
Efficient local memory sequence generation for data parallel programs using permutations
Generating local memory access sequence is a critical issue in distributed-memory implementations of data-parallel languages. In this paper, for arrays distributed block-cyclically on multiple processors, we introduce a novel approach to the local memory access sequence generation using the theory o...
Saved in:
Published in: | Journal of systems architecture 2001-06, Vol.47 (6), p.505-515 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Generating local memory access sequence is a critical issue in distributed-memory implementations of data-parallel languages. In this paper, for arrays distributed block-cyclically on multiple processors, we introduce a novel approach to the local memory access sequence generation using the theory of permutation. By compressing the active elements in a block into an integer, called
compress number, and exploiting the fact that there is a repeating pattern in the access sequence, we obtain the
global block cycle. Then, we show that the
local block cycle can be efficiently enumerated as closed forms using the permutation of global block cycle. After decompressing the compress number in the local block cycle, the local block patterns are restored and the local memory access sequence can be quickly generated. Unlike other works, our approach incurs no run-time overhead. |
---|---|
ISSN: | 1383-7621 1873-6165 |
DOI: | 10.1016/S1383-7621(01)00012-1 |