Loading…
Concurrent algorithm for integrating three-dimensional B-spline functions into machines with shared memory such as GPU
The aim of this paper is to analyze the integration for 3D isogeometric finite element method solvers and its effective scheduling on hierarchical computer architecture. Data necessary for concurrency over elements is independent, so computation on this level is trivially concurrent. However, constr...
Saved in:
Published in: | Computer methods in applied mechanics and engineering 2022-08, Vol.398, p.115201, Article 115201 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The aim of this paper is to analyze the integration for 3D isogeometric finite element method solvers and its effective scheduling on hierarchical computer architecture. Data necessary for concurrency over elements is independent, so computation on this level is trivially concurrent. However, constructing several layers of concurrency for the integration algorithm is challenging. In this work, we propose a multilevel concurrent integration algorithm associated with scheduling that brings one extra degree of possible speedup. Because of one extra degree of possible speedup, we analyze the concurrent integration inside elements. The scheduling algorithm is intended for strongly related hierarchical architectures of a GPU. Using trace theory and Foata Normal Form, we verify integrity of the proposed solution. Summing up, we propose a general method for analyzing concurrency of the integration algorithm. We instantiate this method on a classical element-based integration algorithm, however, this methodology is possible to apply for other integration algorithms, including sum factorization, fast numerical quadrature, or row-wise integration methods.
•Multilayer parallel integrating coefficients of IsoGeometric Analysis IGA equations.•Trace theory model used for the formal verification.•Dikert graph based sub-optimal scheduling on cluster with GPU units.•Efficiency and scalability tests of proposed parallel IGA for L2 projection problem.•Application to sum factorization, fast numerical quadrature, row-wise integration. |
---|---|
ISSN: | 0045-7825 1879-2138 |
DOI: | 10.1016/j.cma.2022.115201 |