dc.contributor.author |
Klyuyeva, Ye.G |
|
dc.contributor.author |
Yavorskij, V.V. |
|
dc.contributor.author |
Adamov, A.A. |
|
dc.contributor.author |
Utepbergenov, I.T. |
|
dc.date.accessioned |
2024-07-08T07:39:04Z |
|
dc.date.available |
2024-07-08T07:39:04Z |
|
dc.date.issued |
2020 |
|
dc.identifier.citation |
Ye.G Klyuyeva, V.V. Yavorskij, A.A. Adamov & I.T. Utepbergenov | (2020) Determination of the optimal shape of matrix elements partitioning on three abstract heterogeneous processors, Cogent Engineering, 7:1, 1769948, DOI: 10.1080/23311916.2020.1769948 |
ru |
dc.identifier.issn |
2331-1916 |
|
dc.identifier.other |
DOI 10.1080/23311916.2020.1769948 |
|
dc.identifier.uri |
http://rep.enu.kz/handle/enu/16054 |
|
dc.description.abstract |
The paper presents the results of a study done to find the optimal shapes of matrix element partitioning on three abstract heterogeneous processors when performing multiplication operations. An abstract processor model allows applying the research results in systems with different heterogeneous architectures. To determine the optimal partitioning shape, the work uses non-rectangular candidate shapes identified by Ashley DeFlumere in her work as a result of applying the technology of redistribution of matrix elements between the processors «push»: Square Corner, Rectangle Corner, Square Rectangle, Block Rectangle, L-Rectangle, Traditional 1D Rectangular. The optimality of shapes is determined for four classes of matrix multiplication algorithms: Serial Communication with Barrier (SCB), Parallel Communication with Barrier (PCB), Serial Communication with Bulk Overlap (SCO) and Parallel Communication with Overlap (PCO). The Hockney model was used to evaluate the communication complexity of algorithms. Mathematical models of the algorithm execution time were introduced in the paper for each considered candidate shape in all algorithms. Based on the developed mathematical models, software was developed that allows to select the form of elements partitioning between processors, depending on the ratio of their speeds and latency of the transmission medium. |
ru |
dc.description.sponsorship |
This work was supported by the Ministry of Education and Science of the Republic of Kazakhstan [grant number АР05133699]. |
ru |
dc.language.iso |
en |
ru |
dc.publisher |
Cogent Engineering |
ru |
dc.relation.ispartofseries |
Volume 7, Issue 1;Article number 1769948 |
|
dc.subject |
data partitioning |
ru |
dc.subject |
heterogeneous parallel systems |
ru |
dc.subject |
hockney model |
ru |
dc.subject |
parallel communication with barrier |
ru |
dc.subject |
parallel communication with overlap |
ru |
dc.subject |
parallel computing |
ru |
dc.subject |
parallel matrix multiplication |
ru |
dc.subject |
serial communication with barrier |
ru |
dc.subject |
serial communication with overlap |
ru |
dc.title |
Determination of the optimal shape of matrix elements partitioning on three abstract heterogeneous processors |
ru |
dc.type |
Article |
ru |