Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11779/1597
Title: Array Bp-Xor Codes for Hierarchically Distributed Matrix Multiplication
Authors: Arslan, Şuayb Şefik
Keywords: Decoding
Codes
Complexity theory
Arrays
Encoding
Task analysis
Iterative decoding
Publisher: IEEE
Source: Arslan, S. S. (02 December 2021). Array BP-XOR Codes for Hierarchically Distributed Matrix Multiplication. IEEE Transactions on Information Theory, pp. 1–17. https://doi.org/10.1109/tit.2021.3132043 ‌ ‌
Abstract: A novel fault-tolerant computation technique based on array Belief Propagation (BP)-decodable XOR (BP-XOR) codes is proposed for distributed matrix-matrix multiplication. The proposed scheme is shown to be configurable and suited for modern hierarchical compute architectures such as Graphical Processing Units (GPUs) equipped with multiple nodes, whereby each has many small independent processing units with increased core-to-core communications. The proposed scheme is shown to outperform a few of the well–known earlier strategies in terms of total end-to-end execution time while in presence of slow nodes, called stragglers. This performance advantage is due to the careful design of array codes which distributes the encoding operation over the cluster (slave) nodes at the expense of increased master-slave communication. An interesting trade-off between end-to-end latency and total communication cost is precisely described. In addition, to be able to address an identified problem of scaling stragglers, an asymptotic version of array BP-XOR codes based on projection geometry is proposed at the expense of some computation overhead. A thorough latency analysis is conducted for all schemes to demonstrate that the proposed scheme achieves order-optimal computation in both the sublinear as well as the linear regimes in the size of the computed product from an end-to-end delay perspective.
URI: https://doi.org/10.1109/tit.2021.3132043
https://hdl.handle.net/20.500.11779/1597
Appears in Collections:Bilgisayar Mühendisliği Bölümü Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:
File Description SizeFormat 
Array_BP-XOR_Codes_for_Hierarchically_Distributed_Matrix_Multiplication.pdf
  Until 2040-01-01
Full Text - Article765.27 kBAdobe PDFView/Open    Request a copy
Show full item record



CORE Recommender

SCOPUSTM   
Citations

1
checked on Nov 16, 2024

WEB OF SCIENCETM
Citations

1
checked on Nov 16, 2024

Page view(s)

42
checked on Nov 18, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.