JOURNAL ARTICLE

Efficient and scalable all-to-all personalized exchange for InfiniBand-based clusters

Abstract

The all-to-all personalized exchange is the most dense collective communication function offered by the MPI specification. The operation involves every process sending a different message to all other participating processes. This collective operation is essential for many parallel scientific applications. With increasing system and message sizes, it becomes challenging to offer a fast, scalable and efficient implementation of this operation. InfiniBand is an emerging modern interconnect. It offers very low latency, high bandwidth and one-sided operations like RDMA write. Its advanced features like RDMA write gather allow us to design and implement all-to-all algorithms much more efficiently than in the past. Our aim in This work is to design efficient and scalable implementations of traditional personalized exchange algorithms. We present two novel approaches towards designing all-to-all algorithms for short and long messages respectively. The hypercube RDMA write gather and direct eager schemes effectively leverage the RDMA and RDMA with write gather mechanisms offered by InfiniBand. Performance evaluation of our design and implementation reveals that it is able to reduce the all-to-all communication time by upto a factor of 3.07 for 32 byte messages on a 16 node InfiniBand cluster. Our analytical models suggest that the proposed designs perform 64% better on InfiniBand clusters with 1024 nodes for 4k message size.

Keywords:
InfiniBand Remote direct memory access Computer science Scalability Implementation Message passing Low latency (capital markets) Distributed computing Operating system Parallel computing Computer network Programming language

Metrics

18
Cited By
2.45
FWCI (Field Weighted Citation Impact)
16
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Interconnection Networks and Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Distributed and Parallel Computing Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Efficient and scalable all-to-all personalized exchange for InfiniBand-based clusters

Sayantan SurHyun‐Wook JinD.K. Panda

Journal:   Proceedings of the International Conference on Parallel Processing Year: 2004 Pages: 275-282
JOURNAL ARTICLE

All-to-all personalized exchange in generalized shuffle-exchange networks

Well Y. ChouChiuyuan Chen

Journal:   Theoretical Computer Science Year: 2009 Vol: 411 (16-18)Pages: 1669-1684
© 2026 ScienceGate Book Chapters — All rights reserved.