Nowshin IslamM. W. RahmanJithin JoseRaghunath RajachandrasekarHuamin WangHari SubramoniChethana R MurthyDhabaleswar K. Panda
Hadoop Distributed File System (HDFS) acts as the primary storage of Hadoop and has been adopted by reputed organizations (Facebook, Yahoo! etc.) due to its portability and fault-tolerance. The existing implementation of HDFS uses Java-socket interface for communication which delivers suboptimal performance in terms of latency and throughput. For data-intensive applications, network performance becomes key component as the amount of data being stored and replicated to HDFS increases. In this paper, we present a novel design of HDFS using Remote Direct Memory Access (RDMA) over InfiniBand via JNI interfaces. Experimental results show that, for 5GB HDFS file writes, the new design reduces the communication time by 87% and 30% over 1Gigabit Ethernet (1GigE) and IP-over-InfiniBand (IPoIB), respectively, on QDR platform (32Gbps). For HBase, the Put operation performance is improved by 26% with our design. To the best of our knowledge, this is the first design of HDFS over InfiniBand networks.
Nowshin IslamM. W. RahmanJithin JoseRaghunath RajachandrasekarH. WangHari SubramoniChethana R MurthyDhabaleswar K. Panda
Dong BuyunPei FangFu XiaoBin LuoZhihong Zhao
Md. Wasi-ur-RahmanNusrat Sharmin IslamXiaoyi LuJithin JoseHari SubramoniHao WangDhabaleswar K. Panda
Jiuxing LiuJiesheng WuSushmitha P. KiniPete WyckoffDhabaleswar K. Panda
Jiuxing LiuJiesheng WuDhabaleswar K. Panda