Accelerating SLIDE: Exploiting Sparsity on Accelerator Architectures

Sho Ko; Alexander Rucker; Yaqi Zhang; Paul Mure; Kunle Olukotun

doi:10.1109/ipdpsw55747.2022.00116

ScienceGate Book Chapters

JOURNAL ARTICLE

Accelerating SLIDE: Exploiting Sparsity on Accelerator Architectures

Sho Ko Alexander Rucker Yaqi Zhang Paul Mure Kunle Olukotun

Year: 2022 Journal: 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Pages: 663-670

DOI: 10.1109/ipdpsw55747.2022.00116

Get Full-Text PDF Get Analytical Report

Abstract

A significant trend in machine learning is sparsifying the training of neural networks to reduce the amount of computation required. Algorithms like Sub-LInear Deep learning Engine (SLIDE) [2] use locality-sensitive hashing (LSH) to create sparsity. These sparse training algorithms were originally developed on multi-threaded multicore CPUs. However, they are not well-studied and optimized for accelerator platforms such as GPUs and reconfigurable dataflow architectures (RDAs). In this paper, we study the different variants of the SLIDE algorithm and investigate accuracy-performance tradeoffs on CPU, GPU, and RDAs. The implementation targeting RDA outperforms the GPU by 7.5×. The performance on a limited-memory RDA is improved further by proposing a smart caching algorithm, which is 2 × faster than the baseline RDA. Furthermore, we are able to achieve another 2 × performance by putting all of the weights on-chip using an RDA with enough memory. We believe our work will pave the road for the future development of both algorithm and hardware architecture for sparse training.

Keywords:

Computer science Dataflow Parallel computing Computer architecture Multi-core processor Deep learning CUDA Locality Computer engineering Computation Computer hardware Artificial intelligence Embedded system Algorithm

Metrics

Cited By

0.07

FWCI (Field Weighted Citation Impact)

Refs

0.26

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Caching and Content Delivery

Physical Sciences → Computer Science → Computer Networks and Communications

Accelerating SLIDE: Exploiting Sparsity on Accelerator Architectures

Abstract

Metrics

Citation History

Topics

Related Documents

Exploiting Sparsity in Automatic Differentiation on Multicore Architectures

HotTiles: Accelerating SpMM with Heterogeneous Accelerator Architectures

Accelerating Convolutional Neural Network by Exploiting Sparsity on GPUs

CAESAR: A CNN Accelerator Exploiting Sparsity and Redundancy Pattern

Exploiting sparsity and equation-free architectures in complex systems