Multi-Tenant Deep Learning Acceleration with Competitive GPU Resource Sharing

Yongbo Yu; Xiang Chen

doi:10.1109/cloudsummit57601.2023.00014

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Tenant Deep Learning Acceleration with Competitive GPU Resource Sharing

Yongbo Yu Xiang Chen

Year: 2023 Pages: 49-51

DOI: 10.1109/cloudsummit57601.2023.00014

Get Full-Text PDF Get Analytical Report

Abstract

As Deep Learning (DL) continues to drive a variety of applications in edge and cloud data centers, co-locating multiple DL models onto the same GPU become widely deployed to improve resource utilization, and achieve acceleration. For example, a self-driving system hosts multiple tasks simultaneously (e.g., detection, classification, segmentation, etc.) and expects concurrent computing on one single device. However, our analysis demonstrates that, when deploying compound DNN models for multiple tenants on a GPU, certain issues arise: As different models' structure heterogeneities and skewed data distributions, corresponding models cause highly imbalanced computing workloads. However, current GPU scheduling methods lack effective resource allocations. To address these issues, we propose a novel resource allocation method – competitive resource sharing, which is beneficial for parallel model executions, and the proposed concept of "virtual resource" could effectively characterize and guide the practical per-task resource utilization and allocation. Our experiments demonstrate that the DNN computing throughput could be significantly escalated by $2.16 \times \sim 2.80 \times$ in various multitenant scenarios.

Keywords:

Computer science Multitenancy Cloud computing Distributed computing Scheduling (production processes) Resource allocation Shared resource Deep learning Resource (disambiguation) Artificial intelligence Machine learning Software as a service Computer network Operating system

Metrics

Cited By

0.88

FWCI (Field Weighted Citation Impact)

Refs

0.62

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Cloud Computing and Resource Management

Physical Sciences → Computer Science → Information Systems

Multi-Tenant Deep Learning Acceleration with Competitive GPU Resource Sharing

Abstract

Metrics

Citation History

Topics

Related Documents

Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing

Powering Multi-Task Federated Learning with Competitive GPU Resource Sharing

FedMT: Multi-Task Federated Learning with Competitive GPU Resource Sharing

FedMT: Multi-Task Federated Learning with Competitive GPU Resource Sharing

Optimal resource sharing in multi-tenant 5G networks