High-Dimensional Bayesian Optimization with Multi-Task Learning for RocksDB

Sami Alabed; Eiko Yoneki

doi:10.1145/3437984.3458841

ScienceGate Book Chapters

JOURNAL ARTICLE

High-Dimensional Bayesian Optimization with Multi-Task Learning for RocksDB

Sami Alabed Eiko Yoneki

Year: 2021 Pages: 111-119

DOI: 10.1145/3437984.3458841

Get Full-Text PDF Get Analytical Report

Abstract

RocksDB is a general-purpose embedded key-value store used in multiple\ndifferent settings. Its versatility comes at the cost of complex tuning\nconfigurations. This paper investigates maximizing the throughput of RocksDB IO\noperations by auto-tuning ten parameters of varying ranges. Off-the-shelf\noptimizers struggle with high-dimensional problem spaces and require a large\nnumber of training samples. We propose two techniques to tackle this problem:\nmulti-task modeling and dimensionality reduction through a manual grouping of\nparameters. By incorporating adjacent optimization in the model, the model\nconverged faster and found complicated settings that other tuners could not\nfind. This approach had an additional computational complexity overhead, which\nwe mitigated by manually assigning parameters to each sub-goal through our\nknowledge of RocksDB. The model is then incorporated in a standard Bayesian\nOptimization loop to find parameters that maximize RocksDB's IO throughput. Our\nmethod achieved x1.3 improvement when benchmarked against a simulation of\nFacebook's social graph traffic, and converged in ten optimization steps\ncompared to other state-of-the-art methods that required fifty steps.\n

Keywords:

Metrics

Cited By

3.49

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management

Physical Sciences → Computer Science → Information Systems

Parallel Computing and Optimization Techniques

Physical Sciences → Computer Science → Hardware and Architecture

Distributed and Parallel Computing Systems

Physical Sciences → Computer Science → Computer Networks and Communications

High-Dimensional Bayesian Optimization with Multi-Task Learning for RocksDB

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Task Learning with High-Dimensional Noisy Images

Multi-task Learning with High-Dimensional Noisy Images

Multi-Task Learning with High-Dimensional Noisy Images

ON PARALLELIZING MULTI-TASK BAYESIAN OPTIMIZATION

On parallelizing multi-task bayesian optimization