Elastic Provisioning of Hadoop Clusters on OpenStack Private Cloud

Namrata Hosamani; Nageshwar Albur; Prajna Yaji; Mohammed Moin Mulla; D. G. Narayan

doi:10.1109/icccnt49239.2020.9225277

ScienceGate Book Chapters

JOURNAL ARTICLE

Elastic Provisioning of Hadoop Clusters on OpenStack Private Cloud

Namrata Hosamani Nageshwar Albur Prajna Yaji Mohammed Moin Mulla D. G. Narayan

Year: 2020 Vol: 8 Pages: 1-7

DOI: 10.1109/icccnt49239.2020.9225277

Get Full-Text PDF Get Analytical Report

Abstract

Big Data Processing refers to processing, analyzing, and extracting information from a large amount of data. The traditional processing techniques of big data are no longer viable. Therefore this large volume of data is processed by distributed computing frameworks like Apache Hadoop. MapReduce frameworks are an excellent way to process larger datasets on the cloud platform. The automation of Hadoop is achieved on OpenStack based private cloud. Auto-scaling refers to the dynamic addition of nodes to or removal of the nodes from the existing cluster to use the resources effectively. The Quality of Service(QoS) has to be maintained while auto-scaling the resources on-demand. There is a requirement for auto-scaling of the Hadoop cluster when the load increases to adhere to the Service Level Agreements(SLAs). The paper presents the auto-scaling method of Hadoop clusters using predictive algorithms. To effectively implement auto-scaling, the prediction techniques are employed. The proposed model obtains the CPU utilization metrics and scales up or scales down the cluster based on the time series analysis. The experimental result using the OpenStack-based private cloud testbed reveals the effectiveness of the proposed mechanism.

Keywords:

Cloud computing Computer science Provisioning Testbed Big data Quality of service Distributed computing Database Cluster (spacecraft) Automation Operating system Data mining Computer network Engineering

Metrics

Cited By

1.98

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management

Physical Sciences → Computer Science → Information Systems

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Data Stream Mining Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Elastic Provisioning of Hadoop Clusters on OpenStack Private Cloud

Abstract

Metrics

Citation History

Topics

Related Documents

Elastic provisioning of virtual Hadoop clusters in OpenStack-based Clouds

Elastic Resource Provisioning System Based on OpenStack Cloud Platform

Hadoop in OpenStack: Data-location-aware cluster provisioning

A Performance Analysis of OpenStack Cloud vs Real System on Hadoop Clusters

A big data on private cloud agile provisioning framework based on OpenStack