JOURNAL ARTICLE

Dynamic data replication and placement strategy in geographically distributed data centers

Laila BouhouchClaude TadonkiMostapha Zbakh

Year: 2022 Journal:   Concurrency and Computation Practice and Experience Vol: 35 (14)   Publisher: Wiley

Abstract

Abstract With the evolution of geographically distributed data centers in the Cloud Computing landscape along with the amount of data being processed in these data centers, which is growing at an exponential rate, processing massive data applications become an important topic. Since a given task may require many datasets for its execution and the datasets are spread over several different data centers, finding an efficient way to manage the datasets storage across nodes of a Cloud system is a difficult problem. In fact, the execution time of a task might be influenced by the cost of data transfers, which mainly depends on two criterias. The first one is the initial placement of the input datasets during the build‐time phase, while the second is the replication of the datasets during the runtime phase. The replication is explicitly considered when datasets are being migrated over the data centers in order to make them locally available wherever needed. Data placement and data replication are important challenges in Cloud Computing. Nevertheless, many studies focus on data placement or data replication exclusively. In this paper, a combination of a data placement strategy followed by a dynamic data replication management strategy is proposed, with the purpose of reducing the associated cost of all data transfers between the (distant) data centers. Our proposed data placement approach considers the main characteristics of a data center such as storage capacity and read/write speeds to efficiently store the datasets, while our dynamic data replication management approach considers three parameters: the number of replicas in the system, the dependency between datasets and tasks and the storage capacity of data centers. The decision of when and whether to keep or to delete replicas is determined by the fulfillment of those three parameters. Our approach estimates the total execution time of the tasks as well as the monetary cost, considering the data transfers activity. Our experiments are conducted using Cloudsim simulator. The obtained results show that our proposed strategies produce an efficient data management by reducing the overheads of the data transfers, compared to both a data placement without replication (by 76%) and the selected data replication approach from Kouidri et al. (by 52%), and by improving the financial cost.

Keywords:
Computer science Replication (statistics) Cloud computing Distributed computing Data center Task (project management) Dynamic data Data management Distributed database Data access Database Computer network Operating system

Metrics

9
Cited By
3.42
FWCI (Field Weighted Citation Impact)
34
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management
Physical Sciences →  Computer Science →  Information Systems
Distributed and Parallel Computing Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
Advanced Data Storage Technologies
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Optimal Dynamic Placement of Virtual Machines in Geographically Distributed Cloud Data Centers

Hana TeyebNejib Ben Hadj-AlouaneSamir TataAli Balma

Journal:   International Journal of Cooperative Information Systems Year: 2017 Vol: 26 (03)Pages: 1750001-1750001
JOURNAL ARTICLE

Energy-aware coordinated operation strategy of geographically distributed data centers

S. James ZhouMing ZhouZhaoyuan WuYuyang WangGengyin Li

Journal:   International Journal of Electrical Power & Energy Systems Year: 2024 Vol: 159 Pages: 110032-110032
JOURNAL ARTICLE

The cloud of geographically distributed data centers

Petr FedchenkovAndrey ShevelSergey KhoruzhnikovOleg SadovOleg LazoNikitta Samokhin

Journal:   EPJ Web of Conferences Year: 2019 Vol: 214 Pages: 07007-07007
© 2026 ScienceGate Book Chapters — All rights reserved.