JOURNAL ARTICLE

Exploring Innovative Approaches to Synthetic Tabular Data Generation

Eugenia PapadakiAristidis G. VrahatisSotiris Kotsiantis

Year: 2024 Journal:   Electronics Vol: 13 (10)Pages: 1965-1965   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The rapid advancement of data generation techniques has spurred innovation across multiple domains. This comprehensive review delves into the realm of data generation methodologies, with a keen focus on statistical and machine learning-based approaches. Notably, novel strategies like the divide-and-conquer (DC) approach and cutting-edge models such as GANBLR have emerged to tackle a spectrum of challenges, spanning from preserving intricate data relationships to enhancing interpretability. Furthermore, the integration of generative adversarial networks (GANs) has sparked a revolution in data generation across sectors like healthcare, cybersecurity, and retail. This review meticulously examines how these techniques mitigate issues such as class imbalance, data scarcity, and privacy concerns. Through a meticulous analysis of evaluation metrics and diverse applications, it underscores the efficacy and potential of synthetic data in refining predictive models and decision-making software. Concluding with insights into prospective research trajectories and the evolving role of synthetic data in propelling machine learning and data-driven solutions across disciplines, this work provides a holistic understanding of the transformative power of contemporary data generation methodologies.

Keywords:
Interpretability Data science Transformative learning Computer science Big data Data-driven Generative grammar Artificial intelligence Machine learning Management science Data mining Engineering

Metrics

8
Cited By
5.11
FWCI (Field Weighted Citation Impact)
50
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning in Healthcare
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Federated Generation of Synthetic Tabular Data

Martinez Duarte, Daniela

Journal:   reposiTUm (TU Wien) Year: 2024
DISSERTATION

Generation and Evaluation of Realistic Tabular Synthetic Data

Lautrup, Anton Danholt

University:   University of Southern Denmark Research Portal (University of Southern Denmark) Year: 2025
© 2026 ScienceGate Book Chapters — All rights reserved.