JOURNAL ARTICLE

Feature Engineering and Missing Data Imputation Method of Medical Data Analysis

Abstract

This work provides an alternative way to preprocessing procedure for consolidated data.Two methods are proposed.The first one is used for feature selection based on ensemble of machine learning algorithms.And the second one organizes missing data imputation based on combination of functional dependencies and associative rules.Ensemble methods for processing multimodal data based on a hierarchical classifier, a set of weak classifiers and a number of methods for selecting important characteristics with a much higher value of accuracy on unbalanced data sets compared to existing machine learning methods are developed.The methods are validated on medical dataset.The percentage of recovery data is on 1.2% comparing with associative rules.The proposed missing data imputation method creates additional data values operating a based domain and functional dependencies and includes these values to available training data.The correctness of the filled-in values is proved on the predictor built on the original dataset.The proposed PPD method conducts 12% better than RF and EM models for 30% missing data.

Keywords:
Imputation (statistics) Missing data Computer science Data mining Feature (linguistics) Data science Machine learning

Metrics

1
Cited By
0.38
FWCI (Field Weighted Citation Impact)
25
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Traditional Chinese Medicine Studies
Health Sciences →  Medicine →  Complementary and alternative medicine

Related Documents

JOURNAL ARTICLE

Hybrid Analytic Method for Missing Data Imputation in Medical Big Data

Karima BenhamzaNadjette BenhamidaMohamed Ilyes BOURAHDOUNBilel BOUDJAHEM

Journal:   International Journal of Informatics and Applied Mathematics Year: 2022 Vol: 5 (2)Pages: 1-11
BOOK-CHAPTER

Multi-feature Based Generative Missing Imputation Method for Multivariate Data

Wenli LiuBobin YaoZhi Dong

Lecture notes in electrical engineering Year: 2025 Pages: 367-375
BOOK-CHAPTER

Missing Data Imputation and Analysis

Mark Chang

Statistics in the health sciences Year: 2011 Pages: 117-143
JOURNAL ARTICLE

Feature Engineering for Healthcare Big Data: Approaches to Missing Data Imputation, Dimensionality Reduction, and Time-Series Analysis

Simran Sethi

Journal:   International Journal of Multidisciplinary Research and Growth Evaluation Year: 2020 Vol: 1 (1)Pages: 120-124
BOOK-CHAPTER

Missing Data Imputation

Chapman & Hall/CRC biostatistics series Year: 2011 Pages: 275-290
© 2026 ScienceGate Book Chapters — All rights reserved.