JOURNAL ARTICLE

Optimizing Diabetes Classification with Support Vector Machine and SMOTEENN-based Feature Selection

Abstract

The use of data-driven model in diabetes detection has gained much attention nowadays to improve the globe medical systems due to its cost-effective and less-invasive methods. The common studies implement statistical feature selection such as PCC or PCA with an assumption of linear relationships, which leads to impracticality in real-life diabetic data. In this paper, a proposed SMOTEENN-based univariate feature selection method is proposed in machine learning-based diabetes classification models. It combines the advantages of SMOTEENN oversampling and univariate feature selection to improve the classification rate with lower dimensional input. A more extensive dataset should be taken into consideration and compared to verify further this method's effectiveness in solving this task. The results acquired from this research implies that this proposed method is effective in achieving high classification accuracy, where the Logistic Regression, Random Forest and Support Vector Machine-based models constructed in this research are able to achieve accuracy of over 90% after feature selection; while reducing the computational cost and time required for the classification tasks at the same time.

Keywords:
Feature selection Univariate Random forest Computer science Support vector machine Artificial intelligence Oversampling Machine learning Feature (linguistics) Selection (genetic algorithm) Data mining Logistic regression Statistical classification Multivariate statistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
32
Refs
0.20
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning in Healthcare
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Feature Selection for Cancer Classification Based on Support Vector Machine

Yingxin Li

Journal:   Journal of Computer Research and Development Year: 2005 Vol: 42 (10)Pages: 1796-1796
JOURNAL ARTICLE

Feature Selection based Classification of Spams Using Fuzzy Support Vector Machine

Lovely BansalNirupama Tiwari

Journal:   2020 International Conference on Smart Electronics and Communication (ICOSEC) Year: 2020 Pages: 258-263
BOOK-CHAPTER

Optimization Approach for Feature Selection and Classification with Support Vector Machine

S. ChidambaramK. G. Srinivasagan

Advances in intelligent systems and computing Year: 2015 Pages: 103-111
© 2026 ScienceGate Book Chapters — All rights reserved.