Enterprise Credit Prediction Model Based on SCC-MIC-Boruta Algorithm Feature Selection Algorithm

Kang Xu; Lingnan Xie; Xuechun Liang; Tengfei Cao; Yingjia Chen

doi:10.1109/ispds58840.2023.10235653

ScienceGate Book Chapters

JOURNAL ARTICLE

Enterprise Credit Prediction Model Based on SCC-MIC-Boruta Algorithm Feature Selection Algorithm

Kang Xu Lingnan Xie Xuechun Liang Tengfei Cao Yingjia Chen

Year: 2023 Pages: 589-594

DOI: 10.1109/ispds58840.2023.10235653

Get Full-Text PDF Get Analytical Report

Abstract

A method combining Spearman Correlation Coefficient (SCC), Maximal Information Coefficient (MIC), and Boruta algorithm is proposed to address the problem of low classification accuracy of traditional machine learning algorithms when processing features of enterprise credit data. The method is applied to Decision Trees, Extreme Gradient Boosting (XGBOOST), and Gradient Boosting Decision Tree (GBDT). Firstly, SCC is used to remove highly correlated features, and then MIC is used to find the strongest correlation between features and labels. Next, Boruta is embedded in the Random Forest model to find the optimal feature subset. Finally, the optimal feature subset is applied to the three classification models. Experimental results show that the feature subset selected by this method improves the classification accuracy of the three classification models by 1.18%, 1.18% and 3.53%, respectively.

Keywords:

Boosting (machine learning) Feature selection Gradient boosting Decision tree Random forest Computer science Feature (linguistics) Algorithm Artificial intelligence Statistical classification Correlation coefficient Pattern recognition (psychology) Data mining Machine learning

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.17

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Financial Distress and Bankruptcy Prediction

Social Sciences → Business, Management and Accounting → Accounting

Imbalanced Data Classification Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Artificial Intelligence in Healthcare

Health Sciences → Health Professions → Health Information Management

Enterprise Credit Prediction Model Based on SCC-MIC-Boruta Algorithm Feature Selection Algorithm

Abstract

Metrics

Topics

Related Documents

Boruta algorithm: An alternative feature selection method in credit scoring model

Boruta based feature selection model for heart disease prediction

A Path-Based Feature Selection Algorithm for Enterprise Credit Risk Evaluation

Feature selection on educational data using Boruta algorithm

Feature selection on educational data using Boruta algorithm