Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Djelloul Bouchiha; Abdelghani Bouziane; Noureddine Doumi

doi:10.60692/gdj5c-s5616

ScienceGate Book Chapters

JOURNAL ARTICLE

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Djelloul Bouchiha Abdelghani Bouziane Noureddine Doumi

Year: 2023 Journal: Greater South Information System

DOI: 10.60692/gdj5c-s5616

Get Full-Text PDF Get Analytical Report

Abstract

Text classification consists in attributing text (document) to its corresponding class (category). It can be performed using an artificial intelligence technique called machine learning. However, before training the machine learning model that classifies texts, three main steps are also mandatory: (1) Preprocessing, which cleans the text; (2) Feature selection, which chooses the features that significantly represent the text; and (3) Feature weighting, which aims at numerically representing text through feature vector. In this paper, we propose two algorithms for feature selection and feature weighting. Unlike most existing works, our algorithms are sense-based since they use ontology to represent, not the syntax, but the sense of a text as a feature vector. Experiments show that our approach gives encouraging results compared to existing works. However, some additional suggested improvements can make these results more impressive.

Keywords:

Feature selection Feature (linguistics) Weighting Class (philosophy) Ontology Pattern recognition (psychology) Selection (genetic algorithm) Feature extraction

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.40

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Biomedical Text Mining and Ontologies

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Abstract

Metrics

Topics

Related Documents

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Feature Selection based Arabic Text Classification using Different Machine Learning Algorithms

Feature Selection for Text Classification Using Machine Learning Approaches

Ontology-Based Feature Weighting for Biomedical Literature Classification