JOURNAL ARTICLE

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Djelloul BouchihaAbdelghani BouzianeNoureddine Doumi

Year: 2023 Journal:   Journal of Information Technology and Computing Vol: 4 (1)Pages: 1-14

Abstract

Text classification consists in attributing text (document) to its corresponding class (category). It can be performed using an artificial intelligence technique called machine learning. However, before training the machine learning model that classifies texts, three main steps are also mandatory: (1) Preprocessing, which cleans the text; (2) Feature selection, which chooses the features that significantly represent the text; and (3) Feature weighting, which aims at numerically representing text through feature vector. In this paper, we propose two algorithms for feature selection and feature weighting. Unlike most existing works, our algorithms are sense-based since they use ontology to represent, not the syntax, but the sense of a text as a feature vector. Experiments show that our approach gives encouraging results compared to existing works. However, some additional suggested improvements can make these results more impressive.

Keywords:
Feature selection Artificial intelligence Computer science Weighting Feature (linguistics) Preprocessor Syntax Support vector machine Machine learning Selection (genetic algorithm) Feature vector Class (philosophy) Pattern recognition (psychology) Ontology Natural language processing

Metrics

2
Cited By
0.51
FWCI (Field Weighted Citation Impact)
59
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.