Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Djelloul Bouchiha; Abdelghani Bouziane; Noureddine Doumi

doi:10.48185/jitc.v4i1.612

ScienceGate Book Chapters

JOURNAL ARTICLE

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Djelloul Bouchiha Abdelghani Bouziane Noureddine Doumi

Year: 2023 Journal: Journal of Information Technology and Computing Vol: 4 (1)Pages: 1-14

DOI: 10.48185/jitc.v4i1.612

Get Full-Text PDF Get Analytical Report

Abstract

Text classification consists in attributing text (document) to its corresponding class (category). It can be performed using an artificial intelligence technique called machine learning. However, before training the machine learning model that classifies texts, three main steps are also mandatory: (1) Preprocessing, which cleans the text; (2) Feature selection, which chooses the features that significantly represent the text; and (3) Feature weighting, which aims at numerically representing text through feature vector. In this paper, we propose two algorithms for feature selection and feature weighting. Unlike most existing works, our algorithms are sense-based since they use ontology to represent, not the syntax, but the sense of a text as a feature vector. Experiments show that our approach gives encouraging results compared to existing works. However, some additional suggested improvements can make these results more impressive.

Keywords:

Feature selection Artificial intelligence Computer science Weighting Feature (linguistics) Preprocessor Syntax Support vector machine Machine learning Selection (genetic algorithm) Feature vector Class (philosophy) Pattern recognition (psychology) Ontology Natural language processing

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.65

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Web Data Mining and Analysis

Physical Sciences → Computer Science → Information Systems

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Ontology based Feature Selection and Weighting for Text classification using Machine Learning

Feature Selection based Arabic Text Classification using Different Machine Learning Algorithms

Feature Selection for Text Classification Using Machine Learning Approaches

Ontology-Based Feature Weighting for Biomedical Literature Classification