Scalable multi-label Arabic text classification

Nizar Ahmed; Mohammed A. Shehab; Mahmoud Al‐Ayyoub; Ismail Hmeidi

doi:10.1109/iacs.2015.7103229

ScienceGate Book Chapters

JOURNAL ARTICLE

Scalable multi-label Arabic text classification

Nizar Ahmed Mohammed A. Shehab Mahmoud Al‐Ayyoub Ismail Hmeidi

Year: 2015 Pages: 212-217

DOI: 10.1109/iacs.2015.7103229

Get Full-Text PDF Get Analytical Report

Abstract

Multi-label text classification (MTC) is a natural extension of the traditional text classification (TC) in which a possibly large set of labels can be assigned to each document. The dimensionality of labels makes MTC difficult and challenging. Several ways are proposed to ease the classification process and one of them is called the problem transformation (PT) method. It is used to transform the multi-labeled data into a single-label one that is suitable for normal classification. Our paper presents a detailed study about using the supervised approach to address the MTC problem for Arabic text. Moreover, the scalability of such an approach is considered in our experiments. The MEKA system is used to convert the multi-label data into a single-label one using different PT methods: LC, BR and RT. Then, different classifiers commonly used for TC such as SVM, NB, KNN, and Decision Tree, are applied to each dataset. The results show that using SVM on the LC dataset generated the best results with 71% ML-accuracy.

Keywords:

Computer science Artificial intelligence Support vector machine Multi-label classification Decision tree Scalability Pattern recognition (psychology) Extension (predicate logic) Arabic Set (abstract data type) Data mining Transformation (genetics) Machine learning Natural language processing Database

Metrics

Cited By

4.40

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Scalable multi-label Arabic text classification

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Label Arabic Text Classification: An Overview

Arabic Multi-label Text Classification of News Articles

Multi-Label Arabic Text Classification Based On Deep Learning

Multi-label Arabic text classification in Online Social Networks

A Proposed Arabic Text Classification Model using Multi-Label System