JOURNAL ARTICLE

Arabic Language Text Classification Using Dependency Syntax-Based Feature Selection

Yannis HaralambousYassir ElidrissiPhilippe Lenca

Year: 2014 Journal:   OPAL (Open@LaTrobe) (La Trobe University)   Publisher: La Trobe University

Abstract

We study the performance of Arabic text classification combining various techniques: (a) tfidf vs. dependency syntax, for feature selection and weighting; (b) class association rules vs. support vector machines, for classification. The Arabic text is used in two forms: rootified and lightly stemmed. The results we obtain show that lightly stemmed text leads to better performance than rootified text; that class association rules are better suited for small feature sets obtained by dependency syntax constraints; and, finally, that support vector machines are better suited for large feature sets based on morphological feature selection criteria.

Keywords:
Computer science Dependency (UML) Natural language processing Artificial intelligence Syntax Arabic Feature selection Selection (genetic algorithm) Feature (linguistics) Dependency grammar Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
30
Refs
0.28
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.