JOURNAL ARTICLE

Building a Sentiment Analysis system using automatically generated training Dataset

Abstract

In this paper, we describe a procedure for extracting annotated Arabic negative and positive tweets. We use these extracted annotated tweets to build our sentiment system using Naive Bayes with TF-IDF enhancement. The large size of training data for a highly inflected language is necessary to compensate for the sparseness nature of such languages. We present our techniques and explain our experimental system. We automatically collect 200 thousand annotated tweets. The evaluation shows that our sentiment analysis system has high precision and accuracy measures compared to existing ones.

Keywords:
Computer science Arabic Sentiment analysis Artificial intelligence Naive Bayes classifier Training set Natural language processing Machine learning Support vector machine

Metrics

1
Cited By
0.15
FWCI (Field Weighted Citation Impact)
14
Refs
0.55
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.