Building a Sentiment Analysis system using automatically generated training Dataset

Daoud Daoud; Samer Aoudi; M. Samir Abou oudi

doi:10.1145/3328833.3328874

ScienceGate Book Chapters

JOURNAL ARTICLE

Building a Sentiment Analysis system using automatically generated training Dataset

Daoud Daoud Samer Aoudi M. Samir Abou oudi

Year: 2019 Pages: 120-125

DOI: 10.1145/3328833.3328874

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we describe a procedure for extracting annotated Arabic negative and positive tweets. We use these extracted annotated tweets to build our sentiment system using Naive Bayes with TF-IDF enhancement. The large size of training data for a highly inflected language is necessary to compensate for the sparseness nature of such languages. We present our techniques and explain our experimental system. We automatically collect 200 thousand annotated tweets. The evaluation shows that our sentiment analysis system has high precision and accuracy measures compared to existing ones.

Keywords:

Computer science Arabic Sentiment analysis Artificial intelligence Naive Bayes classifier Training set Natural language processing Machine learning Support vector machine

Metrics

Cited By

0.15

FWCI (Field Weighted Citation Impact)

Refs

0.55

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Building a Sentiment Analysis system using automatically generated training Dataset

Abstract

Metrics

Citation History

Topics

Related Documents

Building a Sentiment Analysis System Using Automatically Generated Training Dataset

The Feasibility of Using Synthetic-Generated Dataset for Training Sentiment Analysis Model

Building Detection and Segmentation Using a CNN with Automatically Generated Training Data

Sentiment Classification of Russian Texts Using Automatically Generated Thesaurus

Prompt2Fashion: An automatically generated fashion dataset