Loading…

Challenges in Sentiment Analysis for Arabic Social Networks

There has been an enormous increase in the volume of Arabic posts on many social networks, providing a rich source of opinions on a variety of topics. Opinion mining has attracted much research interest in the last decade, but has mostly been conducted using the English language. Opinion mining from...

Full description

Saved in:
Bibliographic Details
Published in:Procedia computer science 2017, Vol.117, p.89-100
Main Authors: Alwakid, Ghadah, Osman, Taha, Hughes-Roberts, Thomas
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:There has been an enormous increase in the volume of Arabic posts on many social networks, providing a rich source of opinions on a variety of topics. Opinion mining has attracted much research interest in the last decade, but has mostly been conducted using the English language. Opinion mining from Arabic social media is lagging behind, primarily due to challenges with processing the morphologically complex Arabic natural language and the lack of available tools and resources for extracting Arabic sentiments from text. This complex task is further exacerbated when dealing with Saudi dialects that do not abide by the formal grammatical structure of Modern Standard Arabic (MSA). Targeting the use-case of unemployment in Saudi Arabia, this paper investigates the major challenges facing sentiment analysis researchers of the informal Arabic language and documents how these challenges can be initially addressed by means of linguistic pre-processing of raw text and supervised machine learning (ML) to identify sentiment polarity. The initial experimental evaluation yielded satisfactory sentiment classification results while also demonstrating the benefit of using ML to identify the dominant features in deciding sentiment polarity, which can in turn be used in constructing a problem-domain knowledge base that can further improve the accuracy of sentiment classification.
ISSN:1877-0509
1877-0509
DOI:10.1016/j.procs.2017.10.097