ASTD: Arabic sentiment tweets dataset

Mahmoud Nabil, Mohamed Aly, Amir F. Atiya

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

199 Scopus citations


This paper introduces ASTD, an Arabic social sentiment analysis dataset gathered from Twitter. It consists of about 10,000 tweets which are classified as objective, subjective positive, subjective negative, and subjective mixed. We present the properties and the statistics of the dataset, and run experiments using standard partitioning of the dataset. Our experiments provide benchmark results for 4 way sentiment classification on the dataset.

Original languageEnglish (US)
Title of host publicationConference Proceedings - EMNLP 2015
Subtitle of host publicationConference on Empirical Methods in Natural Language Processing
PublisherAssociation for Computational Linguistics (ACL)
Number of pages5
ISBN (Electronic)9781941643327
StatePublished - 2015
Externally publishedYes
EventConference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Lisbon, Portugal
Duration: Sep 17 2015Sep 21 2015


OtherConference on Empirical Methods in Natural Language Processing, EMNLP 2015

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Information Systems


Dive into the research topics of 'ASTD: Arabic sentiment tweets dataset'. Together they form a unique fingerprint.

Cite this