论文标题
ASAD:基于Twitter的基准阿拉伯情感分析数据集
ASAD: A Twitter-based Benchmark Arabic Sentiment Analysis Dataset
论文作者
论文摘要
本文详细介绍了一个新的基于Twitter的基准数据集用于阿拉伯情感分析(ASAD),该数据集于竞赛中启动,该数据集由Kaust赞助,该数据集于Kaust授予10000 USD,5000 USD和2000 USD授予第一,第二和第三名的获奖者。与其他公开发布的阿拉伯数据集相比,ASAD是一个大型,高质量的注释数据集(包括95K推文),具有三级情绪标签(正,负和中性)。我们介绍了数据收集过程和注释过程的详细信息。此外,我们为竞争任务实施了几种基线模型,并将结果报告作为参与者参与竞争的参考。
This paper provides a detailed description of a new Twitter-based benchmark dataset for Arabic Sentiment Analysis (ASAD), which is launched in a competition3, sponsored by KAUST for awarding 10000 USD, 5000 USD and 2000 USD to the first, second and third place winners, respectively. Compared to other publicly released Arabic datasets, ASAD is a large, high-quality annotated dataset(including 95K tweets), with three-class sentiment labels (positive, negative and neutral). We presents the details of the data collection process and annotation process. In addition, we implement several baseline models for the competition task and report the results as a reference for the participants to the competition.