I-AID：从与灾难有关的推文中识别可操作的信息

论文标题

I-AID：从与灾难有关的推文中识别可操作的信息

I-AID: Identifying Actionable Information from Disaster-related Tweets

论文作者

Zahera, Hamada M., Jalota, Rricha, Sherif, Mohamed A., Ngomo, Axel N.

论文摘要

社交媒体通过提供有关受影响人员，捐赠和帮助要求的宝贵数据在灾难管理中发挥着重要作用。最近的研究强调了需要将信息在社交媒体上过滤到细粒度的内容标签中。但是，在危机期间从大量社交媒体帖子中确定有用的信息是一项艰巨的任务。在本文中，我们提出了I-AID，这是一种多模型方法，将推文自动分类为多标签信息类型，并从大量社交媒体数据中过滤关键信息。 I-AID incorporates three main components: i) a BERT-based encoder to capture the semantics of a tweet and represent as a low-dimensional vector, ii) a graph attention network (GAT) to apprehend correlations between tweets' words/entities and the corresponding information types, and iii) a Relation Network as a learnable distance metric to compute the similarity between tweets and their corresponding information types in a supervised way.我们对两个真正的公共可用数据集进行了几项实验。我们的结果表明，I-AID在TREC-IS数据集和COVID-19推文上的加权平均F1得分分别以 +6％和4％的速度来超过最先进的方法。

Social media plays a significant role in disaster management by providing valuable data about affected people, donations and help requests. Recent studies highlight the need to filter information on social media into fine-grained content labels. However, identifying useful information from massive amounts of social media posts during a crisis is a challenging task. In this paper, we propose I-AID, a multimodel approach to automatically categorize tweets into multi-label information types and filter critical information from the enormous volume of social media data. I-AID incorporates three main components: i) a BERT-based encoder to capture the semantics of a tweet and represent as a low-dimensional vector, ii) a graph attention network (GAT) to apprehend correlations between tweets' words/entities and the corresponding information types, and iii) a Relation Network as a learnable distance metric to compute the similarity between tweets and their corresponding information types in a supervised way. We conducted several experiments on two real publicly-available datasets. Our results indicate that I-AID outperforms state-of-the-art approaches in terms of weighted average F1 score by +6% and +4% on the TREC-IS dataset and COVID-19 Tweets, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题