论文标题
社交网络中的信息信誉:上下文,方法和开放问题
Information Credibility in the Social Web: Contexts, Approaches, and Open Issues
论文作者
论文摘要
在社交网络场景中,大量用户生成的内容(UGC)经常通过社交媒体扩散,几乎没有任何形式的传统信任中介。因此,遇到错误信息的风险不可忽略。因此,评估和挖掘在线信息的信誉构成当今的基本研究问题。信誉(也称为可信度)是个人所感知的素质,他们并不总是能够以自己的认知能力来辨别自己的认知能力,这是一个来自假的人的真实信息。因此,在过去的几年中,已经提出了几种方法来自动评估社交媒体的信誉。其中许多是基于数据驱动的模型,即,它们采用机器学习技术来识别错误信息,但最近也出现了模型驱动的方法,以及针对信誉传播的基于图的方法,以及利用语义Web技术的基于知识的基于知识的方法。研究信息可信度评估的三个主要背景关注:(i)在审查站点中检测垃圾邮件,(ii)在微博中检测假新闻,以及(iii)在线健康相关信息的可信度评估。在本文中,讨论了与社交网络中信息可信度评估有关的主要问题,这些问题是由上述上下文共享的。还提出了对近年来解决这些问题的方法和方法的简洁调查。
In the Social Web scenario, large amounts of User-Generated Content (UGC) are diffused through social media often without almost any form of traditional trusted intermediaries. Therefore, the risk of running into misinformation is not negligible. For this reason, assessing and mining the credibility of online information constitutes nowadays a fundamental research issue. Credibility, also referred as believability, is a quality perceived by individuals, who are not always able to discern, with their own cognitive capacities, genuine information from fake one. Hence, in the last years, several approaches have been proposed to automatically assess credibility in social media. Many of them are based on data-driven models, i.e., they employ machine learning techniques to identify misinformation, but recently also model-driven approaches are emerging, as well as graph-based approaches focusing on credibility propagation, and knowledge-based ones exploiting Semantic Web technologies. Three of the main contexts in which the assessment of information credibility has been investigated concern: (i) the detection of opinion spam in review sites, (ii) the detection of fake news in microblogging, and (iii) the credibility assessment of online health-related information. In this article, the main issues connected to the evaluation of information credibility in the Social Web, which are shared by the above-mentioned contexts, are discussed. A concise survey of the approaches and methodologies that have been proposed in recent years to address these issues is also presented.