论文标题

波斯Instagram用户的行为建模以检测机器人

Behavioral Modeling of Persian Instagram Users to detect Bots

论文作者

Bazm, Muhammad, Asadpour, Masoud

论文摘要

机器人是由计算机程序控制的社交媒体中的用户帐户。与许多其他事物类似,它们都用于善与恶。他们的一个邪恶用例是在网络中传播错误信息或有偏见的数据。基于社交媒体数据进行了许多研究,其结果有效性受到有害数据机器人传播极大的威胁。因此,需要有效的方法和工具来检测机器人,然后删除机器人传播的误导数据。在本研究中,提出了一种检测Instagram机器人的方法。没有包括Instagram机器人和真实帐户的样本的数据集,因此目前的研究已经开始收集有关一般性问题的数据集,因此每个组中都包含1,000个数据点。主要方法是监督机器学习,与深度神经网络相比,经典模型是优选的。最终模型使用从10倍交叉验证开始的多种方法评估。之后,对分类研究的信心,之后是针对模型计算的目标概率的特征重要性分析和特征行为。最后,一个实验旨在测量操作环境中的模型有效性。最后,有力得出的结论是,该模型在所有评估实验中都表现良好。

Bots are user accounts in social media which are controlled by computer programs. Similar to many other things, they are used for both good and evil purposes. One nefarious use-case for them is to spread misinformation or biased data in the networks. There are many pieces of research being performed based on social media data and their results validity is extremely threatened by the harmful data bots spread. Consequently, effective methods and tools are required for detecting bots and then removing misleading data spread by the bots. In the present research, a method for detecting Instagram bots is proposed. There is no data set including samples of Instagram bots and genuine accounts, thus the current research has begun with gathering such a data set with respect to generality concerns such that it includes 1,000 data points in each group. The main approach is supervised machine learning and classic models are preferred compared to deep neural networks. The final model is evaluated using multiple methods starting with 10-fold cross-validation. After that, confidence in classification studies and is followed by feature importance analysis and feature behavior against the target probability computed by the model. In the end, an experiment is designed to measure the models effectiveness in an operational environment. Finally, It is strongly concluded that the model performs very well in all evaluation experiments.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源