论文标题
Fadman:跨多个属性网络联合的异常检测
FadMan: Federated Anomaly Detection across Multiple Attributed Networks
论文作者
论文摘要
异常子图检测已在各种应用中广泛使用,从计算机网络中的网络攻击到社交网络中的恶意活动。尽管越来越需要在多个属性网络上进行联合的异常检测,但对于此问题,只有有限数量的方法。联邦异常检测面临两个主要挑战。一个是,在大多数行业中,孤立的数据与其他行业限制有关数据隐私和安全性。另一个是基于数据集成的大多数集中式方法培训。联合异常检测的主要思想是使从服务器中通过公共异常中的属性网络从公共异常的本地数据所有者对私人异常进行对齐,以使联合局部异常。在每个私人属性网络中,检测到的异常子图与公共归因网络中的异常子图对齐。为联邦私人异常选择了重要的公共异常子图,同时又可以防止本地私人数据泄漏。提出的算法FADMAN是一个垂直联合学习框架,用于与许多不同特征的许多私人节点对齐的公共节点,并在两个任务上验证了在多个属性网络上对异常检测进行验证,并在属性网络上使用五个现实World数据集对属性网络进行了异常检测。在第一种情况下,Fadman在10%的噪声水平下的竞争方法的精度至少超过12%。在第二种情况下,通过分析异常节点的分布,我们发现交通异常的节点与同一天的研究生入学期检查有关。
Anomaly subgraph detection has been widely used in various applications, ranging from cyber attack in computer networks to malicious activities in social networks. Despite an increasing need for federated anomaly detection across multiple attributed networks, only a limited number of approaches are available for this problem. Federated anomaly detection faces two major challenges. One is that isolated data in most industries are restricted share with others for data privacy and security. The other is most of the centralized approaches training based on data integration. The main idea of federated anomaly detection is aligning private anomalies from local data owners on the public anomalies from the attributed network in the server through public anomalies to federate local anomalies. In each private attributed network, the detected anomaly subgraph is aligned with an anomaly subgraph in the public attributed network. The significant public anomaly subgraphs are selected for federated private anomalies while preventing local private data leakage. The proposed algorithm FadMan is a vertical federated learning framework for public node aligned with many private nodes of different features, and is validated on two tasks correlated anomaly detection on multiple attributed networks and anomaly detection on an attributeless network using five real-world datasets. In the first scenario, FadMan outperforms competitive methods by at least 12% accuracy at 10% noise level. In the second scenario, by analyzing the distribution of abnormal nodes, we find that the nodes of traffic anomalies are associated with the event of postgraduate entrance examination on the same day.