论文标题
概率公平聚类
Probabilistic Fair Clustering
论文作者
论文摘要
在聚类问题中,在顶点上给中央决策者提供了完整的度量图,并且必须提供顶点的聚类,以最大程度地减少某些目标函数。在公平的聚类问题中,顶点具有颜色(例如,组成员资格),有效聚类的功能也可能包括该聚类中颜色的表示。公平聚类中的先前工作假定会成员的完整知识。在本文中,我们通过假设通过概率分配对小组成员的知识不完善来概括先前的工作。我们在更通用的环境中介绍了群集算法,并保证了近似值。我们还解决了“公制成员资格”的问题,其中不同的组具有秩序和距离的概念。实验是使用我们提出的算法以及基准来验证我们的方法的实验,并且在不确定的群体成员身份时也会表现出细微的关注。
In clustering problems, a central decision-maker is given a complete metric graph over vertices and must provide a clustering of vertices that minimizes some objective function. In fair clustering problems, vertices are endowed with a color (e.g., membership in a group), and the features of a valid clustering might also include the representation of colors in that clustering. Prior work in fair clustering assumes complete knowledge of group membership. In this paper, we generalize prior work by assuming imperfect knowledge of group membership through probabilistic assignments. We present clustering algorithms in this more general setting with approximation ratio guarantees. We also address the problem of "metric membership", where different groups have a notion of order and distance. Experiments are conducted using our proposed algorithms as well as baselines to validate our approach and also surface nuanced concerns when group membership is not known deterministically.