论文标题
实例流:可视化分类器混淆在实例级别的演变
InstanceFlow: Visualizing the Evolution of Classifier Confusion on the Instance Level
论文作者
论文摘要
分类是最重要的监督机器学习任务之一。在训练分类模型的过程中,训练实例多次(在多个时期)中喂入模型,以迭代地提高分类性能。模型的复杂性日益增加,导致对模型可解释性的需求不断增长。现有方法主要集中于训练后最终模型性能的视觉分析,并且通常仅限于汇总性能指标。在本文中,我们介绍了一种新颖的双视觉可视化工具InstancFlow,允许用户在实例级别上分析分类器的学习行为。 Sankey图可视化整个时期实例的流动,并具有按需详细的字形和单个实例的痕迹。表格视图允许用户通过排名和过滤来找到有趣的实例。这样,InstanceFlow桥接了类级和实例级别的性能评估之间的差距,同时使用户能够对培训过程进行完整的时间分析。
Classification is one of the most important supervised machine learning tasks. During the training of a classification model, the training instances are fed to the model multiple times (during multiple epochs) in order to iteratively increase the classification performance. The increasing complexity of models has led to a growing demand for model interpretability through visualizations. Existing approaches mostly focus on the visual analysis of the final model performance after training and are often limited to aggregate performance measures. In this paper we introduce InstanceFlow, a novel dual-view visualization tool that allows users to analyze the learning behavior of classifiers over time on the instance-level. A Sankey diagram visualizes the flow of instances throughout epochs, with on-demand detailed glyphs and traces for individual instances. A tabular view allows users to locate interesting instances by ranking and filtering. In this way, InstanceFlow bridges the gap between class-level and instance-level performance evaluation while enabling users to perform a full temporal analysis of the training process.