论文标题
评估监督机器学习中的合成和实验培训数据应用于量子点的状态检测
Evaluation of synthetic and experimental training data in supervised machine learning applied to charge state detection of quantum dots
论文作者
论文摘要
栅极定义的量子点的自动调整是基于大规模半导体的量子量初始化的必要条件。这些调整程序的重要步骤是基于电荷稳定性图的电荷状态检测。使用监督的机器学习执行此任务需要一个大型数据集,以便进行训练。为了避免手动标记实验数据,已探索了合成数据作为替代方案。与使用实验数据相比,训练数据集的大小可显着增加,但使用合成数据意味着分类器是根据来自调谐过程一部分的实验数据的数据来培训的。在这里,我们评估了一系列基于模拟和实验数据训练的机器学习模型的预测准确性,以及它们在二维电子气体和纳米线设备中推广到实验电荷稳定性图的能力。我们发现,分类器在纯粹的实验或合成和实验训练数据的组合上表现最好,并且在合成数据中添加常见的实验噪声标志并不能显着提高分类精度。这些结果表明,使用监督的机器学习,实验训练数据以及逼真的量子点模拟和噪声模型是负责状态检测至关重要的。
Automated tuning of gate-defined quantum dots is a requirement for large scale semiconductor based qubit initialisation. An essential step of these tuning procedures is charge state detection based on charge stability diagrams. Using supervised machine learning to perform this task requires a large dataset for models to train on. In order to avoid hand labelling experimental data, synthetic data has been explored as an alternative. While providing a significant increase in the size of the training dataset compared to using experimental data, using synthetic data means that classifiers are trained on data sourced from a different distribution than the experimental data that is part of the tuning process. Here we evaluate the prediction accuracy of a range of machine learning models trained on simulated and experimental data and their ability to generalise to experimental charge stability diagrams in two dimensional electron gas and nanowire devices. We find that classifiers perform best on either purely experimental or a combination of synthetic and experimental training data, and that adding common experimental noise signatures to the synthetic data does not dramatically improve the classification accuracy. These results suggest that experimental training data as well as realistic quantum dot simulations and noise models are essential in charge state detection using supervised machine learning.