论文标题
基于深度学习的强大热门识别方法
A Robust Hot Subdwarfs Identification Method Based on Deep Learning
论文作者
论文摘要
Hot Subdwarf Star是一种特殊类型的恒星,对于研究二进制进化和大气扩散过程至关重要。近年来,通过机器学习方法识别热门曲线已经成为一个热门话题,但是自动化和准确性仍然存在局限性。在本文中,我们提出了一种基于卷积神经网络(CNN)的强大识别方法。我们首先使用Lamos DR7-V1的光谱数据构建了数据集。然后,我们构建了一个混合识别模型,包括8级分类模型和二进制分类模型。该模型在测试集中的准确度达到96.17%。为了进一步验证该模型的准确性,我们选择了835个热门细分,而这些细分未参与训练过程中的训练过程(2428,包括重复观测值)作为验证集。达到96.05%的精度。在此基础上,我们使用该模型对Lamost DR7-V1的所有10,640,255个光谱进行过滤和分类,并获得了2393个热门SubdWarf候选者的目录,其中已确认了2067年。通过手动验证,我们在其余候选人中发现了25个新的热门细分。该模型的总体准确性为87.42%。总体而言,本研究中提出的模型可以有效地识别具有稳健结果和高精度的特定光谱,并且可以进一步应用于大规模光谱的分类和搜索特定目标。
Hot subdwarf star is a particular type of star that is crucial for studying binary evolution and atmospheric diffusion processes. In recent years, identifying Hot subdwarfs by machine learning methods has become a hot topic, but there are still limitations in automation and accuracy. In this paper, we proposed a robust identification method based on the convolutional neural network (CNN). We first constructed the dataset using the spectral data of LAMOS DR7-V1. We then constructed a hybrid recognition model including an 8-class classification model and a binary classification model. The model achieved an accuracy of 96.17% on the testing set. To further validate the accuracy of the model, we selected 835 Hot subdwarfs that were not involved in the training process from the identified LAMOST catalog (2428, including repeated observations) as the validation set. An accuracy of 96.05% was achieved. On this basis, we used the model to filter and classify all 10,640,255 spectra of LAMOST DR7-V1, and obtained a catalog of 2393 Hot subdwarf candidates, of which 2067 have been confirmed. We found 25 new Hot subdwarfs among the remaining candidates by manual validation. The overall accuracy of the model is 87.42%. Overall, the model presented in this study can effectively identify specific spectra with robust results and high accuracy, and can be further applied to the classification of large-scale spectra and the search of specific targets.