对COVID-19检测的深度不确定性预测的客观评估

论文标题

对COVID-19检测的深度不确定性预测的客观评估

Objective Evaluation of Deep Uncertainty Predictions for COVID-19 Detection

论文作者

Asgharnezhad, Hamzeh, Shamsi, Afshar, Alizadehsani, Roohallah, Khosravi, Abbas, Nahavandi, Saeid, Sani, Zahra Alizadeh, Srinivasan, Dipti

论文摘要

深度神经网络（DNN）已被广泛应用于医学图像中的COVID-19。现有研究主要采用转移学习和其他数据表示策略来产生准确的点估计。这些网络的概括能力总是值得怀疑的，这是因为使用小型数据集开发并且未能报告其预测性信心。量化与DNN预测相关的不确定性是其在医疗环境中受信任的部署的先决条件。在这里，我们使用胸部X射线（CXR）图像应用并评估三种不确定性定量技术用于COVID-19检测。提出了不确定性混淆矩阵的新型概念，并引入了新的性能指标，以进行不确定性估计的客观评估。通过全面的实验，可以表明，在CXR图像上，网络的表现优于在自然图像数据集（例如ImageNet）上预算的网络。定性和定量评估还表明，错误预测的预测不确定性估计值在统计上比正确的预测高。因此，不确定性量化方法能够标记高不确定性估计的风险预测。我们还观察到集合方法在推断过程中更可靠地捕获不确定性。

Deep neural networks (DNNs) have been widely applied for detecting COVID-19 in medical images. Existing studies mainly apply transfer learning and other data representation strategies to generate accurate point estimates. The generalization power of these networks is always questionable due to being developed using small datasets and failing to report their predictive confidence. Quantifying uncertainties associated with DNN predictions is a prerequisite for their trusted deployment in medical settings. Here we apply and evaluate three uncertainty quantification techniques for COVID-19 detection using chest X-Ray (CXR) images. The novel concept of uncertainty confusion matrix is proposed and new performance metrics for the objective evaluation of uncertainty estimates are introduced. Through comprehensive experiments, it is shown that networks pertained on CXR images outperform networks pretrained on natural image datasets such as ImageNet. Qualitatively and quantitatively evaluations also reveal that the predictive uncertainty estimates are statistically higher for erroneous predictions than correct predictions. Accordingly, uncertainty quantification methods are capable of flagging risky predictions with high uncertainty estimates. We also observe that ensemble methods more reliably capture uncertainties during the inference.

下载PDF全文

下载文献需遵守相关版权规定

论文标题