论文标题
BSODA:一个用于在线疾病诊断的双方可伸缩框架
BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis
论文作者
论文摘要
越来越多的人在网上寻求医疗建议。通常,他们会根据经历的症状诊断其医疗状况,这也称为自我诊断。从机器学习的角度来看,在线疾病诊断是一个顺序的特征(症状)选择和分类问题。强化学习(RL)方法是这种任务的标准方法。通常,当特征空间很小时,它们的表现良好,但经常在具有大量功能(例如自我诊断)的任务中效率低下。为了应对这一挑战,我们提出了一个非RL两分性可伸缩框架,用于在线疾病诊断,称为Bsoda。 Bsoda由两个合作分支组成,分别处理症状症状和疾病诊断。查询分支确定信息理论奖励下一步收集的症状。我们采用专家编码器来显着改善对大量功能的部分观察的处理。此外,我们提出了几种近似方法,以实质上将奖励的计算成本降低到在线服务可接受的水平。此外,我们利用诊断模型更精确地估计奖励。对于诊断分支,我们使用知识引导的自我注意模型来执行预测。特别是,Bsoda确定何时使用查询和诊断模型停止查询和输出预测。我们证明,BSODA的表现优于几个公共数据集上的最新方法。此外,我们提出了一种新的评估方法,以测试症状检查方法从合成到现实世界任务的转移性。与现有的RL基线相比,BSODA更有效地可扩展到大型搜索空间。
A growing number of people are seeking healthcare advice online. Usually, they diagnose their medical conditions based on the symptoms they are experiencing, which is also known as self-diagnosis. From the machine learning perspective, online disease diagnosis is a sequential feature (symptom) selection and classification problem. Reinforcement learning (RL) methods are the standard approaches to this type of tasks. Generally, they perform well when the feature space is small, but frequently become inefficient in tasks with a large number of features, such as the self-diagnosis. To address the challenge, we propose a non-RL Bipartite Scalable framework for Online Disease diAgnosis, called BSODA. BSODA is composed of two cooperative branches that handle symptom-inquiry and disease-diagnosis, respectively. The inquiry branch determines which symptom to collect next by an information-theoretic reward. We employ a Product-of-Experts encoder to significantly improve the handling of partial observations of a large number of features. Besides, we propose several approximation methods to substantially reduce the computational cost of the reward to a level that is acceptable for online services. Additionally, we leverage the diagnosis model to estimate the reward more precisely. For the diagnosis branch, we use a knowledge-guided self-attention model to perform predictions. In particular, BSODA determines when to stop inquiry and output predictions using both the inquiry and diagnosis models. We demonstrate that BSODA outperforms the state-of-the-art methods on several public datasets. Moreover, we propose a novel evaluation method to test the transferability of symptom checking methods from synthetic to real-world tasks. Compared to existing RL baselines, BSODA is more effectively scalable to large search spaces.