论文标题
变性贝叶斯深度操作员网络:用于参数微分方程的数据驱动的贝叶斯求解器
Variational Bayes Deep Operator Network: A data-driven Bayesian solver for parametric differential equations
论文作者
论文摘要
基于神经网络的数据驱动操作员学习方案在计算力学中显示出巨大的潜力。 DeWonet是一种这样的神经网络架构,由于其出色的预测能力,它获得了广泛的欣赏。话虽如此,在确定性框架中设定的deponet体系结构面临过度拟合,概括差和其不变形式的风险,它无法量化与预测相关的不确定性。我们在本文中提出了一种用于操作员学习的跨跨贝叶斯深索纳(VB-Deeponet),可以在很大程度上减轻deeponet架构的这些局限性,并为用户提供有关在预测阶段相关的不确定性的更多信息。贝叶斯框架中设定的神经网络背后的关键思想是,神经网络的权重和偏差被视为概率分布而不是点估计,并且使用贝叶斯推理来更新其先前的分布。现在,为了管理与近似后验分布相关的计算成本,提出的VB-Deeponet使用\ textIt {变异推理}。与马尔可夫链蒙特卡洛方案不同,变分推断有能力考虑高维后分布,同时保持相关的计算成本较低。涵盖力学问题的不同示例,例如扩散反应,重力摆,对流扩散,以说明所提出的VB-Deeponet的性能,并且在确定性框架中也对Deeponet集进行了比较。
Neural network based data-driven operator learning schemes have shown tremendous potential in computational mechanics. DeepONet is one such neural network architecture which has gained widespread appreciation owing to its excellent prediction capabilities. Having said that, being set in a deterministic framework exposes DeepONet architecture to the risk of overfitting, poor generalization and in its unaltered form, it is incapable of quantifying the uncertainties associated with its predictions. We propose in this paper, a Variational Bayes DeepONet (VB-DeepONet) for operator learning, which can alleviate these limitations of DeepONet architecture to a great extent and give user additional information regarding the associated uncertainty at the prediction stage. The key idea behind neural networks set in Bayesian framework is that, the weights and bias of the neural network are treated as probability distributions instead of point estimates and, Bayesian inference is used to update their prior distribution. Now, to manage the computational cost associated with approximating the posterior distribution, the proposed VB-DeepONet uses \textit{variational inference}. Unlike Markov Chain Monte Carlo schemes, variational inference has the capacity to take into account high dimensional posterior distributions while keeping the associated computational cost low. Different examples covering mechanics problems like diffusion reaction, gravity pendulum, advection diffusion have been shown to illustrate the performance of the proposed VB-DeepONet and comparisons have also been drawn against DeepONet set in deterministic framework.