论文标题
Hessians的层次矩阵近似在由PDES支配的反问题中产生
Hierarchical Matrix Approximations of Hessians Arising in Inverse Problems Governed by PDEs
论文作者
论文摘要
在由部分微分方程(PDE)支配的反问题中产生的Hessian操作员在为确定性反问题的牛顿解决方案以及Markov Chain Carlo对贝叶斯环境中后代的牛顿采样方面以及马尔可夫链的蒙特卡洛采样方面都起着至关重要的作用。这些方法需要能够反复在Hessian上执行此类操作,作为使用任意向量,求解线性系统,反转和(反向)平方根的乘法。不幸的是,Hessian是一个(正式的)密集的,隐式定义的操作员,对于实用的反问题明确形成很棘手,需要与反转参数一样多的PDE求解。当数据包含有关参数的有限信息时,低等级近似值是有效的,但是随着数据变得更加有用的信息,则变得越来越高。但是,在实际应用中出现的许多反问题的Hessians可以通过层次级别等级结构的矩阵很好地近似。分层矩阵表示有望克服密集表示的高复杂性,并提供仅具有对数线性复杂性的有效数据结构和矩阵操作。在这项工作中,我们描述了用于构建和更新Hessians的层次矩阵近似值的算法,并在许多代表性的逆问题上说明了它们,这些问题涉及时间依赖时间扩散,以对流为主的传输,频域声波传播和低频麦克斯韦方程,以表明大量较低级别的数量较低级别的均值均高级别。
Hessian operators arising in inverse problems governed by partial differential equations (PDEs) play a critical role in delivering efficient, dimension-independent convergence for both Newton solution of deterministic inverse problems, as well as Markov chain Monte Carlo sampling of posteriors in the Bayesian setting. These methods require the ability to repeatedly perform such operations on the Hessian as multiplication with arbitrary vectors, solving linear systems, inversion, and (inverse) square root. Unfortunately, the Hessian is a (formally) dense, implicitly-defined operator that is intractable to form explicitly for practical inverse problems, requiring as many PDE solves as inversion parameters. Low rank approximations are effective when the data contain limited information about the parameters, but become prohibitive as the data become more informative. However, the Hessians for many inverse problems arising in practical applications can be well approximated by matrices that have hierarchically low rank structure. Hierarchical matrix representations promise to overcome the high complexity of dense representations and provide effective data structures and matrix operations that have only log-linear complexity. In this work, we describe algorithms for constructing and updating hierarchical matrix approximations of Hessians, and illustrate them on a number of representative inverse problems involving time-dependent diffusion, advection-dominated transport, frequency domain acoustic wave propagation, and low frequency Maxwell equations, demonstrating up to an order of magnitude speedup compared to globally low rank approximations.