论文标题
基于快树的正交模型证据
Model Evidence with Fast Tree Based Quadrature
论文作者
论文摘要
高维整合对于许多科学领域至关重要,从粒子物理学到贝叶斯推断。近似这些积分很难,部分原因是从集成域的区域进行定位和采样的难度,这些区域对整体积分做出了重要贡献。在这里,我们提出了一种称为树正交(TQ)的新算法,该算法将此采样问题与使用这些样品产生积分近似的问题分开。 TQ没有关于如何获得的样品的资格,使其可以使用现有集成算法在很大程度上忽略的最新采样算法。给定一组样品,TQ以回归树的形式构建了积分的替代模型,其结构优化以最大化积分精度。该树将集成域分为较小的容器,这些容器是单独集成并汇总以估计整体积分的。任何方法都可以用来整合每个容器,因此现有的集成方法(例如贝叶斯蒙特卡洛)可以与TQ结合使用以提高其性能。在一组基准问题上,我们表明TQ在多达15个维度中为积分提供了准确的近似值;在4及以上,它的表现优于简单的蒙特卡洛和流行的拉斯维加斯方法。
High dimensional integration is essential to many areas of science, ranging from particle physics to Bayesian inference. Approximating these integrals is hard, due in part to the difficulty of locating and sampling from regions of the integration domain that make significant contributions to the overall integral. Here, we present a new algorithm called Tree Quadrature (TQ) that separates this sampling problem from the problem of using those samples to produce an approximation of the integral. TQ places no qualifications on how the samples provided to it are obtained, allowing it to use state-of-the-art sampling algorithms that are largely ignored by existing integration algorithms. Given a set of samples, TQ constructs a surrogate model of the integrand in the form of a regression tree, with a structure optimised to maximise integral precision. The tree divides the integration domain into smaller containers, which are individually integrated and aggregated to estimate the overall integral. Any method can be used to integrate each individual container, so existing integration methods, like Bayesian Monte Carlo, can be combined with TQ to boost their performance. On a set of benchmark problems, we show that TQ provides accurate approximations to integrals in up to 15 dimensions; and in dimensions 4 and above, it outperforms simple Monte Carlo and the popular Vegas method.