多维非线性字段上的物理解释的机器学习算法

论文标题

多维非线性字段上的物理解释的机器学习算法

Physically interpretable machine learning algorithm on multidimensional non-linear fields

论文作者

Mouradi, Rem-Sophia, Goeury, Cédric, Thual, Olivier, Zaoui, Fabrice, Tassi, Pablo

论文摘要

在对机器学习（ML）和有利的数据开发环境的兴趣中，我们在这里提出了一种基于数据的二维物理领域预测的原始方法。多项式混乱扩展（PCE）广泛用于不确定性定量社区（UQ），长期以来一直被用作概率输入到输出映射的强大表示形式。最近在纯ML上下文中对其进行了测试，并显示出与精选预测的经典ML技术一样强大。除了相关的概率框架之外，该方法固有的一些优点是该方法固有的，例如其对小训练集的显性和适应性。同时，降低维度（DR）技术越来越多地用于模式识别和数据压缩，并且由于数据质量的提高而引起了兴趣。在这项研究中，证明了适当的正交分解（POD）对建造统计预测模型的兴趣。 POD和PCE在各自的框架中都充分证明了它们的价值。本文的目的是将它们结合起来以进行基于现场测量的预测。所述步骤也可用于分析数据。在处理多维字段测量时，遇到的一些具有挑战性的问题，例如处理很少的数据时。提出了POD-PCE耦合方法，特别关注输入数据特征和训练集选择。提出了一种用于评估每个物理参数的重要性的简单方法，并将其扩展到PCE耦合。

In an ever-increasing interest for Machine Learning (ML) and a favorable data development context, we here propose an original methodology for data-based prediction of two-dimensional physical fields. Polynomial Chaos Expansion (PCE), widely used in the Uncertainty Quantification community (UQ), has long been employed as a robust representation for probabilistic input-to-output mapping. It has been recently tested in a pure ML context, and shown to be as powerful as classical ML techniques for point-wise prediction. Some advantages are inherent to the method, such as its explicitness and adaptability to small training sets, in addition to the associated probabilistic framework. Simultaneously, Dimensionality Reduction (DR) techniques are increasingly used for pattern recognition and data compression and have gained interest due to improved data quality. In this study, the interest of Proper Orthogonal Decomposition (POD) for the construction of a statistical predictive model is demonstrated. Both POD and PCE have amply proved their worth in their respective frameworks. The goal of the present paper was to combine them for a field-measurement-based forecasting. The described steps are also useful to analyze the data. Some challenging issues encountered when using multidimensional field measurements are addressed, for example when dealing with few data. The POD-PCE coupling methodology is presented, with particular focus on input data characteristics and training-set choice. A simple methodology for evaluating the importance of each physical parameter is proposed for the PCE model and extended to the POD-PCE coupling.

下载PDF全文

下载文献需遵守相关版权规定

论文标题