论文标题

解释的Shapley Value置信区间的归因方差

Shapley value confidence intervals for attributing variance explained

论文作者

Fryer, Daniel, Strumke, Inga, Nguyen, Hien

论文摘要

确定系数,即$ r^2 $,通常用于测量通过多个解释性协变量的仿射组合所解释的差异。经常寻求这种解释性贡献对每个个体协变量的归因,以便推断每个协变量对响应现象的重要性。确定这种归因的最新方法是通过游戏理论沙普利价值分解确定系数。这种分解具有理想的效率,单调性和均等处理特性。在一个弱的假设中,关节分布是伪涡旋,我们获得了沙普利值的渐近正态性。然后,我们利用此结果来构建有关此类数量的置信区间和假设检验。提供了有关我们的结果的蒙特卡洛研究。我们发现,我们的渐近置信区间在计算上优于竞争性的引导方法,并且能够改善此类间隔的性能。在对澳大利亚房地产价格建模的说明性申请中,我们采用Shapley Value置信区间来确定协变量的解释性贡献之间的显着差异(模型之间),否则它们具有大致相同的$ r^2 $值。这些不同的模型基于2019年和2020年同一时期的房地产数据,后者涵盖了新型冠状病毒(Covid-19)到来的早期阶段。

The coefficient of determination, the $R^2$, is often used to measure the variance explained by an affine combination of multiple explanatory covariates. An attribution of this explanatory contribution to each of the individual covariates is often sought in order to draw inference regarding the importance of each covariate with respect to the response phenomenon. A recent method for ascertaining such an attribution is via the game theoretic Shapley value decomposition of the coefficient of determination. Such a decomposition has the desirable efficiency, monotonicity, and equal treatment properties. Under a weak assumption that the joint distribution is pseudo-elliptical, we obtain the asymptotic normality of the Shapley values. We then utilize this result in order to construct confidence intervals and hypothesis tests regarding such quantities. Monte Carlo studies regarding our results are provided. We found that our asymptotic confidence intervals are computationally superior to competing bootstrap methods and are able to improve upon the performance of such intervals. In an expository application to Australian real estate price modelling, we employ Shapley value confidence intervals to identify significant differences between the explanatory contributions of covariates, between models, which otherwise share approximately the same $R^2$ value. These different models are based on real estate data from the same periods in 2019 and 2020, the latter covering the early stages of the arrival of the novel coronavirus, COVID-19.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源