使用分布强大优化的强大分组变量选择

论文标题

使用分布强大优化的强大分组变量选择

Robust Grouped Variable Selection Using Distributionally Robust Optimization

论文作者

Chen, Ruidi, Paschalidis, Ioannis Ch.

论文摘要

我们提出了一个基于Wasserstein的不确定性集，用于在数据扰动下选择线性回归和分类问题的数据，以选择分布强大的优化（DRO）公式。最终的模型为分组的绝对收缩和选择操作员（Glasso）算法提供了鲁棒性解释，并突出了鲁棒性与正则化之间的联系。 We prove probabilistic bounds on the out-of-sample loss and the estimation bias, and establish the grouping effect of our estimator, showing that coefficients in the same group converge to the same value as the sample correlation between covariates approaches 1. Based on this result, we propose to use the spectral clustering algorithm with the Gaussian similarity function to perform grouping on the predictors, which makes our approach applicable without knowing the grouping structure a先验。我们将我们的方法比较了一系列替代方案，并在合成数据和与手术相关的医疗记录的真正大数据集上提供了广泛的数值结果，这表明我们的配方会产生一种可解释和偏爱的模型，从而鼓励在组水平上稀疏，并能够在异常值的存在下实现更好的预测和估计表现。

We propose a Distributionally Robust Optimization (DRO) formulation with a Wasserstein-based uncertainty set for selecting grouped variables under perturbations on the data for both linear regression and classification problems. The resulting model offers robustness explanations for Grouped Least Absolute Shrinkage and Selection Operator (GLASSO) algorithms and highlights the connection between robustness and regularization. We prove probabilistic bounds on the out-of-sample loss and the estimation bias, and establish the grouping effect of our estimator, showing that coefficients in the same group converge to the same value as the sample correlation between covariates approaches 1. Based on this result, we propose to use the spectral clustering algorithm with the Gaussian similarity function to perform grouping on the predictors, which makes our approach applicable without knowing the grouping structure a priori. We compare our approach to an array of alternatives and provide extensive numerical results on both synthetic data and a real large dataset of surgery-related medical records, showing that our formulation produces an interpretable and parsimonious model that encourages sparsity at a group level and is able to achieve better prediction and estimation performance in the presence of outliers.

下载PDF全文

下载文献需遵守相关版权规定

论文标题