论文标题
学习可解释的决策规则集:一种suppodular优化方法
Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach
论文作者
论文摘要
规则集是高度可解释的逻辑模型,其中决策的谓词以分离的正常形式(DNF或ands)表达,或者等效地,总体模型包括无序的决策规则。在本文中,我们考虑了一种基于基于学习规则集的基于superibuliular优化的方法。学习问题被构架为子集选择任务,其中所有可能的规则的子集需要选择以形成准确且可解释的规则集。我们采用了表现出表达性的客观函数,因此可以适合于下二次优化技术。为了克服难以处理指数尺寸的地面规则集引起的困难,搜索规则的子问题被视为另一个询问特征子集的子集选择任务。我们表明,可以为子问题编写诱导的目标函数,作为两个子模函数(DS)函数的差,以使其通过DS优化算法近似解决。总体而言,所提出的方法是简单,可扩展的,并且可能会从进一步研究子解体优化中受益。实际数据集上的实验证明了我们方法的有效性。
Rule sets are highly interpretable logical models in which the predicates for decision are expressed in disjunctive normal form (DNF, OR-of-ANDs), or, equivalently, the overall model comprises an unordered collection of if-then decision rules. In this paper, we consider a submodular optimization based approach for learning rule sets. The learning problem is framed as a subset selection task in which a subset of all possible rules needs to be selected to form an accurate and interpretable rule set. We employ an objective function that exhibits submodularity and thus is amenable to submodular optimization techniques. To overcome the difficulty arose from dealing with the exponential-sized ground set of rules, the subproblem of searching a rule is casted as another subset selection task that asks for a subset of features. We show it is possible to write the induced objective function for the subproblem as a difference of two submodular (DS) functions to make it approximately solvable by DS optimization algorithms. Overall, the proposed approach is simple, scalable, and likely to be benefited from further research on submodular optimization. Experiments on real datasets demonstrate the effectiveness of our method.