论文标题
小组不变词典学习
Group Invariant Dictionary Learning
论文作者
论文摘要
词典学习问题涉及将数据表示为稀疏线性总和的任务,该总和是从较小的基本构建块集合中提取的。在部署此类技术的应用程序域中,我们经常遇到存在某种形式的对称或不变性的数据集。在这一观察结果的推动下,我们开发了一个框架,以在此类对称性下的基本构建块的收集仍然不变的限制下为数据学习词典。我们学习此类词典的程序依赖于表示对称性作为作用于数据的矩阵组的作用,并随后引入了凸惩罚函数,从而在矩阵组元素的收集方面引起了稀疏性。当我们考虑整数变化时,我们的框架专门针对卷积词典学习问题。使用阳性半菲尼族遗传矩阵的属性,我们开发了一个扩展,该扩展是学习在连续移动下不变的字典。我们对合成数据和ECG数据的数值实验表明,当数据集的数据点很少,或者在数据集中表达不充分的对称性时,将诸如先验诸如先验的对称性的合并最有价值。
The dictionary learning problem concerns the task of representing data as sparse linear sums drawn from a smaller collection of basic building blocks. In application domains where such techniques are deployed, we frequently encounter datasets where some form of symmetry or invariance is present. Motivated by this observation, we develop a framework for learning dictionaries for data under the constraint that the collection of basic building blocks remains invariant under such symmetries. Our procedure for learning such dictionaries relies on representing the symmetry as the action of a matrix group acting on the data, and subsequently introducing a convex penalty function so as to induce sparsity with respect to the collection of matrix group elements. Our framework specializes to the convolutional dictionary learning problem when we consider integer shifts. Using properties of positive semidefinite Hermitian Toeplitz matrices, we develop an extension that learns dictionaries that are invariant under continuous shifts. Our numerical experiments on synthetic data and ECG data show that the incorporation of such symmetries as priors are most valuable when the dataset has few data-points, or when the full range of symmetries is inadequately expressed in the dataset.