数据可重复性度量的统计分析

论文标题

数据可重复性度量的统计分析

Statistical Analysis of Data Repeatability Measures

论文作者

Wang, Zeyi, Bridgeford, Eric, Wang, Shangsi, Vogelstein, Joshua T., Caffo, Brian

论文摘要

现代数据收集和处理技术的出现使数据的规模，规模和复杂性呈指数增长。利用这些丰富数据集进行下游推断的开创性步骤是了解可重复的数据的特征 - 数据的各个方面能够在重复的分析中识别。矛盾的是，在这些设置下，传统的可重复性措施的实用性（例如类内相关系数）是有限的。在最近的工作中，在一组受试者进行两次或更多的情况下，已经引入了新的数据可重复性度量，包括：指纹识别，等级总和和内部相关系数的概括。但是，这些措施之间的最佳实践之间的关系在很大程度上是未知的。在此手稿中，我们将一种新颖的可重复性度量（可区分性）形式化。我们表明，它与单变量随机效应模型下的相关系数确定性链接，并且具有使用多元测量值的推理任务具有最佳精度的属性。此外，我们使用理论结果和仿真概述并系统地比较可重复性统计。我们表明，等级总统计量与一致的可区分性估计器链接。在高斯和非高斯设置下，从数值上比较了从这些度量中得出的置换测试的能力，具有和没有模拟批处理效应。在理论和经验结果的动机上，我们为每个基准设置提供方法论建议，以作为未来分析的资源。我们认为，这些建议将在提高功能磁共振成像，基因组学，药理学等领域的可重复性方面发挥重要作用。

The advent of modern data collection and processing techniques has seen the size, scale, and complexity of data grow exponentially. A seminal step in leveraging these rich datasets for downstream inference is understanding the characteristics of the data which are repeatable -- the aspects of the data that are able to be identified under a duplicated analysis. Conflictingly, the utility of traditional repeatability measures, such as the intraclass correlation coefficient, under these settings is limited. In recent work, novel data repeatability measures have been introduced in the context where a set of subjects are measured twice or more, including: fingerprinting, rank sums, and generalizations of the intraclass correlation coefficient. However, the relationships between, and the best practices among these measures remains largely unknown. In this manuscript, we formalize a novel repeatability measure, discriminability. We show that it is deterministically linked with the correlation coefficient under univariate random effect models, and has desired property of optimal accuracy for inferential tasks using multivariate measurements. Additionally, we overview and systematically compare repeatability statistics using both theoretical results and simulations. We show that the rank sum statistic is deterministically linked to a consistent estimator of discriminability. The power of permutation tests derived from these measures are compared numerically under Gaussian and non-Gaussian settings, with and without simulated batch effects. Motivated by both theoretical and empirical results, we provide methodological recommendations for each benchmark setting to serve as a resource for future analyses. We believe these recommendations will play an important role towards improving repeatability in fields such as functional magnetic resonance imaging, genomics, pharmacology, and more.

下载PDF全文

下载文献需遵守相关版权规定

论文标题