论文标题
一种系统的方法,用于识别和评估多元教育,社会和行为研究中缺失的数据模式和机制
A systematic approach to identify and evaluate missing data patterns and mechanisms in multivariate educational, social, and behavioral research
论文作者
论文摘要
解决丢失数据的方法已被应用的研究人员更容易访问。但是,很少有指导可以帮助研究人员系统地确定合理的缺失数据机制,以确保适当地应用这些方法。两个考虑因素激发了本研究。首先,心理学研究通常以大量潜在反应变量为特征,这些变量可能会在多个数据收集浪潮中观察到。与其他领域相比,这种情况使识别合理的缺失数据机制更具挑战性,例如生物统计学中,少数因变量通常具有主要兴趣,而兴趣的主要预测因子在统计上与其他协方差无关。其次,人们对系统方法在心理科学中缺失数据的敏感性分析的重要性越来越认识到。我们开发并应用了一种系统的方法来减少大量观察到的模式,并演示如何使用这些模式来探索多元环境中潜在的缺失数据机制。一项大规模仿真研究用于指导建议的建议最准确,这是样本量,因素数量,指标数量,每个因素的指标数以及缺失数据的比例。这种方法对数据示例的三种应用表明,该方法在实践中似乎有用。
Methods for addressing missing data have become much more accessible to applied researchers. However, little guidance exists to help researchers systematically identify plausible missing data mechanisms in order to ensure that these methods are appropriately applied. Two considerations motivate the present study. First, psychological research is typically characterized by a large number of potential response variables that may be observed across multiple waves of data collection. This situation makes it more challenging to identify plausible missing data mechanisms than is the case in other fields such as biostatistics where a small number of dependent variables is typically of primary interest and the main predictor of interest is statistically independent of other covariates. Second, there is growing recognition of the importance of systematic approaches to sensitivity analyses for treatment of missing data in psychological science. We develop and apply a systematic approach for reducing a large number of observed patterns and demonstrate how these can be used to explore potential missing data mechanisms within multivariate contexts. A large scale simulation study is used to guide suggestions for which approaches are likely to be most accurate as a function of sample size, number of factors, number of indicators per factor, and proportion of missing data. Three applications of this approach to data examples suggest that the method appears useful in practice.