论文标题
比较不同方法以调整混杂因素的生存曲线
A Comparison of Different Methods to Adjust Survival Curves for Confounders
论文作者
论文摘要
治疗特定的生存曲线是说明与事件时间结局的研究中的治疗效果的重要工具。在非随机研究中,未经调整的估计值可能导致由于混杂而导致的偏见。存在多种调整混杂因素生存曲线的方法。但是,目前尚不清楚哪种方法在哪种情况下最合适。我们的目标是比较治疗加权,G形式,倾向评分匹配,经验可能性估计和增强估计器的形式,以及基于伪值的估计值以及基于伪值的对应物,重点是他们的偏见和拟合度。我们使用德国流行病学试验的数据对所有方法进行了简短的审查,并通过与吸烟者和非吸烟者的生存进行对比,以与脚踝 - 桥式索引的数据进行对比。随后,我们使用蒙特卡洛模拟比较方法。我们考虑正确或不正确指定的用于描述治疗分配的模型,并使用不同样本量的情况使用时间。偏见和拟合优度是通过考虑整个生存曲线来确定的。正确使用时,所有方法在中等大型样品中均未显示系统的偏差。然而,基于COX回归的方法显示了小样本中的系统偏差。拟合优度在不同的方法和场景之间差异很大。利用结果模型的方法比其他技术更有效,而使用附加治疗分配模型的增强估计器则是无偏见的,而两种模型都与其他方法相当。在每个考虑的情况下,这些双重稳定方法都具有重要的优势。
Treatment specific survival curves are an important tool to illustrate the treatment effect in studies with time-to-event outcomes. In non-randomized studies, unadjusted estimates can lead to biased depictions due to confounding. Multiple methods to adjust survival curves for confounders exist. However, it is currently unclear which method is the most appropriate in which situation. Our goal is to compare forms of Inverse Probability of Treatment Weighting, the G-Formula, Propensity Score Matching, Empirical Likelihood Estimation and augmented estimators as well as their pseudo-values based counterparts in different scenarios with a focus on their bias and goodness-of-fit. We provide a short review of all methods and illustrate their usage by contrasting the survival of smokers and non-smokers, using data from the German Epidemiological Trial on Ankle-Brachial-Index. Subsequently, we compare the methods using a Monte-Carlo simulation. We consider scenarios in which correctly or incorrectly specified models for describing the treatment assignment and the time-to-event outcome are used with varying sample sizes. The bias and goodness-of-fit is determined by taking the entire survival curve into account. When used properly, all methods showed no systematic bias in medium to large samples. Cox regression based methods, however, showed systematic bias in small samples. The goodness-of-fit varied greatly between different methods and scenarios. Methods utilizing an outcome model were more efficient than other techniques, while augmented estimators using an additional treatment assignment model were unbiased when either model was correct with a goodness-of-fit comparable to other methods. These doubly-robust methods have important advantages in every considered scenario.