论文标题
最佳估计样品标准偏差与五个数字摘要
Optimally estimating the sample standard deviation from the five-number summary
论文作者
论文摘要
在报告临床研究的结果时,一些研究人员可能会选择五个数字摘要(包括样本中位数,第一和第三四分位数以及最小值和最大值),而不是样品均值和标准偏差,尤其是对于偏斜的数据。对于这些研究,当包括在荟萃分析中时,通常希望将五个数字的摘要转换回样品平均值和标准偏差。为此,在最近的文献中提出了几种方法,如今已越来越多地使用它们。在本文中,我们建议通过为样本标准偏差的平滑加权估计器开发出完全利用样本量信息的样本标准偏差,以进一步推进文献。为了易于实施,我们还得出了最佳权重的近似公式,以及用于样品标准偏差的快捷公式。数值结果表明,我们的新估计器为正常数据提供了更准确的估计,并且对非正常数据的表现也有利。与Luo等人的最佳样品平均估计量一起,我们的新方法显着改善了现有的数据转换方法,并且它们能够用作五名摘要报告的研究的荟萃分析中的“经验法则”。最后,为了实际使用,还提供了Excel电子表格和在线计算器,以实现我们的最佳估计器。
When reporting the results of clinical studies, some researchers may choose the five-number summary (including the sample median, the first and third quartiles, and the minimum and maximum values) rather than the sample mean and standard deviation, particularly for skewed data. For these studies, when included in a meta-analysis, it is often desired to convert the five-number summary back to the sample mean and standard deviation. For this purpose, several methods have been proposed in the recent literature and they are increasingly used nowadays. In this paper, we propose to further advance the literature by developing a smoothly weighted estimator for the sample standard deviation that fully utilizes the sample size information. For ease of implementation, we also derive an approximation formula for the optimal weight, as well as a shortcut formula for the sample standard deviation. Numerical results show that our new estimator provides a more accurate estimate for normal data and also performs favorably for non-normal data. Together with the optimal sample mean estimator in Luo et al., our new methods have dramatically improved the existing methods for data transformation, and they are capable to serve as "rules of thumb" in meta-analysis for studies reported with the five-number summary. Finally for practical use, an Excel spreadsheet and an online calculator are also provided for implementing our optimal estimators.