论文标题
在野外进行3D人体姿势估计的Smply基准测试
SMPLy Benchmarking 3D Human Pose Estimation in the Wild
论文作者
论文摘要
从图像中预测3D人类姿势的最新改进。已经引入了甚至可以从单个输入图像中预测姿势和形状的新颖方法,通常依赖于人体的参数模型,例如SMPL。尽管经常显示用于在野外捕获的图像的定性结果,但在这种情况下仍缺少适当的基准,因为在运动捕获室中以外的其他地方获得地面真相3D姿势很麻烦。本文提出了一条管道,可以轻松地生产和验证具有准确的基地真实性的数据集,我们使用该数据集进行了基准,我们在野外进行了近期的3D人姿势估计方法。我们利用最近引入的人体模型挑战数据集,该数据集包含在雕像等行动中冻结的人的野外视频,并利用了人们是静态的,相机可以将SMPL模型准确地适合序列。然后,仅使用在线RGB视频中,从567个场景中选择了总共24,428个带有注册车身型号的帧。我们基于该数据集的基于SMPL的最新人类姿势估计方法。我们的结果强调了挑战仍然存在,特别是对于困难的姿势,或者是部分被部分截断或被遮挡的场景。
Predicting 3D human pose from images has seen great recent improvements. Novel approaches that can even predict both pose and shape from a single input image have been introduced, often relying on a parametric model of the human body such as SMPL. While qualitative results for such methods are often shown for images captured in-the-wild, a proper benchmark in such conditions is still missing, as it is cumbersome to obtain ground-truth 3D poses elsewhere than in a motion capture room. This paper presents a pipeline to easily produce and validate such a dataset with accurate ground-truth, with which we benchmark recent 3D human pose estimation methods in-the-wild. We make use of the recently introduced Mannequin Challenge dataset which contains in-the-wild videos of people frozen in action like statues and leverage the fact that people are static and the camera moving to accurately fit the SMPL model on the sequences. A total of 24,428 frames with registered body models are then selected from 567 scenes at almost no cost, using only online RGB videos. We benchmark state-of-the-art SMPL-based human pose estimation methods on this dataset. Our results highlight that challenges remain, in particular for difficult poses or for scenes where the persons are partially truncated or occluded.