在野外进行3D人体姿势估计的Smply基准测试

论文标题

在野外进行3D人体姿势估计的Smply基准测试

SMPLy Benchmarking 3D Human Pose Estimation in the Wild

论文作者

Leroy, Vincent, Weinzaepfel, Philippe, Brégier, Romain, Combaluzier, Hadrien, Rogez, Grégory

论文摘要

从图像中预测3D人类姿势的最新改进。已经引入了甚至可以从单个输入图像中预测姿势和形状的新颖方法，通常依赖于人体的参数模型，例如SMPL。尽管经常显示用于在野外捕获的图像的定性结果，但在这种情况下仍缺少适当的基准，因为在运动捕获室中以外的其他地方获得地面真相3D姿势很麻烦。本文提出了一条管道，可以轻松地生产和验证具有准确的基地真实性的数据集，我们使用该数据集进行了基准，我们在野外进行了近期的3D人姿势估计方法。我们利用最近引入的人体模型挑战数据集，该数据集包含在雕像等行动中冻结的人的野外视频，并利用了人们是静态的，相机可以将SMPL模型准确地适合序列。然后，仅使用在线RGB视频中，从567个场景中选择了总共24,428个带有注册车身型号的帧。我们基于该数据集的基于SMPL的最新人类姿势估计方法。我们的结果强调了挑战仍然存在，特别是对于困难的姿势，或者是部分被部分截断或被遮挡的场景。

Predicting 3D human pose from images has seen great recent improvements. Novel approaches that can even predict both pose and shape from a single input image have been introduced, often relying on a parametric model of the human body such as SMPL. While qualitative results for such methods are often shown for images captured in-the-wild, a proper benchmark in such conditions is still missing, as it is cumbersome to obtain ground-truth 3D poses elsewhere than in a motion capture room. This paper presents a pipeline to easily produce and validate such a dataset with accurate ground-truth, with which we benchmark recent 3D human pose estimation methods in-the-wild. We make use of the recently introduced Mannequin Challenge dataset which contains in-the-wild videos of people frozen in action like statues and leverage the fact that people are static and the camera moving to accurately fit the SMPL model on the sequences. A total of 24,428 frames with registered body models are then selected from 567 scenes at almost no cost, using only online RGB videos. We benchmark state-of-the-art SMPL-based human pose estimation methods on this dataset. Our results highlight that challenges remain, in particular for difficult poses or for scenes where the persons are partially truncated or occluded.

下载PDF全文

下载文献需遵守相关版权规定

论文标题