论文标题

图形 - pcnn:两级人姿势估计,图形姿势细化

Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement

论文作者

Wang, Jian, Long, Xiang, Gao, Yuan, Ding, Errui, Wen, Shilei

论文摘要

最近,大多数最先进的人姿势估计方法基于热图回归。密钥点的最终坐标是通过直接解码热图获得的。在本文中,我们旨在找到一种更好的方法来获得更准确的本地化结果。我们主要提出了改进的两个建议:1)应应用不同的特征和方法以进行粗糙,准确的定位,2)应考虑关键点之间的关系。具体而言,我们提出了一个基于两阶段的基于图形的和模型的框架框架,称为Graph-PCNN,其本地化子网和图形姿势细化模块添加到了原始的热图回归网络中。在第一阶段,应用热图回归网络以获得粗糙的定位结果,并采样了一组称为指导点的建议关键。在第二阶段,对于每个引导点,定位子网提取不同的视觉特征。图形姿势细化模块探索了指导点之间的关系,以获得更准确的定位结果。实验表明,图形PCNN可用于各种骨干,以大幅度提高性能。如果没有铃铛和口哨声,我们的最佳模型就可以在可可Test-Dev拆分上实现新的76.8%AP。

Recently, most of the state-of-the-art human pose estimation methods are based on heatmap regression. The final coordinates of keypoints are obtained by decoding heatmap directly. In this paper, we aim to find a better approach to get more accurate localization results. We mainly put forward two suggestions for improvement: 1) different features and methods should be applied for rough and accurate localization, 2) relationship between keypoints should be considered. Specifically, we propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN, with a localization subnet and a graph pose refinement module added onto the original heatmap regression network. In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled. In the second stage, for each guided point, different visual feature is extracted by the localization subnet. The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results. Experiments show that Graph-PCNN can be used in various backbones to boost the performance by a large margin. Without bells and whistles, our best model can achieve a new state-of-the-art 76.8% AP on COCO test-dev split.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源