pose

HRFormer for Pose Estimation

Introduction

This is the official implementation of High-Resolution Transformer (HRT) for pose estimation. We present a High-Resolution Transformer (HRT) that learns high-resolution repre-sentations for dense prediction tasks, in contrast to the original Vision Transformerthat produces low-resolution representations and has high memory and computa-tional cost. We take advantage of the multi-resolution parallel design introduced inhigh-resolution convolutional networks (HRNet), along with local-window self-attention that performs self-attention over small non-overlapping image windows,for improving the memory and computation efficiency. In addition, we introduce aconvolution into the FFN to exchange information across the disconnected imagewindows. We demonstrate the effectiveness of the High-Resolution Transformeron human pose estimation and semantic segmentation tasks.

Results and models

2d Human Pose Estimation

Results on COCO `val2017` with detector having human AP of 56.4 on COCO `val2017` dataset

Backbone	Input Size	AP	AP⁵⁰	AP⁷⁵	AR^M	AR^L	AR	ckpt	log	script
HRT-S	256x192	74.0%	90.2%	81.2%	70.4%	80.7%	79.4%	ckpt	log	script
HRT-S	384x288	75.6%	90.3%	82.2%	71.6%	82.5%	80.7%	ckpt	log	script
HRT-B	256x192	75.6%	90.8%	82.8%	71.7%	82.6%	80.8%	ckpt	log	script
HRT-B	384x288	77.2%	91.0%	83.6%	73.2%	84.2%	82.0%	ckpt	log	script

Results on COCO `test-dev` with detector having human AP of 56.4 on COCO `val2017` dataset

Backbone	Input Size	AP	AP⁵⁰	AP⁷⁵	AR^M	AR^L	AR	ckpt	log	script
HRT-S	384x288	74.5%	92.3%	82.1%	70.7%	80.6%	79.8%	ckpt	log	script
HRT-B	384x288	76.2%	92.7%	83.8%	72.5%	82.3%	81.2%	ckpt	log	script

The models are first pre-trained on ImageNet-1K dataset, and then fine-tuned on COCO val2017 dataset.

Citation

If you find this project useful in your research, please consider cite:

@article{YuanFHZCW21,
  title={HRT: High-Resolution Transformer for Dense Prediction},
  author={Yuhui Yuan and Rao Fu and Lang Huang and Chao Zhang and Xilin Chen and Jingdong Wang},
  booktitle={arXiv},
  year={2021}
}

Name		Name	Last commit message	Last commit date
parent directory ..
.github		.github
configs		configs
demo		demo
docker		docker
docs		docs
mmcv_custom		mmcv_custom
mmpose		mmpose
requirements		requirements
resources		resources
tests		tests
tools		tools
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
LOG		LOG
README.md		README.md
README_CN.md		README_CN.md
README_MMPOSE.md		README_MMPOSE.md
requirements.txt		requirements.txt
run_dist.sh		run_dist.sh
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

HRFormer for Pose Estimation

Introduction

Results and models

2d Human Pose Estimation

Results on COCO `val2017` with detector having human AP of 56.4 on COCO `val2017` dataset

Results on COCO `test-dev` with detector having human AP of 56.4 on COCO `val2017` dataset

Citation

FilesExpand file tree

pose

Directory actions

More options

Directory actions

More options

Latest commit

History

pose

Folders and files

parent directory

README.md

HRFormer for Pose Estimation

Introduction

Results and models

2d Human Pose Estimation

Results on COCO val2017 with detector having human AP of 56.4 on COCO val2017 dataset

Results on COCO test-dev with detector having human AP of 56.4 on COCO val2017 dataset

Citation

Results on COCO `val2017` with detector having human AP of 56.4 on COCO `val2017` dataset

Results on COCO `test-dev` with detector having human AP of 56.4 on COCO `val2017` dataset