Swin-Pose: Swin Transformer Based Human Pose Estimation

Zinan Xiong; Chenxi Wang; Ying Li; Yan Luo; Yu Cao

doi:10.1109/mipr54900.2022.00048

ScienceGate Book Chapters

JOURNAL ARTICLE

Swin-Pose: Swin Transformer Based Human Pose Estimation

Zinan Xiong Chenxi Wang Ying Li Yan Luo Yu Cao

Year: 2022 Pages: 228-233

DOI: 10.1109/mipr54900.2022.00048

Get Full-Text PDF Get Analytical Report

Abstract

Convolutional neural networks (CNNs) have been widely utilized in many computer vision tasks. However, CNNs have a fixed reception field and lack the ability of long-range perception, which is crucial to human pose estimation. Transformer architecture has been adopted to computer vision applications recently and is proven to be a highly effective architecture. We are interested in exploring its capability in human pose estimation, and thus propose a novel model based on transformer, enhanced with a feature pyramid fusion structure. More specifically, we use pre-trained Swin Transformer to extract features, and leverage a feature pyramid structure to extract and fuse feature maps from different stages. The experiment results of our study have demonstrated that the proposed transformer-based model can achieve better performance compared to the state-of-the-art CNN-based models.

Keywords:

Artificial intelligence Computer science Transformer Pose Convolutional neural network Leverage (statistics) Computer vision Pattern recognition (psychology) Feature extraction 3D pose estimation Architecture Engineering Voltage

Metrics

Cited By

3.59

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Swin-Pose: Swin Transformer Based Human Pose Estimation

Abstract

Metrics

Citation History

Topics

Related Documents

Depth-Based 6DoF Object Pose Estimation Using Swin Transformer

Swin-6D: 6D Pose Estimation via 3D Keypoints Voting with Swin Transformer

Swin-AFF: an improved accuracy 6D pose estimation network for high reflection and texture-less workpieces based on Swin transformer

Gaze estimation based on swin transformer

Bilateral Pose Transformer for Human Pose Estimation