Multi-modal Transformer for Indoor Human Action Recognition

Jeonghyeok Do; Munchurl Kim

doi:10.23919/iccas55662.2022.10003914

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-modal Transformer for Indoor Human Action Recognition

Jeonghyeok Do Munchurl Kim

Year: 2022 Journal: 2022 22nd International Conference on Control, Automation and Systems (ICCAS) Pages: 1155-1160

DOI: 10.23919/iccas55662.2022.10003914

Get Full-Text PDF Get Analytical Report

Abstract

Indoor human action recognition is used in various fields. For example, we can use it to recognize exercise movements in the fitness industry, which can significantly help improve the health of modern people. With the development of sensors, it has become possible to easily acquire multiple data modalities of RGB, IR, depth, and skeleton in the same scene. Since each data modality is complementary, proper fusion is beneficial in recognizing human action. However, existing studies have limitations in utilizing the advantages of each modality. Therefore, we propose a Multi-Modal Transformer (MMT) to use RGB and skeleton data simultaneously in this work. Using the transformer-based structure, MMT can capture the correlation between non-local joints in skeleton data modality. In addition, MMT does not require additional training phases or multiple trained networks as the number of people on the scene changes. In experiments on public benchmark datasets, MMT shows comparable results using only eight input frames.

Keywords:

Computer science Modality (human–computer interaction) Transformer Artificial intelligence Modalities RGB color model Modal Action recognition Computer vision Benchmark (surveying) Sensor fusion Pattern recognition (psychology) Engineering Class (philosophy) Voltage

Metrics

Cited By

0.14

FWCI (Field Weighted Citation Impact)

Refs

0.46

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Multi-modal Transformer for Indoor Human Action Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Cmf-transformer: cross-modal fusion transformer for human action recognition

Multi Modal Aware Transformer Network for Effective Daily Life Human Action Recognition

Multi-Modal Transformer with Skeleton and Text for Action Recognition

Multi-level Fusion for Multi-modal Human Action Recognition

Hybrid Multi-modal Fusion for Human Action Recognition