A Hybrid Visual Transformer for Efficient Deep Human Activity Recognition

Youcef Djenouri; Ahmed Nabil Belbachir

doi:10.1109/iccvw60793.2023.00080

ScienceGate Book Chapters

JOURNAL ARTICLE

A Hybrid Visual Transformer for Efficient Deep Human Activity Recognition

Youcef Djenouri Ahmed Nabil Belbachir

Year: 2023 Pages: 721-730

DOI: 10.1109/iccvw60793.2023.00080

Get Full-Text PDF Get Analytical Report

Abstract

Human Activity Recognition (HAR) has gained significant attention in recent years due to its wide-ranging applications. This paper introduces a novel hybrid visual transformer methodology designed to enhance the robust analysis and comprehension of activities. CVTN (Convolution Visual Transformer Network) leverages sensor data represented jointly in spatial and temporal dimensions to enhance the resilience of the HAR process. The proposed technique employs a hybrid model that integrates Convolutional Neural Networks (CNNs) and Visual Transformers (VTs). Initially, the CNN component learns spatial visual features from diverse sensor data. Subsequently, these acquired visual features are inputted into the transformer segment of the model. VT captures temporal insights by observing sensor statuses across different time points. The efficacy of the CVTN methodology is assessed using the Kinetics dataset, which emulates real-world human activity recognition scenarios. The experimental results reveal clear superiority compared to the recent baseline HAR solutions, reaffirming its potential for advancing activity analysis.

Keywords:

Computer science Transformer Convolutional neural network Artificial intelligence Activity recognition Pattern recognition (psychology) Visualization Machine learning Engineering

Metrics

Cited By

2.00

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Context-Aware Activity Recognition Systems

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

A Hybrid Visual Transformer for Efficient Deep Human Activity Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

E-harnet: an efficient hybrid transformer network for human activity recognition

An Efficient Human Activity Recognition Using Hybrid Features and Transformer Model

An Efficient Human Activity Recognition Using Hybrid Features and Transformer Model

An Efficient Human Activity Recognition Using Hybrid Features and Transformer Model

A Hybrid CNN-LSTM Deep Neural Network Model for Efficient Human Activity Recognition