Atrous Temporal Convolutional Network for Video Action Segmentation

Jiahao Wang; Zhengyin Du; Annan Li; Yunhong Wang

doi:10.1109/icip.2019.8803088

ScienceGate Book Chapters

JOURNAL ARTICLE

Atrous Temporal Convolutional Network for Video Action Segmentation

Jiahao Wang Zhengyin Du Annan Li Yunhong Wang

Year: 2019 Pages: 1585-1589

DOI: 10.1109/icip.2019.8803088

Get Full-Text PDF Get Analytical Report

Abstract

Fine-grained temporal human action segmentation in untrimmed videos is receiving increasing attention due to its extensive applications in surveillance, robotics, and beyond. It is crucial for an action segmentation system to be robust to the temporal scale of different actions since in practical applications the duration of an action can vary from less than a second to tens of minutes. In this paper, we introduce a novel atrous temporal convolutional network (AT-Net), which explicitly generates multiscale video contextual representations by utilizing atrous temporal pyramid pooling (ATPP) and has an architecture of encoder-decoder fully convolutional network. In the decoding stage, AT-Net combines multiscale contextual features with low-level local features to generate high-quality action segmentation results. Experiments on the 50 Salads, GTEA and JIGSAWS benchmarks demonstrate that AT-Net achieves improvement over the state of the art.

Keywords:

Computer science Segmentation Artificial intelligence Pooling Pyramid (geometry) Encoder Pattern recognition (psychology) Convolutional neural network Decoding methods Convolutional code Action (physics) Computer vision Mathematics Algorithm

Metrics

Cited By

0.86

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Atrous Temporal Convolutional Network for Video Action Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Receptive Atrous Convolutional Network for Semantic Segmentation

Depthwise Separable Temporal Convolutional Network for Action Segmentation

Atrous convolutional feature network for weakly supervised semantic segmentation

Long video-based action segmentation for earthmoving excavators using improved Temporal Convolutional Network models

Stacking-Based Attention Temporal Convolutional Network for Action Segmentation