CapViT: Cross-context capsule vision transformers for land cover classification with airborne multispectral LiDAR data

Yongtao Yu; Tao Jiang; Junyong Gao; Haiyan Guan; Dilong Li; Shangbing Gao; E. Tang; Wenhao Wang; Peng Tang; Jonathan Li

doi:10.1016/j.jag.2022.102837

ScienceGate Book Chapters

JOURNAL ARTICLE

CapViT: Cross-context capsule vision transformers for land cover classification with airborne multispectral LiDAR data

Yongtao Yu Tao Jiang Junyong Gao Haiyan Guan Dilong Li Shangbing Gao E. Tang Wenhao Wang Peng Tang Jonathan Li

Year: 2022 Journal: International Journal of Applied Earth Observation and Geoinformation Vol: 111 Pages: 102837-102837 Publisher: Elsevier BV

DOI: 10.1016/j.jag.2022.102837

Get Full-Text PDF Get Analytical Report

Abstract

Equipped with multiple channels of laser scanners, multispectral light detection and ranging (MS-LiDAR) devices possess more advanced prospects in earth observation tasks compared with their single-band counterparts. It also opens up a potential-competitive solution to conducting land cover mapping with MS-LiDAR devices. In this paper, we develop a cross-context capsule vision transformer (CapViT) to serve for land cover classification with MS-LiDAR data. Specifically, the CapViT is structurized with three streams of capsule transformer encoders, which are stacked by capsule transformer (CapFormer) blocks, to exploit long-range global feature interactions at different context scales. These cross-context feature semantics are finally effectively fused to supervise accurate land cover type inferences. In addition, the CapFormer block parallels dual-path multi-head self-attention modules functioning to interpret both spatial token correlations and channel feature interdependencies, which favor significantly to the semantic promotion of feature encodings. Consequently, with the semantic-promoted feature encodings to boost the feature representation distinctiveness and quality, the land cover classification accuracy is effectively improved. The CapViT is elaborately testified on two MS-LiDAR datasets. Both quantitative assessments and comparative analyses demonstrate the competitive capability and advanced performance of the CapViT in tackling land cover classification issues.

Keywords:

Lidar Land cover Multispectral image Remote sensing Computer science Feature extraction Feature (linguistics) Ranging Artificial intelligence Computer vision Geography Engineering Land use Telecommunications

Metrics

Cited By

3.24

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Remote Sensing and LiDAR Applications

Physical Sciences → Environmental Science → Environmental Engineering

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

CapViT: Cross-context capsule vision transformers for land cover classification with airborne multispectral LiDAR data

Abstract

Metrics

Citation History

Topics

Related Documents

Local Enhanced Transformer Networks for Land Cover Classification With Airborne Multispectral LiDAR Data

A Hybrid Capsule Network for Land Cover Classification Using Multispectral LiDAR Data

Land Cover Classification of Multispectral LiDAR Data With an Efficient Self-Attention Capsule Network

Classification of Airborne Multispectral Lidar Point Clouds for Land Cover Mapping

AIRBORNE MULTISPECTRAL LIDAR DATA FOR LAND-COVER CLASSIFICATION AND LAND/WATER MAPPING USING DIFFERENT SPECTRAL INDEXES