Mask-Pyramid Network: A Novel Panoptic Segmentation Method

Pengfei Xian; Lai-Man Po; Jingjing Xiong; Yuzhi Zhao; Wing-Yin Yu; Kwok-Wai Cheung

doi:10.3390/s24051411

ScienceGate Book Chapters

JOURNAL ARTICLE

Mask-Pyramid Network: A Novel Panoptic Segmentation Method

Pengfei Xian Lai-Man Po Jingjing Xiong Yuzhi Zhao Wing-Yin Yu Kwok-Wai Cheung

Year: 2024 Journal: Sensors Vol: 24 (5)Pages: 1411-1411 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s24051411

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we introduce a novel panoptic segmentation method called the Mask-Pyramid Network. Existing Mask RCNN-based methods first generate a large number of box proposals and then filter them at each feature level, which requires a lot of computational resources, while most of the box proposals are suppressed and discarded in the Non-Maximum Suppression process. Additionally, for panoptic segmentation, it is a problem to properly fuse the semantic segmentation results with the Mask RCNN-produced instance segmentation results. To address these issues, we propose a new mask pyramid mechanism to distinguish objects and generate much fewer proposals by referring to existing segmented masks, so as to reduce computing resource consumption. The Mask-Pyramid Network generates object proposals and predicts masks from larger to smaller sizes. It records the pixel area occupied by the larger object masks, and then only generates proposals on the unoccupied areas. Each object mask is represented as a H × W × 1 logit, which fits well in format with the semantic segmentation logits. By applying SoftMax to the concatenated semantic and instance segmentation logits, it is easy and natural to fuse both segmentation results. We empirically demonstrate that the proposed Mask-Pyramid Network achieves comparable accuracy performance on the Cityscapes and COCO datasets. Furthermore, we demonstrate the computational efficiency of the proposed method and obtain competitive results.

Keywords:

Segmentation Computer science Pyramid (geometry) Artificial intelligence Softmax function Computer vision Fuse (electrical) Feature (linguistics) Object (grammar) Object detection Image segmentation Scale-space segmentation Pattern recognition (psychology) Deep learning Engineering Mathematics

Metrics

Cited By

1.06

FWCI (Field Weighted Citation Impact)

Refs

0.64

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Mask-Pyramid Network: A Novel Panoptic Segmentation Method

Abstract

Metrics

Citation History

Topics

Related Documents

Mask4Former: Mask Transformer for 4D Panoptic Segmentation

Panoptic segmentation of UAV images with deformable convolution network and mask scoring

YOLO-based Panoptic Segmentation Network

Contour-Aware Panoptic Segmentation Network

Mask-Based Panoptic LiDAR Segmentation for Autonomous Driving