Weakly-Supervised Text Instance Segmentation

Xinyan Zu; Haiyang Yu; Bin Li; Xiangyang Xue

doi:10.1145/3581783.3612243

ScienceGate Book Chapters

JOURNAL ARTICLE

Weakly-Supervised Text Instance Segmentation

Xinyan Zu Haiyang Yu Bin Li Xiangyang Xue

Year: 2023 Pages: 1915-1923

DOI: 10.1145/3581783.3612243

Get Full-Text PDF Get Analytical Report

Abstract

Text segmentation is a challenging computer vision task with many downstream applications. Current text segmentation models need to be trained with pixel-level annotations, which requires a lot of labor cost. In this paper, we take the first attempt to perform weakly-supervised text instance segmentation through bridging text recognition and text segmentation. We observe that text recognition models are able to produce the attention localization of each text instance. Based on this observation, we propose a two-stage Text Adaptive Refinement (TAR) module to generate the pseudo labels based on the attention map of a text recognizer. Meanwhile, we develop a text segmentation module to take the rough attention location as input to predict segmentation masks, which are supervised by the aforementioned pseudo labels. In addition, we introduce a mask-augmented contrastive learning by treating the segmentation result as an augmented version of the input text image, thus improving the visual representation and further enhancing the performance of both recognition and segmentation. The experimental results demonstrate that the proposed method outperforms the state-of-the-art (SOTA) weakly-supervised generic segmentation methods by 18.95% and 17.80% in fgIoU on ICDAR13-FST and TextSeg. On MLT-S, COCO-TS and Total-Text, the proposed method achieves about 82% of the fully-supervised methods' performance. When evaluated on instance segmentation, the proposed method exceeds existing SOTA methods by 23.32% and 21.34% on ICDAR13-FST and TextSeg, respectively. Code and Supplementary Materials are available at https://github.com/FudanVI/FudanOCR/tree/main/weakly-text-segmentation.

Keywords:

Computer science Artificial intelligence Segmentation Image segmentation Scale-space segmentation Pattern recognition (psychology) Natural language processing Computer vision

Metrics

Cited By

1.09

FWCI (Field Weighted Citation Impact)

Refs

0.75

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Web Data Mining and Analysis

Physical Sciences → Computer Science → Information Systems

Weakly-Supervised Text Instance Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Learning Instance Activation Maps for Weakly Supervised Instance Segmentation

Weakly Supervised Nuclei Segmentation Via Instance Learning

Weakly supervised segmentation via instance-aware propagation

PWISeg: Weakly-Supervised Surgical Instrument Instance Segmentation

Weakly Supervised Instance Segmentation Using Hybrid Networks