JOURNAL ARTICLE

OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection

Sheng ZhangYuliang LiuLianwen JinZhongrong WeiChunhua Shen

Year: 2020 Journal:   IEEE Transactions on Multimedia Vol: 23 Pages: 454-467   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Scene text detection methods have achieved significant progresses. However, stack-omnidirectional text dilemma, under-segmentation of very close text words, and over-segmentation of arbitrary-shape long text lines, are still main challenges. Motivated by these problems, we proposed a two stage method called omnidirectional pyramid mask proposal text detector (OPMP). OPMP removes anchor mechanism that requires heuristic non-maximum suppress processing. Instead, it uses an effective pyramid lengthwise and sidewise residual sequence modeling method to produce arbitrary-shape proposals. To accurately extract the features of text shape, OPMP enhances the backbone layers by a multiple arbitrary-shape fitting mechanism. Finally, a multi-grain text classification module is proposed, which reclassifies each text region robustly. Comprehensive ablation studies demonstrate the effectiveness of each proposed component. In addition, experiments on various benchmarks, including ICDAR2015, MLT, MSRA-TD500, CTW1500, and Total-text, show that our method outperforms previous state-of-the-art methods.

Keywords:
Computer science Segmentation Omnidirectional antenna Artificial intelligence Pyramid (geometry) Image segmentation Pattern recognition (psychology) Computer vision Physics

Metrics

44
Cited By
3.04
FWCI (Field Weighted Citation Impact)
89
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network

Fagui LiuCheng ChenDian GuJingzhong Zheng

Journal:   IEEE Access Year: 2019 Vol: 7 Pages: 44219-44228
JOURNAL ARTICLE

Kernel Proposal Network for Arbitrary Shape Text Detection

Shi-Xue ZhangXiaobin ZhuJie-Bo HouChun YangXu-Cheng Yin

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2022 Vol: 34 (11)Pages: 8731-8742
JOURNAL ARTICLE

Scene Text Detection Using Pyramid-Based Text Proposal Network and Transformation Component Network

A.S. Venkata PraneelDr.T. Srinivasa Rao

Journal:   Indian Journal of Computer Science and Engineering Year: 2023 Vol: 14 (1)Pages: 21-32
JOURNAL ARTICLE

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Shi-Xue ZhangXiaobin ZhuChun YangHongfa WangXu-Cheng Yin

Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Year: 2021 Pages: 1285-1294
© 2026 ScienceGate Book Chapters — All rights reserved.