JOURNAL ARTICLE

Kernel Proposal Network for Arbitrary Shape Text Detection

Shi-Xue ZhangXiaobin ZhuJie-Bo HouChun YangXu-Cheng Yin

Year: 2022 Journal:   IEEE Transactions on Neural Networks and Learning Systems Vol: 34 (11)Pages: 8731-8742   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Segmentation-based methods have achieved great success for arbitrary shape text detection. However, separating neighboring text instances is still one of the most challenging problems due to the complexity of texts in scene images. In this article, we propose an innovative kernel proposal network (dubbed KPN) for arbitrary shape text detection. The proposed KPN can separate neighboring text instances by classifying different texts into instance-independent feature maps, meanwhile avoiding the complex aggregation process existing in segmentation-based arbitrary shape text detection methods. To be concrete, our KPN will predict a Gaussian center map for each text image, which will be used to extract a series of candidate kernel proposals (i.e., dynamic convolution kernel) from the embedding feature maps according to their corresponding keypoint positions. To enforce the independence between kernel proposals, we propose a novel orthogonal learning loss (OLL) via orthogonal constraints. Specifically, our kernel proposals contain important self-information learned by network and location information by position embedding. Finally, kernel proposals will individually convolve all embedding feature maps for generating individual embedded maps of text instances. In this way, our KPN can effectively separate neighboring text instances and improve the robustness against unclear boundaries. To the best of our knowledge, our work is the first to introduce the dynamic convolution kernel strategy to efficiently and effectively tackle the adhesion problem of neighboring text instances in text detection. Experimental results on challenging datasets verify the impressive performance and efficiency of our method. The code and model are available at https://github.com/GXYM/KPN.

Keywords:
Kernel (algebra) Computer science Embedding Robustness (evolution) Segmentation Feature (linguistics) Artificial intelligence Kernel method Graph kernel Pattern recognition (psychology) Kernel embedding of distributions Mathematics Discrete mathematics Support vector machine

Metrics

33
Cited By
4.09
FWCI (Field Weighted Citation Impact)
68
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Shi-Xue ZhangXiaobin ZhuChun YangHongfa WangXu-Cheng Yin

Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Year: 2021 Pages: 1285-1294
JOURNAL ARTICLE

Text kernel calculation for arbitrary shape text detection

Xu HanJunyu GaoYuan YuanQi Wang

Journal:   The Visual Computer Year: 2023 Vol: 40 (4)Pages: 2641-2654
© 2026 ScienceGate Book Chapters — All rights reserved.