JOURNAL ARTICLE

Con-Text: Text Detection for Fine-Grained Object Classification

Sezer KaraoğluRan TaoJan C. van GemertTheo Gevers

Year: 2017 Journal:   IEEE Transactions on Image Processing Vol: 26 (8)Pages: 3965-3980   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This paper focuses on fine-grained object classification using recognized scene text in natural images. While the state-of-the-art relies on visual cues only, this paper is the first work which proposes to combine textual and visual cues. Another novelty is the textual cue extraction. Unlike the state-of-the-art text detection methods, we focus more on the background instead of text regions. Once text regions are detected, they are further processed by two methods to perform text recognition, i.e., ABBYY commercial OCR engine and a state-of-the-art character recognition algorithm. Then, to perform textual cue encoding, bi- and trigrams are formed between the recognized characters by considering the proposed spatial pairwise constraints. Finally, extracted visual and textual cues are combined for fine-grained classification. The proposed method is validated on four publicly available data sets: ICDAR03, ICDAR13, Con-Text, and Flickr-logo. We improve the state-of-the-art end-to-end character recognition by a large margin of 15% on ICDAR03. We show that textual cues are useful in addition to visual cues for fine-grained classification. We show that textual cues are also useful for logo retrieval. Adding textual cues outperforms visual- and textual-only in fine-grained classification (70.7% to 60.3%) and logo retrieval (57.4% to 54.8%).

Keywords:
Computer science Artificial intelligence Focus (optics) Pattern recognition (psychology) Margin (machine learning) Natural language processing Object (grammar) Sensory cue Machine learning

Metrics

40
Cited By
2.67
FWCI (Field Weighted Citation Impact)
99
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Boosting Fine-Grained Oriented Object Detection via Text Features

Beichen ZhouQi BiJian DingGui-Song Xia

Lecture notes in computer science Year: 2024 Pages: 109-125
BOOK-CHAPTER

Text Mining for Fine-Grained Emotion Detection

Ubeeka JainParminder Singh

Lecture notes in networks and systems Year: 2024 Pages: 423-437
JOURNAL ARTICLE

Fine-grained and coarse-grained contrastive learning for text classification

Shaokang ZhangNing Ran

Journal:   Neurocomputing Year: 2024 Vol: 596 Pages: 128084-128084
JOURNAL ARTICLE

Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data

Dheeraj MekalaVarun GangalJingbo Shang

Journal:   Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing Year: 2021 Pages: 583-594
BOOK-CHAPTER

Simple Framework for Interpretable Fine-Grained Text Classification

Munkhtulga BattogtokhG. FluckeCosmin DavidescuRita Borgo

Communications in computer and information science Year: 2024 Pages: 398-425
© 2026 ScienceGate Book Chapters — All rights reserved.