JOURNAL ARTICLE

Using Text and Visual Cues for Fine-Grained Classification

Zaryab ShakerFeng XiaoMuhammad Tahir

Year: 2021 Journal:   International Journal of Advanced Network Monitoring and Controls Vol: 6 (3)Pages: 42-49   Publisher: Exeley Inc

Abstract

Abstract Text is an important invention of humanity, which plays a key role in human life, so far from dark ages. Text in image is closely related to the scene or a product and is widely used in vision based application. In this paper we are addressing the problem of visual understanding with text. The main focus is combining textual cues and visual cues in deep neural network. First the text is recognized and classified from the image. Then we combine the attended word embedding and visual feature vector which are then optimized by CNN for Fine-grained image classification. We carried out the experiments on soft drink dataset in Pakistan. The results shows that the system achieves significant performance which can be potentially beneficial for real world application e.g. product search.

Keywords:
Computer science Artificial intelligence Focus (optics) Feature (linguistics) Word (group theory) Key (lock) Image (mathematics) Embedding Natural language processing Product (mathematics) Pattern recognition (psychology)

Metrics

3
Cited By
0.31
FWCI (Field Weighted Citation Impact)
43
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Fine-Grained Visual Text Prompting

Lingfeng YangXiang LiYueze WangXinlong WangJian Yang

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2024 Vol: 47 (3)Pages: 1594-1609
JOURNAL ARTICLE

SemLa: A Visual Analysis System for Fine-Grained Text Classification

Munkhtulga BattogtokhCosmin DavidescuG. FluckeRita Borgo

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2024 Vol: 38 (21)Pages: 23772-23774
© 2026 ScienceGate Book Chapters — All rights reserved.