JOURNAL ARTICLE

Sarcasm Identification and Classification in Hindi Newspaper Headlines

I. AhmadPraveen GatlaRajesh Kumar Mundotiya

Year: 2025 Journal:   ACM Transactions on Asian and Low-Resource Language Information Processing Vol: 24 (4)Pages: 1-21   Publisher: Association for Computing Machinery

Abstract

Sarcasm identification in textual data is the most captivating area of research in the current research trends. It is a challenging task for humans as well as for the computer. In this article, we have tried to identify sarcasm in the Hindi newspaper headlines of two of the most-read Hindi newspapers in India, namely Hindustan and Dainik Jagran. Initially, we collected 88,518 Hindi newspaper headlines and identified 1,945 headlines to be sarcastic, which we have considered for the present study. The headlines taken into consideration belong to the political domain and were published during some of the recent Legislative Assembly Elections of 2020, 2021, and 2022. Various machine learning and deep learning techniques have been used to develop the baseline models. It justifies the assumption that sarcastic text does not always bear a negative sentiment. It may bear a positive sentiment depending on the context. The present article aims at the creation of a dataset consisting of 1,945 Hindi newspaper headlines, training and testing machine learning and deep learning models, namely Extra Trees Classifier, Random Forest Classifier, XGBClassifier, fasttext-stackedTCN, and mBERT-stackedTCN for sarcasm identification on the dataset and comparing the results obtained by the models after the experiment. Out of all the choosen models, the Random Forest Classifier performs better with \(F_1\) score of 92.11 before data augmentation and 90.68 after data augmentation.

Keywords:
Sarcasm Hindi Newspaper Natural language processing Identification (biology) Artificial intelligence Computer science Linguistics Irony Media studies Biology Botany Sociology

Metrics

1
Cited By
23.94
FWCI (Field Weighted Citation Impact)
37
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anthropological Studies and Insights
Social Sciences →  Social Sciences →  Anthropology
Crime, Deviance, and Social Control
Social Sciences →  Social Sciences →  Sociology and Political Science
South Asian Studies and Conflicts
Social Sciences →  Social Sciences →  Political Science and International Relations

Related Documents

BOOK-CHAPTER

Sarcasm Detection in Newspaper Headlines

Vishnu Sai Reddy ChilpuriSaaman NadeemTahir MehmoodMuhammad Yaqoob

Lecture notes on data engineering and communications technologies Year: 2024 Pages: 237-250
JOURNAL ARTICLE

Decoding sarcasm: unveiling nuances in newspaper headlines

D. SumaM. Raviraja HollaD. M.

Journal:   International Journal of Power Electronics and Drive Systems/International Journal of Electrical and Computer Engineering Year: 2024 Vol: 14 (3)Pages: 3011-3011
© 2026 ScienceGate Book Chapters — All rights reserved.