JOURNAL ARTICLE

Semi-Supervised Learning for Prompt Classification in ChatGPT

Abstract

Chat Generative Pre-trained Transformer (ChatGPT) is a large language model-based chatbot that can interact with people and hold interesting and interactive conversations.Individuals have the ability to engage in dialogues with the model by submitting input sentences or prompts of their choosing.Over the past months, ChatGPT has been continuously growing in popularity, reaching over one million users in a matter of days and surpassing the one billion visits in less than 5 months.It is clear that ChatGPT has become an important aid for numerous people, as there are various tasks it is used into, such as generation, question answering, rewriting or simple chatting.Such tasks are represented by certain instructions that are encapsulated in the user input sent to the model.Having access to the most common types of user's instructions could help Machine Learning engineers improve current datasets and models and adapt them to better suit human needs.However, obtaining a large amount of annotated data is expensive and time-consuming.In order to address the aforementioned issues, we investigate the usage of semi-supervised learning techniques.In this paper we describe the creation process of a new multi-label classification dataset for i nstruction classification i n C hatGPT u sing u ser-shared c onversations and employ various semi-supervised learning approaches in order to boost our model's performances.The unlabeled data used for semi-supervised learning methods is extracted from the same source as our labeled dataset.This approach increased the weighted F1 score of the model by 3.5%.

Keywords:
Computer science Chatbot Artificial intelligence Machine learning Supervised learning Popularity Rewriting Generative grammar Transformer Process (computing) Generative model Natural language processing Artificial neural network

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
28
Refs
0.20
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Semi-supervised Classification with Metric Learning

Gang ZhangLianglun Cheng

Year: 2010 Vol: 15 Pages: 123-126
DISSERTATION

Semi-supervised learning for image classification

Sandra Ebert

University:   SciDok (Saarland University and State Library) Year: 2012
© 2026 ScienceGate Book Chapters — All rights reserved.