A Novel Approach to Efficient Multilabel Text Classification: BERT-Federated Learning Fusion

Arefin Abu Isha Md. Sadot; Mitu Maliha Mehjabin; Aziz Mahafuz

doi:10.1109/iccit60459.2023.10441264

ScienceGate Book Chapters

JOURNAL ARTICLE

A Novel Approach to Efficient Multilabel Text Classification: BERT-Federated Learning Fusion

Arefin Abu Isha Md. Sadot Mitu Maliha Mehjabin Aziz Mahafuz

Year: 2023 Vol: 1 Pages: 1-6

DOI: 10.1109/iccit60459.2023.10441264

Get Full-Text PDF Get Analytical Report

Abstract

Large Language Model (LLM)-based transformers, such as Bidirectional Encoder Representations from Transformers (BERT), are currently gaining significant attention for various Natural Language Processing (NLP) tasks, such as machine translation, classification, and auto-completion. These transformer models demonstrate substantial performance improvements for text classification tasks. Multi-label classification problems often require more computation than binary and multi-class classification problems. Also, the computation requirements become more aggressive if large datasets are considered. Federated Learning (FL) offers a solution to train models in a distributed manner while preserving data privacy. This paper proposes a novel approach for building a machine learning model, which deals with a sizeable textual dataset for multi-label classification leveraging FL. FL has been used to train a compound model constructed by extending Bidirectional Encoder Representations from Transformers (BERT) with a "One-dimensional Convolutional Neural Network (1D CNN)". At first, The experiment was conducted in a single machine (Central) with the entire dataset. Then, the dataset was split into two groups, and the same experiment was performed in a Federated Learning fashion (BERT-FL Fusion). The FL setup considerably reduced the required computing power to derive an equivalent global model while increasing accuracy, precision, and F1 Score and minimizing Hamming Loss.

Keywords:

Computer science Artificial intelligence Natural language processing Fusion Machine learning Information retrieval Linguistics

Metrics

Cited By

0.26

FWCI (Field Weighted Citation Impact)

Refs

0.61

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Internet Traffic Analysis and Secure E-voting

Physical Sciences → Computer Science → Artificial Intelligence

A Novel Approach to Efficient Multilabel Text Classification: BERT-Federated Learning Fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Federated Freeze BERT for text classification

Federated Freeze BERT for text classification

Federated Freeze BERT for text classification

Federated Split BERT for Heterogeneous Text Classification

A Multilabel Classifier for Text Classification and Enhanced BERT System