Abstractive Summarization of Korean Legal Cases using Pre-trained Language Models

Jiyoung Yoon; Muhammad Junaid; Sajid Ali; Jongwuk Lee

doi:10.1109/imcom53663.2022.9721808

ScienceGate Book Chapters

JOURNAL ARTICLE

Abstractive Summarization of Korean Legal Cases using Pre-trained Language Models

Jiyoung Yoon Muhammad Junaid Sajid Ali Jongwuk Lee

Year: 2022 Journal: 2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM) Pages: 1-7

DOI: 10.1109/imcom53663.2022.9721808

Get Full-Text PDF Get Analytical Report

Abstract

AI technology in the legal domain has developed at a rapid pace around the world, but not much research is being conducted in the Korean legal field due to barriers of language and the high level of expertise required. We first attempt abstractive summarization of Korean legal decision text and publicly release our collected dataset. We utilize two pretrained language models, i.e., BERT2BERT and BART, for our task. They are based on the encoder-decoder approach under transformer architecture. While BERT2BERT is pre-trained with BERT on both the encoder and decoder, BART combines BERT and GPT as the encoder and the decoder. We then evaluate the baseline models and show that, despite the difference in language style, the high-quality summary was generated using applied models. We also show that pre-training using both autoencoder and autoregressive method makes better performance than using solely denoising autoencoder.

Keywords:

Automatic summarization Computer science Autoencoder Encoder Artificial intelligence Natural language processing Autoregressive model Transformer Pace Language model Baseline (sea) Speech recognition Machine learning Deep learning Engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Abstractive Summarization of Korean Legal Cases using Pre-trained Language Models

Abstract

Metrics

Citation History

Topics

Related Documents

Using Pre-Trained Language Models for Abstractive DBPEDIA Summarization: A Comparative Study

Exploring Arabic Pre-Trained Language Models for Arabic Abstractive Text Summarization

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

An Analysis of Abstractive Text Summarization Using Pre-trained Models

Indonesian Abstractive Summarization using Pre-trained Model