JOURNAL ARTICLE

Improving Text Simplification with Factuality Error Detection

Abstract

In the past few years, the field of text simplification has been dominated by supervised learning approaches thanks to the appearance of large parallel datasets such as Wikilarge and Newsela. However, these datasets suffer from sentence pairs with factuality errors which compromise the models’ performance. So, we proposed a model-independent factuality error detection mechanism, considering bad simplification and bad alignment, to refine the Wikilarge dataset through reducing the weight of these samples during training. We demonstrated that this approach improved the performance of the state-of-the-art text simplification model TST5 by an FKGL reduction of 0.33 and 0.29 on the TurkCorpus and ASSET testing datasets respectively. Our study illustrates the impact of erroneous samples in TS datasets and highlights the need for automatic methods to improve their quality.

Keywords:
Computer science Sentence Artificial intelligence Field (mathematics) Quality (philosophy) Reduction (mathematics) Machine learning Natural language processing Pattern recognition (psychology) Data mining Mathematics

Metrics

2
Cited By
0.39
FWCI (Field Weighted Citation Impact)
25
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Evaluating Factuality in Text Simplification

Ashwin DevarajWilliam P. SheffieldByron WallaceJunyi Jessy Li

Journal:   Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Year: 2022 Vol: 2022 Pages: 7331-7345
DISSERTATION

Medical text simplification and an evaluation of factuality

Devaraj, Ashwin0000-0001-5571-0681

University:   Texas Digital Library (University of Texas) Year: 2022
© 2026 ScienceGate Book Chapters — All rights reserved.