Systematic Generalization by Finetuning? Analyzing Pretrained Language Models Using Constituency Tests

Aishik Chakraborty; Jackie CK Cheung; Timothy J. O’Donnell

doi:10.18653/v1/2023.blackboxnlp-1.27

ScienceGate Book Chapters

JOURNAL ARTICLE

Systematic Generalization by Finetuning? Analyzing Pretrained Language Models Using Constituency Tests

Aishik Chakraborty Jackie CK Cheung Timothy J. O’Donnell

Year: 2023 Pages: 357-366

DOI: 10.18653/v1/2023.blackboxnlp-1.27

Get Full-Text PDF Get Analytical Report

Abstract

Constituents are groups of words that behave as a syntactic unit. Many linguistic phenomena (e.g., question formation, diathesis alternations) require the manipulation and rearrangement of constituents in a sentence. In this paper, we investigate how different finetuning setups affect the ability of pretrained sequence-to-sequence language models such as BART and T5 to replicate constituency tests — transformations that involve manipulating constituents in a sentence. We design multiple evaluation settings by varying the combinations of constituency tests and sentence types that a model is exposed to during finetuning. We show that models can replicate a linguistic transformation on a specific type of sentence that they saw during finetuning, but performance degrades substantially in other settings, showing a lack of systematic generalization. These results suggest that models often learn to manipulate sentences at a surface level unrelated to the constituent-level syntactic structure, for example by copying the first word of a sentence. These results may partially explain the brittleness of pretrained language models in downstream tasks.

Keywords:

Sentence Computer science Natural language processing Artificial intelligence Generalization Copying Replicate Language model Word (group theory) Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.17

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Systematic Generalization by Finetuning? Analyzing Pretrained Language Models Using Constituency Tests

Abstract

Metrics

Topics

Related Documents

Improving Generalization of Pretrained Language Models

Masking as an Efficient Alternative to Finetuning for Pretrained Language Models

Generalization and Fairness Optimization in Pretrained Language Models

Adaptive BPE Tokenization for Enhanced Vocabulary Adaptation in Finetuning Pretrained Language Models

Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts