JOURNAL ARTICLE

SLOG: A Structural Generalization Benchmark for Semantic Parsing

Abstract

The goal of compositional generalization benchmarks is to evaluate how well models generalize to new complex linguistic expressions. Existing benchmarks often focus on lexical generalization, the interpretation of novel lexical items in syntactic structures familiar from training; structural generalization tasks, where a model needs to interpret syntactic structures that are themselves unfamiliar from training, are often underrepresented, resulting in overly optimistic perceptions of how well models can generalize. We introduce SLOG, a semantic parsing dataset that extends COGS (Kim and Linzen, 2020) with 17 structural generalization cases. In our experiments, the generalization accuracy of Transformer models, including pretrained ones, only reaches 40.6%, while a structure-aware parser only achieves 70.8%. These results are far from the near-perfect accuracy existing models achieve on COGS, demonstrating the role of SLOG in foregrounding the large discrepancy between models’ lexical and structural generalization capacities.

Keywords:
Computer science Generalization Parsing Natural language processing Artificial intelligence Foregrounding Benchmark (surveying) Linguistics Mathematics

Metrics

3
Cited By
0.77
FWCI (Field Weighted Citation Impact)
45
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text Readability and Simplification
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

GENERALIZATION IN RELATION-AWARE TRANSFORMERS FOR SEMANTIC PARSING

MANZAMBI NDONGALA, Nathan

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2025
JOURNAL ARTICLE

Compositional Generalization in Multilingual Semantic Parsing over Wikidata

Ruixiang CuiRahul AralikatteHeather LentDaniel Hershcovich

Journal:   Transactions of the Association for Computational Linguistics Year: 2022 Vol: 10 Pages: 937-955
JOURNAL ARTICLE

GENERALIZATION IN RELATION-AWARE TRANSFORMERS FOR SEMANTIC PARSING

MANZAMBI NDONGALA, Nathan

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2025
© 2026 ScienceGate Book Chapters — All rights reserved.