JOURNAL ARTICLE

Deterministic dependency parsing of English text

Abstract

This paper presents a deterministic dependency parser based on memory-based learning, which parses English text in linear time. When trained and evaluated on the Wall Street Journal section of the Penn Treebank, the parser achieves a maximum attachment score of 87.1%. Unlike most previous systems, the parser produces labeled dependency graphs, using as arc labels a combination of bracket labels and grammatical role labels taken from the Penn Treebank II annotation scheme. The best overall accuracy obtained for identifying both the correct head and the correct arc label is 86.0%, when restricted to grammatical role labels (7 labels), and 84.4% for the maximum set (50 labels).

Keywords:
Treebank Computer science Natural language processing Artificial intelligence Parsing Dependency (UML) Dependency grammar Set (abstract data type) Scheme (mathematics) Annotation Programming language Mathematics

Metrics

222
Cited By
20.08
FWCI (Field Weighted Citation Impact)
34
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Algorithms for Deterministic Incremental Dependency Parsing

Joakim Nivre

Journal:   Computational Linguistics Year: 2008 Vol: 34 (4)Pages: 513-553
JOURNAL ARTICLE

Text-level Discourse Dependency Parsing

Sujian LiLiang WangZhigang CaoWenjie Li

Journal:   Greater South Information System Year: 2014
© 2026 ScienceGate Book Chapters — All rights reserved.