JOURNAL ARTICLE

The acquisition of lexical semantic knowledge from large corpora

Abstract

Machine-readable dictionaries provide the raw material from which to construct computationally useful representations of the generic vocabulary contained within it. Many sublanguages, however, are poorly represented in on-line dictionaries, if represented at all. Vocabularies geared to specialized domains are necessary for many applications, such as text categorization and information retrieval. In this paper I describe research devoted to developing techniques for building sublanguage lexicons via syntactic and statistical corpus analysis coupled with analytic techniques based on the tenets of a generative lexicon.

Keywords:
Sublanguage Computer science Natural language processing Artificial intelligence Lexicon Construct (python library) Vocabulary Categorization Generative grammar Text categorization Information retrieval Linguistics

Metrics

13
Cited By
3.30
FWCI (Field Weighted Citation Impact)
20
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Lexicography and Language Studies
Social Sciences →  Arts and Humanities →  Language and Linguistics
Second Language Acquisition and Learning
Social Sciences →  Psychology →  Developmental and Educational Psychology

Related Documents

DISSERTATION

Lexical knowledge acquisition from bilingual corpora

武仁 宇津呂

University:   Medical Entomology and Zoology Year: 1994
JOURNAL ARTICLE

Using Web Corpora for the Automatic Acquisition of Lexical-Semantic Knowledge

Sabine Schulte im WaldeStefan Müller

Journal:   LDV-Forum/Journal for language technology and computational linguistics Year: 2013 Vol: 28 (2)Pages: 85-105
© 2026 ScienceGate Book Chapters — All rights reserved.