The acquisition of lexical semantic knowledge from large corpora

James Pustejovsky

doi:10.3115/1075527.1075581

ScienceGate Book Chapters

JOURNAL ARTICLE

The acquisition of lexical semantic knowledge from large corpora

James Pustejovsky

Year: 1992 Pages: 243-243

DOI: 10.3115/1075527.1075581

Get Full-Text PDF Get Analytical Report

Abstract

Machine-readable dictionaries provide the raw material from which to construct computationally useful representations of the generic vocabulary contained within it. Many sublanguages, however, are poorly represented in on-line dictionaries, if represented at all. Vocabularies geared to specialized domains are necessary for many applications, such as text categorization and information retrieval. In this paper I describe research devoted to developing techniques for building sublanguage lexicons via syntactic and statistical corpus analysis coupled with analytic techniques based on the tenets of a generative lexicon.

Keywords:

Sublanguage Computer science Natural language processing Artificial intelligence Lexicon Construct (python library) Vocabulary Categorization Generative grammar Text categorization Information retrieval Linguistics

Metrics

Cited By

3.30

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Lexicography and Language Studies

Social Sciences → Arts and Humanities → Language and Linguistics

Second Language Acquisition and Learning

Social Sciences → Psychology → Developmental and Educational Psychology

The acquisition of lexical semantic knowledge from large corpora

Abstract

Metrics

Citation History

Topics

Related Documents

Lexical knowledge acquisition from bilingual corpora

Lexical knowledge acquisition from bilingual corpora

Using Web Corpora for the Automatic Acquisition of Lexical-Semantic Knowledge

An application of lexical semantics to knowledge acquisition from corpora

Automatic lexical acquisition from raw corpora