Building a large Chinese corpus annotated with semantic dependency

Li Mingqin; Juanzi Li; Zhendong Dong; Zuoying Wang; Dajin Lu

doi:10.3115/1119250.1119262

ScienceGate Book Chapters

JOURNAL ARTICLE

Building a large Chinese corpus annotated with semantic dependency

Li Mingqin Juanzi Li Zhendong Dong Zuoying Wang Dajin Lu

Year: 2003 Vol: 17 Pages: 84-91

DOI: 10.3115/1119250.1119262

Get Full-Text PDF Get Analytical Report

Abstract

At present most of corpora are annotated mainly with syntactic knowledge. In this paper, we attempt to build a large corpus and annotate semantic knowledge with dependency grammar. We believe that words are the basic units of semantics, and the structure and meaning of a sentence consist mainly of a series of semantic dependencies between individual words. A 1,000,000-word-scale corpus annotated with semantic dependency has been built. Compared with syntactic knowledge, semantic knowledge is more difficult to annotate, for ambiguity problem is more serious. In the paper, the strategy to improve consistency is addressed, and congruence is defined to measure the consistency of tagged corpus.. Finally, we will compare our corpus with other well-known corpora.

Keywords:

Computer science Natural language processing Artificial intelligence Ambiguity Dependency (UML) Sentence Consistency (knowledge bases) Dependency grammar Semantics (computer science) Semantic role labeling Information retrieval

Metrics

Cited By

1.92

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Second Language Acquisition and Learning

Social Sciences → Psychology → Developmental and Educational Psychology

Building a large Chinese corpus annotated with semantic dependency

Abstract

Metrics

Citation History

Topics

Related Documents

Building a large-scale annotated Chinese corpus

Building a Large Annotated Corpus of English: The

Building Chinese Discourse Corpus with Connective-driven Dependency Tree Structure

Building a large syntactically-annotated corpus of Vietnamese

Build a Large-Scale Syntactically Annotated Chinese Corpus