Urdu Noun Phrase Chunking - Hybrid Approach

Shahid Siddiq; Sarmad Hussain; Aasim Ali; Muhammad Kamran Malik; Wajid Ali

doi:10.1109/ialp.2010.71

ScienceGate Book Chapters

JOURNAL ARTICLE

Urdu Noun Phrase Chunking - Hybrid Approach

Shahid Siddiq Sarmad Hussain Aasim Ali Muhammad Kamran Malik Wajid Ali

Year: 2010 Vol: 2 Pages: 69-72

DOI: 10.1109/ialp.2010.71

Get Full-Text PDF Get Analytical Report

Abstract

In this work, chunking is used to mark the noun phrases of Urdu sentences. The approach used in this work is hybrid that combines statistical method and hand crafted rules. The statistical model used in this work is HMM along with IOB chunk annotation. From a POS tagged corpus of 100,000 words, around 90,000 word tokens are used for training and 10,000 word tokens for testing. Several experiments are conducted to achieve high accuracy with different combinations of input, output and rule application patterns. Overall accuracy of 97.52% is achieved using TnT Tagger. It is observed that the input sequence which is successful in this regard is merging of POS annotation with IOB annotation.

Keywords:

Computer science Chunking (psychology) Noun phrase Annotation Natural language processing Artificial intelligence Hidden Markov model Phrase Urdu Word (group theory) Part of speech Noun Nominalization Speech recognition Linguistics

Metrics

Cited By

0.40

FWCI (Field Weighted Citation Impact)

Refs

0.70

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Urdu Noun Phrase Chunking - Hybrid Approach

Abstract

Metrics

Citation History

Topics

Related Documents

Urdu noun phrase chunking: HMM based approach

Noun phrase chunking with APL2

Noun phrase chunking in Hebrew

Noun phrase chunking with APL2

A Semi-supervised Approach for Chinese Noun Phrase Chunking