“Part of Speech Tagging – A Corpus Based Approach”

S Rashmi; M. Hanumanthappa

doi:10.1007/978-981-10-3433-6_11

ScienceGate Book Chapters

BOOK-CHAPTER

“Part of Speech Tagging – A Corpus Based Approach”

S Rashmi M. Hanumanthappa

Year: 2016 Communications in computer and information science Pages: 88-96 Publisher: Springer Science+Business Media

DOI: 10.1007/978-981-10-3433-6_11

Get Full-Text PDF Get Analytical Report

Abstract

POS tagging, an ideal way to augment a corpus is an imperative abstraction for text mining. However with an increase in the amount of linguistic errors and distinctive fashion of language ambiguities, the data filtered by POS tagging is noisier. In this paper, probabilistic tagging and tagging based on Markov models are combined to estimate the association probabilities. Based on this combined approach, error estimation model is defined. Comparison study is made on different corpus available in NLTK such as Crubadan, Brown and INSPEC. The results obtained by the proposed methodologies show a drastic increase in the accuracy rate of about 98% when compared to the existing algorithms which shows an average of 96% accurate. The performance measure is plotted to calculate the error ratio across the maximum-likelihood estimation.

Keywords:

Computer science Probabilistic logic Measure (data warehouse) Hidden Markov model Artificial intelligence Word error rate Abstraction Natural language processing Ideal (ethics) Speech recognition Pattern recognition (psychology) Data mining

Metrics

Cited By

0.33

FWCI (Field Weighted Citation Impact)

Refs

0.60

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Rough Sets and Fuzzy Logic

Physical Sciences → Computer Science → Computational Theory and Mathematics

“Part of Speech Tagging – A Corpus Based Approach”

Abstract

Metrics

Citation History

Topics

Related Documents

Corpus based part-of-speech tagging

Part of Speech Tagging for Hindi Corpus

Part of speech tagging for Mongolian corpus

An Efficient Corpus Based Part-of-Speech Tagging with GEP

Part-of-Speech Tagging of Hindi Corpus Using Rule-Based Method