JOURNAL ARTICLE

Entity Relation Extraction from geological text using Conditional Random Fields and subsequence kernels

Abstract

An important research field in text mining is Entity Relation Extraction. Extracting various relations between geological entities is of immense benefit to developing intelligent search tools for geology researchers. In this paper Conditional Random Fields (CRFs) as well as sequence kernels are used for extracting relations between entities from a geological corpus. A geological corpus was developed from a collection of scientific reports and articles on the geology of the Indian subcontinent. The training set, consisting of more than 200K words, has been annotated with a named entity tag set of seventeen tags and with labeled instances of part-of and nearby relations. The system is able to recognize part-of and near-by relations with 71.57% and 77.27% F-measure values for T-CRF, and 78.25% and 83.71% for subsequence kernels. The extracted relations were used for query expansion in a retrieval system to achieve a gain of 10.86% for T-CRF, and 10.58% for subsequence kernels over the baseline Mean Average Precision.

Keywords:
Conditional random field Subsequence CRFS Computer science Set (abstract data type) Relation (database) Information extraction Field (mathematics) Named-entity recognition Sequence (biology) Natural language processing Information retrieval Longest common subsequence problem Artificial intelligence Relationship extraction Random forest Longest increasing subsequence Data mining Algorithm Mathematics

Metrics

6
Cited By
0.00
FWCI (Field Weighted Citation Impact)
25
Refs
0.06
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.