JOURNAL ARTICLE

High-Performance Unsupervised Relation Extraction from Large Corpora

Binjamin RozenfeldRonen Feldman

Year: 2006 Journal:   Proceedings Pages: 1032-1037   Publisher: Institute of Electrical and Electronics Engineers

Abstract

We present URIES - an unsupervised relation identification and extraction system. The system automatically identifies interesting binary relations between entities in the input corpus, and then proceeds to extract a large number of instances of these relations. The system discovers relations by clustering frequently co- occuring pairs of entities, based on the contexts in which they appear. Its complex pattern-based representation of the contexts allows the clustering step to achieve very high precision, sufficient for the clusters to perform as sets of seeds for bootstrapping a high-recall relation extraction process. In a series of experiments we demonstrate the successful performance of URIES and compare it to the two existing systems - a weakly supervised high-recall Web relation extraction system called SRES, and an unsupervised relation identification system that uses a simpler bag-ofwords representation of contexts. The experiments show that URIES performs comparably to SRES, but without any supervision, and that such performance is due to the power of its complex contexts representation and to its novel candidate selection method.

Keywords:
Computer science Relationship extraction Cluster analysis Relation (database) Bootstrapping (finance) Representation (politics) Identification (biology) Artificial intelligence Precision and recall Process (computing) Data mining Recall Machine learning Pattern recognition (psychology) Mathematics

Metrics

22
Cited By
4.10
FWCI (Field Weighted Citation Impact)
15
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Towards Large-Scale Unsupervised Relation Extraction from the Web

Bonan MinShuming ShiRalph GrishmanChin-Yew Lin

Journal:   International Journal on Semantic Web and Information Systems Year: 2012 Vol: 8 (3)Pages: 1-23
JOURNAL ARTICLE

Quootstrap: Scalable Unsupervised Extraction of Quotation-Speaker Pairs from Large News Corpora via Bootstrapping

Dario PavlloTiziano PiccardiRobert West

Journal:   Proceedings of the International AAAI Conference on Web and Social Media Year: 2018 Vol: 12 (1)
JOURNAL ARTICLE

Ensemble Semantics for Large-scale Unsupervised Relation Extraction

Bonan MinShuming ShiRalph GrishmanChin-Yew Lin

Journal:   Empirical Methods in Natural Language Processing Year: 2012 Vol: 2013 Pages: 1027-1037
© 2026 ScienceGate Book Chapters — All rights reserved.