JOURNAL ARTICLE

Matching Attributes across Overlapping Heterogeneous Data Sources Using Mutual Information

Huimin Zhao

Year: 2010 Journal:   Journal of Database Management Vol: 21 (4)Pages: 91-110   Publisher: IGI Global

Abstract

Identifying matching attributes across heterogeneous data sources is a critical and time-consuming step in integrating the data sources. In this paper, the author proposes a method for matching the most frequently encountered types of attributes across overlapping heterogeneous data sources. The author uses mutual information as a unified measure of dependence on various types of attributes. An example is used to demonstrate the utility of the proposed method, which is useful in developing practical attribute matching tools.

Keywords:
Computer science Matching (statistics) Mutual information Data mining Measure (data warehouse) Artificial intelligence Statistics Mathematics

Metrics

5
Cited By
0.38
FWCI (Field Weighted Citation Impact)
54
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Semantic Web and Ontologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Semantic matching across heterogeneous data sources

Huimin Zhao

Journal:   Communications of the ACM Year: 2007 Vol: 50 (1)Pages: 45-50
BOOK-CHAPTER

Entity Matching Across Multiple Heterogeneous Data Sources

Chao KongMing GaoXu ChenWeining QianAoying Zhou

Lecture notes in computer science Year: 2016 Pages: 133-146
© 2026 ScienceGate Book Chapters — All rights reserved.