JOURNAL ARTICLE

Supporting top-K keyword search in XML databases

Abstract

Keyword search is considered to be an effective information discovery method for both structured and semi-structured data. In XML keyword search, query semantics is based on the concept of Lowest Common Ancestor (LCA). However, naive LCA-based semantics leads to exponential computation and result size. In the literature, LCA-based semantic variants (e.g., ELCA and SLCA) were proposed, which define a subset of all the LCAs as the results. While most existing work focuses on algorithmic efficiency, top-K processing for XML keyword search is an important issue that has received very little attention. Existing algorithms focusing on efficiency are designed to optimize the semantic pruning and are incapable of supporting top-K processing. On the other hand, straightforward applications of top-K techniques from other areas (e.g., relational databases) generate LCAs that may not be the results and unnecessarily expand efforts in the semantic pruning. In this paper, we propose a series of join-based algorithms that combine the semantic pruning and the top-K processing to support top-K keyword search in XML databases. The algorithms essentially reduce the keyword query evaluation to relational joins, and incorporate the idea of the top-K join from relational databases. Extensive experimental evaluations show the performance advantages of our algorithms.

Keywords:
Computer science Information retrieval Pruning XML database XML Semantics (computer science) Relational database Joins Database Data mining World Wide Web Programming language

Metrics

81
Cited By
14.38
FWCI (Field Weighted Citation Impact)
41
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications
Semantic Web and Ontologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

A Top-K Keyword Search for Supporting Semantics in Relational Databases

Bin WangXiaochun YangGuoren Wang

Journal:   Journal of Software Year: 2008 Vol: 19 (9)Pages: 2362-2375
BOOK-CHAPTER

Scalable Top-k Keyword Search in Relational Databases

Yanwei XuJihong GuanYoshiharu Ishikawa

Lecture notes in computer science Year: 2012 Pages: 65-80
JOURNAL ARTICLE

Efficient Top-k Keyword Search Over Multidimensional Databases

Ziqiang YuXiaohui YuYang Liu

Journal:   International Journal of Data Warehousing and Mining Year: 2013 Vol: 9 (3)Pages: 1-21
© 2026 ScienceGate Book Chapters — All rights reserved.