Keyword search in relational databases allows the user to search information without knowing database schema and using structural query language. As results needed by user are assembled from connected tuples of multiple relations, ranking keyword queries are needed to retrieve relevant results. For a given keyword query, the authors first generate candidate networks and also produce connected tuple trees according to the generated candidate networks by reducing the size of intermediate joining results. They then model the generated connected tuple trees as a document and evaluate score for each document to estimate its relevance. Finally, the authors retrieve top-k keyword queries by ranking the results. In this paper, the authors propose a new ranking method based on virtual document. They also propose Top-k CTT algorithm by using the frequency threshold value. The experimental results are shown by comparison of the proposed ranking method and the previous ranking methods on IMDB and DBLP datasets.
Xiangfu MengLongbing CaoXiaoyan ZhangJingyu Shao
Zhong ZengMong Li LeeTok Wang Ling
Sayan RanuMinh HoangAmbuj K. Singh
Lei ChenJianliang XuXin LinChristian S. JensenHaibo Hu
Alessandro D’AtriMarina MoscariniNicolas Spyratos