JOURNAL ARTICLE

High quality SimRank-based similarity search

Yu, WeirenMcCann, Julie A

Year: 2015 Journal:   Spiral (Imperial College London)   Publisher: Imperial College London

Abstract

SimRank is an influential link-based similarity measure that has been used in many fields of Web search and sociometry. The best-of-breed method by Kusumoto et al. [7], however, does not always deliver high-quality results, since it fails to accurately obtain its diagonal correction matrix D. Besides, SimRank is also limited by an unwanted“connectivity trait”: increasing the number of paths between nodes a and b often incurs a decrease in score s(a, b). The best-known solution, SimRank++ [1], cannot resolve this problem, since a revised score will be zero if a and b have no common in-neighbors. In this paper, we consider high-quality similarity search. Our scheme, SR#, is efficient and semantically meaningful: (1) We first formulate the exact D, and devise a “varied-D” method to accurately compute SimRank in linear memory. Moreover, by grouping computation, we also reduce the time of [7] from quadratic to linear in the number of iterations. (2) We design a “kernel-based”model to improve the quality of SimRank, and circumvent the “connectivity trait” issue. (3) We give mathematical insights to the semantic difference between SimRank and its variant, and correct an argument in [7]: “if D is replaced by a scaled identity matrix (1−γ)I, top-K rankings will not be affected much”. The experiments confirm that SR# can accurately extract high-quality scores, and is much faster than the state-of-the-art competitors.

Keywords:
Similarity (geometry) Diagonal Quality (philosophy) Measure (data warehouse) Similarity measure Matrix similarity Quadratic equation Matrix (chemical analysis)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.58
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Historical Studies on Reproduction, Gender, Health, and Societal Changes
Social Sciences →  Arts and Humanities →  History
Medical History and Research
Social Sciences →  Arts and Humanities →  History
Medical History and Innovations
Social Sciences →  Arts and Humanities →  History

Related Documents

JOURNAL ARTICLE

An experimental evaluation of simrank-based similarity search algorithms

Zhipeng ZhangYingxia ShaoBin CuiCe Zhang

Journal:   Proceedings of the VLDB Endowment Year: 2017 Vol: 10 (5)Pages: 601-612
JOURNAL ARTICLE

Efficient SimRank-Based Similarity Join

Weiguo ZhengLei ZouLei ChenDongyan Zhao

Journal:   ACM Transactions on Database Systems Year: 2017 Vol: 42 (3)Pages: 1-37
© 2026 ScienceGate Book Chapters — All rights reserved.