Improving Document Ranking with Relevance-based Entity Embeddings

Jingbei Li; Chengyu Guo; Zichao Wei

doi:10.1109/bigdia56350.2022.9874119

ScienceGate Book Chapters

JOURNAL ARTICLE

Improving Document Ranking with Relevance-based Entity Embeddings

Jingbei Li Chengyu Guo Zichao Wei

Year: 2022 Vol: 1 Pages: 186-192

DOI: 10.1109/bigdia56350.2022.9874119

Get Full-Text PDF Get Analytical Report

Abstract

Document ranking aims to rank and return a group of documents in accordance to their relevance to queries. Traditional document ranking solutions utilize sparse vectors to represent query and documents, and then rank documents in accordance to the similarity between sparse vectors. Over recent years, with the advance of big data technology, such as the emergence of knowledge graph (KG), entities are considered as more essential pivots to connect queries and documents, which are used to improve the document ranking process. However, state-of-the-art entity embedding methods usually place entities with close proximity or similar contexts into the same area in the entity embedding space, which does not meet the goal of document ranking, in which entity relevance plays a more important role. As thus, we propose to enhance document ranking with relevance-based entity embedding. In particular, we first introduce the neural network for training such embeddings, and set the objective that given information needs, i.e., query entities, the frequently occurring entities in the top retrieved documents should be predicted. Then, we trained the model based on Wikipedia articles, and used it to improve the baseline document ranking framework. Empirical experiments validate the superiority of the proposed method.

Keywords:

Computer science Information retrieval Ranking (information retrieval) Relevance (law) Learning to rank Rank (graph theory) Embedding Set (abstract data type) Similarity (geometry) Document retrieval Graph Artificial intelligence Theoretical computer science Mathematics

Metrics

Cited By

0.20

FWCI (Field Weighted Citation Impact)

Refs

0.53

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Graph Neural Networks

Physical Sciences → Computer Science → Artificial Intelligence

Information Retrieval and Search Behavior

Physical Sciences → Computer Science → Information Systems

Improving Document Ranking with Relevance-based Entity Embeddings

Abstract

Metrics

Citation History

Topics

Related Documents

Improving Document Ranking with Dual Word Embeddings

Improving Semantic Search through Entity-Based Document Ranking

Improving Opinion-based Entity Ranking

Named Entity Based Document Similarity with SVM-Based Re-ranking for Entity Linking

Entity Embeddings for Entity Ranking: A Replicability Study