Loading…

Learning to Co-Embed Queries and Documents

Learning to Rank (L2R) methods that utilize machine learning techniques to solve the ranking problems have been widely studied in the field of information retrieval. Existing methods usually concatenate query and document features as training input, without explicit understanding of relevance betwee...

Full description

Saved in:

Bibliographic Details
Published in:	Electronics (Basel) 2022-11, Vol.11 (22), p.3694
Main Authors:	Wu, Yuehong, Lu, Bowen, Tian, Lin, Liang, Shangsong
Format:	Article
Language:	English
Subjects:	Algorithms Datasets Documents Embedding Information retrieval Machine learning Methods Normal distribution Performance evaluation Queries Query processing Ranking Ranking and selection (Statistics) Ratings & rankings Recommender systems Relevance Retrieval performance measures Semantics
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Learning to Rank (L2R) methods that utilize machine learning techniques to solve the ranking problems have been widely studied in the field of information retrieval. Existing methods usually concatenate query and document features as training input, without explicit understanding of relevance between queries and documents, especially in pairwise based ranking approach. Thus, it is an interesting question whether we can devise an algorithm that effectively describes the relation between queries and documents to learn a better ranking model without incurring huge parameter costs. In this paper, we present a Gaussian Embedding model for Ranking (GERank), an architecture for co-embedding queries and documents, such that each query or document is represented by a Gaussian distribution with mean and variance. Our GERank optimizes an energy-based loss based on the pairwise ranking framework. Additionally, the KL-divergence is utilized to measure the relevance between queries and documents. Experimental results on two LETOR datasets and one TREC dataset demonstrate that our model obtains a remarkable improvement in the ranking performance compared with the state-of-the-art retrieval models.
ISSN:	2079-9292 2079-9292
DOI:	10.3390/electronics11223694