ORBIT

🏆 ClueWeb-Reco Leaderboard

Candidate Ranking Results

ModelRecall@10NDCG@10Recall@50NDCG@50Recall@100NDCG@100
GPT-3.5-Turbo-QueryGen0.00680.00270.01760.00500.03120.0072
GPT-4o-QueryGen0.00680.00420.01460.00580.02640.0077
Gemini-QueryGen0.00680.00420.01460.00580.02640.0077
TASTE0.00200.00150.00390.00190.00390.0019

Prompt Construction for Query Generation

To assess the generalization power of LLM-based recommenders, ClueWeb-Reco includes a query generation task. Browsing history titles are formatted into a prompt, and LLMs are asked to infer the next likely interest without rephrasing. The generated query is then embedded and matched to the candidate pool via dense retrieval.

Prompt construction visual