Loading paper
Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models | Tomesphere