Loading paper
Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment | Tomesphere