Efficient course recommendations with T5-based ranking and summarization
Thijmen Bijl, Niels van Weeren, Suzan Verberne

TL;DR
This paper presents a two-stage T5-based ranking and summarization system for course recommendations that significantly improves ranking quality and speed, demonstrating the potential of transformer models in online educational platforms.
Contribution
The paper introduces a novel two-stage retrieval pipeline using RankT5 and summarization models, with quantization for efficiency, tailored for online course recommendation systems.
Findings
Significant improvement in nDCG@10 scores over baseline BM25.
40% speed-up achieved through quantization of RankT5.
User questionnaire confirmed improved ranking quality, though A/B test results varied.
Abstract
In this paper, we implement and evaluate a two-stage retrieval pipeline for a course recommender system that ranks courses for skill-occupation pairs. The in-production recommender system BrightFit provides course recommendations from multiple sources. Some of the course descriptions are long and noisy, while retrieval and ranking in an online system have to be highly efficient. We developed a two-step retrieval pipeline with RankT5 finetuned on MSMARCO as re-ranker. We compare two summarizers for course descriptions: a LongT5 model that we finetuned for the task, and a generative LLM (Vicuna) with in-context learning. We experiment with quantization to reduce the size of the ranking model and increase inference speed. We evaluate our rankers on two newly labelled datasets, with an A/B test, and with a user questionnaire. On the two labelled datasets, our proposed two-stage ranking with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment · Intelligent Tutoring Systems and Adaptive Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
