Forecasting NCAA Basketball Outcomes with Deep Learning: A Comparative Study of LSTM and Transformer Models

Md Imtiaz Habib

arXiv:2508.02725·cs.LG·August 6, 2025

Forecasting NCAA Basketball Outcomes with Deep Learning: A Comparative Study of LSTM and Transformer Models

Md Imtiaz Habib

PDF

TL;DR

This study compares LSTM and Transformer deep learning models for predicting NCAA basketball tournament outcomes, highlighting the impact of different architectures and loss functions on predictive accuracy and calibration.

Contribution

It introduces a comprehensive framework combining feature engineering and model evaluation for sports outcome prediction using deep learning.

Findings

01

Transformers with BCE achieve highest AUC of 0.8473.

02

LSTM with Brier loss has best probabilistic calibration.

03

Model choice impacts prediction accuracy and reliability.

Abstract

In this research, I explore advanced deep learning methodologies to forecast the outcomes of the 2025 NCAA Division 1 Men's and Women's Basketball tournaments. Leveraging historical NCAA game data, I implement two sophisticated sequence-based models: Long Short-Term Memory (LSTM) and Transformer architectures. The predictive power of these models is augmented through comprehensive feature engineering, including team quality metrics derived from Generalized Linear Models (GLM), Elo ratings, seed differences, and aggregated box-score statistics. To evaluate the robustness and reliability of predictions, I train each model variant using both Binary Cross-Entropy (BCE) and Brier loss functions, providing insights into classification performance and probability calibration. My comparative analysis reveals that while the Transformer architecture optimized with BCE yields superior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.