FlatFormer: A Flat Transformer Knowledge Tracing Model Based on Cognitive Bias Injection

Xiao-li Xia; Hou-biao Li

arXiv:2512.06629·cs.AI·December 9, 2025

FlatFormer: A Flat Transformer Knowledge Tracing Model Based on Cognitive Bias Injection

Xiao-li Xia, Hou-biao Li

PDF

Open Access

TL;DR

FlatFormer introduces a simplified flat Transformer architecture with cognitive bias injection mechanisms, achieving state-of-the-art knowledge tracing performance with reduced complexity and faster inference.

Contribution

The paper presents a novel flat Transformer model with information injection techniques, avoiding hierarchical complexity while maintaining high cognitive fidelity in knowledge tracing.

Findings

01

Achieves state-of-the-art AUC on large-scale datasets.

02

Uses less than 15% of parameters compared to hierarchical models.

03

Increases inference speed by approximately three times.

Abstract

Knowledge Tracing (KT) models face a critical ``Performance-Complexity Trap'': capturing complex cognitive dynamics like learning sessions and memory decay typically requires deep hierarchical architectures, which incur prohibitive computational costs for real-time deployment. To resolve this, we propose FlatFormer, a streamlined architecture based on the novel design paradigm of ``Information Injection over Structural Stacking.'' Unlike parameter-heavy hierarchical models, FlatFormer leverages a standard flat Transformer augmented with two lightweight injection mechanisms: (i) a hybrid input encoding strategy combining learnable session identifiers with fixed sinusoidal step embeddings; and (ii) a pre-computed power-law bias integrated directly into attention logits to explicitly model the forgetting curve. Extensive experiments on four large-scale datasets (e.g., EdNet, Junyi) show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning in Healthcare · Personal Information Management and User Behavior