SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance

Divij Dudeja; Mayukha Pal

arXiv:2512.21280·cs.CL·December 25, 2025

SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance

Divij Dudeja, Mayukha Pal

PDF

Open Access

TL;DR

SMART SLM introduces a hierarchical, structured approach with a compact transformer and memory system to improve accuracy and efficiency in document assistance tasks involving complex engineering manuals.

Contribution

It presents a novel hierarchical model combining syntax-aware fact extraction, indexed memory, and a small transformer, achieving higher accuracy with fewer parameters than larger models.

Findings

01

Achieves 21.3% higher accuracy than GPT-2.

02

Uses 64% fewer parameters than GPT-2 and BERT.

03

Provides fast inference for known documents and effective retrieval for new uploads.

Abstract

The user of Engineering Manuals (EM) finds it difficult to read EM s because they are long, have a dense format which includes written documents, step by step procedures, and standard parameter lists for engineering equipment. Off the shelf transformers, especially compact ones, treat this material as a flat stream of tokens. This approach leads to confident but incorrect numeric answers and forces the models to memorize separate facts inefficiently. SMART (Structured Memory and Reasoning Transformer) offers a different and practical solution to the above problem. SMART structures its processing by using a hierarchical approach, and is based upon three main job categories (1) A syntax-aware Fact Extractor (Grammarian) Tree LSTM which extracts facts as subject relation object relations from EM sentences (2) A compact indexed memory MANN (Memory Augmented Neural Network) that indexes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Handwritten Text Recognition Techniques · Advanced Graph Neural Networks