Deterministic Preprocessing and Interpretable Fuzzy Banding for Cost-per-Student Reporting from Extracted Records
Shane Lee, Stella Ng

TL;DR
This paper introduces a deterministic, rule-based workflow for processing administrative data extracts to compute and interpret cost-per-student ratios, enhancing transparency and reproducibility in educational budgeting reports.
Contribution
It presents a novel, transparent method combining deterministic preprocessing with fuzzy banding for interpretability of cost ratios from extracted records.
Findings
Supports snapshot-matched recomputation with SHA-256 hashing
Provides fuzzy band labels (Low, Medium, High) for cost ratios
Includes reproducible example and supplementary materials
Abstract
Administrative extracts are often exchanged as spreadsheets and may be read as reports in their own right during budgeting, workload review, and governance discussions. When an exported workbook becomes the reference snapshot for such decisions, the transformation can be checked by recomputation against a clearly identified input. A deterministic, rule-governed, file-based workflow is implemented in cad_processor.py. The script ingests a Casual Academic Database (CAD) export workbook and aggregates inclusive on-costs and student counts into subject-year and school-year totals, from which it derives cost-per-student ratios. It writes a processed workbook with four sheets: Processing Summary (run record and counters), Trend Analysis (schoolyear cost-per-student matrix), Report (wide subject-level table), and Fuzzy Bands (per-year anchors, membership weights, and band labels). The run…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Educational Assessment and Pedagogy · Spreadsheets and End-User Computing
