Decision-Level Ordinal Modeling for Multimodal Essay Scoring with Large Language Models

Han Zhang; Jiamin Su; Li liu

arXiv:2603.14891·cs.CL·March 17, 2026

Decision-Level Ordinal Modeling for Multimodal Essay Scoring with Large Language Models

Han Zhang, Jiamin Su, Li liu

PDF

Open Access

TL;DR

This paper introduces Decision-Level Ordinal Modeling (DLOM), a novel approach for multimodal and text-only automated essay scoring that explicitly models ordinal trait scores using large language models, improving accuracy and interpretability.

Contribution

The paper proposes DLOM, a new method that makes essay scoring an explicit ordinal decision, with specialized modules for multimodal and text-only scenarios, enhancing performance over existing generation-based methods.

Findings

01

DLOM outperforms generation-based baselines on the EssayJudge dataset.

02

DLOM-GF improves scores when modality relevance varies.

03

DLOM-DA enhances performance on text-only benchmarks.

Abstract

Automated essay scoring (AES) predicts multiple rubric-defined trait scores for each essay, where each trait follows an ordered discrete rating scale. Most LLM-based AES methods cast scoring as autoregressive token generation and obtain the final score via decoding and parsing, making the decision implicit. This formulation is particularly sensitive in multimodal AES, where the usefulness of visual inputs varies across essays and traits. To address these limitations, we propose Decision-Level Ordinal Modeling (DLOM), which makes scoring an explicit ordinal decision by reusing the language model head to extract score-wise logits on predefined score tokens, enabling direct optimization and analysis in the score space. For multimodal AES, DLOM-GF introduces a gated fusion module that adaptively combines textual and multimodal score logits. For text-only AES, DLOM-DA adds a distance-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Mental Health via Writing