DOCUEVAL: An LLM-based AI Engineering Tool for Building Customisable Document Evaluation Workflows

Hao Zhang; Qinghua Lu; and Liming Zhu

arXiv:2511.05496·cs.IR·November 11, 2025

DOCUEVAL: An LLM-based AI Engineering Tool for Building Customisable Document Evaluation Workflows

Hao Zhang, Qinghua Lu, and Liming Zhu

PDF

Open Access

TL;DR

DOCUEVAL is a flexible AI engineering tool that leverages large language models to create customizable, traceable document evaluation workflows, improving evaluation accuracy, scalability, and comparability in practical settings.

Contribution

The paper introduces DOCUEVAL, a novel tool that enables customizable, theory-grounded document evaluation workflows with comprehensive logging and source attribution.

Findings

01

Enables systematic comparison of evaluation strategies.

02

Supports custom reviewer roles and evaluation criteria.

03

Demonstrated effectiveness in academic peer review case.

Abstract

Foundation models, such as large language models (LLMs), have the potential to streamline evaluation workflows and improve their performance. However, practical adoption faces challenges, such as customisability, accuracy, and scalability. In this paper, we present DOCUEVAL, an AI engineering tool for building customisable DOCUment EVALuation workflows. DOCUEVAL supports advanced document processing and customisable workflow design which allow users to define theory-grounded reviewer roles, specify evaluation criteria, experiment with different reasoning strategies and choose the assessment style. To ensure traceability, DOCUEVAL provides comprehensive logging of every run, along with source attribution and configuration management, allowing systematic comparison of results across alternative setups. By integrating these capabilities, DOCUEVAL directly addresses core software…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Scientific Computing and Data Management · Natural Language Processing Techniques