TRUST: A Decentralized Framework for Auditing Large Language Model Reasoning

Morris Yu-Chao Huang; Zhen Tan; Mohan Zhang; Pingzhi Li; Zhuo Zhang; Tianlong Chen

arXiv:2510.20188·cs.AI·October 24, 2025

TRUST: A Decentralized Framework for Auditing Large Language Model Reasoning

Morris Yu-Chao Huang, Zhen Tan, Mohan Zhang, Pingzhi Li, Zhuo Zhang, Tianlong Chen

PDF

Open Access 3 Reviews

TL;DR

TRUST introduces a decentralized, transparent framework for auditing large language models' reasoning processes, enhancing scalability, robustness, and privacy while ensuring accountability and detecting flaws across diverse tasks.

Contribution

It presents a novel decentralized auditing framework with blockchain-based transparency, consensus among diverse auditors, and privacy-preserving reasoning segmentation, addressing key limitations of centralized methods.

Findings

01

Effectively detects reasoning flaws in multiple LLMs.

02

Maintains robustness against adversarial auditors.

03

Scales to complex reasoning tasks with hierarchical DAG decomposition.

Abstract

Large Language Models generate complex reasoning chains that reveal their decision-making, yet verifying the faithfulness and harmlessness of these intermediate steps remains a critical unsolved problem. Existing auditing methods are centralized, opaque, and hard to scale, creating significant risks for deploying proprietary models in high-stakes domains. We identify four core challenges: (1) Robustness: Centralized auditors are single points of failure, prone to bias or attacks. (2) Scalability: Reasoning traces are too long for manual verification. (3) Opacity: Closed auditing undermines public trust. (4) Privacy: Exposing full reasoning risks model theft or distillation. We propose TRUST, a transparent, decentralized auditing framework that overcomes these limitations via: (1) A consensus mechanism among diverse auditors, guaranteeing correctness under up to $30%$ malicious…

Peer Reviews

Decision·ICLR 2026 Conference Withdrawn Submission

Reviewer 01Rating 4Confidence 3

Strengths

1. The paper introduces a decentralized framework that decomposes reasoning into five-level HDAGs and routes segments to heterogeneous auditors, enabling parallel, modular verification while recording outcomes on-chain and storing raw traces off-chain—balancing scalability with transparency. 2. TRUST uses segmentation so each auditor only sees partial context, plus a commit–reveal voting scheme and PoS-style incentives on a blockchain. 3. The framework claims correctness with up to ~30% maliciou

Weaknesses

1. Narrow empirical scope of the human study. The only human-in-the-loop experiment involves 15 PhD students auditing 10 GSM8K math problems, which limits external validity across domains, task types, and auditor populations. 2. The design relies on IPFS, a blockchain ledger for immutable records, and a commit–reveal voting protocol. The paper does not report end-to-end latency, throughput, or cost under realistic workloads, leaving deployability uncertain. 3. Although the paper states experimen

Reviewer 02Rating 2Confidence 4

Strengths

The paper explores a novel and timely direction by focusing on decentralized, semantics-level auditing of LLM reasoning. The work presents solid empirical evaluations alongside clear theoretical analysis, including statistical guarantees and an incentive model.

Weaknesses

The paper frames its contribution as auditing the semantics of the reasoning process, yet the practical objective in most deployments is high-quality, policy-compliant outputs, which are fully observable to end users. The authors do not convincingly justify why auditing internal reasoning semantics is necessary or preferable to auditing outcomes and observable behaviors. If the real concern is billing fairness or “token inflation,” the proposed framework does not verify usage-based charges and c

Reviewer 03Rating 4Confidence 3

Strengths

- This paper tackles an important problem (LLM auditing) and provides a novel solution that allows for decentralized / human-machine collaboration in the auditing process. - Breaking down the trace into atomic segments using the HDAG structure is novel and interesting. - The incentive mechanism design and its analysis is a very nice addition bridging economics theory and practice. - The experimental results show that the proposed system is indeed effective against "flipped" auditors also shows g

Weaknesses

- While the framework itself sounds nice, there needs to be more consideration of practicallity of such a system. The addition of blockchain seems superficial in the current work. Modern blockchain systems still suffer from scalability issues (i.e. high latency, gas price etc). A more detailed analysis of the actual system performance is needed to understand if this is a viable solution. - If this system is to be actually deployed on-chain, I think the incentive analysis needs to include gas p

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Scientific Computing and Data Management · Explainable Artificial Intelligence (XAI)