Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges

Guanxi Lu; Hao Mark Chen; Zhiqiang Que; Wayne Luk; Hongxiang Fan

arXiv:2511.22483·cs.LG·December 1, 2025

Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges

Guanxi Lu, Hao Mark Chen, Zhiqiang Que, Wayne Luk, Hongxiang Fan

PDF

Open Access

TL;DR

This paper investigates how quantization affects the trustworthiness of large language models, revealing challenges and proposing a mixed-precision ensemble method that improves trustworthiness metrics in high-stakes applications.

Contribution

It systematically analyzes the impact of quantization on trustworthiness metrics and introduces a novel mixed-precision ensemble voting approach to enhance trustworthiness.

Findings

01

Quantization impacts adversarial robustness, fairness, ethics, and out-of-distribution robustness.

02

The proposed ensemble method improves trustworthiness metrics by up to 5.8%.

03

Identifies instability across compression ratios and quantization methods.

Abstract

Large language models (LLMs) have shown promising performance across various tasks. However, their autoregressive decoding process poses significant challenges for efficient deployment on existing AI hardware. Quantization alleviates memory and compute pressure by compressing weights, activations, and KV caches to low precisions while preserving generation quality. However, existing quantization frameworks typically focus on perplexity or classification accuracy, often omitting critical trustworthiness metrics. This gap introduces risks when applying quantized LLMs to downstream high-stakes domains such as finance and healthcare. In this work, we systematically investigate the impact of quantization on four trustworthiness metrics (adversarial robustness, fairness, machine ethics, and out-of-distribution robustness) and identify the instability across compression ratios and quantization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications