Knowledge-Augmented Reasoning for EUAIA Compliance and Adversarial   Robustness of LLMs

Tomas Bueno Momcilovic; Dian Balta; Beat Buesser; Giulio Zizzo; Mark; Purcell

arXiv:2410.09078·cs.CL·October 15, 2024

Knowledge-Augmented Reasoning for EUAIA Compliance and Adversarial Robustness of LLMs

Tomas Bueno Momcilovic, Dian Balta, Beat Buesser, Giulio Zizzo, Mark, Purcell

PDF

Open Access

TL;DR

This paper proposes a knowledge-augmented reasoning architecture to help developers and auditors ensure large language models are both compliant with EU regulations and resistant to adversarial attacks, enhancing trustworthiness.

Contribution

It introduces a novel reasoning layer that integrates regulatory compliance and adversarial robustness using knowledge augmentation, bridging legal and technical requirements.

Findings

01

Demonstrates a functional architecture combining compliance and robustness.

02

Supports developers and auditors with a reasoning layer based on knowledge augmentation.

03

Provides a new direction for trustworthy LLM deployment in the EU.

Abstract

The EU AI Act (EUAIA) introduces requirements for AI systems which intersect with the processes required to establish adversarial robustness. However, given the ambiguous language of regulation and the dynamicity of adversarial attacks, developers of systems with highly complex models such as LLMs may find their effort to be duplicated without the assurance of having achieved either compliance or robustness. This paper presents a functional architecture that focuses on bridging the two properties, by introducing components with clear reference to their source. Taking the detection layer recommended by the literature, and the reporting layer required by the law, we aim to support developers and auditors with a reasoning layer based on knowledge augmentation (rules, assurance cases, contextual mappings). Our findings demonstrate a novel direction for ensuring LLMs deployed in the EU are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning