RegGuard: AI-Powered Retrieval-Enhanced Assistant for Pharmaceutical Regulatory Compliance
Siyuan Yang, Xihan Bian, and Jiayin Tang

TL;DR
RegGuard is an AI assistant that automates interpretation of complex regulatory texts for pharmaceutical compliance, improving relevance and reducing errors with novel semantic segmentation and reranking components.
Contribution
The paper introduces RegGuard, featuring HiSACC and ReLACE, novel components that enhance document segmentation and candidate reranking for regulatory compliance automation.
Findings
Improves answer relevance, groundedness, and focus.
Reduces hallucination risk in AI-generated responses.
Built for auditability with provenance and access controls.
Abstract
The increasing frequency and complexity of regulatory updates present a significant burden for multinational pharmaceutical companies. Compliance teams must interpret evolving rules across jurisdictions, formats, and agencies, often manually, at high cost and risk of error. We introduce RegGuard, an industrial-scale AI assistant designed to automate the interpretation of heterogeneous regulatory texts and align them with internal corporate policies. The system ingests heterogeneous document sources through a secure pipeline and enhances retrieval and generation quality with two novel components: HiSACC (Hierarchical Semantic Aggregation for Contextual Chunking) semantically segments long documents into coherent units while maintaining consistency across non-contiguous sections. ReLACE (Regulatory Listwise Adaptive Cross-Encoder for Reranking), a domain-adapted cross-encoder built on an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Scientific Computing and Data Management
