CCFC: Core & Core-Full-Core Dual-Track Defense for LLM Jailbreak Protection

Jiaming Hu; Haoyu Wang; Debarghya Mukherjee; Ioannis Ch. Paschalidis

arXiv:2508.14128·cs.CR·August 21, 2025

CCFC: Core & Core-Full-Core Dual-Track Defense for LLM Jailbreak Protection

Jiaming Hu, Haoyu Wang, Debarghya Mukherjee, Ioannis Ch. Paschalidis

PDF

Open Access

TL;DR

CCFC is a dual-track prompt-level defense framework that significantly reduces jailbreak attack success rates on LLMs by isolating query semantics and evaluating responses through complementary safety checks.

Contribution

This paper introduces CCFC, a novel dual-track prompt-level defense mechanism that enhances LLM safety against prompt injection and structure-aware jailbreak attacks.

Findings

01

Reduces attack success rates by 50-75% against strong adversaries

02

Outperforms existing prompt-level defenses in safety and robustness

03

Maintains response quality on benign queries

Abstract

Jailbreak attacks pose a serious challenge to the safe deployment of large language models (LLMs). We introduce CCFC (Core & Core-Full-Core), a dual-track, prompt-level defense framework designed to mitigate LLMs' vulnerabilities from prompt injection and structure-aware jailbreak attacks. CCFC operates by first isolating the semantic core of a user query via few-shot prompting, and then evaluating the query using two complementary tracks: a core-only track to ignore adversarial distractions (e.g., toxic suffixes or prefix injections), and a core-full-core (CFC) track to disrupt the structural patterns exploited by gradient-based or edit-based attacks. The final response is selected based on a safety consistency check across both tracks, ensuring robustness without compromising on response quality. We demonstrate that CCFC cuts attack success rates by 50-75% versus state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Security and Resilience · Security and Verification in Computing · Network Security and Intrusion Detection