Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Xiaoyun Zhang; Jingqing Ruan; Xing Ma; Yawen Zhu; Jiansong Chen; Ke Zeng; Xunliang Cai

arXiv:2507.01327·cs.LG·July 3, 2025

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Xiaoyun Zhang, Jingqing Ruan, Xing Ma, Yawen Zhu, Jiansong Chen, Ke Zeng, Xunliang Cai

PDF

Open Access

TL;DR

This paper introduces APARL, a reinforcement learning framework that uses adaptive, perplexity-aware sampling and curriculum learning to improve abnormal event detection in customer service dialogues, especially in out-of-domain scenarios.

Contribution

The paper presents a novel dual-loop curriculum learning architecture that enhances the reasoning and out-of-domain generalization of large language models for event detection.

Findings

01

Achieves a 17.19% improvement in F1 score on food delivery dialogue tasks.

02

Enhances out-of-domain transferability by 9.59%.

03

Demonstrates robustness and adaptability in real-world industrial scenarios.

Abstract

Detecting abnormal events in real-world customer service dialogues is highly challenging due to the complexity of business data and the dynamic nature of customer interactions. Moreover, models must demonstrate strong out-of-domain (OOD) generalization to enable rapid adaptation across different business scenarios and maximize commercial value. In this work, we propose a novel Adaptive Perplexity-Aware Reinforcement Learning (APARL) framework that leverages the advanced reasoning capabilities of large language models for abnormal event detection. APARL introduces a dual-loop dynamic curriculum learning architecture, enabling the model to progressively focus on more challenging samples as its proficiency increases. This design effectively addresses performance bottlenecks and significantly enhances OOD transferability. Extensive evaluations on food delivery dialogue tasks show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Topic Modeling · Software System Performance and Reliability