dataRLsec: Safety, Security, and Reliability With Robust Offline Reinforcement Learning for DPAs

Shriram KS Pandian; Naresh Kshetri

arXiv:2601.01289·cs.CR·January 6, 2026

dataRLsec: Safety, Security, and Reliability With Robust Offline Reinforcement Learning for DPAs

Shriram KS Pandian, Naresh Kshetri

PDF

Open Access

TL;DR

This paper investigates data poisoning attacks on AI systems, analyzes existing defenses, and proposes a robust offline reinforcement learning approach with weighted hash verification and density-ratio weighted behavioral cloning to enhance safety and security.

Contribution

It introduces a novel offline RL framework with four-stage algorithms and combines DWBC with data defense strategies to counter poisoning attacks in AI training data.

Findings

01

Analyzed risks of DPAs in AI systems.

02

Proposed a four-stage offline RL algorithm for robustness.

03

Demonstrated improved safety and security in experiments.

Abstract

Data poisoning attacks (DPAs) are becoming popular as artificial intelligence (AI) algorithms, machine learning (ML) algorithms, and deep learning (DL) algorithms in this artificial intelligence (AI) era. Hackers and penetration testers are excessively injecting malicious contents in the training data (and in testing data too) that leads to false results that are very hard to inspect and predict. We have analyzed several recent technologies used (from deep reinforcement learning to federated learning) for the DPAs and their safety, security, & countermeasures. The problem setup along with the problem estimation is shown in the MuJoCo environment with performance of HalfCheetah before the dataset is poisoned and after the dataset is poisoned. We have analyzed several risks associated with the DPAs and falsification in medical data from popular poisoning data attacks to some popular data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Advanced Malware Detection Techniques