Structured Reinforcement Learning for Incentivized Stochastic Covert   Optimization

Adit Jain; Vikram Krishnamurthy

arXiv:2405.07415·cs.LG·May 14, 2024

Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

Adit Jain, Vikram Krishnamurthy

PDF

Open Access

TL;DR

This paper introduces a structured reinforcement learning approach to control stochastic gradient algorithms in distributed optimization, aiming to hide the stationary point estimate from eavesdroppers through incentivized obfuscation.

Contribution

It formulates the covert optimization problem as a finite-horizon MDP and develops algorithms to find optimal threshold-based policies for obfuscation.

Findings

01

Optimal policies have a monotone threshold structure.

02

Proposed algorithms effectively hide stationary points in federated learning.

03

Numerical results demonstrate the approach's effectiveness.

Abstract

This paper studies how a stochastic gradient algorithm (SG) can be controlled to hide the estimate of the local stationary point from an eavesdropper. Such problems are of significant interest in distributed optimization settings like federated learning and inventory management. A learner queries a stochastic oracle and incentivizes the oracle to obtain noisy gradient measurements and perform SG. The oracle probabilistically returns either a noisy gradient of the function} or a non-informative measurement, depending on the oracle state and incentive. The learner's query and incentive are visible to an eavesdropper who wishes to estimate the stationary point. This paper formulates the problem of the learner performing covert optimization by dynamically incentivizing the stochastic oracle and obfuscating the eavesdropper as a finite-horizon Markov decision process (MDP). Using conditions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Privacy-Preserving Technologies in Data · Blockchain Technology Applications and Security