online and lightweight kernel-based approximated policy iteration for   dynamic p-norm linear adaptive filtering

Yuki Akiyama; Minh Vu; Konstantinos Slavakis

arXiv:2210.11755·cs.LG·October 24, 2022

online and lightweight kernel-based approximated policy iteration for dynamic p-norm linear adaptive filtering

Yuki Akiyama, Minh Vu, Konstantinos Slavakis

PDF

Open Access

TL;DR

This paper presents an online, kernel-based reinforcement learning approach for adaptive filtering that dynamically selects the optimal p-norm to handle outliers without prior knowledge of their distribution.

Contribution

It introduces a novel Bellman mapping on RKHSs and an approximate policy iteration framework using random Fourier features to efficiently adapt the p-norm in real-time.

Findings

01

Outperforms existing non-RL and KBRL methods in synthetic outlier scenarios.

02

Effectively adapts to outliers without prior distribution knowledge.

03

Reduces computational complexity with random Fourier features.

Abstract

This paper introduces a solution to the problem of selecting dynamically (online) the ``optimal'' p-norm to combat outliers in linear adaptive filtering without any knowledge on the probability density function of the outliers. The proposed online and data-driven framework is built on kernel-based reinforcement learning (KBRL). To this end, novel Bellman mappings on reproducing kernel Hilbert spaces (RKHSs) are introduced. These mappings do not require any knowledge on transition probabilities of Markov decision processes, and are nonexpansive with respect to the underlying Hilbertian norm. The fixed-point sets of the proposed Bellman mappings are utilized to build an approximate policy-iteration (API) framework for the problem at hand. To address the ``curse of dimensionality'' in RKHSs, random Fourier features are utilized to bound the computational complexity of the API. Numerical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Adaptive Filtering Techniques · Water Systems and Optimization · Adaptive Dynamic Programming Control