Nonsmooth optimal value and policy functions in mechanical systems   subject to unilateral constraints

Bora S. Banjanin; Samuel A. Burden

arXiv:1710.06745·cs.RO·August 29, 2019

Nonsmooth optimal value and policy functions in mechanical systems subject to unilateral constraints

Bora S. Banjanin, Samuel A. Burden

PDF

TL;DR

This paper demonstrates that in contact-rich mechanical systems, optimal value and policy functions are inherently nonsmooth, challenging the effectiveness of traditional smooth approximation and gradient-based methods in such contexts.

Contribution

It reveals the fundamental nonsmooth nature of value and policy functions in contact-rich mechanical systems, highlighting limitations of existing smooth optimization approaches.

Findings

01

Value and policy functions are generally nonsmooth in contact-rich systems.

02

Traditional smooth approximation methods may not be suitable for such systems.

03

Implications for designing control algorithms for robots with contact dynamics.

Abstract

State-of-the-art approaches to optimal control use smooth approximations of value and policy functions and gradient-based algorithms for improving approximator parameters. Unfortunately, we show that value and policy functions that arise in optimal control of mechanical systems subject to unilateral constraints -- i.e. the contact-rich dynamics of robot locomotion and manipulation -- are generally nonsmooth due to the underlying dynamics exhibiting discontinuous or piecewise-differentiable trajectory outcomes. Simple mechanical systems are used to illustrate this result and the implications for optimal control of contact-rich robot dynamics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.