Boosting Image Recognition with Non-differentiable Constraints
Xuan Li, Yuchen Lu, Peng Xu, Jizong Peng, Christian Desrosiers, Xue, Liu

TL;DR
This paper introduces a reinforcement learning method to incorporate non-differentiable, rule-based constraints into image recognition models, significantly improving accuracy and robustness in digit sequence recognition tasks.
Contribution
It proposes a novel policy gradient reinforcement learning approach to effectively integrate non-differentiable rules into image recognition models, enhancing performance with limited data.
Findings
Increased accuracy by up to 23.6% on MNIST-based datasets.
Improved robustness against synthesized adversarial examples.
Effective rule-based inductive bias with limited training data.
Abstract
In this paper, we study the problem of image recognition with non-differentiable constraints. A lot of real-life recognition applications require a rich output structure with deterministic constraints that are discrete or modeled by a non-differentiable function. A prime example is recognizing digit sequences, which are restricted by such rules (e.g., \textit{container code detection}, \textit{social insurance number recognition}, etc.). We investigate the usefulness of adding non-differentiable constraints in learning for the task of digit sequence recognition. Toward this goal, we synthesize six different datasets from MNIST and Cropped SVHN, with three discrete rules inspired by real-life protocols. To deal with the non-differentiability of these rules, we propose a reinforcement learning approach based on the policy gradient method. We find that incorporating this rule-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
MethodsTest
