Reinforcement Learning for Efficient Returns Management
Pascal Linden, Nathalie Paul, Tim Wirtz, Stefan Wrobel

TL;DR
This paper introduces a reinforcement learning method for real-time product reallocation in warehouses, significantly reducing storage time and costs while maintaining near-optimal revenue outcomes.
Contribution
It presents a novel RL approach to solve the online multiple knapsack problem for warehouse returns management, improving efficiency over traditional offline methods.
Findings
Storage time reduced by 96%
Performance gap of only 3% compared to offline methods
Significant cost savings in warehouse management
Abstract
In retail warehouses, returned products are typically placed in an intermediate storage until a decision regarding further shipment to stores is made. The longer products are held in storage, the higher the inefficiency and costs of the returns management process, since enough storage area has to be provided and maintained while the products are not placed for sale. To reduce the average product storage time, we consider an alternative solution where reallocation decisions for products can be made instantly upon their arrival in the warehouse allowing only a limited number of products to still be stored simultaneously. We transfer the problem to an online multiple knapsack problem and propose a novel reinforcement learning approach to pack the items (products) into the knapsacks (stores) such that the overall value (expected revenue) is maximized. Empirical evaluations on simulated data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
