Novel Multi-Agent Action Masked Deep Reinforcement Learning for General Industrial Assembly Lines Balancing Problems
Ali Mohamed Ali, Luca Tirel, Hashim A. Hashim

TL;DR
This paper presents a novel multi-agent deep reinforcement learning framework with action masking for optimizing industrial assembly line scheduling, achieving faster convergence and scalability without restrictive assumptions.
Contribution
It introduces a new MDP-based model for assembly lines, a multi-agent DRL approach with action masking, and a scalable training framework for real-time industrial scheduling.
Findings
Faster convergence to optimal solutions compared to model-based methods.
Effective reduction of training time through action masking.
Scalable multi-agent architecture suitable for large industrial systems.
Abstract
Efficient planning of activities is essential for modern industrial assembly lines to uphold manufacturing standards, prevent project constraint violations, and achieve cost-effective operations. While exact solutions to such challenges can be obtained through Integer Programming (IP), the dependence of the search space on input parameters often makes IP computationally infeasible for large-scale scenarios. Heuristic methods, such as Genetic Algorithms, can also be applied, but they frequently produce suboptimal solutions in extensive cases. This paper introduces a novel mathematical model of a generic industrial assembly line formulated as a Markov Decision Process (MDP), without imposing assumptions on the type of assembly line a notable distinction from most existing models. The proposed model is employed to create a virtual environment for training Deep Reinforcement Learning (DRL)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
