# Optimized multi agent reinforcement learning algorithms with hybrid BiLSTM for cost efficient EV charging scheduling

**Authors:** Urvashi Khekare, Rajay Vedaraj I. S.

PMC · DOI: 10.3389/frai.2025.1700664 · Frontiers in Artificial Intelligence · 2026-01-05

## TL;DR

This paper introduces a new EV charging scheduling framework using optimized multi-agent reinforcement learning and improved forecasting for cost efficiency.

## Contribution

The novel integration of POA-tuned BiLSTM forecasting with a CTDE-based MARL framework for EV charging scheduling is introduced.

## Key findings

- The proposed method reduces charging cost by 12.34% compared to conventional baselines.
- Forecasting accuracy improves by 8.46% and simulation time decreases by 0.456 seconds.

## Abstract

With the fast development of electric vehicles, the demand for intelligent charging management strategies in order to minimize operational costs, ensure grid stability, and enhance user satisfaction. This paper proposes a new framework that embeds multi-MARL algorithm tuned by the Pelican optimization algorithm (POA) bidirectional long short-term memory for anticipatory energy forecasting scheduling in EV charging stations—EVCS. Unlike previous works that treat forecasting, the proposed method seamlessly unifies these steps, which were hitherto considered as separate entities: optimization and then scheduling. Components within a Markov decision process formulation. The framework employs publicly available Indian Energy Exchange (IEX) day-ahead market data, where POA-tuned BiLSTM forecasts electricity price and demand with improved accuracy, feeding into the MARL controller for dynamic scheduling. Experimental results demonstrate that the proposed method reduces charging cost by 12.34%, improves state-of-charge (SOC) satisfaction by 10.25%, and increases forecasting accuracy by 8.46% compared to conventional GA, PSO, MARL, and deep learning baselines. Furthermore, simulation time is reduced by 0.456 s, confirming computational efficiency. This study presents integrated frameworks that combine POA-tuned BiLSTM forecasting with a CTDE-based MARL architecture for anticipatory EV charging scheduling.

## Full-text entities

- **Genes:** TMC8 (transmembrane channel like 8) [NCBI Gene 147138] {aka EV2, EVER2, EVIN2}, TMC6 (transmembrane channel like 6) [NCBI Gene 11322] {aka EV1, EVER1, EVIN1, LAK-4P, TNRC6C-AS1, lnc}
- **Diseases:** POA (MESH:D007859)
- **Chemicals:** MDPs (-), carbon (MESH:D002244), lithium (MESH:D008094)
- **Species:** Pelecanidae (pelicans, family) [taxon 30444]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12812875/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12812875/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12812875/full.md

---
Source: https://tomesphere.com/paper/PMC12812875