GITSR: Graph Interaction Transformer-based Scene Representation for Multi Vehicle Collaborative Decision-making
Xingyu Hu, Lijun Zhang, Dejian Meng, Ye Han, and Lisha Yuan

TL;DR
GITSR introduces a novel graph interaction transformer framework for multi-vehicle scene understanding, enhancing collaborative decision-making in mixed traffic environments with improved accuracy and interaction modeling.
Contribution
The paper presents a new GITSR framework combining transformer-based scene representation with graph neural networks for better multi-vehicle decision-making.
Findings
Outperforms baseline methods in simulation metrics.
Effectively captures spatial interactions among vehicles.
Enhances decision-making accuracy in highway off-ramp scenarios.
Abstract
In this study, we propose GITSR, an effective framework for Graph Interaction Transformer-based Scene Representation for multi-vehicle collaborative decision-making in intelligent transportation system. In the context of mixed traffic where Connected Automated Vehicles (CAVs) and Human Driving Vehicles (HDVs) coexist, in order to enhance the understanding of the environment by CAVs to improve decision-making capabilities, this framework focuses on efficient scene representation and the modeling of spatial interaction behaviors of traffic states. We first extract features of the driving environment based on the background of intelligent networking. Subsequently, the local scene representation, which is based on the agent-centric and dynamic occupation grid, is calculated by the Transformer module. Besides, feasible region of the map is captured through the multi-head attention mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Graph Theory and Algorithms · Data Management and Algorithms
MethodsAttention Is All You Need · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Residual Connection · Linear Layer · Byte Pair Encoding · Dropout · Absolute Position Encodings · Softmax
