Loading paper
Enhancing Visual Dialog State Tracking through Iterative Object-Entity Alignment in Multi-Round Conversations | Tomesphere