Loading paper
Multi-Modal Video Dialog State Tracking in the Wild | Tomesphere