Ensemble based discriminative models for Visual Dialog Challenge 2018

Shubham Agarwal; Raghav Goyal

arXiv:2001.05865·cs.CV·January 17, 2020·1 cites

Ensemble based discriminative models for Visual Dialog Challenge 2018

Shubham Agarwal, Raghav Goyal

PDF

Open Access

TL;DR

This paper presents an ensemble of discriminative models for the Visual Dialog Challenge 2018, achieving competitive results by combining different encoders and decoders to improve dialog understanding.

Contribution

Introduces an ensemble approach with diverse discriminative models for visual dialog, achieving top-tier challenge performance.

Findings

01

NDCG score of 55.46 on test-std split

02

MRR value of 63.77 on test-std split

03

Secured third position in the challenge

Abstract

This manuscript describes our approach for the Visual Dialog Challenge 2018. We use an ensemble of three discriminative models with different encoders and decoders for our final submission. Our best performing model on 'test-std' split achieves the NDCG score of 55.46 and the MRR value of 63.77, securing third position in the challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning