Adversarial Robustness of Visual Dialog
Lu Yu, Verena Rieser

TL;DR
This paper investigates the robustness of visually grounded dialog models against textual adversarial attacks, revealing the importance of dialog history and multimodal context in model vulnerability and defense.
Contribution
It is the first study to analyze adversarial robustness in visual dialog models, highlighting the role of dialog history and multimodal context in model resilience.
Findings
Models encoding dialog history are more robust.
Attacking dialog history increases model uncertainty.
Both textual and visual context are crucial for generating plausible adversarial examples.
Abstract
Adversarial robustness evaluates the worst-case performance scenario of a machine learning model to ensure its safety and reliability. This study is the first to investigate the robustness of visually grounded dialog models towards textual attacks. These attacks represent a worst-case scenario where the input question contains a synonym which causes the previously correct model to return a wrong answer. Using this scenario, we first aim to understand how multimodal input components contribute to model robustness. Our results show that models which encode dialog history are more robust, and when launching an attack on history, model prediction becomes more uncertain. This is in contrast to prior work which finds that dialog history is negligible for model performance on this task. We also evaluate how to generate adversarial test examples which successfully fool the model but remain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsTest
