ImageCaptioner$^2$: Image Captioner for Image Captioning Bias Amplification Assessment
Eslam Mohamed Bakr, Pengzhan Sun, Li Erran Li, Mohamed Elhoseiny

TL;DR
This paper introduces $ImageCaptioner^2$, a new bias assessment metric for image captioning that considers visual signals and bias amplification, validated across multiple architectures and datasets with strong correlation to human judgment.
Contribution
The paper presents $ImageCaptioner^2$, a novel bias metric that incorporates visual context and measures bias amplification, improving over existing caption-only bias evaluation methods.
Findings
$ImageCaptioner^2$ outperforms LIC in human alignment by 80%.
The metric is validated on 11 architectures and 3 datasets.
It effectively measures bias amplification related to protected attributes.
Abstract
Most pre-trained learning systems are known to suffer from bias, which typically emerges from the data, the model, or both. Measuring and quantifying bias and its sources is a challenging task and has been extensively studied in image captioning. Despite the significant effort in this direction, we observed that existing metrics lack consistency in the inclusion of the visual signal. In this paper, we introduce a new bias assessment metric, dubbed , for image captioning. Instead of measuring the absolute bias in the model or the data, pay more attention to the bias introduced by the model w.r.t the data bias, termed bias amplification. Unlike the existing methods, which only evaluate the image captioning algorithms based on the generated captions only, incorporates the image while measuring the bias. In addition, we design a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Digital Imaging for Blood Diseases
