CREMD: Crowd-Sourced Emotional Multimodal Dogs Dataset
Jinho Baek, Houwei Cao, Kate Blackwell

TL;DR
This paper introduces CREMD, a new multimodal dataset for dog emotion recognition, analyzing how context, audio, and annotator background affect labeling accuracy and confidence.
Contribution
CREMD is the first comprehensive dataset exploring multimodal cues and annotator factors influencing dog emotion perception, with detailed analysis of annotation agreement and confidence.
Findings
Visual context improves annotation agreement.
Audio cues increase annotator confidence for certain emotions.
Professionals and non-owners show higher agreement levels.
Abstract
Dog emotion recognition plays a crucial role in enhancing human-animal interactions, veterinary care, and the development of automated systems for monitoring canine well-being. However, accurately interpreting dog emotions is challenging due to the subjective nature of emotional assessments and the absence of standardized ground truth methods. We present the CREMD (Crowd-sourced Emotional Multimodal Dogs Dataset), a comprehensive dataset exploring how different presentation modes (e.g., context, audio, video) and annotator characteristics (e.g., dog ownership, gender, professional experience) influence the perception and labeling of dog emotions. The dataset consists of 923 video clips presented in three distinct modes: without context or audio, with context but no audio, and with both context and audio. We analyze annotations from diverse participants, including dog owners,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Animal Interaction Studies · Rabies epidemiology and control · Animal Behavior and Welfare Studies
