Preliminary Use of Vision Language Model Driven Extraction of Mouse Behavior Towards Understanding Fear Expression
Paimon Goulart, Jordan Steinhauser, Kylene Shuler, Edward Korzus, Jia Chen, Evangelos E. Papalexakis

TL;DR
This study introduces a vision-language model that classifies mouse behaviors from videos with high accuracy, producing detailed behavioral datasets without fine-tuning, aiding interdisciplinary research.
Contribution
It demonstrates the effective use of a vision-language model with prompts and in-context learning for behavior classification without model fine-tuning.
Findings
High F1 scores across behaviors, including rare ones
Enhanced performance through prompts, ICL, and preprocessing
Minimal user input needed for accurate classification
Abstract
Integration of diverse data will be a pivotal step towards improving scientific explorations in many disciplines. This work establishes a vision-language model (VLM) that encodes videos with text input in order to classify various behaviors of a mouse existing in and engaging with their environment. Importantly, this model produces a behavioral vector over time for each subject and for each session the subject undergoes. The output is a valuable dataset that few programs are able to produce with as high accuracy and with minimal user input. Specifically, we use the open-source Qwen2.5-VL model and enhance its performance through prompts, in-context learning (ICL) with labeled examples, and frame-level preprocessing. We found that each of these methods contributes to improved classification, and that combining them results in strong F1 scores across all behaviors, including rare classes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroendocrine regulation and behavior · Emotion and Mood Recognition · Memory and Neural Mechanisms
