High-throughput data analysis in behavior genetics
Anat Sakov, Ilan Golani, Dina Lipkind, Yoav Benjamini

TL;DR
This paper presents robust computational methods for high-throughput analysis of behavioral data in genetics, addressing noise, outliers, and protocol deviations to improve accuracy in tracking and measuring rodent behavior.
Contribution
It introduces a combination of robust statistical techniques and algorithms tailored for analyzing large behavioral datasets in genetics research.
Findings
Effective estimation of mouse location, velocity, and acceleration.
Robust detection of arena boundary and center.
Automatic correction for protocol deviations.
Abstract
In recent years, a growing need has arisen in different fields for the development of computational systems for automated analysis of large amounts of data (high-throughput). Dealing with nonstandard noise structure and outliers, that could have been detected and corrected in manual analysis, must now be built into the system with the aid of robust methods. We discuss such problems and present insights and solutions in the context of behavior genetics, where data consists of a time series of locations of a mouse in a circular arena. In order to estimate the location, velocity and acceleration of the mouse, and identify stops, we use a nonstandard mix of robust and resistant methods: LOWESS and repeated running median. In addition, we argue that protection against small deviations from experimental protocols can be handled automatically using statistical methods. In our case, it is of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
