# Real-Time Visual Anomaly Detection in High-Speed Motorsport: An Entropy-Driven Hybrid Retrieval- and Cache-Augmented Architecture

**Authors:** Rubén Juárez Cádiz, Fernando Rodríguez-Sela

PMC · DOI: 10.3390/jimaging12020060 · 2026-01-28

## TL;DR

This paper introduces a real-time system for detecting visual anomalies in high-speed motorsport using a hybrid architecture that balances speed and accuracy.

## Contribution

The novel hybrid cache–retrieval architecture reduces latency while maintaining high detection accuracy in real-time motorsport monitoring.

## Key findings

- The system achieves 55.3% lower latency compared to a retrieval-only baseline.
- It attains a Macro-F1 score of 0.89 for detecting anomalies like tire degradation and suspension issues.
- The framework uses entropy-based routing to stabilize decisions under uncertainty.

## Abstract

At 300 km/h, an end-to-end vision delay of 100 ms corresponds to 8.3 m of unobserved travel; therefore, real-time anomaly monitoring must balance sensitivity with strict tail-latency constraints at the edge. We propose a hybrid cache–retrieval inference architecture for visual anomaly detection in high-speed motorsport that exploits lap-to-lap spatiotemporal redundancy while reserving local similarity retrieval for genuinely uncertain events. The system combines a hierarchical visual encoder (a lightweight backbone with selective refinement via a Nested U-Net for texture-level cues) and an uncertainty-driven router that selects between two memory pathways: (i) a static cache of precomputed scene embeddings for track/background context and (ii) local similarity retrieval over historical telemetry–vision patterns to ground ambiguous frames, improve interpretability, and stabilize decisions under high uncertainty. Routing is governed by an entropy signal computed from prediction and embedding uncertainty: low-entropy frames follow a cache-first path, whereas high-entropy frames trigger retrieval and refinement to preserve decision stability without sacrificing latency. On a high-fidelity closed-circuit benchmark with synchronized onboard video and telemetry and controlled anomaly injections (tire degradation, suspension chatter, and illumination shifts), the proposed approach reduces mean end-to-end latency to 21.7 ms versus 48.6 ms for a retrieval-only baseline (55.3% reduction) while achieving Macro-F1 = 0.89 at safety-oriented operating points. The framework is designed for passive monitoring and decision support, producing advisory outputs without actuating ECU control strategies.

## Full-text entities

- **Genes:** ETFA (electron transfer flavoprotein subunit alpha) [NCBI Gene 2108] {aka EMA, GA2, MADD}, INTS8 (integrator complex subunit 8) [NCBI Gene 55656] {aka C8orf52, INT8, NEDCHS}, NFKB1 (nuclear factor kappa B subunit 1) [NCBI Gene 4790] {aka CVID12, EBP-1, KBF1, NF-kB, NF-kB1, NF-kappa-B1}, NBN (nibrin) [NCBI Gene 4683] {aka AT-V1, AT-V2, ATV, NBS, NBS1, P95}, PPP1R10 (protein phosphatase 1 regulatory subunit 10) [NCBI Gene 5514] {aka CAT53, FB19, PNUTS, PP1R10, R111, p99}
- **Diseases:** injury to (MESH:D014947), Anomaly (MESH:D000013), visual anomaly (MESH:D014786), Drop Burstiness (MESH:D020427), Air-Gapped Retrieval (MESH:D004618), Granularity (MESH:D016586), hallucination (MESH:D006212)
- **Chemicals:** IP (MESH:C041508), CAG (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

39 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12942198/full.md

---
Source: https://tomesphere.com/paper/PMC12942198