Loading paper
JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments | Tomesphere