Loading paper
Fine-grained Spatiotemporal Grounding on Egocentric Videos | Tomesphere