Loading paper
VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos | Tomesphere