Loading paper
Vision-Language Integration for Zero-Shot Scene Understanding in Real-World Environments | Tomesphere