Loading paper
Seeing to Act, Prompting to Specify: A Bayesian Factorization of Vision Language Action Policy | Tomesphere