Loading paper
Object Referring in Visual Scene with Spoken Language | Tomesphere