Loading paper
GROOT-2: Weakly Supervised Multi-Modal Instruction Following Agents | Tomesphere