Loading paper
Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs | Tomesphere