Loading paper
Low-latency vision transformers via large-scale multi-head attention | Tomesphere