Loading paper
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions | Tomesphere