Loading paper
LLM-FP4: 4-Bit Floating-Point Quantized Transformers | Tomesphere