Loading paper
A 28nm 0.22{\mu}J/token memory-compute-intensity-aware CNN-Transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semantic-segmentation | Tomesphere