Loading paper
Do Transformers Use their Depth Adaptively? Evidence from a Relational Reasoning Task | Tomesphere