Loading paper
BERT's output layer recognizes all hidden layers? Some Intriguing Phenomena and a simple way to boost BERT | Tomesphere