Loading paper
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding | Tomesphere