Loading paper
Unifying Event Detection and Captioning as Sequence Generation via Pre-Training | Tomesphere