Loading paper
X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning | Tomesphere