TexRel: a Green Family of Datasets for Emergent Communications on Relations
Hugh Perkins

TL;DR
TexRel is a new, efficient dataset for studying emergent communication of relations, offering realistic, large-scale data that enables rapid training and comprehensive analysis of communication architectures and compositionality.
Contribution
Introduces TexRel, a large, efficient dataset for emergent relations communication, and provides baseline evaluations and case studies demonstrating its utility.
Findings
TexRel enables fast training and experimentation.
Multitask learning improves accuracy and clustering.
Increasing latent space size enhances compositionality metrics.
Abstract
We propose a new dataset TexRel as a playground for the study of emergent communications, in particular for relations. By comparison with other relations datasets, TexRel provides rapid training and experimentation, whilst being sufficiently large to avoid overfitting in the context of emergent communications. By comparison with using symbolic inputs, TexRel provides a more realistic alternative whilst remaining efficient and fast to learn. We compare the performance of TexRel with a related relations dataset Shapeworld. We provide baseline performance results on TexRel for sender architectures, receiver architectures and end-to-end architectures. We examine the effect of multitask learning in the context of shapes, colors and relations on accuracy, topological similarity and clustering precision. We investigate whether increasing the size of the latent meaning space improves metrics of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Topic Modeling · Multimodal Machine Learning Applications
