Capturing Structural Locality in Non-parametric Language Models

Frank F. Xu; Junxian He; Graham Neubig; Vincent J. Hellendoorn

arXiv:2110.02870·cs.CL·February 2, 2022

Capturing Structural Locality in Non-parametric Language Models

Frank F. Xu, Junxian He, Graham Neubig, Vincent J. Hellendoorn

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to incorporate structural locality into non-parametric language models, enhancing their ability to retrieve relevant local information and improve performance across different domains.

Contribution

The paper proposes a simple approach to embed locality information into non-parametric language models, demonstrating its effectiveness in code and text datasets.

Findings

01

Locality features improve model performance over non-local models.

02

The approach works across different domains like Java code and Wikipedia.

03

Traditional similarity metrics are insufficient to capture locality structures.

Abstract

Structural locality is a ubiquitous feature of real-world datasets, wherein data points are organized into local hierarchies. Some examples include topical clusters in text or project hierarchies in source code repositories. In this paper, we explore utilizing this structural locality within non-parametric language models, which generate sequences that reference retrieved examples from an external source. We propose a simple yet effective approach for adding locality information into such models by adding learned parameters that improve the likelihood of retrieving examples from local neighborhoods. Experiments on two different domains, Java source code and Wikipedia text, demonstrate that locality features improve model efficacy over models without access to these features, with interesting differences. We also perform an analysis of how and where locality features contribute to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Capturing Structural Locality in Non-parametric Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis