Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank   Adaptation

Xiang Luo; Zhiwen Tang; Jin Wang; Xuejie Zhang

arXiv:2407.21633·cs.CL·August 1, 2024

Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation

Xiang Luo, Zhiwen Tang, Jin Wang, Xuejie Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces DualLoRA, a novel low-rank adaptation method that enhances zero-shot dialogue state tracking across domains by effectively integrating prompts into transformer models without increasing inference latency.

Contribution

The paper proposes DualLoRA, a dual low-rank adaptation architecture that improves prompt influence throughout transformer layers for zero-shot DST, outperforming existing methods.

Findings

01

Outperforms baseline methods on MultiWOZ and SGD datasets

02

Enhances prompt influence across all transformer layers

03

Maintains inference efficiency without latency increase

Abstract

Zero-shot dialogue state tracking (DST) seeks to enable dialogue systems to transition to unfamiliar domains without manual annotation or extensive retraining. Prior research has approached this objective by embedding prompts into language models (LMs). Common methodologies include integrating prompts at the input layer or introducing learnable variables at each transformer layer. Nonetheless, each strategy exhibits inherent limitations. Prompts integrated at the input layer risk underutilization, with their impact potentially diminishing across successive transformer layers. Conversely, the addition of learnable variables to each layer can complicate the training process and increase inference latency. To tackle the issues mentioned above, this paper proposes Dual Low-Rank Adaptation (DualLoRA), a plug-and-play architecture designed for zero-shot DST. DualLoRA incorporates two distinct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

suntea233/duallora
pytorchOfficial

Videos

Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation· underline

Taxonomy

TopicsCognitive Radio Networks and Spectrum Sensing · Seismology and Earthquake Studies · Energy Efficient Wireless Sensor Networks

MethodsDynamic Sparse Training · Stochastic Gradient Descent