What's New in My Data? Novelty Exploration via Contrastive Generation

Masaru Isonuma; Ivan Titov

arXiv:2410.14765·cs.LG·October 22, 2024

What's New in My Data? Novelty Exploration via Contrastive Generation

Masaru Isonuma, Ivan Titov

PDF

Open Access 1 Video 3 Reviews

TL;DR

This paper introduces Contrastive Generative Exploration (CGE), a method for identifying novel properties in fine-tuning datasets by contrasting pre-trained and fine-tuned models, with an iterative approach to enhance diversity and effectiveness.

Contribution

The paper proposes a novel contrastive generation method for novelty discovery in datasets without direct data access, including an iterative version to improve diversity of generated examples.

Findings

01

CGE effectively detects novel content like toxic language and new languages.

02

The iterative CGE enhances diversity and captures a broader range of dataset properties.

03

CGE remains effective even with privacy-preserving fine-tuning methods.

Abstract

Fine-tuning is widely used to adapt language models for specific goals, often leveraging real-world data such as patient records, customer-service interactions, or web content in languages not covered in pre-training. These datasets are typically massive, noisy, and often confidential, making their direct inspection challenging. However, understanding them is essential for guiding model deployment and informing decisions about data cleaning or suppressing any harmful behaviors learned during fine-tuning. In this study, we introduce the task of novelty discovery through generation, which aims to identify novel properties of a fine-tuning dataset by generating examples that illustrate these properties. Our approach, Contrastive Generative Exploration (CGE), assumes no direct access to the data but instead relies on a pre-trained model and the same model after fine-tuning. By contrasting…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 4

Strengths

- Shifting Focus from Data Inspection to Model Behavior: This is a neat transition because direct data access is often restricted due to privacy concerns or the sheer scale of the datasets involved. CGE, therefore, offers a practical and potentially more scalable solution for understanding novelties in fine-tuning data. - Contrastive Decoding for Dataset Exploration: This technique, originally developed for controlling text generation properties, is cleverly adapted to distinguish between in-dis

Weaknesses

- Missing an honest baseline like n-gram: This paper misses basic baselines such as n-gram distribution change, or top-k nearest neighbour distance in the embedding space. - This could quite naturally be done in the RPJ experiments. - Can also be adapted in the DP-SGD experiments by returning privatized n-gram stats. - You might like the infini-gram paper as a tool to enable such analysis - Constraint: The paper writes that one of the constraints for the method to work effectively is

Reviewer 02Rating 6Confidence 4

Strengths

1. The experimental setting are soild. They conduct experiments on multiple base LLM and mulitple datasets. The authors report the experimental results on multiple evaluation metrics. 2. To achieve contrastive generative exploration, the methods consists of static approach, iterative approach, and so on. The design of methods seem reasonable. 3. The task (direction) this paper focusing on sounds interesting. Achieve a indirect and effective tine-tuning is curical in LLM and generalized methods

Weaknesses

1. The baseline methods are kind of old and weak. It would be better to compare methods proposed in 2023 or 2024, but the baselines used in this submission are proposed before 2021. 2. To compare with DP-based methods, the stardard setting is to compare over different \epsilon instead of the noise multiplier. \epsilon indicates the strength of the privacy protection but noise multiplier is not so clear to show the privacy protection performance since the scale of noises varies in different mode

Reviewer 03Rating 6Confidence 3

Strengths

- Identifying what kind of examples in the fine-tuning set that are not present in the pretraining set is interesting.

Weaknesses

- The writing is poor, and the story and motivation are vague and confusing. The authors keep switching between writing "novelty exploration", "novel characteristics", and "novel domains" which makes it difficult to understand what the actual task is. After much digging, this work is basically about identifying example classes that are in the fine-tuning set and not in the pretraining set. Perhaps the title could be, "identifying new classes in the finetuning set". - It is not clear what the go

Videos

What's New in My Data? Novelty Exploration via Contrastive Generation· slideslive

Taxonomy

TopicsScientific Computing and Data Management · Data Analysis with R · Time Series Analysis and Forecasting