Quantifying Memorization and Detecting Training Data of Pre-trained Language Models using Japanese Newspaper
Shotaro Ishihara, Hiromu Takahashi

TL;DR
This paper investigates how Japanese newspaper domain-specific GPT-2 models memorize training data and demonstrates that such models can reveal their training data through membership inference attacks, highlighting privacy concerns.
Contribution
It extends understanding of memorization and data leakage in pre-trained language models to Japanese, showing similar patterns as in English and emphasizing privacy risks.
Findings
Memorization correlates with data duplication, model size, and prompt length.
Membership inference attacks can detect training data in Japanese PLMs.
Japanese PLMs exhibit data copying similar to English models.
Abstract
Dominant pre-trained language models (PLMs) have demonstrated the potential risk of memorizing and outputting the training data. While this concern has been discussed mainly in English, it is also practically important to focus on domain-specific PLMs. In this study, we pre-trained domain-specific GPT-2 models using a limited corpus of Japanese newspaper articles and evaluated their behavior. Experiments replicated the empirical finding that memorization of PLMs is related to the duplication in the training data, model size, and prompt length, in Japanese the same as in previous English studies. Furthermore, we attempted membership inference attacks, demonstrating that the training data can be detected even in Japanese, which is the same trend as in English. The study warns that domain-specific PLMs, sometimes trained with valuable private data, can ''copy and paste'' on a large scale.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Attention Dropout · Cosine Annealing · Dropout · Residual Connection · Softmax · Weight Decay · Discriminative Fine-Tuning · Byte Pair Encoding
