On Extracting Specialized Code Abilities from Large Language Models: A   Feasibility Study

Zongjie Li; Chaozheng Wang; Pingchuan Ma; Chaowei Liu; Shuai Wang,; Daoyuan Wu; Cuiyun Gao; Yang Liu

arXiv:2303.03012·cs.SE·November 1, 2023·6 cites

On Extracting Specialized Code Abilities from Large Language Models: A Feasibility Study

Zongjie Li, Chaozheng Wang, Pingchuan Ma, Chaowei Liu, Shuai Wang,, Daoyuan Wu, Cuiyun Gao, Yang Liu

PDF

Open Access

TL;DR

This study investigates the feasibility of extracting specialized coding abilities from commercial large language models through imitation attacks, revealing potential security threats and practical attack methods using medium-sized models.

Contribution

The paper introduces a novel approach to slice black-box LLMs by training medium-sized models to imitate their specialized code abilities, highlighting security risks.

Findings

01

Imitation attacks can successfully replicate code synthesis and translation abilities.

02

Effective training of medium-sized models requires a reasonable number of queries.

03

Response checks improve the quality of imitation outputs.

Abstract

Recent advances in large language models (LLMs) significantly boost their usage in software engineering. However, training a well-performing LLM demands a substantial workforce for data collection and annotation. Moreover, training datasets may be proprietary or partially open, and the process often requires a costly GPU cluster. The intellectual property value of commercial LLMs makes them attractive targets for imitation attacks, but creating an imitation model with comparable parameters still incurs high costs. This motivates us to explore a practical and novel direction: slicing commercial black-box LLMs using medium-sized backbone models. In this paper, we explore the feasibility of launching imitation attacks on LLMs to extract their specialized code abilities, such as"code synthesis" and "code translation." We systematically investigate the effectiveness of launching code ability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques