SecDTD: Dynamic Token Drop for Secure Transformers Inference
Yifei Cai, Zhuoran Li, Yizhou Feng, Qiao Zhang, Hongyi Wu, Danella Zhao, Chunsheng Xin

TL;DR
SecDTD introduces a dynamic token drop method for secure Transformer inference, significantly reducing computational costs while maintaining accuracy, by shifting token dropping to earlier stages and employing novel scoring and selection techniques.
Contribution
The paper presents SecDTD, a novel secure inference scheme that enables early token dropping in encrypted Transformer models, reducing costs with minimal accuracy loss.
Findings
Achieves 4.47x inference acceleration without accuracy degradation.
Introduces Max-Centric Normalization for efficient early token dropping.
Develops OMSel, a fast oblivious median selection protocol.
Abstract
The rapid adoption of Transformer-based AI has been driven by accessible models such as ChatGPT, which provide API-based services for developers and businesses. However, as these online inference services increasingly handle sensitive inputs, privacy concerns have emerged as a significant challenge. To address this, secure inference frameworks have been proposed, but their high computational and communication overhead often limit practical deployment. In plaintext settings, token drop is an effective technique for reducing inference cost; however, our analysis reveals that directly applying such methods to ciphertext scenarios is suboptimal due to distinct cost distributions in secure computation. We propose SecDTD, a dynamic token drop scheme tailored for secure Transformer inference. SecDTD advances token drop by shifting the dropping to earlier inference stages, effectively reducing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Cryptography and Data Security
