Confidential Boosting with Random Linear Classifiers for Outsourced User-generated Data
Sagar Sharma, Keke Chen

TL;DR
SecureBoost is a privacy-preserving boosting framework that enables data owners to train predictive models on encrypted user data using random linear classifiers, leveraging cryptographic techniques for security and efficiency.
Contribution
This paper introduces SecureBoost, a novel confidential boosting framework utilizing random linear classifiers and cryptographic protocols to protect user data in outsourced learning.
Findings
SecureBoost achieves high model quality with encrypted data.
The framework effectively balances security and computational efficiency.
Experimental results demonstrate practical applicability for large-scale user data.
Abstract
User-generated data is crucial to predictive modeling in many applications. With a web/mobile/wearable interface, a data owner can continuously record data generated by distributed users and build various predictive models from the data to improve their operations, services, and revenue. Due to the large size and evolving nature of users data, data owners may rely on public cloud service providers (Cloud) for storage and computation scalability. Exposing sensitive user-generated data and advanced analytic models to Cloud raises privacy concerns. We present a confidential learning framework, SecureBoost, for data owners that want to learn predictive models from aggregated user-generated data but offload the storage and computational burden to Cloud without having to worry about protecting the sensitive data. SecureBoost allows users to submit encrypted or randomly masked data to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Data Security · Privacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting
