Serverless GPU Architecture for Enterprise HR Analytics: A Production-Scale BDaaS Implementation
Guilin Zhang, Wulan Guo, Ziqi Tan, Srinivas Vippagunta, Suchitra Raman, Shreeshankar Chatterjee, Ju Lin, Shang Liu, Mary Schladenhauffen, Jeffrey Luo, Hailong Jiang

TL;DR
This paper introduces a serverless GPU-based architecture for enterprise HR analytics that offers high throughput, low latency, and cost efficiency, while ensuring interpretability and compliance in regulated environments.
Contribution
It presents a novel production-scale BDaaS blueprint integrating serverless GPU runtime with TabNet for interpretable, compliant, and cost-effective analytics in regulated sectors.
Findings
GPU pipelines outperform Spark with 4.5x higher throughput
Latency is reduced by 98x compared to CPU baselines
Cost per 1K inferences is 90% lower with GPU approach
Abstract
Industrial and government organizations increasingly depend on data-driven analytics for workforce, finance, and regulated decision processes, where timeliness, cost efficiency, and compliance are critical. Distributed frameworks such as Spark and Flink remain effective for massive-scale batch or streaming analytics but introduce coordination complexity and auditing overheads that misalign with moderate-scale, latency-sensitive inference. Meanwhile, cloud providers now offer serverless GPUs, and models such as TabNet enable interpretable tabular ML, motivating new deployment blueprints for regulated environments. In this paper, we present a production-oriented Big Data as a Service (BDaaS) blueprint that integrates a single-node serverless GPU runtime with TabNet. The design leverages GPU acceleration for throughput, serverless elasticity for cost reduction, and feature-mask…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI and HR Technologies · Financial Distress and Bankruptcy Prediction · Cloud Computing and Resource Management
