Dynamic Fog Computing for Enhanced LLM Execution in Medical Applications

Philipp Zagar; Vishnu Ravi; Lauren Aalami; Stephan Krusche; Oliver; Aalami; Paul Schmiedmayer

arXiv:2408.04680·cs.CL·December 16, 2024

Dynamic Fog Computing for Enhanced LLM Execution in Medical Applications

Philipp Zagar, Vishnu Ravi, Lauren Aalami, Stephan Krusche, Oliver, Aalami, Paul Schmiedmayer

PDF

TL;DR

This paper introduces a decentralized fog computing architecture and SpeziLLM framework to improve privacy, trust, and cost-efficiency in deploying large language models for medical applications.

Contribution

It proposes a novel fog computing approach for LLM deployment in healthcare and introduces an open-source framework to facilitate its integration.

Findings

01

SpeziLLM enables seamless LLM deployment across multiple healthcare scenarios.

02

Decentralized execution improves data privacy and trust in medical AI.

03

Cost reduction achieved by shifting from cloud to edge and fog environments.

Abstract

The ability of large language models (LLMs) to transform, interpret, and comprehend vast quantities of heterogeneous data presents a significant opportunity to enhance data-driven care delivery. However, the sensitive nature of protected health information (PHI) raises valid concerns about data privacy and trust in remote LLM platforms. In addition, the cost associated with cloud-based artificial intelligence (AI) services continues to impede widespread adoption. To address these challenges, we propose a shift in the LLM execution environment from opaque, centralized cloud providers to a decentralized and dynamic fog computing architecture. By executing open-weight LLMs in more trusted environments, such as the user's edge device or a fog layer within a local network, we aim to mitigate the privacy, trust, and financial challenges associated with cloud-based LLMs. We further present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.