FlexModel: A Framework for Interpretability of Distributed Large   Language Models

Matthew Choi; Muhammad Adil Asif; John Willes; David Emerson

arXiv:2312.03140·cs.LG·December 7, 2023·1 cites

FlexModel: A Framework for Interpretability of Distributed Large Language Models

Matthew Choi, Muhammad Adil Asif, John Willes, David Emerson

PDF

Open Access 1 Repo

TL;DR

FlexModel is a software framework that simplifies interaction with distributed large language models, making interpretability and research more accessible for users without extensive distributed computing expertise.

Contribution

It introduces a user-friendly interface that integrates with existing distributed model tools, enabling easier interpretability and interaction with large-scale models.

Findings

01

Enhances accessibility for model interpretability in distributed settings

02

Bridges the gap between distributed and single-device model paradigms

03

Facilitates inclusive research in large-scale neural networks

Abstract

With the growth of large language models, now incorporating billions of parameters, the hardware prerequisites for their training and deployment have seen a corresponding increase. Although existing tools facilitate model parallelization and distributed training, deeper model interactions, crucial for interpretability and responsible AI techniques, still demand thorough knowledge of distributed computing. This often hinders contributions from researchers with machine learning expertise but limited distributed computing background. Addressing this challenge, we present FlexModel, a software package providing a streamlined interface for engaging with models distributed across multi-GPU and multi-node configurations. The library is compatible with existing model distribution libraries and encapsulates PyTorch models. It exposes user-registerable HookFunctions to facilitate straightforward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vectorinstitute/flex_model
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications

MethodsLib