PyTorchPipe: a framework for rapid prototyping of pipelines combining language and vision
Tomasz Kornuta

TL;DR
PyTorchPipe is a flexible, component-based framework built on PyTorch that enables rapid prototyping of complex multi-modal machine learning pipelines using human-readable configurations.
Contribution
The paper introduces PyTorchPipe, a novel framework that simplifies building and training multi-modal models with a pipeline approach and YAML configuration files.
Findings
Supports complex language and vision models
Enables efficient use of CPUs and GPUs
Provides a user-friendly, modular pipeline structure
Abstract
Access to vast amounts of data along with affordable computational power stimulated the reincarnation of neural networks. The progress could not be achieved without adequate software tools, lowering the entry bar for the next generations of researchers and developers. The paper introduces PyTorchPipe (PTP), a framework built on top of PyTorch. Answering the recent needs and trends in machine learning, PTP facilitates building and training of complex, multi-modal models combining language and vision (but is not limited to those two modalities). At its core, PTP employs a component-oriented approach and relies on the concept of a pipeline, defined as a directed acyclic graph of loosely coupled components. A user defines a pipeline using yaml-based (thus human-readable) configuration files, whereas PTP provides generic workers for their loading, training, and testing using all the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Topic Modeling
