A C++ library for Multimodal Deep Learning
Jian Jin

TL;DR
MDL is a versatile C++ library designed for multimodal deep learning, supporting various models and compatible across Linux, Mac, and Unix platforms, with dependencies on OpenCV.
Contribution
This paper introduces MDL, a new C++ library that facilitates multimodal deep learning with broad platform support and integration with OpenCV.
Findings
Supports multiple deep learning models
Runs on Linux, Mac, and Unix platforms
Depends on OpenCV for image processing
Abstract
MDL, Multimodal Deep Learning Library, is a deep learning framework that supports multiple models, and this document explains its philosophy and functionality. MDL runs on Linux, Mac, and Unix platforms. It depends on OpenCV.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Handwritten Text Recognition Techniques
MethodsMinimum Description Length
