AutoTrain: No-code training for state-of-the-art models

Abhishek Thakur

arXiv:2410.15735·cs.AI·November 15, 2024

AutoTrain: No-code training for state-of-the-art models

Abhishek Thakur

PDF

Open Access 1 Repo 1 Models

TL;DR

AutoTrain is a versatile, open-source, no-code tool that simplifies training and fine-tuning models across various modalities and tasks, making it accessible for both local and cloud environments.

Contribution

It introduces AutoTrain Advanced, a comprehensive library that supports multi-modal, multi-task model training without coding, integrating best practices and broad model compatibility.

Findings

01

Supports diverse modalities and tasks including LLMs, VLMs, and tabular data.

02

Enables training on local or cloud environments.

03

Integrates with Hugging Face Hub models.

Abstract

With the advancements in open-source models, training (or finetuning) models on custom datasets has become a crucial part of developing solutions which are tailored to specific industrial or open-source applications. Yet, there is no single tool which simplifies the process of training across different types of modalities or tasks. We introduce AutoTrain (aka AutoTrain Advanced) -- an open-source, no code tool/library which can be used to train (or finetune) models for different kinds of tasks such as: large language model (LLM) finetuning, text classification/regression, token classification, sequence-to-sequence task, finetuning of sentence transformers, visual language model (VLM) finetuning, image classification/regression and even classification and regression tasks on tabular data. AutoTrain Advanced is an open-source library providing best practices for training models on custom…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huggingface/autotrain-advanced
pytorchOfficial

Models

🤗
kasim90/turkish
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Scientific Computing and Data Management · Model-Driven Software Engineering Techniques

MethodsLib