Tricks and Plugins to GBM on Images and Sequences

Biyi Fang; Jean Utke; Diego Klabjan

arXiv:2203.00761·cs.LG·March 3, 2022

Tricks and Plugins to GBM on Images and Sequences

Biyi Fang, Jean Utke, Diego Klabjan

PDF

Open Access

TL;DR

This paper introduces new boosting algorithms for CNNs and transformers that simplify architecture search, improve performance, and reduce training time on image and sequence tasks.

Contribution

It proposes a novel boosting framework with subgrid selection and importance sampling for deep networks, enhancing efficiency and accuracy.

Findings

01

Outperforms benchmarks on fine-grained classification tasks

02

Reduces manual effort in architecture tuning

03

Achieves superior performance with lower training time

Abstract

Convolutional neural networks (CNNs) and transformers, which are composed of multiple processing layers and blocks to learn the representations of data with multiple abstract levels, are the most successful machine learning models in recent years. However, millions of parameters and many blocks make them difficult to be trained, and sometimes several days or weeks are required to find an ideal architecture or tune the parameters. Within this paper, we propose a new algorithm for boosting Deep Convolutional Neural Networks (BoostCNN) to combine the merits of dynamic feature selection and BoostCNN, and another new family of algorithms combining boosting and transformers. To learn these new models, we introduce subgrid selection and importance sampling strategies and propose a set of algorithms to incorporate boosting weights into a deep learning architecture based on a least squares…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning and Data Classification · Advanced Neural Network Applications

MethodsFeature Selection