Tricks and Plugins to GBM on Images and Sequences
Biyi Fang, Jean Utke, Diego Klabjan

TL;DR
This paper introduces new boosting algorithms for CNNs and transformers that simplify architecture search, improve performance, and reduce training time on image and sequence tasks.
Contribution
It proposes a novel boosting framework with subgrid selection and importance sampling for deep networks, enhancing efficiency and accuracy.
Findings
Outperforms benchmarks on fine-grained classification tasks
Reduces manual effort in architecture tuning
Achieves superior performance with lower training time
Abstract
Convolutional neural networks (CNNs) and transformers, which are composed of multiple processing layers and blocks to learn the representations of data with multiple abstract levels, are the most successful machine learning models in recent years. However, millions of parameters and many blocks make them difficult to be trained, and sometimes several days or weeks are required to find an ideal architecture or tune the parameters. Within this paper, we propose a new algorithm for boosting Deep Convolutional Neural Networks (BoostCNN) to combine the merits of dynamic feature selection and BoostCNN, and another new family of algorithms combining boosting and transformers. To learn these new models, we introduce subgrid selection and importance sampling strategies and propose a set of algorithms to incorporate boosting weights into a deep learning architecture based on a least squares…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning and Data Classification · Advanced Neural Network Applications
MethodsFeature Selection
