Deep Collective Learning: Learning Optimal Inputs and Weights Jointly in Deep Neural Networks
Xiang Deng, Zhongfei (Mark) Zhang

TL;DR
This paper introduces deep collective learning, jointly optimizing inputs and network weights in deep neural networks, with a novel lookup table approach for computer vision, showing promising results on multiple benchmarks.
Contribution
It proposes the paradigm of deep collective learning for end-to-end input and weight optimization in vision tasks, a concept largely unexplored in computer vision.
Findings
Lookup-VNets outperform traditional models on benchmark datasets.
Joint input and weight learning reveals surprising characteristics.
Deep collective learning shows promising potential for improved vision models.
Abstract
It is well observed that in deep learning and computer vision literature, visual data are always represented in a manually designed coding scheme (eg., RGB images are represented as integers ranging from 0 to 255 for each channel) when they are input to an end-to-end deep neural network (DNN) for any learning task. We boldly question whether the manually designed inputs are good for DNN training for different tasks and study whether the input to a DNN can be optimally learned end-to-end together with learning the weights of the DNN. In this paper, we propose the paradigm of {\em deep collective learning} which aims to learn the weights of DNNs and the inputs to DNNs simultaneously for given tasks. We note that collective learning has been implicitly but widely used in natural language processing while it has almost never been studied in computer vision. Consequently, we propose the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
