Knowledge Matters: Importance of Prior Information for Optimization
\c{C}a\u{g}lar G\"ul\c{c}ehre, Yoshua Bengio

TL;DR
Introducing prior intermediate-level information into neural networks significantly improves learning on complex tasks where other algorithms fail, highlighting the importance of curriculum-like supervision.
Contribution
This work demonstrates that pre-training intermediate concepts enables neural networks to learn difficult tasks that traditional algorithms cannot, emphasizing the role of prior knowledge.
Findings
Two-tiered MLP with intermediate pre-training learns perfectly
Traditional algorithms perform no better than chance
Intermediate supervision helps overcome optimization challenges
Abstract
We explore the effect of introducing prior information into the intermediate level of neural networks for a learning task on which all the state-of-the-art machine learning algorithms tested failed to learn. We motivate our work from the hypothesis that humans learn such intermediate concepts from other individuals via a form of supervision or guidance using a curriculum. The experiments we have conducted provide positive evidence in favor of this hypothesis. In our experiments, a two-tiered MLP architecture is trained on a dataset with 64x64 binary inputs images, each image with three sprites. The final task is to decide whether all the sprites are the same or one of them is different. Sprites are pentomino tetris shapes and they are placed in an image with different locations using scaling and rotation transformations. The first part of the two-tiered MLP is pre-trained with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Language and cultural evolution · Image Retrieval and Classification Techniques
