Learning to Teach with Dynamic Loss Functions
Lijun Wu, Fei Tian, Yingce Xia, Yang Fan, Tao Qin, Jianhuang Lai,, Tie-Yan Liu

TL;DR
This paper introduces a novel framework where a teacher model dynamically generates loss functions to improve the training of student models, leading to better performance across tasks like image classification and translation.
Contribution
It proposes a new method, L2T-DLF, enabling the teacher to adaptively produce loss functions during training, which enhances model learning efficiency and effectiveness.
Findings
Significant performance improvements on image classification tasks.
Enhanced neural machine translation results.
Efficient gradient-based optimization for the teacher model.
Abstract
Teaching is critical to human society: it is with teaching that prospective students are educated and human civilization can be inherited and advanced. A good teacher not only provides his/her students with qualified teaching materials (e.g., textbooks), but also sets up appropriate learning objectives (e.g., course projects and exams) considering different situations of a student. When it comes to artificial intelligence, treating machine learning models as students, the loss functions that are optimized act as perfect counterparts of the learning objective set by the teacher. In this work, we explore the possibility of imitating human teaching behaviors by dynamically and automatically outputting appropriate loss functions to train machine learning models. Different from typical learning settings in which the loss function of a machine learning model is predefined and fixed, in our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Intelligent Tutoring Systems and Adaptive Learning · Topic Modeling
