TL;DR
This survey comprehensively reviews multiple instance learning (MIL), identifying key problem characteristics, analyzing their impact on algorithm performance, and providing insights and recommendations for future research and benchmarking.
Contribution
It formally characterizes MIL problem types, analyzes their influence on algorithm performance, and offers guidance for future research and benchmarking efforts.
Findings
MIL problem characteristics significantly affect algorithm performance
Performance varies across different data distributions and ambiguity levels
Experiments highlight the importance of problem characteristics in MIL applications
Abstract
Multiple instance learning (MIL) is a form of weakly supervised learning where training instances are arranged in sets, called bags, and a label is provided for the entire bag. This formulation is gaining interest because it naturally fits various problems and allows to leverage weakly labeled data. Consequently, it has been used in diverse application fields such as computer vision and document classification. However, learning from bags raises important challenges that are unique to MIL. This paper provides a comprehensive survey of the characteristics which define and differentiate the types of MIL problems. Until now, these problem characteristics have not been formally identified and described. As a result, the variations in performance of MIL algorithms from one data set to another are difficult to explain. In this paper, MIL problem characteristics are grouped into four broad…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
