# Exploring the Challenges towards Lifelong Fact Learning

**Authors:** Mohamed Elhoseiny, Francesca Babiloni, Rahaf Aljundi, Marcus Rohrbach,, Manohar Paluri, Tinne Tuytelaars

arXiv: 1812.10524 · 2018-12-31

## TL;DR

This paper introduces a large-scale, more natural lifelong learning setup focusing on diverse, structured, and semantically related visual facts, using new benchmarks with over 900,000 images to evaluate existing methods.

## Contribution

It presents a novel large-scale, realistic LLL benchmark with structured, semantically related facts and long-tail distributions, advancing the evaluation of lifelong learning models.

## Key findings

- State-of-the-art methods are evaluated on the new benchmarks.
- The setup highlights challenges in learning structured and semantically related facts.
- The benchmarks include over 900,000 images and 165,150 facts.

## Abstract

So far life-long learning (LLL) has been studied in relatively small-scale and relatively artificial setups. Here, we introduce a new large-scale alternative. What makes the proposed setup more natural and closer to human-like visual systems is threefold: First, we focus on concepts (or facts, as we call them) of varying complexity, ranging from single objects to more complex structures such as objects performing actions, and objects interacting with other objects. Second, as in real-world settings, our setup has a long-tail distribution, an aspect which has mostly been ignored in the LLL context. Third, facts across tasks may share structure (e.g., <person, riding, wave> and <dog, riding, wave>). Facts can also be semantically related (e.g., "liger" relates to seen categories like "tiger" and "lion"). Given the large number of possible facts, a LLL setup seems a natural choice. To avoid model size growing over time and to optimally exploit the semantic relations and structure, we combine it with a visual semantic embedding instead of discrete class labels. We adapt existing datasets with the properties mentioned above into new benchmarks, by dividing them semantically or randomly into disjoint tasks. This leads to two large-scale benchmarks with 906,232 images and 165,150 unique facts, on which we evaluate and analyze state-of-the-art LLL methods.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.10524/full.md

## Figures

49 figures with captions in the complete paper: https://tomesphere.com/paper/1812.10524/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1812.10524/full.md

---
Source: https://tomesphere.com/paper/1812.10524