Dagstuhl Seminar 23122
Deep Continual Learning
( Mar 19 – Mar 24, 2023 )
- Bing Liu (University of Illinois - Chicago, US)
- Vincenzo Lomonaco (University of Pisa, IT)
- Tinne Tuytelaars (KU Leuven, BE)
- Gido van de Ven (KU Leuven, BE)
- Marsha Kleinbauer (for scientific matters)
- Jutka Gasiorowski (for administrative matters)
As part of the mandatory documentation, participants are asked to submit their talk abstracts, working group results, etc. for publication in our series Dagstuhl Reports via the Dagstuhl Reports Submission System.
- Upload (Use personal credentials as created in DOOR to log in)
Dagstuhl Seminar Wiki
- Dagstuhl Seminar Wiki (Use personal credentials as created in DOOR to log in)
- Dagstuhl Materials Page (Use personal credentials as created in DOOR to log in)
Continual learning, also referred to as incremental learning or lifelong learning, is a sub-field of machine learning focusing on the challenging setting where data distributions and/or task specifications vary over time. This includes learning a sequence of tasks as well as learning from data streams. This calls for learning algorithms that can acquire new knowledge over time, with minimal forgetting of what they have learned previously, transfer knowledge across tasks, and smoothly adapt to new circumstances as needed. This contrasts with the traditional setting of machine learning, which largely builds on the premise that all data, both for training and testing, are sampled i.i.d. from a single, stationary data distribution.
Deep learning models in particular are in need of continual learning capabilities. A first reason for this is the strong data-dependence of these models. When trained on a stream of data whose underlying distribution changes over time, deep learning models tend to fully adapt to the most recently seen data, thereby "catastrophically" forgetting the skills they had learned earlier in their training process. Another reason that continual learning capabilities could be especially beneficial for deep learning models, is that they can help deal with the very long training times of these models. The current practice applied in industry, where models are completely re-trained on a regular basis to avoid being outdated, is time inefficient, unsustainable and sub-optimal. Freezing the feature extraction layers is not an option, as the power of deep learning in many challenging applications, be it in computer vision, NLP or audio processing, hinges on the learned representations.
Open research questions we would like to address in this Dagstuhl Seminar include:
- How do we tackle continual learning at scale on real-world problems, where domain shifts may be unpredictable and data can be long-tailed?
- To what extent can recent advances in representation learning and insights in model generalisability help continual learning?
- Rather than relying on tools developed and optimized for machine learning under i.i.d. conditions, should we consider completely different learning strategies?
- In case old data can be revisited, are there more efficient strategies than simply retraining on all available data over and over again?
- What are the open challenges in open world learning and automated continual learning, where the agent discovers new tasks by itself, collects its own training data and incrementally learns the new tasks?
- What can we learn from related fields such as online learning, meta-learning, Bayesian deep learning, robotics and neuroscience?
By aiming to bring together world-class researchers in the field of deep continual learning, as well as in the related fields of online learning, meta-learning, Bayesian deep learning, robotics and neuroscience to discuss and to brainstorm, we plan to set the research agenda for years to come.
- Rahaf Aljundi (Toyota Motor Europe - Zaventem, BE) [dblp]
- Matthias Bethge (Universität Tübingen, DE) [dblp]
- Andrea Cossu (University of Pisa, IT) [dblp]
- Fabian Fumagalli (Universität Bielefeld, DE) [dblp]
- Joao Gama (INESC TEC - Porto, PT) [dblp]
- Alexander Geppert (Hochschule für Angewandte Wissenschaften Fulda, DE) [dblp]
- Tyler Hayes (NAVER Labs Europe - Meylan, FR) [dblp]
- Paul Hofman (LMU München, DE)
- Eyke Hüllermeier (LMU München, DE) [dblp]
- Christopher Kanan (University of Rochester, US) [dblp]
- Tatsuya Konishi (KDDI - Saitama, JP) [dblp]
- Dhireesha Kudithipudi (University of Texas - San Antonio, US) [dblp]
- Christoph H. Lampert (IST Austria - Klosterneuburg, AT) [dblp]
- Bing Liu (University of Illinois - Chicago, US) [dblp]
- Vincenzo Lomonaco (University of Pisa, IT) [dblp]
- Martin Mundt (TU Darmstadt, DE) [dblp]
- Razvan Pascanu (DeepMind - London, GB) [dblp]
- Adrian Popescu (CEA LIST - Nano-INNOV, FR) [dblp]
- James M. Rehg (Georgia Institute of Technology - Atlanta, US) [dblp]
- Andreas Tolias (Baylor College of Medicine - Houston, US) [dblp]
- Tinne Tuytelaars (KU Leuven, BE) [dblp]
- Gido van de Ven (KU Leuven, BE) [dblp]
- Joost van de Weijer (Computer Vision Center - Barcelona, ES) [dblp]
- Eli Verwimp (KU Leuven, BE) [dblp]
- Michal Zajac (Jagiellonian University - Kraków, PL) [dblp]
- Shai Ben-David (University of Waterloo, CA) [dblp]
- Machine Learning
- continual learning
- incremental learning