Dagstuhl Seminar 25202: Generative Models for 3D Vision

Dagstuhl Seminar 25202

Generative Models for 3D Vision

( May 11 – May 16, 2025 )

(Click in the middle of the image to enlarge)

Permalink

Please use the following short url to reference this page: https://www.dagstuhl.de/25202

Organizers

Bernhard Egger (Friedrich-Alexander-Universität Erlangen-Nürnberg, DE)
Adam Kortylewski (Universität Freiburg, DE and MPI für Informatik - Saarbrücken, DE)
William Smith (University of York, GB)
Stefanie Wuhrer (INRIA - Grenoble, FR)

Contact

Marsha Kleinbauer (for scientific matters)
Simone Schilke (for administrative matters)

Publications

Laura Neschen, Bernhard Egger, Adam Kortylewski, William Smith, and Stefanie Wuhrer. Generative Models for 3D Vision (Dagstuhl Seminar 25202). In Dagstuhl Reports, Volume 15, Issue 5, pp. 96-113, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2025)

Schedule

Schedule

Summary

Show Summary

The rise of purely data-driven generative models, in particular generative adversarial networks, auto-regressive models, neural fields and diffusion models, has led to a step change in image synthesis quality. It is now possible to create photorealistic images with high level semantic control and solve many desirable use cases such as 2D inpainting. Whilst prior models were object specific (e.g. 3D Morphable Models of Faces), we now have generative models for images and videos that can represent various object classes and generate a huge variety of objects and scenes, even in different styles. The drawback of purely data-driven approaches is that the control and explainability provided by 3D and physically-based parameters is lost. It is also difficult (and perhaps prohibitively inefficient) to learn 3D consistent representations without prior models purely from 2D data alone.

For this seminar, a total of 58 researchers were invited, and 25 of them attended. Participants came from both academia and industry and at varying stages of their careers. Thirteen participants presented their work in around 15-30 minute presentations, and an abstract of each presentation is included in this report. We started the seminar with a short introduction of each participant. Everyone was given one slide to introduce themselves and asked to prepare a question, challenge or goal to discuss during the seminar.

In addition to traditional presentations, multiple slots were left for research discussions with the full group or sub-groups of the participants. The first set of these slots was filled with topics that participants proposed before the start of the seminar. Five participants led research discussions of about 1 hour each about a topic or a problem they considered important. Some of these discussions were led with the full group, while others were discussed in sub-groups, and the resulting conclusions were shared with the full group afterwards. Additionally, two 2 hour discussion slots were initially reserved to be filled with suggestions that came up during the seminar. These two long discussions concerned research questions that were identified as being important for the research community in the course of the seminar, namely the topics of metrics and capture, and hard problems in the research community that merit being studied more. All proposed topics led to lively discussions about various problems around generative models. Summaries of the results of these flexible sessions are contained in the Dagstuhl report. In addition to these organized discussions, there were numerous informal discussions during both the Wednesday outing and free time slots that are not summarized in this report.

Creative Commons BY 4.0

Bernhard Egger, Adam Kortylewski, Laura Neschen, William Smith, and Stefanie Wuhrer

Motivation

Show Motivation

Very recently, the community has begun to explore how to combine these two philosophies. 3D computer vision tasks can benefit from the visual prior provided by generative image models. Generative models can learn powerful image priors with some notion of view-point consistency from solely 2D data and then be used to synthesize training data for 3D vision models. Physically-based priors from 3D vision can be used to guide generative image models as a strong explicit inductive prior towards more data-efficient and accurate visual representations of the world. On the other hand, modern generative models rely on huge training datasets and compute resources that, increasingly, are only available to large industrial research labs.

This Dagstuhl Seminar seeks to bring together communities of researchers from computer graphics, computer vision and machine learning in both industry and academia at this extremely timely moment in the progress of the field.

Creative Commons BY 4.0

Bernhard Egger, Adam Kortylewski, William Smith, and Stefanie Wuhrer

Participants

Show Participants

Andreea Ardelean (Universität Erlangen-Nürnberg, DE) [dblp]
Timotei Ardelean (Universität Erlangen-Nürnberg, DE)
Thabo Beeler (Google - Zürich, CH) [dblp]
Timo Bolkart (Google Research - Zürich, CH) [dblp]
Neill Campbell (University of Bath, GB) [dblp]
Rishabh Dabral (MPI für Informatik - Saarbrücken, DE)
Olaf Dünkel (MPI für Informatik - Saarbrücken, DE) [dblp]
Bernhard Egger (Friedrich-Alexander-Universität Erlangen-Nürnberg, DE) [dblp]
James Gardner (University of York, GB)
Samara Ghrer (University of Grenoble, FR)
Marilyn Keller (MPI für Intelligente Systeme - Tübingen, DE) [dblp]
Ron Kimmel (Technion - Haifa, IL) [dblp]
Tobias Kirschstein (TU München - Garching, DE) [dblp]
Adam Kortylewski (Universität Freiburg, DE and MPI für Informatik - Saarbrücken, DE) [dblp]
Jan Eric Lenssen (MPI für Informatik - Saarbrücken, DE)
Ruoshi Liu (Columbia University - New York, US) [dblp]
Laura Neschen (INRIA Rhône-Alpes, FR)
Or Patashnik (Tel Aviv University, IL) [dblp]
Ryan Po (Stanford University, US) [dblp]
Shunsuke Saito (Codec Avatars Lab - Pittsburgh, US) [dblp]
William Smith (University of York, GB) [dblp]
Christian Theobalt (MPI für Informatik - Saarbrücken, DE) [dblp]
Gül Varol (ENPC - Marne-la-Vallée, FR) [dblp]
Yaniv Wolf (Technion - Haifa, IL)
Stefanie Wuhrer (INRIA - Grenoble, FR) [dblp]

Related Seminars

Dagstuhl Seminar 19102: 3D Morphable Models (2019-03-03 - 2019-03-08) (Details)
Dagstuhl Seminar 22121: 3D Morphable Models and Beyond (2022-03-20 - 2022-03-25) (Details)

Classification

Computer Vision and Pattern Recognition
Graphics
Machine Learning

Keywords

Generative Models
Implicit Representation
Diffusion Models
Neural Rendering
Inverse Rendering

Seminar 25202

Search the Dagstuhl Website

Schloss Dagstuhl Services

Seminars

Within this website:

External resources:

Publishing

Within this website:

External resources:

dblp

Within this website:

External resources:

Dagstuhl Seminar 25202

Generative Models for 3D Vision

( May 11 – May 16, 2025 )

Permalink

Organizers

Contact

Publications

Schedule

Summary

Motivation

Participants

Related Seminars

Classification

Keywords