Dagstuhl Seminar 22382
Machine Learning for Science: Bridging Data-Driven and Mechanistic Modelling
( Sep 18 – Sep 23, 2022 )
Permalink
Organizers
- Philipp Berens (Universität Tübingen, DE)
- Kyle Cranmer (University of Wisconsin - Madison, US)
- Neil D. Lawrence (University of Cambridge, GB)
- Ulrike von Luxburg (Universität Tübingen, DE)
Coordinator
- Jessica Montgomery (University of Cambridge, GB)
Contact
- Michael Gerke (for scientific matters)
- Susanne Bach-Bernhard (for administrative matters)
Shared Documents
- Dagstuhl Materials Page (Use personal credentials as created in DOOR to log in)
Impacts
- The passive symmetries of machine learning - Villar, Soledad; Hogg, David W.; Yao, Weichi; Kevrekidis, George A.; Schölkopf, Bernhard - Cornell University : arXiv.org, 2023. - 14 pp..
- AI for science : an emerging agenda - Berens, Philipp; Cranmer, Kyle; Lawrence, Neil D.; Luxburg, Ulrike von; Montgomery, Jessica - Cornell University : arXiv.org, 2023. - 44 pp..
Schedule
Machine learning has the potential to transform research and innovation. Today’s machine learning methods are already being applied to advance the frontiers of science, helping researchers better understand how the world around us works – from interactions between atoms, to the ways that proteins fold, interactions between cells, the dynamics of Earth’s systems and the discovery of exoplanets. These contributions are the foothills of the wider transformation that machine learning could bring for science and the scientific workflow.
Recent successes in the deployment of machine learning for scientific discovery point to the potential of a new generation of machine learning methods for science. These tools would combine data-derived insights with existing domain knowledge or theory, creating more powerful analytical tools. They would enhance researchers’ ability to simulate the systems they study, testing new ideas or identifying new areas for investigation; and they would support researchers to understand not only what patterns can be found in data, but why and how such patterns have emerged.
Creating this new generation of machine learning methods requires further efforts to bridge the current gap between data-driven and mechanistic modelling. Recent successes in the field suggest a route to create these hybrid approaches. Through further development of machine learning approaches that encode domain knowledge in data-driven systems, that enable simulation and emulation of complex real-world systems, and that allow causal inference in data-enabled systems, machine learning research could create more powerful tools for scientific discovery.
This Dagstuhl Seminar will seek to articulate a roadmap for bridging the gap between data-driven and mechanistic modelling approaches. It will consider the lessons that recent work at the interface of machine learning and science provides for the future development of the field, and it will review emerging research directions at this interface. In so doing, it will identify a set of common interests where further research could unlock progress in the use of machine learning for scientific discovery.
Machine learning methods have already been successfully adopted in a variety of scientific domains. This seminar will review recent experiences of – and lessons learned from – efforts to deploy machine learning to advance:
- Healthcare and biomedical sciences including neuroscience
- Climatology and environmental sciences
- Theoretical and experimental physics
By reviewing these recent experiences, the seminar will identify emerging research directions and best practices in:
- Encoding domain knowledge in machine learning systems, reviewing methods for leveraging insights from data while embedding the knowledge contained in mechanistic modelling approaches.
- Simulation and emulation, investigating how innovations in the mathematics of emulation and techniques for understanding uncertainty propagation can support more effective machine learning tool.
- Approaches to causality in machine learning, exploring how techniques from statistical inference and uncertainty quantification can be combined to create a new mathematics of causality.

- Bubacarr Bah (AIMS South Africa - Cape Town, ZA)
- Jessica Beasley (Collective Next - Boston, US)
- Philipp Berens (Universität Tübingen, DE) [dblp]
- Maren Büttner (Helmholtz Zentrum München & Universität Bonn)
- Thomas G. Dietterich (Oregon State University - Corvallis, US) [dblp]
- Carl Henrik Ek (University of Cambridge, GB)
- Asja Fischer (Ruhr-Universität Bochum, DE) [dblp]
- Philipp Hennig (Universität Tübingen, DE) [dblp]
- David W. Hogg (New York University, US)
- Christian Igel (University of Copenhagen, DK) [dblp]
- Samuel Kaski (Aalto University, FI) [dblp]
- Ieva Kazlauskaite (University of Cambridge, GB)
- Hans Kersting (INRIA - Paris, FR) [dblp]
- Niki Kilbertus (TU München, DE & Helmholtz AI München, DE)
- Neil D. Lawrence (University of Cambridge, GB) [dblp]
- Gilles Louppe (University of Liège, BE)
- Jakob Macke (Universität Tübingen, DE)
- Siddharth Mishra-Sharma (MIT - Cambridge, US)
- Jessica Montgomery (University of Cambridge, GB)
- Jonas Peters (University of Copenhagen, DK) [dblp]
- Markus Reichstein (MPI für Biogeochemistry - Jena, DE) [dblp]
- Bernhard Schölkopf (MPI für Intelligente Systeme - Tübingen, DE) [dblp]
- Soledad Villar (Johns Hopkins University - Baltimore, US)
- Ulrike von Luxburg (Universität Tübingen, DE) [dblp]
- Verena Wolf (Universität des Saarlandes - Saarbrücken, DE) [dblp]
- Mauricio A Álvarez (University of Manchester, GB)
- Kyle Cranmer (University of Wisconsin - Madison, US)
- Stuart Feldman (Schmidt Futures - New York, US)
- Vidhi Lalchand (University of Cambridge, GB)
- Dina Machuve (DevData Analytics - A, TZ)
- Eric Meissner (University of Cambridge, GB)
- Aditya Ravuri (University of Cambridge, GB)
- Francisco Vargas (University of Cambridge, GB)
Classification
- Artificial Intelligence
- Machine Learning
Keywords
- Machine learning
- AI
- scientific discovery