Dagstuhl Seminar 27231: Reinforcement Learning for Optimization

Dagstuhl Seminar 27231

Reinforcement Learning for Optimization

( Jun 06 – Jun 11, 2027 )

Permalink

Please use the following short url to reference this page: https://www.dagstuhl.de/27231

Organizers

Quentin Cappart (UCLouvain, BE & Polytechnique Montréal, CA)
Nguyen Dang (University of St Andrews, GB)
Carola Doerr (CNRS and Sorbonne University - Paris, FR)
Kevin Tierney (Universität Wien, AT)

Contact

Michael Gerke (for scientific matters)
Christina Schwarz (for administrative matters)

Motivation

Show Motivation

Reinforcement learning (RL) has become a powerful paradigm for sequential decision-making, with major successes in domains such as robotics, game playing, and language modelling. In optimization, however, its impact is still uneven. While RL has shown promise in improving algorithmic components and in learning solution strategies end-to-end, consistent and transferable gains across problem classes remain difficult to achieve. Key challenges include sparse and delayed rewards, highly structured or combinatorial search spaces, expensive evaluations, and the need to generalize across diverse instance distributions.

Over the past years, research at the intersection of RL and optimization has emerged in three largely independent communities:

Evolutionary algorithms and metaheuristics, where RL is used to control and adapt search components or even construct new algorithms;
General-purpose solving paradigms (e.g., constraint programming, mixed-integer programming, SAT/SMT), where RL is integrated into highly engineered solving pipelines to learn branching, variable selection, restart, or cutting strategies; and
Neural combinatorial optimization, where RL is used to learn solution construction or improvement policies directly.

Despite their different development histories and methodologies, it is very clear that these communities face a set of common challenges when applying RL to optimization problems. Among them are the design of effective representations, the stability and scalability of learning procedures, and the ability to generalize across problem families.

The goal of this Dagstuhl Seminar is to bring together leading researchers from these three domains to foster exchange, identify common principles, and accelerate progress toward a unified understanding of RL for optimization. The seminar will focus on a set of concrete topics intended to stimulate cross-domain discussion and to define a shared research agenda for RL in optimization, including:

Shared challenges across all domains when using RL, particularly deep RL, including representation learning, stability, scalability, and generalization.
Domain-specific difficulties, and whether solutions from one area (e.g., evolutionary computation) can be transferred to another (e.g., CP or MIP).
Common methodologies, benchmarks, and theoretical frameworks that could unify the study of RL for optimization.
Best practices and lessons learned, including the integration of negative results and empirical insights that rarely appear in publications but are crucial for scientific progress.

Creative Commons BY 4.0

Quentin Cappart, Nguyen Dang, Carola Doerr, and Kevin Tierney

LZI Junior Researchers

Show LZI Junior Researchers

This seminar qualifies for Dagstuhl's LZI Junior Researchers program. Schloss Dagstuhl wishes to enable the participation of junior scientists with a specialisation fitting for this Dagstuhl Seminar, even if they are not on the radar of the organizers. Applications by outstanding junior scientists are possible until Friday, September 18, 2026.

Classification

Artificial Intelligence
Machine Learning
Neural and Evolutionary Computing

Keywords

optimization
reinforcement learning
evolutionary computation
general-purpose solvers
neural combinatorial optimization

Seminar 27231

Search the Dagstuhl Website

Schloss Dagstuhl Services

Seminars

Within this website:

External resources:

Publishing

Within this website:

External resources:

dblp

Within this website:

External resources:

Dagstuhl Seminar 27231

Reinforcement Learning for Optimization

( Jun 06 – Jun 11, 2027 )

Permalink

Organizers

Contact

Motivation

LZI Junior Researchers

Classification

Keywords