Dagstuhl Seminar 24372: Explainable AI for Sequential Decision Making

Dagstuhl Seminar 24372

Explainable AI for Sequential Decision Making

( Sep 08 – Sep 11, 2024 )

(Click in the middle of the image to enlarge)

Permalink

Please use the following short url to reference this page: https://www.dagstuhl.de/24372

Organizers

Hendrik Baier (TU Eindhoven, NL)
Mark T. Keane (University College Dublin, IE)
Sarath Sreedharan (Colorado State University - Fort Collins, US)
Silvia Tulli (Sorbonne University - Paris, FR)
Abhinav Verma (Pennsylvania State University - University Park, US)

Contact

Andreas Dolzmann (for scientific matters)
Christina Schwarz (for administrative matters)

Shared Documents

Dagstuhl Materials Page (Use personal credentials as created in DOOR to log in)

Publications

Hendrik Baier, Mark T. Keane, Sarath Sreedharan, Silvia Tulli, Abhinav Verma, and Stylianos Loukas Vasileiou. Explainable AI for Sequential Decision Making (Dagstuhl Seminar 24372). In Dagstuhl Reports, Volume 14, Issue 9, pp. 67-103, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2025)

Schedule

Schedule

Press Room

Show Press Room

UCD CS academics invited to organise prestigious Dagstuhl seminars - University College Dublin School of Computer Science News, March 28, 2024

Summary

Show Summary

We work with AI and rely on AI for more and more decisions that influence our lives. To serve increasingly urgent goals such as enabling transparency, enhancing collaboration, and increasing trust in AI, the research area of explainable AI (XAI) has rapidly developed in recent years. However, the focus of XAI to date has largely been on explaining the input-output mappings of "black box" models such as neural networks, which have been seen as the central problem for the explainability of AI systems. While these models are certainly important, intelligent behavior often extends over time and needs to be explained and understood as such. The challenge of explaining sequential decision making (SDM), such as that of robots collaborating with humans or software agents engaged in complex ongoing tasks, has only recently gained attention. We may have AIs that can beat us in Go, but can they teach us how to play? We may have search and rescue robots, but can we effectively communicate with them and coordinate missions in the field?

Initial attempts at making the behavior of SDM algorithms more understandable have recently appeared in different fields such as classical AI planning, reinforcement learning, multi-agent systems, or logic-based argumentation - but often focused on ad hoc solutions to specific problems, with an emphasis on area-specific terminology and concepts developed in isolation of other fields. Many of these approaches are also restricted in their scope, for example to explanations of isolated, single actions that do not address the full complexity of SDM; or to summaries of entire agent policies, which are often too high-level to be helpful. To truly trust an AI agent and collaboratively work with it towards human goals, and to increase successful AI adoption and acceptance in many fields from robotics to logistics, and from production planning to smart cities, we need considerable progress in this new field of XAI for SDM (or X-SDM).

Under-researched challenges for X-SDM include for example: XAI for complex decisions, e.g., on plans or policies instead of single output labels; conversational XAI that continuously interacts with users over time, aiming to understand and support them; contestable and collaborative XAI, which can successfully work with users in areas where neither user nor AI is omniscient nor infallible; and flexible decision-making for XAI, able to adapt to users, respect their autonomy, and go beyond one-size-fits-all explanations. This Dagstuhl Seminar aimed to identify and clarify such challenges that are unique to, or of particular relevance to, explainability in sequential decision-making settings. It used complementary perspectives of researchers from different communities such as reinforcement learning, planning, game AI, robotics, and cognitive science, which historically use different theoretical foundations and computational approaches. The aim of the seminar was to move towards a shared understanding of the field, by first building a taxonomy that unifies our perspectives, and then developing a common roadmap for moving X-SDM forward.

This seminar was organized in two parts: in the first part, breakout groups were formed based on the primary research communities of the participants. These groups were encouraged to identify key terminology and definitions in their area, which were later summarized in plenary sessions, and combined into a first sketch of a shared taxonomy. Based on this understanding of the field of explainable sequential decision making, breakout groups in the second part were then formed by participants interested in a particular aspect of future work, and their results were again summarized and combined in later plenary sessions to develop a first sketch of a roadmap for the field. The seminar was accompanied by spotlight talks throughout, giving insight into the work of individual participants. This report gives an overview of the talks, outlines the discussions of all breakout groups, and summarizes sketches of the taxonomy and the roadmap. The interest in the seminar was high and the participants were very enthusiastic to contribute and connect across research fields; the most frequently expressed regret was that the seminar did not last long enough to flesh out more details of the work we started. As the organizers, we therefore consider this seminar a great success and are looking forward to the resulting collaborations and impact to the field.

Creative Commons BY 4.0

Hendrik Baier, Mark T. Keane, Sarath Sreedharan, Silvia Tulli, and Abhinav Verma

Motivation

Show Motivation

As we work with AI and rely on AI for more and more decisions that influence our lives, the research area of explainable AI (XAI) has rapidly developed, with goals such as increasing trust, enhancing collaboration, and enabling transparency in AI. However, to date, the focus of XAI has largely been on explaining the input-output mappings of “black box” models like neural networks, which have been seen as the central problem for the explainability of AI systems. While these models are certainly important, intelligent behavior often extends over time and needs to be explained and understood as such. The challenge of explaining sequential decision-making (SDM), such as that of robots collaborating with humans or software agents engaged in complex ongoing tasks, has only recently gained attention. We may have AIs that can beat us in Go, but can they teach us how to play? We may have search and rescue robots, but can we effectively communicate with them, and coordinate missions with them, in the field?

Initial attempts to make the behavior of SDM algorithms more understandable have recently appeared in different fields such as classical AI planning, reinforcement learning, multiagent systems, or logic-based argumentation – but often focused on ad hoc solutions to specific problems, with an emphasis on area-specific terminology and concepts developed in isolation of other fields. Many of these approaches are also restricted in their scope, for example to explanations of isolated, single actions that do not address the full complexity of SDM; or to summaries of entire agent policies, which are often too high-level to be helpful. To truly trust an AI agent and collaboratively work with it towards human goals, and to increase successful AI adoption and acceptance in many fields from robotics to logistics, and from production planning to smart cities, we need considerable progress on this new field of XAI for SDM (or X-SDM), which we will focus on developing in this Dagstuhl Seminar.

The seminar will focus on under-researched challenges that are unique to, or of particular relevance to, explainability in sequential decision-making settings. We will seek to identify and clarify such challenges, making use of the complementary perspectives of researchers from different communities such as reinforcement learning, planning, recommender systems, or multi-agent systems, which historically use different theoretical foundations and computational approaches. While the participants will form working groups based on their own research interests and priorities, topics to be discussed can for example include: XAI for complex decisions, e.g., on plans or policies instead of single output labels; conversational XAI that continuously interacts with users over time, aiming to understand and support them; contestable and collaborative XAI, which can successfully work with users in areas where neither user nor AI are omniscient or infallible; and flexible decision-making for XAI, able to adapt to users, respect their autonomy, and go beyond one-size-fits-all explanations. The aim of the seminar is to move towards a shared understanding of the field, and to develop a common roadmap for moving it forward.

Creative Commons BY 4.0

Hendrik Baier, Mark T. Keane, Sarath Sreedharan, Silvia Tulli, and Abhinav Verma

Participants

Show Participants

Please log in to DOOR to see more details.

David Abel (Google DeepMind - London, GB) [dblp]
Hendrik Baier (TU Eindhoven, NL) [dblp]
Ruth Mary Josephine Byrne (Trinity College Dublin, University of Dublin, IE) [dblp]
Rebecca Eifler (LAAS - Toulouse, FR)
Claudia Goldman (The Hebrew University of Jerusalem, IL) [dblp]
Bradley Hayes (University of Colorado - Boulder, US) [dblp]
Tobias Huber (TH Ingolstadt, DE)
Mark T. Keane (University College Dublin, IE) [dblp]
Khimya Khetarpal (Google DeepMind - Seattle, US) [dblp]
Benjamin Krarup (King's College London, GB) [dblp]
Pat Langley (ISLE - Palo Alto, US) [dblp]
Simon M. Lucas (Queen Mary University of London, GB) [dblp]
Anna Lukina (TU Delft, NL) [dblp]
Samer Nashed (University of Montreal, CA & MILA - Quebec AI Institute, CA)
Sriraam Natarajan (University of Texas at Dallas - Richardson, US) [dblp]
Ann Nowé (Free University of Brussels, BE) [dblp]
Ron Petrick (Heriot-Watt University - Edinburgh, GB) [dblp]
Mark Riedl (Georgia Institute of Technology - Atlanta, US) [dblp]
Silvia Rossi (University of Naples, IT) [dblp]
Wojciech Samek (Fraunhofer HHI - Berlin, DE) [dblp]
Lindsay Sanneman (MIT - Cambridge, US) [dblp]
Julian Siber (CISPA - Saarbrücken, DE) [dblp]
Sarath Sreedharan (Colorado State University - Fort Collins, US) [dblp]
Mohan Sridharan (University of Edinburgh, GB) [dblp]
Silvia Tulli (Sorbonne University - Paris, FR) [dblp]
Stylianos Loukas Vasileiou (Washington University - St. Louis, US) [dblp]
Abhinav Verma (Pennsylvania State University - University Park, US) [dblp]

Classification

Artificial Intelligence

Keywords

explainable artificial intelligence
XAI
sequential decision making

Seminar 24372

Search the Dagstuhl Website

Schloss Dagstuhl Services

Seminars

Within this website:

External resources:

Publishing

Within this website:

External resources:

dblp

Within this website:

External resources:

Dagstuhl Seminar 24372

Explainable AI for Sequential Decision Making

( Sep 08 – Sep 11, 2024 )

Permalink

Organizers

Contact

Shared Documents

Publications

Schedule

Press Room

Summary

Motivation

Participants

Classification

Keywords