Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Within this website:
External resources:
Within this website:
External resources:
  • the dblp Computer Science Bibliography

Dagstuhl Seminar 11421

Foundations of distributed data management

( Oct 16 – Oct 21, 2011 )

(Click in the middle of the image to enlarge)

Please use the following short url to reference this page:





The Web has brought fundamentally new challenges to data management. Web data management differs from traditional database management in a number of ways. First, Web data differ in their structure: trees with links (usually described by mark-up languages such as XML) instead of tables. Also, Web data are by nature distributed, often on a large number of autonomous servers. Finally, Web data are typically very dynamic and imprecise.

Unlike for the classical relational database model, there is still no commonly accepted model for data management over the Web. The lack of a clean, simple, mathematical model further prevents us from designing general solutions to typical data management problems, such as building indexes, optimizing queries, and guaranteeing certain properties of applications.

As witnessed by the two seminars that previously occurred in Dagstuhl on this topic (Seminar 01361 in 2001 and Seminar 05061 in 2005, both entitled "Foundations of Semistructured Data"), most of the recent research efforts have concentrated on adapting traditional database techniques to the XML setting. In particular, foundational research on XML focused on the tree structure of XML documents, applying well-developed techniques based on logic and automata for trees. These lines of research have been very successful. However, they do not address all the facets of Web data. In particular distribution, dynamicity, incompleteness and reliability had received limited attention in past work, but play a central role in a Web setting. The aim of Seminar 11421 was to bring together researchers covering this spectrum of relevant areas, to report on recent progress in terms of both results as well as new, relevant research questions. It was organized at the initiative of members of the EU funded research projects FoX ( and Webdam ( that are acknowledged for their support.

The workshop brought together 51 researchers from complementary areas of database theory, logic, and theoretical computer science in general, all with an established record of excellence in Web data management. The participant pool comprised both senior and junior researchers, including several advanced PhD students.

Participants were invited to present their own work, and/or survey state-of-the-art advances and challenges in the field. Thirty-four talks were given, which included four (60-90 minute) tutorials and thirty regular (30 minute) talks. All presentations were scheduled prior to the workshop, and due to the flood of volunteered talks, the organizers had to cap the number of slots. Talks were chosen so as to represent well the aspects of Web data management described above. The talks are listed below, classified by the covered topics. The classification is necessarily rough, as many talks crossed the boundaries between areas, in keeping with the seminar's intent. To the organizers' pleasant surprise, some of the results established surprising bridges between fields previously seen as unrelated (such as Machine Learning and Data Exchange), and brought in techniques from novel areas (such as Nominal Sets).

Due to the rich coverage of the area of foundations of Web data management, as achieved by both the presentations and the informal interactions, the organizers regard the seminar as a great success. The weeklong format was well-suited to such an ambitious topic. The topic was well-received, as witnessed by the high rate of accepted invitations, and the exemplary degree of involvement by the paricipants. These volunteered such a high number of exceptional-quality talks that the organizers were faced with not being able to accommodate demand. Bringing together researchers from different areas of data management, programming languages, theoretical computer science and logic fostered valuable interactions and led to fruitful collaborations, as reflected also by the very positive feedback from the audience. The organizers wish to express their gratitude toward the Scientific Directorate of the Center for its support of this seminar, and hope to continue this seminar series on Web data management.

  • Serge Abiteboul (ENS - Cachan, FR) [dblp]
  • Tom Ameloot (Hasselt University - Diepenbeek, BE)
  • Emilien Antoine (University of Paris South XI, FR)
  • Timos Antonopoulos (Hasselt University - Diepenbeek, BE)
  • Marcelo Arenas (Pontificia Universidad Catolica de Chile, CL) [dblp]
  • Pablo Barcelo (University of Chile - Santiago de Chile, CL) [dblp]
  • Meghyn Bienvenu (University Paris-Sud, FR) [dblp]
  • Mikolaj Bojanczyk (University of Warsaw, PL) [dblp]
  • Pierre Bourhis (University of Oxford, GB) [dblp]
  • Claire David (University Paris-Est - Marne-la-Vallée, FR) [dblp]
  • Alin Deutsch (University of California - San Diego, US) [dblp]
  • Diego Figueira (University of Edinburgh, GB) [dblp]
  • Robert Fink (University of Oxford, GB)
  • Amélie Gheerbrant (University of Edinburgh, GB) [dblp]
  • Giorgio Ghelli (University of Pisa, IT)
  • Florent Jacquemard (ENS - Cachan, FR) [dblp]
  • Ahmet Kara (TU Dortmund, DE) [dblp]
  • Wojtek Kazana (ENS - Cachan, FR)
  • Evgeny Kharlamov (Free University of Bozen-Bolzano, IT)
  • Pekka Kilpeläinen (University of Kuopio, FI)
  • Christoph Koch (EPFL - Lausanne, CH) [dblp]
  • Phokion G. Kolaitis (University of California - Santa Cruz, US) [dblp]
  • Slawomir Lasota (University of Warsaw, PL) [dblp]
  • Leonid Libkin (University of Edinburgh, GB) [dblp]
  • Sebastian Maneth (Universität Leipzig, DE) [dblp]
  • Wim Martens (Universität Bayreuth, DE) [dblp]
  • Maarten Marx (University of Amsterdam, NL)
  • Tova Milo (Tel Aviv University, IL) [dblp]
  • Filip Murlak (University of Warsaw, PL) [dblp]
  • Anca Muscholl (University of Bordeaux, FR) [dblp]
  • Frank Neven (Hasselt University - Diepenbeek, BE) [dblp]
  • Matthias Niewerth (TU Dortmund, DE) [dblp]
  • Dan Olteanu (University of Oxford, GB) [dblp]
  • Pawel Parys (University of Warsaw, PL)
  • Juan L. Reutter (University of Edinburgh, GB) [dblp]
  • Marie-Christine Rousset (University of Grenoble, FR)
  • Anne Schuth (University of Amsterdam, NL)
  • Nicole Schweikardt (Goethe-Universität - Frankfurt a. M., DE) [dblp]
  • Thomas Schwentick (TU Dortmund, DE) [dblp]
  • Luc Segoufin (ENS - Cachan, FR) [dblp]
  • Helmut Seidl (TU München, DE) [dblp]
  • Pierre Senellart (Telecom Paris Tech, FR) [dblp]
  • Cristina Sirangelo (ENS - Cachan, FR)
  • Tony Tan (University of Edinburgh, GB)
  • Balder Ten Cate (University of California - Santa Cruz, US) [dblp]
  • Sophie Tison (Lille I University, FR) [dblp]
  • Szymon Torunczyk (ENS - Cachan, FR) [dblp]
  • Jan Van den Bussche (Hasselt University - Diepenbeek, BE) [dblp]
  • Stijn Vansummeren (University of Brussels, BE) [dblp]
  • Domagoj Vrgoc (University of Edinburgh, GB) [dblp]
  • Thomas Zeume (TU Dortmund, DE) [dblp]

  • Data management
  • Distributed Processing
  • Web
  • Internet

  • XML
  • query language
  • distribution
  • incompleteness