TOP
Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Seminars
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Publishing
Within this website:
External resources:
dblp
Within this website:
External resources:
  • the dblp Computer Science Bibliography


Dagstuhl Seminar 24102

Shapes in Graph Data: Theory and Implementation

( Mar 03 – Mar 08, 2024 )

Permalink
Please use the following short url to reference this page: https://www.dagstuhl.de/24102

Organizers

Contact

Dagstuhl Seminar Wiki

Shared Documents

Schedule
  • Upload (Use personal credentials as created in DOOR to log in)

Motivation

Over the past decade, graph databases such as RDF and property graphs have gained significant traction. The time is ripe to bring people together around shapes, and, more generally, flexible, and expressive schema and constraint languages for graph databases. We explain our motivation in what follows. One of the main reasons for the success of graph databases is that they do not require an elaborate database schema, with accompanying integrity constraints, to be set up in advance. Of course, the principles of conceptual and logical database design remain as valuable as ever for critical enterprise applications. In these classical applications, constraints and schemas are mainly descriptive, having as purpose to support the mental map from the real world to the data to be managed in the database. In contrast, the emergence of graph databases is accompanied by a paradigm shift towards new applications where schemas and constraints are used for a prescriptive purpose. Here, the goal is to establish a contract between the database and its users, which provides guarantees on the structure and form of data provided, and imposes restrictions required for data governance. There is no single schema; instead, schemas are developed “as-you-go” and are adapted depending on the usage of the data.

The need for flexible languages for writing prescriptive schemas was felt rather quickly. Such languages need to have a sound formal underpinning, if they are to be used for improving and ensuring data quality, and for static analysis and verification of graph database transformations. Soon, a new class of formalisms based on the notion of shape emerged. Shapes are constraints on nodes in the data graph that impose or forbid structural patterns (involving paths, edges, labels, and constant values). Importantly, shapes can refer to other shapes. Naturally, then, a novel, prescriptive notion of schema emerges, consisting of a set of shapes, together with an assignment or targeting mechanism that specifies which nodes should satisfy which shapes. In the world of RDF graphs, two main shape-based formalisms have been proposed: SHACL (Shapes Constraint Language), standardized by the W3C, and ShEx (Shape Expression schemas). In the world of property graphs, different systems have their own data definition languages, such as Cypher or GSQL. The ISO/IEC working group is currently engaged in standardizing property graphs within the SQL/PGQ and GQL project. Together with the Linked Data Benchmark Council, they are developing property graph schema formalisms that feature a shape-based type system.

Our aim is to bring together the leading researchers on shapes, schemas, and constraints for graph data, both from academia and industry, to discuss the many open problems. The purpose of this Dagstuhl Seminar is to inform each other on how we perceive the research area; to report on brand new results; to discuss open problems and future directions; and to initiate new research.

Focus topics vary along different research axes that arise from shapes, constraints, and schemas for graph databases, and especially on:

  • Expressive power and complexity
  • Implementation and processing strategies
  • Automated reasoning about shapes
  • Explaining and handling violations

Participants are encouraged to provide details regarding their research interests and preferred discussion topics before the seminar starts. Furthermore, it is suggested that they explore the information shared by the other participants beforehand.

Copyright Shqiponja Ahmetaj, Slawomir Staworko, and Jan Van den Bussche

Participants

Classification
  • Databases
  • Logic in Computer Science

Keywords
  • data for the Semantic Web
  • schema languages
  • constraint languages
  • graph data