https://www.dagstuhl.de/18403

September 30 – October 5 , 2018, Event 18403

Interoperability of Metadata Standards in Cross-Domain Science, Health, and Social Science Applications

Organizers

Simon Cox (CSIRO – South Clayton, AU)
Simon Hodson (CODATA – Paris, FR)
Steven McEachern (Australian National University – Canberra, AU)
Joachim Wackerow (GESIS – Mannheim, DE)

For support, please contact

Heike Clemens

Documents

External Homepage

Press Room

Details

Short Description

The goal of the workshop is a better understanding of how metadata specifications can be aligned to support cross-discipline (or cross domain) data integration and analysis, with a consideration of standards from the social, health and environmental sciences, along with generic all-purpose metadata standards. It will do so with specific reference on the one hand to the standards listed below and on the other hand to specific case studies in infectious disease, disaster risk and resilient cities.

Background

Standards are a vital tool enabling integration and semantic linking of data within and between disciplines. However, standards tend to get developed and adopted within disciplines or application domains with little consideration of cross-discipline requirements and technologies, so data integration can often only be easily achieved within and between closely allied fields. Addressing global scientific challenges that depend on cross-discipline integration remains difficult. The challenge is to make cross-discipline data integration a routine aspect of data-driven science.

Metadata support data discovery, selection, access and use, and are critical for data integration. Data from different sources/domains should be described in a way that cross-discipline discovery can detect and access the relevant data collections, and so that transformations and analyses can be automated. The use of cross-discipline data should become efficient, scalable and reproducible, enabling discipline-neutral data processing and analysis tools to be applied. Furthermore it would be possible to apply (meta-)data mining approaches and reasoning. In sum, new opportunities of insights and realization will develop.

A CODATA initiative on interdisciplinary data integration is seeking to explore these challenges and opportunities in relation to three specific case studies in interdisciplinary research into infectious disease outbreaks, disaster risk and resilient cities. These case studies provide a concrete focus for exploring the potential of interoperability and data integration through metadata alignment.

Focus of the Workshop

The workshop will build on a platform provided particularly by the following activities: (i) two previous workshops on DDI and interoperability with other specifications ( 2016, 2015 ), (ii) work to extend and refine DCAT by the W3C Dataset Exchange Working Group ( DXWG ), and (iii) the three detailed case studies and pilots from the CODATA initiative mentioned above. Metadata activities in the Research Data Alliance provide additional background and context.

There are several different areas where metadata comes into play:

  • Description of studies or data collections for discovery purposes.
  • Descriptions of provenance and scientific context for purposes.
  • Description of data variables or dimensions for analysis purposes.
  • Description of data transformation steps for recording purposes (possibly also for reusing the transformation steps on similar data).
  • Controlled vocabularies to ensure standardized and agreed concepts (in relation to variables, collections, measurements, techniques and procedures etc.).

The capability to express discoverable and structured metadata must be automatic and achieved as far as possible using tools that are familiar and in common use.

Topics for Discussion and Possible Outcomes

Areas of exploration and discussion will identify and describe following:

  • Common rules for metadata specifications
  • Advantages and limitations of generic approaches
  • Techniques for profiling or specializing generic standards for specific applications
  • Best practices for setting up domain-specific data/metadata for cross-domain use
  • Controlled Vocabularies, domain-independent and useful domain-specific ones
  • Contact points/overlaps of specifications, crosswalks and transformations
  • Identification of gaps. Possible workarounds, possible areas for future specifications

The output of the workshop will likely be reports and working documents on one or more of these topics.

Motivation text license
  Creative Commons BY 3.0 DE
  Simon Cox, Simon Hodson, Steven McEachern, and Joachim Wackerow

Event Series

Online Publications

We offer several possibilities to publish the results of your event. Please contact publishing(at)dagstuhl.de if you are interested.

Dagstuhl's Impact

Please inform us when a publication was published as a result from your seminar. These publications are listed in the category Dagstuhl's Impact and are presented on a special shelf in the library.