Dagstuhl-Seminar 26272: Open Music Data for Music Processing Research

Dagstuhl-Seminar 26272

Open Music Data for Music Processing Research

( 28. Jun – 03. Jul, 2026 )

(zum Vergrößern in der Bildmitte klicken)

Permalink

Bitte benutzen Sie folgende Kurz-Url zum Verlinken dieser Seite: https://www.dagstuhl.de/26272

Organisatoren

Magdalena Fuentes (New York University - Brooklyn, US)
Dasaem Jeong (Sogang University - Seoul, KR)
Meinard Müller (Universität Erlangen-Nürnberg, DE)

Kontakt

Michael Gerke (für wissenschaftliche Fragen)
Christina Schwarz (für administrative Fragen)

Dagstuhl Reports

As part of the mandatory documentation, participants are asked to submit their talk abstracts, working group results, etc. for publication in our series Dagstuhl Reports via the Dagstuhl Reports Submission System.

Upload (Use personal credentials as created in DOOR to log in)

Dagstuhl Seminar Wiki

Dagstuhl Seminar Wiki (Use personal credentials as created in DOOR to log in)

Gemeinsame Dokumente

Dagstuhl Materials Page (Use personal credentials as created in DOOR to log in)

Programm

Programm
Upload (Use personal credentials as created in DOOR to log in)

Motivation

Show Motivation

Over the past decades, Music Information Retrieval (MIR) has developed into a multidisciplinary field connecting signal processing, machine learning, musicology, and the digital humanities. MIR engages with melody, harmony, rhythm, timbre, and cultural diversity across audio recordings, symbolic scores, lyrics, videos, and metadata. At its core, MIR is data-driven: progress depends on reliable, diverse, and representative datasets. Yet despite advances in artificial intelligence and deep learning, open and sustainable music data resources remain scarce, fragmented, and difficult to share.

This Dagstuhl Seminar addresses one of the central challenges in the field: how to build a more open, reliable, and inclusive ecosystem for music data. While computer vision and natural language processing benefit from large-scale benchmark datasets, MIR still faces persistent barriers. Existing datasets are often narrow in scope, focusing on Western or popular music while neglecting other traditions. Copyright restrictions, unstable hosting platforms, and inconsistent annotations further hinder accessibility, reproducibility, and sustainability. These issues not only slow progress but also reinforce inequalities, as groups with privileged data access gain advantages while newcomers and underrepresented communities are left behind.

The seminar aims to bring together researchers, developers, educators, and practitioners from MIR, machine learning, and the computational humanities. Key topics include:

Complexity and Representation: Capturing the richness of music and aligning multimodal data such as audio, symbolic, and textual sources.
Annotation and Bias: Developing reliable annotation practices, addressing subjectivity, and mitigating cultural and stylistic bias.
Legal and Ethical Barriers: Navigating copyright and licensing while considering the roles of public domain, Creative Commons, and synthetic music data.
Reproducibility and Sustainability: Building infrastructures, standards, and documentation practices for long-term usability.
Community and Collaboration: Creating shared frameworks, open-source tools, and recognition mechanisms such as citation standards, dataset papers, and community awards that properly value dataset curation and foster inclusivity.

The seminar will emphasize discussion and collaboration over formal presentations. Plenary sessions, breakout groups, and hands-on demos will provide space to exchange perspectives, present tools and datasets, and explore solutions. Creative and social activities, including informal music-making, will strengthen community bonds and highlight the cultural dimensions of music research.

The seminar aims to define practical steps toward more transparent, sustainable, and inclusive music data. By connecting expertise across disciplines, we hope to lay the groundwork for lasting resources that strengthen research and creativity in MIR and beyond.

Creative Commons BY 4.0

Magdalena Fuentes, Dasaem Jeong, and Meinard Müller

LZI Junior Researchers

Show LZI Junior Researchers

This seminar qualifies for Dagstuhl's LZI Junior Researchers program. Schloss Dagstuhl wishes to enable the participation of junior scientists with a specialisation fitting for this Dagstuhl Seminar, even if they are not on the radar of the organizers. Applications by outstanding junior scientists are possible until Friday, November 21, 2025.

Teilnehmer

Zeige Teilnehmer

Please log in to DOOR to see more details.

Stefan Balke
Axel Berndt
Rachel Bittner
Míklós Both
Carlos Eduardo Cancino-Chacon
Rafael Caro Repetto
Seungheon Doh
Hao-Wen Dong
Magdalena Fuentes
Mark Gotham
Masataka Goto
Edward Guo
Johannes Hentschel
Cheng-Zhi Anna Huang
Dasaem Jeong
Anna Kruspe
Stefan Lattner
Cynthia Liem
Brian McFee
Meinard Müller
Juhan Nam
Néstor Nápoles López
Emilia Parada Cabaleiro
Genís Plaja-Roglans
Martín Rocamora
Xavier Serra
Sebastian Strahl
Bob L. T. Sturm
Li Su
Christof Weiß
Huan Zhang

Klassifikation

Databases
Machine Learning
Sound

Schlagworte

music information retrieval
audio signal processing
deep learning
open source
user interaction and interfaces

Seminar 26272

Suche auf der Schloss Dagstuhl Webseite

Schloss Dagstuhl Services

Seminare

Innerhalb dieser Seite:

Externe Seiten:

Publishing

Innerhalb dieser Seite:

Externe Seiten:

dblp

Innerhalb dieser Seite:

Externe Seiten:

Dagstuhl-Seminar 26272

Open Music Data for Music Processing Research

( 28. Jun – 03. Jul, 2026 )

Permalink

Organisatoren

Kontakt

Dagstuhl Reports

Dagstuhl Seminar Wiki

Gemeinsame Dokumente

Programm

Motivation

LZI Junior Researchers

Teilnehmer

Klassifikation

Schlagworte