NFDI4DataScience –|National Research Data Infrastructure|for Data Science & Artificial Intelligence


The vision of NFDI4DataScience (NFDI4DS) is to support all steps of the complex and interdisciplinary research data lifecycle, including collecting/creating, processing, analysing, publishing, archiving and reusing resources in Data Science and Artificial Intelligence.
The past years have seen a paradigm shift, with computational methods increasingly relying on data-driven and often deep learning-based approaches, leading to the establishment and ubiquity of Data Science as a discipline driven by advances in the field of Computer Science but being of relevance to most scientific disciplines. Transparency, reproducibility and fairness have become crucial challenges for Data Science and Artificial Intelligence due to the complexity of contemporary Data Science methods, often relying on a combination of code, models and data used for training.
Taking into account the increasing importance of Data Science and Artificial Intelligence methods for Computer Science as well as a broad range of scientific disciplines, NFDI4DS will promote FAIR and open research data infrastructures supporting all involved resources such as code, models, data, or publications through an integrated approach. The overarching objective of NFDI4DS is the development, establishment and sustainment of a national research data infrastructure for the Data Science and Artificial Intelligence community in Germany. This will also deliver benefits for a wider community requiring data analytics solutions, within the NFDI and beyond. The key idea is to work towards increasing the transparency, reproducibility and fairness of Data Science and Artificial Intelligence projects, by making all digital artefacts available, by interlinking them, and by offering innovative tools and services. Based on the reuse of these digital objects, new and innovative research will be enabled.
NFDI4DS intends to represent the Data Science and Artificial Intelligence community in academia, which is an interdisciplinary field rooted in Computer Science. We aim to reuse existing solutions, and to collaborate closely with the other NFDI consortia and beyond. In the initial phase, NFDI4DS will focus on four Data Science intense application areas: language technology, biomedical sciences, information sciences and social sciences. The expertise available in NFDI4DS related to Data Science and Artificial Intelligence, infrastructure development and further domain knowledge ensures that metadata standards are interoperable across domains, and that new ways of dealing with digital objects arise.



This consortium is a cooperation of:

  • Schloss Dagstuhl – Leibniz-Zentrum für Informatik
  • Fraunhofer-Institut für Offene Kommunikationssysteme (FOKUS)
  • Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI) GmbH
  • Fraunhofer-Institut für Angewandte Informationstechnik (FIT)
  • FIZ Karlsruhe – Leibniz-Institut für Informationsinfrastruktur
  • GESIS – Leibniz-Institut für Sozialwissenschaften
  • Hamburger Informatik Technologie-Center (HITeC) e. V.
  • RWTH Aachen
  • Leibniz Universität Hannover
  • TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften
  • Technische Universität Berlin
  • Technische Universität Dresden
  • Universität Leipzig
  • ZB MED – Informationszentrum Lebenswissenschaften
  • ZBW – Leibniz-Informationszentrum Wirtschaft

The following further institutions participate:

  • Alfred-Wegener-Institut (AWI)
  • Universität Bremen
  • Fritz-Haber-Institut der Max-Planck-Gesellschaft
  • Wikimedia Deutschland e.V.




Project Funding

The consortium is funded by the German federation and states as part of the programme to build the National Research Data Infrastructure (NFDI).


Project Duration

Oktober, 2021 – September, 2026

Web links

Project Partners