06.07.14 - 09.07.14, Seminar 14282

Crowdsourcing and the Semantic Web

Diese Seminarbeschreibung wurde vor dem Seminar auf unseren Webseiten veröffentlicht und bei der Einladung zum Seminar verwendet.

Motivation

Semantic technologies provide flexible and scalable solutions to master and make sense of an increasingly vast and complex data landscape. However, while this potential has been acknowledged for various application scenarios and domains, and a number of success stories exist, it is equally clear that the development and deployment of semantic technologies will always remain reliant of human input and intervention. This is due to the very nature of some of the tasks associated with the semantic data management life cycle, which are famous for their knowledge-intensive and/or context-specific character; examples range from conceptual modeling in almost any flavor, to labeling resources (in different languages), describing their content in terms of ontological terms, or recognizing similar concepts and entities. For this reason, the Semantic Web community has always looked into applying the latest theories, methods and tools from CSCW, participatory design, Web 2.0, social computing, and, more recently crowdsourcing to find ways to engage with users and encourage their involvement in the execution of technical tasks. Existing approaches include the usage of wikis as semantic content authoring environments, leveraging folksonomies to create formal ontologies, but also human computation approaches such as games with a purpose or micro-tasks.

The seminar will focus on three categories of topics: first and foremost we aim to look into existing crowdsourcing approaches and how these could or have been applied to solve traditional semantic data management tasks. Particular attention will be paid to core components of a crowdsourcing-enabled data management and processing system, including methods for quality assurance and spam detection, resources, task and workflow management, as well as interfaces, and the way these components can be assembled into coherent frameworks. A second category of topics to be addressed during the seminar reaches out to other disciplines such as economics, social sciences, and design, with the aim to understand how theories and techniques from these fields could be used to build better crowdsourcing-enabled data management systems for the Semantic Web. Last, but not least, we will discuss the usage of semantic technologies within generic crowdsourcing scenarios, most notably as means to describe data, resources and specific components.

The seminar organizers have the ambition of making this a most influential workshop; we aim to lay the foundations for a scientific community at the intersection of crowdsourcing and semantic technologies. The goal of the seminar is to shape the evolution and further development of this emerging community by devising a research roadmap that will outline the future of the field; and publish a special issue in a high-quality journal, or an edited book summarizing the most important lines of research, and the results of our interactions during the seminar. To achieve these goals the seminar will follow the following procedure:

  • Presentation of the participants on what they perceive the greatest challenges are to a successful confluence of the Semantic Social Web and crowdsourcing;
  • An affinity mapping exercise to group and order the challenges along a variety of to be identified dimensions;
  • A writing session where the participants jointly compose a first draft of the roadmap.

One of the results of this community-building exercise should also include, besides a comprehensive overview of the scientific challenges which so far remained underexplored, a collection of basic vocabularies and services that Semantic Web researchers could exploit in order to build effective crowd-sourced and crowd-moderated systems. As a community which values technologies for open access and interoperability, we need to work towards the definition of metadata vocabularies to described data created through crowdsourcing, and publish this data for further reuse and repurposing. This seminar, in its community-formative role, could be the starting point for the emergence of working groups that would jointly address such problems. The first two days of the seminar will be dedicated to presentations and working groups on topics related to challenges identified during the talks and Q&A sessions. The third day will focus on the consolidation of the results of the working groups in written form and define next steps and follow-up activities.