http://www.dagstuhl.de/17301

July 23 – 28 , 2017, Dagstuhl Seminar 17301

User-Generated Content in Social Media

Organizers

Tat-Seng Chua (National University of Singapore, SG)
Norbert Fuhr (Universität Duisburg-Essen, DE)
Gregory Grefenstette (INRIA Saclay – Île-de-France – Gif sur Yvette, FR)
Kalervo Järvelin (University of Tampere, FI)
Jaakko Peltonen (Aalto University, FI)

For support, please contact

Jutka Gasiorowski for administrative matters

Marc Herbstritt for scientific matters

Documents

Dagstuhl Seminar Schedule (Upload here)

(Use seminar number and access code to log in)

Motivation

Social media play a central role in many people’s lives, and they also have a profound impact on businesses and society. Users post vast amounts of content (text, photos, audio, video) every minute. This user generated content (UGC) has become increasingly multimedia in nature. It documents users’ lives, revealing in real time their interests and concerns and activities in society. The analysis of UGC can offer insights to individual and societal concerns and could be beneficial to a wide range of applications, for example, tracking mobility in cities, identifying citizen’s issues, opinion mining, and much more.

In contrast to classical media, social media thrive by allowing anyone to publish content with few constraints and no oversight. Social media posts thus show great variation in length, content, quality, language, speech and other aspects. This heterogeneity poses new challenges for standard content access and analysis methods. On the other hand, UGC is often related to other public information (e.g. product reviews or discussion of news articles), and there often is rich contextual information linking, which allows for new types of analyses.

In this seminar, we aim at discussing the specific properties of UGC, the general research tasks currently operating on this type of content, identifying their limitations and lacunae, and imagining new types of applications made possible by the availability of vast amounts of UGC.

We will identify specific properties of UGC such as presentation quality and style, bias and subjectivity of content, credibility of sources, contradictory statements, and heterogeneity of language and media.

We will discuss current applications exploiting UGC, like e. g. sentiment analysis, noise removal, indexing and retrieving UGC, recommendation and selection methods, summarization methods, credibility and reliability estimation, topic detection and tracking, topic development analysis and prediction, community detection, modeling of content and user interest trends, collaborative content creation, cross media and cross lingual analysis, multi-source and multi-task analysis, social media sites. Live and real-time analysis of streaming data, and machine learning for big data analytics of UGC. These applications and methods involve contributions from several data analysis and machine learning research directions.

We will imagine new applications exploiting UGC in areas such as e.g. marketing, political campaigns, crisis management, eHealth, customer reviews for shopping, socio-political analyses, smart city, user mobility analysis, and user profiling.

The general goal of this seminar is to collect, discuss and understand the state-of-the-art in research on UGC, to identify unresolved problems and to define a research agenda for further work in this area. We are especially interested in properties, tasks, and applications that span multiple content types or different social media systems.

About three months before the seminar, we will ask invited participants to fill out an online questionnaire, describing their specific interest in the topic, naming crucial and urgent research issues, stating their willingness to give a survey talk or give a more specific presentation, as well as proposing working group topics. Based on this input, we will schedule about 4-5 longer survey talks, each followed by a few short, specialized presentations from the respective areas. In addition, we will reserve at least two half days for working group sessions on topics chosen by the participants.

License
  Creative Commons BY 3.0 DE
  Tat-Seng Chua and Norbert Fuhr and Gregory Grefenstette and Kalervo Järvelin and Jaakko Peltonen

Classification

  • Data Bases / Information Retrieval
  • Multimedia
  • World Wide Web / Internet

Keywords

  • Social media
  • Information extraction
  • Multimedia retrieval and annotation
  • Trend detection
  • E-reputation

Book exhibition

Books from the participants of the current Seminar 

Book exhibition in the library, ground floor, during the seminar week.

Documentation

In the series Dagstuhl Reports each Dagstuhl Seminar and Dagstuhl Perspectives Workshop is documented. The seminar organizers, in cooperation with the collector, prepare a report that includes contributions from the participants' talks together with a summary of the seminar.

 

Download overview leaflet (PDF).

Publications

Furthermore, a comprehensive peer-reviewed collection of research papers can be published in the series Dagstuhl Follow-Ups.

Dagstuhl's Impact

Please inform us when a publication was published as a result from your seminar. These publications are listed in the category Dagstuhl's Impact and are presented on a special shelf on the ground floor of the library.