TOP
Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Seminars
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Publishing
Within this website:
External resources:
dblp
Within this website:
External resources:
  • the dblp Computer Science Bibliography


Research Meeting 15273

Retreat SFB 1102: Information Density and Linguistic Encoding

( Jun 28 – Jun 30, 2015 )

(Click in the middle of the image to enlarge)

Permalink
Please use the following short url to reference this page: https://www.dagstuhl.de/15273

Organizer

Contact

Motivation

Beim ersten Retreat im Juni 2015 kommen alle rund 40 Mitglieder zusammen, die im SFB 1102 die Informationsdichte von sprachlichen Äußerungen erforschen. Im Rahmen der dreitägigen Veranstaltung werden erste Forschungsergebnisse vorgestellt und diskutiert. Dieser rege Austausch der Wissenschaftlerinnen und Wissenschaftler aus allen 14 SFB-Teilprojekten bietet neben neuen Impulsen für die Forschung auch die Gelegenheit, die ersten Monate intensiver Zusammenarbeit im SFB 1102 in der gesamten Gruppe noch einmal Revue passieren zu lassen.


Zum SFB 1102: Information Density and Linguistic Encoding

Language provides not only the expressiveness needed to communicate, but also offers speakers a multitude of choices regarding how they may encode their messages – from the choice of words, structuring of syntactic elements, and arranging sentences in discourse. The CRC addresses the hypothesis that language variation and language use can be better understood in terms of the goal of speakers to modulate the amount of information conveyed in an utterance. While previous efforts have sought to understand language systems and their use in terms of complexity, the definition of this notion is often imprecise and specific to particular linguistic levels. Recently, however, there is evidence that the ease of processing linguistic material is correlated with its contextually determined predictability. This has lead to the hypothesis that complexity may be appropriately indexed by Shannon’s notion of information, referred to in recent linguistic work as surprisal. The CRC investigates the hypothesis that (i) processing complexity is indexed by surprisal across linguistic levels, and (ii) that variation in language use may be characterised by the optimal distribution of information across the linguistic signal. Under this view, speakers exploit possible variation in their linguistic encoding – modulating the order, density and specificity of their expressions – so as to avoid informational peaks and troughs that result in inefficient communication. This view naturally extends to all aspects and levels of linguistic communication, thus offering the potential for a deeper understanding of the relationship between the nature of variation offered by our linguistic systems and the way it is exploited in language use. Crucially, just as the surprisal of linguistic material can be determined at different levels of granularity, from phonemes to phrases to entire propositions, so do speakers have encoding choices that span these levels – from varying properties of the acoustic realisation to the broader structuring of the discourse. The aim of the CRC is thus to investigate the extent to which notions of surprisal and the optimal distribution of information offer a unifying explanation of observed patterns of variation in language use within and across linguistic levels, and in a range of communicative settings.


Related Seminars
  • Research Meeting 16283: Retreat SFB 1102: Information Density and Linguistic Encoding (2016-07-13 - 2016-07-15) (Details)
  • Research Meeting 18143: Retreat SFB 1102: Information Density and Linguistic Encoding (2018-04-03 - 2018-04-06) (Details)