Update: Feb. 18, 2014
14.00 – 14.30 h |
Felix Bildhauer & Roland Schäfer (Freie Universität Berlin) |
Web data as a challenge for theoretical linguistics and corpus design (Introduction) |
14.30 – 15.00 h |
Sonja Müller (Universität Bielefeld) |
How webdata can challenge traditional generalizations: a case study of the order of modal particles in German |
15.00 – 15.30 h |
Susanne Flach (Freie Universität Berlin) |
Solving the rare phenomenon problem? 'Quasi-serial' verb constructions in English |
15.30 – 16.00 h |
Dirk Goldhahn & Uwe Quasthoff (Universität Leipzig) |
Using corpus-based statistics for linguistic typology |
16.00 – 16.30 h |
Coffee break |
|
16.30 – 17.00 h |
Adrien Barbaresi (ENS Lyon) |
For a few points more: improving decision processes in web corpus construction |
17.00 – 17.30 h |
Lea Helmers (Freie Universität Berlin) |
Named entity recognition on German web corpora |
17.30 – 18.00 h |
Vladimír Benko (Slovak Academy of Sciences) |
Near-duplicate data in web corpora |
18.00 h |
Jack Grieve (Aston University), Asnaghi Costanza (Università Cattolica del Sacro Cuore) & Tom Ruette (Humboldt-Universität zu Berlin) |
Googleology is good science |
9.00 – 10.00 h |
Stefan Evert (invited talk) |
An NLP approach to the evaluation of web corpora |
10.00 – 10.30 h |
Ines Rehbein (Universität Potsdam) |
Using Twitter for linguistic purposes – three case studies |
10.30 – 11.00 h |
Kazuya Abe (Atomi University) |
Twitter corpus and collection of German phrases |
11.00 – 11.30 h |
Coffee break |
|
11.30 – 12.00 h |
Tom Ruette (Humboldt-Universität zu Berlin) & Jack Grieve (Aston University) |
Cognitive sociolinguistics with Twitter: why do the Dutch swear with diseases? |
12.00 h |
Peter Grube (Martin-Luther-Universität Halle-Wittenberg) |
A diachronic corpus of personal weblogs: possibilities and current constraints |