Crowdsourcing Swiss Dialect Transcriptions for Assessing Factors in Writing Variations

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

  • Simon Clematide
  • Karina Frick
  • Noëmi Aeppli
  • Jean-Philippe Goldmann
In this paper, we systematically analyze writing variations of Swiss German in two existing corpora with standard German glosses, a corpus of 10,000 short text messages and a corpus of transcribed oral history recordings (90,000 tokens). We show that neither resource is sufficient for assessing factors in writing variations of users and describe a data collection project involving a citizen science community for solving this problem. Laymen will independently and redundantly transcribe 1,200 short samples (15-20 seconds) of audio material in Swiss German according to their own best practice.
Original languageEnglish
Title of host publicationProceedings of the 13th Conference on Natural Language Processing (KONVENS) : Bochum, GermanySeptember 19–21, 2016
EditorsStefanie Dipper, Friedrich Neubarth, Heike Zinsmeister
Number of pages6
Place of PublicationBochum
PublisherRuhr-Universität Bochum
Publication date01.09.2016
Pages62-67
Publication statusPublished - 01.09.2016
Externally publishedYes
Event13th Conference on Natural Language Processing (KONVENS) - Linguistics Department / Ruhr-Universität Bochum, Bochum, Germany
Duration: 19.09.201621.09.2016
https://www.linguistics.rub.de/konvens16/

Documents

Links

Recently viewed

Publications

  1. Question Answering Mediated by Visual Clues and Knowledge Graphs
  2. Advanced extrusion processes
  3. Von Modell zu Modell
  4. Advantages and difficulties of conducting thinking-aloud protocols in the school setting
  5. Diversity promotes temporal stability across levels of ecosystem organization in experimental grasslands
  6. Symmetrical Communication?
  7. Latent trees for coreference resolution
  8. The relation of flow-experience and physiological arousal under stress - can u shape it?
  9. “Smart is not smart enough!” Anticipating critical raw material use in smart city concepts
  10. Ist Cola sauer?
  11. HEPS Inventory Tool
  12. Personalized Transaction Kernels for Recommendation Using MCTS
  13. Key criteria for developing ecosystem service indicators to inform decision making
  14. Estimation of physicochemical properties of 52 non-PBDE brominated flame retardants and evaluation of their overall persistence and long-range transport potential
  15. Evidence-Based Management and Organizational Reality
  16. Process Stability and Reproducibility of the Dieless Drawing Process for AZ31 Magnesium Wires
  17. Heinz von Foerster and Early Research in the Field of Pattern Recognition at the Biological Computer Laboratory
  18. Comment on "Recent origin and cultural reversion of a hunter-gatherer group
  19. Wie geben Tutoren Feedback?
  20. Interlanguage pragmatics: From use to acquisition to second language pedagogy
  21. Study Protocol
  22. A Fictional Risk Narrative and Its Potential for Social Resonance: Reception of Barbara Kingsolver’s Flight Behavior in Reviews and Reading Groups
  23. Variational pragmatics