Crowdsourcing Swiss Dialect Transcriptions for Assessing Factors in Writing Variations

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Simon Clematide
  • Karina Frick
  • Noëmi Aeppli
  • Jean-Philippe Goldmann
In this paper, we systematically analyze writing variations of Swiss German in two existing corpora with standard German glosses, a corpus of 10,000 short text messages and a corpus of transcribed oral history recordings (90,000 tokens). We show that neither resource is sufficient for assessing factors in writing variations of users and describe a data collection project involving a citizen science community for solving this problem. Laymen will independently and redundantly transcribe 1,200 short samples (15-20 seconds) of audio material in Swiss German according to their own best practice.
OriginalspracheEnglisch
TitelProceedings of the 13th Conference on Natural Language Processing (KONVENS) : Bochum, GermanySeptember 19–21, 2016
HerausgeberStefanie Dipper, Friedrich Neubarth, Heike Zinsmeister
Anzahl der Seiten6
ErscheinungsortBochum
VerlagRuhr-Universität Bochum
Erscheinungsdatum01.09.2016
Seiten62-67
PublikationsstatusErschienen - 01.09.2016
Extern publiziertJa
Veranstaltung13th Conference on Natural Language Processing (KONVENS) - Linguistics Department / Ruhr-Universität Bochum, Bochum, Deutschland
Dauer: 19.09.201621.09.2016
https://www.linguistics.rub.de/konvens16/

Dokumente

Links

Zuletzt angesehen

Publikationen

  1. Guided Internet-based vs. face-to-face cognitive behavior therapy for psychiatric and somatic disorders
  2. Call for Submissions Business Ethics Quarterly Special Issue on
  3. Promotion of interdisciplinary competence as a challenge for Higher Education
  4. The research potential of new types of enterprise data based on surveys from official statistics in Germany
  5. Modellierung und Implementierung eines Order2Cash-Prozesses in verteilten Systemen
  6. EEG-Umlage
  7. Mediengenealogie
  8. How much does agriculture depend on pollinators?
  9. The scars of childhood adversity
  10. Einleitung
  11. Analytical model to determine the strength of form-fit connection joined by die-less hydroforming
  12. The same, but different? Learning activities, perceived learning success, and social support during the practical term of teacher education in times of COVID-19
  13. Diversitätsgerechte und digitale Lehre - Chance oder Widerspruch?
  14. Ausgewählte Kapitel der Theoretischen Informatik
  15. Is it really worth it?
  16. In memoriam Winfried Steffani
  17. Einleitung
  18. Identitätspolitik als Strategie der Entprivilegierung
  19. Das Echo des Propheten Jesaja
  20. Flexibility, dual labour markets, and temporary employment – Empirical evidence from German establishment data
  21. Tensile and compressive creep behaviour of Al2O3 (Saffil®) short fiber reinforced magnesium alloy AE42
  22. Linked Data-driven Resilience Research 2023
  23. Overhead Projector
  24. Sustainablity Communication - An Introduction
  25. Methoden-Muster: Elternselbstorganisation und -selbstverwaltung
  26. Expatriate performance in terrorism-endangered countries
  27. ‚Descriptio ancilla narrationis‘?
  28. Artistic exchanges across Afro-Eurasia
  29. Gender in Trouble
  30. Case Studies: Germany
  31. Do Online Training offer an effective Option for the Prevention and Health Promotion of Professionals? A systematic Overview and Meta-analysis
  32. Article 27 Relationship with Other Provisions of Community Law
  33. Accelerated dereplication of natural products, supported by reference libraries
  34. Individuelle oder kollektive Unternehmensführung?
  35. Accounting, Auditing and Accountability Journal
  36. Global Immediacy
  37. Multiple import sourcing