Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.
Translated title of the contributionErforschung der Verwendung des Pronomen Ich in deutschen akademischen Texten mit maschinellem Lernen
Original languageEnglish
Title of host publicationInformatik 2020 - Back to the future : 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual
EditorsRalf H. Reussner, Anne Koziolek, Robert Heinrich
Number of pages7
Place of PublicationBonn
PublisherGesellschaft für Informatik e.V.
Publication date2020
Pages1327-1333
ISBN (electronic)978-3-88579-701-2
DOIs
Publication statusPublished - 2020
Event50th Annual Conference of the German Informatics Society - INFORMATIK 2020: Back to the Future - Online, Karlsruhe, Germany
Duration: 28.09.202002.10.2020
Conference number: 50
https://informatik2020.gi.de/

Bibliographical note

Funding Information:
Melanie Andresen’s work on this paper was funded by the Landesforschungsförderung Hamburg in the context of the project hermA [Ga17] (LFF-FV 35) at Universitčt Hamburg.

Publisher Copyright:
© 2020 Gesellschaft fur Informatik (GI). All rights reserved.

    Research areas

  • Language Studies - annotation, Academic language, German, machine learning, classification
  • Academic language, Annotation, Classification, German, Machine learning

DOI

Recently viewed

Publications

  1. Biomass derived ionic liquids
  2. where paintings live
  3. Facing complex crime
  4. Where Paintings Live
  5. Fettflecken untersuchen mit Graphical Analysis
  6. Correlation between Isometric Maximum Strength and One Repetition Maximum in the Calf Muscle in Extended and Bended Knee Joint
  7. "to expose, to show, to demonstrate, to inform, to offer. Artistic Practices around 1990"
  8. Mechanisms of dialectical change
  9. Hot deformation behavior and processing map of Mg-3Sn-2Ca-0.4Al-0.4Zn alloy
  10. A(l)gora: the Mindscape
  11. Accumulation and Subjectivity
  12. Briefe schreiben in der Sekundarstufe I
  13. Large trees are keystone structures in urban parks
  14. The development of an eco-label for software products
  15. Preservice teachers’ competency development and opportunities to learn in teaching multilingual learners in Germany
  16. Destinationsmanagement 3.0 – Auf dem Weg zu einem neuen Aufgabenverständnis
  17. What do we know about new venture investment time patterns?
  18. Towards an Extended Enterprise Architecture Meta-Model for Big Data
  19. Mikrounternehmen
  20. Carbocyclic cis-[1.1.1]-tris-σ-homobenzenes - Syntheses by triple epoxide → cyclopropane conversions, structural data, [σ2s+σ2s+σ2s] cycloreversions
  21. Moderators of the ego depletion effect
  22. Initial evidence for a systematic link between core values and emotional experiences in environmental situations
  23. Interventionen im Datenraum
  24. Hydrological tracers for assessing transport and dissipation processes of pesticides in a model constructed wetland system
  25. Mapping and analysing historical indicators of ecosystem services in Germany
  26. Toward a better understanding of the mindsets of negotiators
  27. Researching collaborative interdisciplinary teams
  28. ETL ensembles for chunking, NER and SRL