Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.
Translated title of the contributionErforschung der Verwendung des Pronomen Ich in deutschen akademischen Texten mit maschinellem Lernen
Original languageEnglish
Title of host publicationInformatik 2020 - Back to the future : 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual
EditorsRalf H. Reussner, Anne Koziolek, Robert Heinrich
Number of pages7
Place of PublicationBonn
PublisherGesellschaft für Informatik e.V.
Publication date2020
Pages1327-1333
ISBN (electronic)978-3-88579-701-2
DOIs
Publication statusPublished - 2020
Event50th Annual Conference of the German Informatics Society - INFORMATIK 2020: Back to the Future - Online, Karlsruhe, Germany
Duration: 28.09.202002.10.2020
Conference number: 50
https://informatik2020.gi.de/

Bibliographical note

Funding Information:
Melanie Andresen’s work on this paper was funded by the Landesforschungsförderung Hamburg in the context of the project hermA [Ga17] (LFF-FV 35) at Universitčt Hamburg.

Publisher Copyright:
© 2020 Gesellschaft fur Informatik (GI). All rights reserved.

    Research areas

  • Language Studies - annotation, Academic language, German, machine learning, classification
  • Academic language, Annotation, Classification, German, Machine learning

DOI

Recently viewed

Researchers

  1. Yuk Hui

Publications

  1. Qualitätssicherung und Entwicklung in der Elementarpädagogik
  2. Information Extraction from Invoices
  3. Integrating indigenous and local knowledge in management and research on coastal ecosystems in the Global South
  4. The bidirectional relationship between ESG performance and earnings management
  5. Probing turbulent superstructures in Rayleigh-Bénard convection by Lagrangian trajectory clusters
  6. Anticipated imitation of multiple agents
  7. Robust Adaptive Soft Landing Control of an Electromagnetic Valve Actuator for Camless Engines
  8. A black box identification in frequency domain
  9. Learning to collaborate while collaborating
  10. Dematerialization
  11. Reiseanalyse 2013:
  12. What a difference a Y makes
  13. Applying standard network analysis to hypermedia systems
  14. The patterns of curriculum change processes that embed sustainability in higher education institutions
  15. Schreiben Englisch
  16. The effect of neighbor species' phylogenetic and trait difference on tree growth in subtropical forests
  17. Special issue: Frameworks for Sustainability Management
  18. Tackling the knowledge-action gap in sustainable consumption
  19. Modelling and simulation of dynamic microstructure evolution of aluminium alloys during thermomechanically coupled extrusion process
  20. High-Volume Resistance Training Improves Double-Poling Peak Oxygen Uptake in Youth Elite Cross-Country Skiers and Biathletes
  21. Machine Learning-Supported Planning of Lead Times in Job Shop Manufacturing
  22. The Right to Liberty and Security, Public Health and Disease Control
  23. How to Limit the Spillover from the 2021 Inflation Surge to Inflation Expectations?
  24. The China puzzle
  25. On the Problems of Honorary Work in German Sports Clubs – A Qualitative-Dominated Crossover Mixed Methods Study