Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.
Translated title of the contributionErforschung der Verwendung des Pronomen Ich in deutschen akademischen Texten mit maschinellem Lernen
Original languageEnglish
Title of host publicationInformatik 2020 - Back to the future : 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual
EditorsRalf H. Reussner, Anne Koziolek, Robert Heinrich
Number of pages7
Place of PublicationBonn
PublisherGesellschaft für Informatik e.V.
Publication date2020
Pages1327-1333
ISBN (electronic)978-3-88579-701-2
DOIs
Publication statusPublished - 2020
Event50th Annual Conference of the German Informatics Society - INFORMATIK 2020: Back to the Future - Online, Karlsruhe, Germany
Duration: 28.09.202002.10.2020
Conference number: 50
https://informatik2020.gi.de/

Bibliographical note

Funding Information:
Melanie Andresen’s work on this paper was funded by the Landesforschungsförderung Hamburg in the context of the project hermA [Ga17] (LFF-FV 35) at Universitčt Hamburg.

Publisher Copyright:
© 2020 Gesellschaft fur Informatik (GI). All rights reserved.

    Research areas

  • Language Studies - annotation, Academic language, German, machine learning, classification
  • Academic language, Annotation, Classification, German, Machine learning

DOI

Recently viewed

Researchers

  1. Anke Haarmann

Publications

  1. Ideological Foundations of Perceived Contract Breach Associated With Downsizing
  2. Hot deformation behavior and processing map of Mg-3Sn-2Ca-0.4Al-0.4Zn alloy
  3. Modernisierung und Partizipation
  4. Exploring intrinsic, instrumental and relational values for sustainable management of social-ecological systems
  5. New methods for the analysis of links between international firm activities and firm performance
  6. FRAMEWORK CONDITIONS AND STRATEGIES FOR ATTRACTING YOUNG WOMEN TO ENGINEERING IN TIMES OF DIGITAL AND GLOBAL TRANSFORMATION
  7. Determinants of promotions in an internal labour market
  8. Die geometry influence on the texture and microstructure development during extrusion of AZ31 and ZK60 magnesium alloy chips
  9. CSR and tax avoidance: A review of empirical research
  10. Meta-analytic cointegrating rank tests for dependent panels
  11. Messung von Markenvorstellungen
  12. Plant functional trait response to environmental drivers across European temperate forest understorey communities
  13. Workshop: 20 years health promotion research in and on settings
  14. Psychological distance modulates goal-based versus movement-based imitation
  15. Intentionalisten vs. Strukturalisten
  16. Biodegradability of some antibiotics, elimination of the genotoxicity and affection of wastewater bacteria in a simple test
  17. Die Erinnerung im Gepäck
  18. One-third Codetermination at Company Supervisory Boards and Firm Performance in German Manufacturing Industries
  19. Effectiveness of One Videoconference-Based Exposure and Response Prevention Session at Home in Adjunction to Inpatient Treatment in Persons With Obsessive-Compulsive Disorder
  20. Flexible and Adaptable Restoration
  21. Sowing different mixtures in dry acidic grassland produced priority effects of varying strength
  22. Robert Walser lieben
  23. Consequence evaluations and moral concerns about climate change
  24. Logistik-Controlling
  25. Tourismusräume
  26. Including software aspects in green IT
  27. Influence of Torsion on Precipitation and Hardening Effects during Aging of an Extruded AZ91 Alloy
  28. Timing, fragmentation of work and income inequality
  29. Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece
  30. The Psychological Study of Positive Behavior Across Group Boundaries
  31. Starker Bär und schneller Hirsch
  32. Theorizing path dependence
  33. Tree diversity promotes generalist herbivore community patterns in a young subtropical forest experiment
  34. On the Power of an Open Scientific Approach to Actions
  35. Reading the 2011 Riots
  36. Teacher collaboration, inclusive education and differentiated instruction
  37. Grassroots Innovations for Inclusive Development
  38. Diffusion of environmental management accounting for cleaner production
  39. The programme on ecosystem change and society (PECS) – a decade of deepening social-ecological research through a place-based focus
  40. Differences in isoprenoid-mediated energy dissipation pathways between coastal and interior Douglas-fir seedlings in response to drought
  41. Soziologische Aspekte des Spiels
  42. The shadow of the family
  43. Results from the project 'Acceptance of CO2 capture and storage