Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. / Andresen, Melanie; Knorr, Dagmar.
Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual. ed. / Ralf H. Reussner; Anne Koziolek; Robert Heinrich. Bonn: Gesellschaft für Informatik e.V., 2020. p. 1327-1333 (Lecture Notes in Informatics (LNI) – Proceedings; Vol. P307).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Andresen, M & Knorr, D 2020, Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. in RH Reussner, A Koziolek & R Heinrich (eds), Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual. Lecture Notes in Informatics (LNI) – Proceedings, vol. P307, Gesellschaft für Informatik e.V., Bonn, pp. 1327-1333, 50th Annual Conference of the German Informatics Society - INFORMATIK 2020, Karlsruhe, Baden-Württemberg, Germany, 28.09.20. https://doi.org/10.18420/inf2020_124

APA

Andresen, M., & Knorr, D. (2020). Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. In R. H. Reussner, A. Koziolek, & R. Heinrich (Eds.), Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual (pp. 1327-1333). (Lecture Notes in Informatics (LNI) – Proceedings; Vol. P307). Gesellschaft für Informatik e.V.. https://doi.org/10.18420/inf2020_124

Vancouver

Andresen M, Knorr D. Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. In Reussner RH, Koziolek A, Heinrich R, editors, Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual. Bonn: Gesellschaft für Informatik e.V. 2020. p. 1327-1333. (Lecture Notes in Informatics (LNI) – Proceedings). doi: 10.18420/inf2020_124

Bibtex

@inbook{aeeb656a92ff4a459da9122c74c22472,
title = "Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning",
abstract = "The use of the pronoun ich ({\textquoteleft}I{\textquoteright}) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.",
keywords = "Language Studies, Korpuslinguistik, annotation, Academic language, German, machine learning, classification, Academic language, Annotation, Classification, German, Machine learning",
author = "Melanie Andresen and Dagmar Knorr",
note = "Funding Information: Melanie Andresen{\textquoteright}s work on this paper was funded by the Landesforschungsf{\"o}rderung Hamburg in the context of the project hermA [Ga17] (LFF-FV 35) at Universit{\v c}t Hamburg. Publisher Copyright: {\textcopyright} 2020 Gesellschaft fur Informatik (GI). All rights reserved.; 50th Annual Conference of the German Informatics Society - INFORMATIK 2020 : Back to the Future, INFORMATIK 2020 ; Conference date: 28-09-2020 Through 02-10-2020",
year = "2020",
doi = "10.18420/inf2020_124",
language = "English",
series = "Lecture Notes in Informatics (LNI) – Proceedings",
publisher = "Gesellschaft f{\"u}r Informatik e.V.",
pages = "1327--1333",
editor = "Reussner, {Ralf H.} and Anne Koziolek and Robert Heinrich",
booktitle = "Informatik 2020 - Back to the future",
address = "Germany",
url = "https://informatik2020.gi.de/",

}

RIS

TY - CHAP

T1 - Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

AU - Andresen, Melanie

AU - Knorr, Dagmar

N1 - Conference code: 50

PY - 2020

Y1 - 2020

N2 - The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.

AB - The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.

KW - Language Studies

KW - Korpuslinguistik

KW - annotation

KW - Academic language

KW - German

KW - machine learning

KW - classification

KW - Academic language

KW - Annotation

KW - Classification

KW - German

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=85127357898&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/74ed2ea8-4157-36bd-9677-24ac22c76c5e/

U2 - 10.18420/inf2020_124

DO - 10.18420/inf2020_124

M3 - Article in conference proceedings

T3 - Lecture Notes in Informatics (LNI) – Proceedings

SP - 1327

EP - 1333

BT - Informatik 2020 - Back to the future

A2 - Reussner, Ralf H.

A2 - Koziolek, Anne

A2 - Heinrich, Robert

PB - Gesellschaft für Informatik e.V.

CY - Bonn

T2 - 50th Annual Conference of the German Informatics Society - INFORMATIK 2020

Y2 - 28 September 2020 through 2 October 2020

ER -

DOI

Recently viewed

Publications

  1. Adaptive Lehrerinterventionen beim mathematischen Modellieren
  2. Feasibility of a worker-directed web-based intervention for employees with depressive symptoms
  3. A review of FEM code accuracy for reliable extrusion process analysis
  4. The IRENA Project Navigator
  5. Classifying railway stations for strategic transport and land use planning
  6. A general result on absolute continuity of non-uniform self-similar measures on the real line
  7. B7-H1 Selectively Controls TH17 Differentiation and Central Nervous System Autoimmunity via a Novel Non-PD-1-Mediated Pathway
  8. Similar factors underlie tree abundance in forests in native and alien ranges
  9. Propagation of particles injected from interplanetary shocks
  10. Bifurcation loads of circular curved beams of glued-laminated timber with continuous lateral support
  11. MICSIM-4J - A General Microsimulation Model
  12. Analysis of observability of a differential equation system describing a synchronous electromagnetic drive
  13. Associations between the financial and industry expertise of audit committee members and Key Audit Matters within related audit reports
  14. Microstructure and hardness evolution of laser metal deposited AA5087 wall-structures
  15. The multi-criteria effectiveness evaluation of the robotic group based on 3D real-time vision system
  16. The ESBW Short Scale A Test for Assessing Teachers’ Standards-Based Educational Knowledge
  17. Intra-specific leaf trait responses to species richness at two different local scales
  18. Traits of butterfly communities change from specialist to generalist characteristics with increasing land-use intensity
  19. Abnormal extrusion texture and reversed yield asymmetry in a Mg–Y-Sm-Zn-Zr alloy
  20. Microstructure and creep properties of MEZ magnesium alloy processed by thixocasting
  21. Efficient Classification of Images with Taxonomies
  22. Anisotropic wavelet bases and thresholding
  23. Recent developments in the manufacture of complex components by influencing the material flow during extrusion
  24. Sensitivity of trace-element analysis by X-ray emission induced by 0.1-10 MeV electrons
  25. An extended kalman filter for temperature monitoring of a metal-polymer hybrid fibre based heater structure
  26. Revisiting Carbon Disclosure and Performance
  27. Aluminium-rich coring structures in Mg-Al alloys with carbon inoculation
  28. An EEG frequency tagging study on biological motion perception in children with DCD
  29. The Cox ring of the space of complete rank two collineations
  30. Microstructure, mechanical and corrosion properties of Mg-Gd-Zn alloys
  31. Peer Evaluation Can Reliably Measure Local Knowledge
  32. Helping to improve suggestion systems
  33. Study on Mg–Si–Sr ternary alloys for biomedical applications
  34. Robust Current Decoupling in a Permanent Magnet Motor Combining a Geometric Method and SMC