Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Melanie Andresen; Dagmar Knorr

doi:10.18420/inf2020_124

Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Standard

Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. / Andresen, Melanie; Knorr, Dagmar.
Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual. Hrsg. / Ralf H. Reussner; Anne Koziolek; Robert Heinrich. Bonn: Gesellschaft für Informatik e.V., 2020. S. 1327-1333 (Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI); Band P-307).

Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet

Harvard

Andresen, M & Knorr, D 2020, Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. in RH Reussner, A Koziolek & R Heinrich (Hrsg.), Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual. Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI), Bd. P-307, Gesellschaft für Informatik e.V., Bonn, S. 1327-1333, 50. Jahrestagung der Gesellschaft für Informatik - INFORMATIK 2020, Karlsruhe, Baden-Württemberg, Deutschland, 28.09.20. https://doi.org/10.18420/inf2020_124

APA

Andresen, M., & Knorr, D. (2020). Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. In R. H. Reussner, A. Koziolek, & R. Heinrich (Hrsg.), Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual (S. 1327-1333). (Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI); Band P-307). Gesellschaft für Informatik e.V.. https://doi.org/10.18420/inf2020_124

Vancouver

Andresen M, Knorr D. Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning. in Reussner RH, Koziolek A, Heinrich R, Hrsg., Informatik 2020 - Back to the future: 50. Jahrestagung der Gesellschaft für Informatik vom 28. September - 2. Oktober 2020, virtual. Bonn: Gesellschaft für Informatik e.V. 2020. S. 1327-1333. (Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)). doi: 10.18420/inf2020_124

Bibtex

@inbook{aeeb656a92ff4a459da9122c74c22472,

title = "Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning",

abstract = "The use of the pronoun ich ({\textquoteleft}I{\textquoteright}) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.",

keywords = "Language Studies, Korpuslinguistik, annotation, Academic language, German, machine learning, classification, Academic language, Annotation, Classification, German, Machine learning",

author = "Melanie Andresen and Dagmar Knorr",

note = "Funding Information: Melanie Andresen{\textquoteright}s work on this paper was funded by the Landesforschungsf{\"o}rderung Hamburg in the context of the project hermA [Ga17] (LFF-FV 35) at Universit{\v c}t Hamburg. Publisher Copyright: {\textcopyright} 2020 Gesellschaft fur Informatik (GI). All rights reserved.; 50th Annual Conference of the German Informatics Society - INFORMATIK 2020 : Back to the Future, INFORMATIK 2020 ; Conference date: 28-09-2020 Through 02-10-2020",

year = "2020",

doi = "10.18420/inf2020_124",

language = "English",

series = "Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)",

publisher = "Gesellschaft f{\"u}r Informatik e.V.",

pages = "1327--1333",

editor = "Reussner, {Ralf H.} and Anne Koziolek and Robert Heinrich",

booktitle = "Informatik 2020 - Back to the future",

address = "Germany",

url = "https://informatik2020.gi.de/",

}

RIS

TY - CHAP

T1 - Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

AU - Andresen, Melanie

AU - Knorr, Dagmar

N1 - Conference code: 50

PY - 2020

Y1 - 2020

N2 - The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.

AB - The use of the pronoun ich (‘I’) in academic language is a source of constant debate and a frequent cause of insecurity for students. We explore manually annotated instances of I from a German learner corpus. Using machine learning techniques, we investigate to what extent it is possible to automatically distinguish between different types of I usage (author I vs. narrator I). We additionally inspect which context words are good indicators of one type or the other. The results show that an automatic classification is not straightforward, but the distinctive features are in line with previous research. The results of the automatic classification are not perfect, but would greatly facilitate manual annotation. The distinctive words are in line with previous research and indicate that the author I is a more homogeneous class.

KW - Language Studies

KW - Korpuslinguistik

KW - annotation

KW - Academic language

KW - German

KW - machine learning

KW - classification

KW - Academic language

KW - Annotation

KW - Classification

KW - German

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=85127357898&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/74ed2ea8-4157-36bd-9677-24ac22c76c5e/

U2 - 10.18420/inf2020_124

DO - 10.18420/inf2020_124

M3 - Article in conference proceedings

T3 - Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)

SP - 1327

EP - 1333

BT - Informatik 2020 - Back to the future

A2 - Reussner, Ralf H.

A2 - Koziolek, Anne

A2 - Heinrich, Robert

PB - Gesellschaft für Informatik e.V.

CY - Bonn

T2 - 50th Annual Conference of the German Informatics Society - INFORMATIK 2020

Y2 - 28 September 2020 through 2 October 2020

ER -

Weitere Publikationen dieser Person(en)

Schreibberatung: Eine Systematik

Knorr, D., 2025, Wien: Böhlau Verlag Wien. 376 S. (Schreibwissenschaft; Band 4)

Publikation: Bücher und Anthologien › Monografien › Forschung › begutachtet

Sprach(en)sensibilität: Eine linguistische Perspektive

Knorr, D., 07.08.2024, 40 Begriffe für eine Schreibwissenschaft: Konzeptuelle Perspektiven auf Praxis und Praktiken des Schreibens. Karsten, A. & Haacke-Werron, S. (Hrsg.). Bielefeld: wbv Media GmbH & Co. KG, S. 253–259 7 S. (Theorie und Praxis der Schreibwissenschaft; Band 17).

Publikation: Beiträge in Sammelwerken › Aufsätze in Sammelwerken › Forschung › begutachtet

Toward a Systematic Approach to Developing Professional Roles: What Writing Tutors Need to Know and Know How to Do

Knorr, D. & Edlich, M. G. P., 2024, in: Hermes (Denmark). 64, S. 271-285 15 S.

Publikation: Beiträge in Zeitschriften › Zeitschriftenaufsätze › Forschung › begutachtet

Sprache in Wissenschaft: Sprachliche Anforderungen wissenschaftlicher Texte

Knorr, D. & Tilmans, A., 2023, Wissenschaftliches Schreiben in den MINT-Fächern: Der Schreibratgeber für alle Texte im Studium. Herfurth, S. & Kaufholz-Soldat, E. (Hrsg.). Tübingen: Expert Verlag, S. 195–207 13 S. (utb; Band 5951).

Publikation: Beiträge in Sammelwerken › Aufsätze in Sammelwerken › Lehre › begutachtet

Wissenschaftliches Schreiben im Zeitalter von KI gemeinsam verantworten: Eine schreibwissenschaftliche Perspektive auf Implikationen für Akteur*innen an Hochschulen

Brommer, S., Berendes, J., Bohle-Jurok, U., Buck, I., Girgensohn, K., Grieshammer, E., Gröner, C., Gürtl, F., Hollosi-Boiger, C., Klamm, C., Knorr, D., Limburg, A., Mundorf, M., Stahlberg, N. & Unterpertinger, E., 2023, Berlin: Hochschulforum Digitalisierung, 21 S. (Diskussionspapier; Band 27).

Publikation: Arbeits- oder Diskussionspapiere und Berichte › Arbeits- oder Diskussionspapiere

DOI

https://doi.org/10.18420/inf2020_124
Endgültige, publizierte Fassung

Exploring the Use of the Pronoun I in German Academic Texts with Machine Learning

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Weitere Publikationen dieser Person(en)

Schreibberatung: Eine Systematik

Sprach(en)sensibilität: Eine linguistische Perspektive

Toward a Systematic Approach to Developing Professional Roles: What Writing Tutors Need to Know and Know How to Do

Sprache in Wissenschaft: Sprachliche Anforderungen wissenschaftlicher Texte

Wissenschaftliches Schreiben im Zeitalter von KI gemeinsam verantworten: Eine schreibwissenschaftliche Perspektive auf Implikationen für Akteur*innen an Hochschulen

DOI

Zuletzt angesehen

Projekte

Aktivitäten

Presse / Medien

Publikationen