Automated scoring in the era of artificial intelligence: An empirical study with Turkish essays

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

Automated scoring (AS) has gained significant attention as a tool to enhance the efficiency and reliability of assessment processes. Yet, its application in under-represented languages, such as Turkish, remains limited. This study addresses this gap by empirically evaluating AS for Turkish using a zero-shot approach with a rubric powered by OpenAI's GPT-4o. A dataset of 590 essays written by learners of Turkish as a second language was scored by professional human raters and an artificial intelligence (AI) model integrated via a custom-built interface. The scoring rubric, grounded in the Common European Framework of Reference for Languages, assessed six dimensions of writing quality. Results revealed a strong alignment between human and AI scores with a Quadratic Weighted Kappa of 0.72, Pearson correlation of 0.73, and an overlap measure of 83.5 %. Analysis of rater effects showed minimal influence on score discrepancies, though factors such as experience and gender exhibited modest effects. These findings demonstrate the potential of AI-driven scoring in Turkish, offering valuable insights for broader implementation in under-represented languages, such as the possible source of disagreements between human and AI scores. Conclusions from a specific writing task with a single human rater underscore the need for future research to explore diverse inputs and multiple raters.

OriginalspracheEnglisch
Aufsatznummer103784
ZeitschriftSystem
Jahrgang133
Anzahl der Seiten12
ISSN0346-251X
DOIs
PublikationsstatusErschienen - 10.2025

Bibliographische Notiz

Publisher Copyright:
© 2025 The Authors

DOI

Zuletzt angesehen

Publikationen

  1. How problem-based or direct instructional case-based learning environments influence pre-service teachers’ cognitive load, motivation and emotions
  2. Доля на внутрішньому ринку“ для України в рамках Угоди про асоціацію між Україною та ЄС
  3. A sensorless control using a sliding-mode observer for an electromagnetic valve actuator in automotive applications
  4. The bispecific SDF1-GPVI fusion protein preserves myocardial function after transient ischemia in mice.
  5. Maschinenbelegungsplanung mit evolutionären Algorithmen
  6. Editorial overview
  7. Systemic Risks from Different Perspectives
  8. Transculturality in Top Model
  9. High quality extrudates from aluminum chips by new billet compaction and deformation routes
  10. Assuring a safe, secure and sustainable
  11. Geheime Verwandtschaften
  12. The ESAFORM benchmark 2023
  13. Pricing effects when competitors arrive
  14. An Introduction to Corporate Environmental Management
  15. Regional powers and the politics of scale
  16. Attention and Information Acquisition
  17. Cinephilia in transition
  18. Was tun, Herr Luhmann?
  19. Bodenlos.
  20. Removal of the anti-cancer drug methotrexate from water by advanced oxidation processes
  21. Elections in Asia and the Pacific: a data handbook
  22. Exploring the influence of testimonial source on attitudes towards e-mental health interventions among university students
  23. A Web-Based Stress Management Intervention for University Students in Indonesia (Rileks)
  24. § 2 Zur Konzeption des Handbuchs
  25. Schlechte Haut