Language Model Transformers as Evaluators for Open-domain Dialogues

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Computer-based systems for communication with humans are a cornerstone of AI research since the 1950s. So far, the most effective way to assess the quality of the dialogues produced by these systems is to use resource-intensive manual labor instead of automated means. In this work, we investigate whether language models (LM) based on transformer neural networks can indicate the quality of a conversation. In a general sense, language models are methods that learn to predict one or more words based on an already given context. Due to their unsupervised nature, they are candidates for efficient, automatic indication of dialogue quality. We demonstrate that human evaluators have a positive correlation between the output of the language models and scores. We also provide some insights into their behavior and inner-working in a conversational context.

Original languageEnglish
Title of host publicationCOLING 2020 - 28th International Conference on Computational Linguistics : Proceedings of the Conference
EditorsDonia Scott, Nuria Bel, Chengqing Zong
Number of pages12
PublisherAssociation for Computational Linguistics (ACL)
Publication date01.01.2020
Pages6797-6808
ISBN (electronic)9781952148279
DOIs
Publication statusPublished - 01.01.2020
Externally publishedYes
Event28th International Conference on Computational Linguistics, COLING 2020 - Virtual, Online, Spain
Duration: 08.12.202013.12.2020
https://coling2020.org
https://coling2020.org/COLING2020programme.pdf

Bibliographical note

We acknowledge the support of the EU projects Cleopatra (GA 812997) and TAILOR (GA 952215), the Federal Ministry for Economic Affairs and Energy (BMWi) project SPEAKER (FKZ 01MK20011A), the German Federal Ministry of Education and Research (BMBF) projects and excellence clusters ML2R (FKZ 01 15 18038 A/B/C), MLwin (01S18050 D/F), ScaDS.AI (01/S18026A) as well as the Fraunhofer Zukunftsstiftung project JOSEPH.

Publisher Copyright:
© 2020 COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference. All rights reserved.

Recently viewed

Projects

  1. Promos 2019

Publications

  1. Third International Mathematics and Science Study and Trends in Mathematics and Science Studies (TIMSS)
  2. A Robust Decoupling Estimator to Indentify Electrical Parameters for Three-Phase Permanent Magnet Synchronous Motors
  3. Using LLMs in sensory service research
  4. The relation of flow-experience and physiological arousal under stress - can u shape it?
  5. Responsibility and environment
  6. Case study on delivery time determination using a machine learning approach in small batch production companies
  7. Introduction
  8. Systematic risk behavior in cyclical industries
  9. On the Existence of Digital Objects
  10. Usage pattern-based exposure screening as a simple tool for the regional priority-setting in environmental risk assessment of veterinary antibiotics
  11. The importance of product lifetime labelling for purchase decisions
  12. Separable models for interconnected production-inventory systems
  13. Release of monomers from four different composite materials after halogen and LED curing
  14. HAWK@QALD5 - Trying to answer hybrid questions with various simple ranking techniques
  15. Paired case research design and mixed-methods approach
  16. Excellence in Teaching and Learning
  17. Schreibt Ihr Unternehmen auch "grüne" Zahlen?
  18. One Fits Them All?
  19. Mapping the vegetation of southern mongolian protected areas: application of GIS and remote sensing techniques
  20. Principals between exploitation and exploration
  21. How data on transformation products can support the redesign of sulfonamides towards better biodegradability in the environment