Language Model Transformers as Evaluators for Open-domain Dialogues

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Computer-based systems for communication with humans are a cornerstone of AI research since the 1950s. So far, the most effective way to assess the quality of the dialogues produced by these systems is to use resource-intensive manual labor instead of automated means. In this work, we investigate whether language models (LM) based on transformer neural networks can indicate the quality of a conversation. In a general sense, language models are methods that learn to predict one or more words based on an already given context. Due to their unsupervised nature, they are candidates for efficient, automatic indication of dialogue quality. We demonstrate that human evaluators have a positive correlation between the output of the language models and scores. We also provide some insights into their behavior and inner-working in a conversational context.

Original languageEnglish
Title of host publicationCOLING 2020 - 28th International Conference on Computational Linguistics : Proceedings of the Conference
EditorsDonia Scott, Nuria Bel, Chengqing Zong
Number of pages12
PublisherAssociation for Computational Linguistics (ACL)
Publication date01.01.2020
Pages6797-6808
ISBN (electronic)9781952148279
DOIs
Publication statusPublished - 01.01.2020
Externally publishedYes
Event28th International Conference on Computational Linguistics, COLING 2020 - Virtual, Online, Spain
Duration: 08.12.202013.12.2020
https://coling2020.org
https://coling2020.org/COLING2020programme.pdf

Bibliographical note

We acknowledge the support of the EU projects Cleopatra (GA 812997) and TAILOR (GA 952215), the Federal Ministry for Economic Affairs and Energy (BMWi) project SPEAKER (FKZ 01MK20011A), the German Federal Ministry of Education and Research (BMBF) projects and excellence clusters ML2R (FKZ 01 15 18038 A/B/C), MLwin (01S18050 D/F), ScaDS.AI (01/S18026A) as well as the Fraunhofer Zukunftsstiftung project JOSEPH.

Publisher Copyright:
© 2020 COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference. All rights reserved.

Recently viewed

Publications

  1. Transcending Methodological Nationalism through a Transversal Method?
  2. Entry, Exit and Productivity
  3. Using LLMs in sensory service research
  4. Changing societies, changing journalism
  5. Enterprise Architecture Management Support for Digital Transformation Projects in Very Large Enterprises
  6. Considering Teachers’ Beliefs, Motivation, and Emotions Regarding Teaching Mathematics With Digital Tools
  7. Performance Saga: Interview 06
  8. The relation of flow-experience and physiological arousal under stress - can u shape it?
  9. Responsibility and environment
  10. Spectral Kinetic Simulation of the Ideal Multipole Resonance Probe
  11. Construal level theory
  12. A Transatlantic Symposium on the Restatement (Fourth)
  13. Optimising patterns of life conduct
  14. Time-varying persistence in real oil prices and its determinant
  15. Development and characterisation of a new interface for coupling capillary LC with collision-cell ICPMS and its application for phosphorylation profiling of tryptic protein digests
  16. A hybrid hydraulic piezo actuator modeling and hysteresis effect identification for control in camless internal combustion engines
  17. Exploring the uncanny valley effect in affective social robotics
  18. CAN BUSINESS MODEL COMPONENTS EXPLAIN DIGITAL START-UP SUCCESS?
  19. Sliding Mode Control for a Vertical Dynamics in the Presence of Nonlinear Friction
  20. Release of monomers from four different composite materials after halogen and LED curing
  21. System and action theory
  22. Paired case research design and mixed-methods approach
  23. Schreibt Ihr Unternehmen auch "grüne" Zahlen?
  24. Mapping the vegetation of southern mongolian protected areas: application of GIS and remote sensing techniques
  25. How many organic compounds are graph-theoretically nonplanar?
  26. Survey Response and Observed Behavior
  27. Essential ecosystem service variables for monitoring progress towards sustainability
  28. On the optimal design of insurance contracts with guarantees
  29. Sustainability Science with Ozzy Osbourne, Julia Roberts and Ai Weiwei
  30. Multiobjective optimal control of fluid mixing
  31. Sustainable Development
  32. Transformation products in the water cycle and the unsolved problem of their proactive assessment
  33. Rethinking the Spatiality of Spatial Planning
  34. Application of Software and Web-Based Tools for Sustainability Management in Small and Medium-Sized Enterprises
  35. Investigating quality raters' performance using interface evaluation methods
  36. Co-production of nature's contributions to people
  37. The Mobile Phone: From an Instrument of Microcoordination to a Universal Control Device