Language Model Transformers as Evaluators for Open-domain Dialogues

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Computer-based systems for communication with humans are a cornerstone of AI research since the 1950s. So far, the most effective way to assess the quality of the dialogues produced by these systems is to use resource-intensive manual labor instead of automated means. In this work, we investigate whether language models (LM) based on transformer neural networks can indicate the quality of a conversation. In a general sense, language models are methods that learn to predict one or more words based on an already given context. Due to their unsupervised nature, they are candidates for efficient, automatic indication of dialogue quality. We demonstrate that human evaluators have a positive correlation between the output of the language models and scores. We also provide some insights into their behavior and inner-working in a conversational context.

Original languageEnglish
Title of host publicationCOLING 2020 - 28th International Conference on Computational Linguistics : Proceedings of the Conference
EditorsDonia Scott, Nuria Bel, Chengqing Zong
Number of pages12
PublisherAssociation for Computational Linguistics (ACL)
Publication date01.01.2020
Pages6797-6808
ISBN (electronic)9781952148279
DOIs
Publication statusPublished - 01.01.2020
Externally publishedYes
Event28th International Conference on Computational Linguistics, COLING 2020 - Virtual, Online, Spain
Duration: 08.12.202013.12.2020
https://coling2020.org
https://coling2020.org/COLING2020programme.pdf

Bibliographical note

We acknowledge the support of the EU projects Cleopatra (GA 812997) and TAILOR (GA 952215), the Federal Ministry for Economic Affairs and Energy (BMWi) project SPEAKER (FKZ 01MK20011A), the German Federal Ministry of Education and Research (BMBF) projects and excellence clusters ML2R (FKZ 01 15 18038 A/B/C), MLwin (01S18050 D/F), ScaDS.AI (01/S18026A) as well as the Fraunhofer Zukunftsstiftung project JOSEPH.

Publisher Copyright:
© 2020 COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference. All rights reserved.

Recently viewed

Activities

  1. How working from home impairs recovery from work: Anticipated availability as a cognitive process in the stressor-detachment model
  2. Learning Management Systems in EFL: Simulating the U.S. Presidential Election in a Transatlantic Blended Learning Project
  3. Exploring the potential role of priority effects for ecological restoration
  4. WK ORG Workshop - WK ORG 2019
  5. Science-Society Interfaces: Co-Organizing and Reporting of a Session at the 2nd Future Earth Summit
  6. Chronic pain patients' acceptance of internet-based interventions and how to influence it: a randomised controlled trial
  7. Inquiry-based Learning Environment to Welcome the Diversity of a Chemistry Class
  8. {Futures} loading...
  9. Modern micropolitics of antipopulism: Rethinking discourse and empathy
  10. 8th Organizations, Artifacts and Practices Workshop - OAP 2018
  11. Methods for Ph.D.
  12. 25th International Conference on System Theory, Control and Computing
  13. Investigation of the evolution and kinetics of temperature-driven intermetallic compound during solid-state joining of an Al-Mg alloy via the multiphase-field method
  14. Lodz University of Technology
  15. Robust Current Decoupling in a Permanent Magnet Motor Combining a Geometric Method and SMC
  16. Do we need a new paradigm for mastering existing and future challenges of the urban water cycle
  17. An Optimal Polynomial Trajectory for Electromagnetic Actuators
  18. What we mean when we talk about freedom – The KOMFOR study: an analysis of students' choices of courses in interdisciplinary parts of the curriculum.
  19. Antitrust and Beyond - The Democratic Task of Antitrust Law in the Light of Heinrich Kronstein's Work
  20. Weaving Fabrics
  21. Multi problem families”, “overburdened mothers”, and where is the child? Physical violence and symbolic power of definition

Publications

  1. rSOESGOPE Method Applied to Four-Tank System Modeling
  2. Construal level theory
  3. Development and characterisation of a new interface for coupling capillary LC with collision-cell ICPMS and its application for phosphorylation profiling of tryptic protein digests
  4. Separable models for interconnected production-inventory systems
  5. Release of monomers from four different composite materials after halogen and LED curing
  6. Meat substitutes
  7. Identifying determinants of teachers' judgment (in)accuracy regarding students' school-related motivations using a Bayesian cross-classified multi-level model
  8. Young children spontaneously recreate core properties of language in a new modality
  9. Scaffolding, software and scenarios
  10. Transformation products in the water cycle and the unsolved problem of their proactive assessment
  11. Knowledge Generation and Sustainable Development
  12. Leaf Nutritional Content, Tree Richness, and Season Shape the Caterpillar Functional Trait Composition Hosted by Trees
  13. Sensorimotor Control and Proprioception in Neurorehabilitation
  14. Co-production of nature's contributions to people
  15. Interpersonal Physiological Synchrony Predicts Group Cohesion
  16. A Systematic Literature Review Of Machine Learning Approaches For The Prediction Of Delivery Dates
  17. Circular and inclusive utilization of alternative proteins
  18. Next generation wireless energy aware sensors for internet of things
  19. A flexible global warming index for use in an integrated approach to climate change assessment
  20. A cascade regulator using Lyapunov's PID-PID controllers for an aggregate actuator in automotive applications
  21. Effekte unterschiedlicher Kollaborationsskripte in chatbasiertem Computer-Supported Collaborative Learning am Beispiel von Lernprotokollen
  22. Comparative effectiveness of guided internet-based stress management training versus established in-person group training in employees – study protocol for a pragmatic, randomized, non-inferiority trial
  23. Dimensions of digital transformation in the context of modern agriculture
  24. Understanding and managing post-acquisition integration as change process
  25. Bifurcation loads of beams of glued-laminated timber with intermediate lateral supports