Proxy Indicators for the Quality of Open-domain Dialogues

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The automatic evaluation of open-domain dialogues remains a largely unsolved challenge. Thus, despite the abundance of work done in the field, human judges have to evaluate dialogues' quality. As a consequence, performing such evaluations at scale is usually expensive. This work investigates using a deep-learning model trained on the General Language Understanding Evaluation (GLUE) benchmark to serve as a quality indication of open-domain dialogues. The aim is to use the various GLUE tasks as different perspectives on judging the quality of conversation, thus reducing the need for additional training data or responses that serve as quality references. Due to this nature, the method can infer various quality metrics and derive a component-based overall score. We achieve statistically significant correlation coefficients of up to 0.7.

Original languageEnglish
Title of host publicationEMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings
EditorsMarie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Number of pages22
PublisherAssociation for Computational Linguistics (ACL)
Publication date01.01.2021
Pages7834-7855
ISBN (electronic)9781955917094
DOIs
Publication statusPublished - 01.01.2021
Externally publishedYes
Event2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021 - ONLINE, Punta Cana, Dominican Republic
Duration: 07.11.202111.11.2021
https://2021.emnlp.org

Bibliographical note

Publisher Copyright:
© 2021 Association for Computational Linguistics

Recently viewed

Activities

  1. Artistic Utopian Spaces and the Promise of Urban Development
  2. Workshop Gold, Weihrauch und Malerei. Notion and Representation of Value in Art - 2013
  3. Mental Parsing as A Mixed Blessing for Integrative Agreements: When Parsing Multiple Issues into Separate Mental Accounts Helps Versus Hurts Negotiators.
  4. Types of institutional proxy representatives for future generations in democracies: A comparative empirical analysis
  5. Acceleration and Reflection
  6. Nonlinear dynamics and opinion formation in time varying networks
  7. Verification of Measuring the Bearing Clearance Using Kurtosis, Recurrences and Neural Networks and Comparison of These Approaches
  8. Workshop mit David Bates: "Compossible Worlds"
  9. Empirical Insights into Working in Research-Practice Partnerships: New Findings on Motivation, Co-Constructive Collaboration and Learning Effects
  10. The temporal dynamics of ambidextrous leadership for innovation: A diary study
  11. Linking Teaching and Learning Formats with Student Development of Key Sustainability Competencies
  12. Digital Games Lab Lecture Series - 2018
  13. Experiencing Nature of Science – Discover your own understanding of NOS, mit Kerstin Oschatz
  14. PhD Workshop 2022 - Empirical Microeconomics
  15. Paper, pegboard, software: Elements of a media theory of organization
  16. Bridging the Curricular Divide. Integrating sustainability and EFL instruction in a project (week) context for secondary school learners of English and Science
  17. That is not enough–Or is it? A qualitative investigation of reference points in negotiations
  18. Combining SMC and MTPA Using an EKF to estimate parameters and states of an interior PMSM
  19. From e-learning to the acquirement of competencies: wiki-based knowledge management and complex problem solving

Publications

  1. Playing in the Spaces: Anarchism in the Classroom
  2. A Comparative Study for Fisheye Image Classification
  3. Reality-Based Tasks with Complex-Situations
  4. Perfectly nested or significantly nested - an important difference for conservation management
  5. Student Game Design for Language Learning
  6. Pressure fault recognition and compensation with an adaptive feedforward regulator in a controlled hybrid actuator within engine applications
  7. Influence of Long-Lasting Static Stretching Intervention on Functional and Morphological Parameters in the Plantar Flexors
  8. The impact of goal focus, task type and group size on synchronous net-based collaborative learning discourses
  9. Masked Autoencoder Pretraining for Event Classification in Elite Soccer
  10. The role of task meaning on output in groups
  11. Efficacy of an internet and app-based gratitude intervention in reducing repetitive negative thinking and mechanisms of change in the intervention's effect on anxiety and depression
  12. Perception and Inference
  13. Action Errors, Error Management, and Learning in Organizations
  14. Quantifying diffuse and point inputs of perfluoroalkyl acids in a nonindustrial river catchment
  15. Segment Introduction
  16. Comparison of EKF and TSO for Health Monitoring of a Textile-Based Heater Structure and its Control
  17. A Voxel-based technique to estimate the volume of trees from terrestrial laser scanner data
  18. Optimal scheduling of AGVs in a reentrant blocking job-shop
  19. BUSINESS MODELS IN BANKING: A CLUSTER ANALYSIS USING ARCHIVAL DATA
  20. Assessing authenticity in modelling test items: deriving a theoretical model
  21. A longitudinal multilevel CFA-MTMM model for interchangeable and structurally different methods
  22. Using EEG movement tagging to isolate brain responses coupled to biological movements
  23. How to support students-learning in mathematical bridging-courses using ITS? Remedial Scenarios in the EU-Project Math-Bridge
  24. An Adaptive Resonance Regulator for an Actuator using Periodic Signals in Camless Engine Systems
  25. Soil conditions modify species diversity effects on tree functional trait expression