Treating dialogue quality evaluation as an anomaly detection problem

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Dialogue systems for interaction with humans have been enjoying increased popularity in the research and industry fields. To this day, the best way to estimate their success is through means of human evaluation and not automated approaches, despite the abundance of work done in the field. In this paper, we investigate the effectiveness of perceiving dialogue evaluation as an anomaly detection task. The paper looks into four dialogue modeling approaches and how their objective functions correlate with human annotation scores. A high-level perspective exhibits negative results. However, a more in-depth look shows limited potential for using anomaly detection for evaluating dialogues.

OriginalspracheEnglisch
TitelLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
HerausgeberNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Anzahl der Seiten5
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum2020
Seiten508-512
ISBN (elektronisch)9791095546344
PublikationsstatusErschienen - 2020
Extern publiziertJa
Veranstaltung12th International Conference on Language Resources and Evaluation, LREC 2020 - Le Palais du Pharao, Marseille, Frankreich
Dauer: 11.05.202016.05.2020
https://lrec2020.lrec-conf.org/en/about/organizers/index.html

Bibliographische Notiz

Publisher Copyright:
© European Language Resources Association (ELRA), licensed under CC-BY-NC

Links

Zuletzt angesehen

Publikationen

  1. The impact of linguistic complexity on the solution of mathematical modelling tasks
  2. Competence models for assessing individual learning outcomes and evaluating educational processes - a priority program of the German research foundation (DFG)
  3. Digging into the roots
  4. Stimulating Computing
  5. Teachers’ temporary support and worked-out examples as elements of scaffolding in mathematical modeling
  6. The effect of structural complexity on large mammal occurrence in revegetation
  7. Artificial intelligence in songwriting and composing - perspectives and challenges in creative practices
  8. How to support teachers to give feedback to modelling tasks effectively? Results from a teacher-training-study in the Co²CA project
  9. A dialectical perspective on innovation: Conflicting demands, multiple pathways, and ambidexterity
  10. Self-supervised Siamese Autoencoders
  11. Value Orientations in the World of Visual Art: An Exploration Based on Latent Class and Correspondence Analysis
  12. Data based root cause analysis for improving logistic key performance indicators of a company’s internal supply chain
  13. Enterprise Architecture Management Support for Digital Transformation Projects in Very Large Enterprises
  14. Late developers and the inequity of "equitable utilization" and the harm of "do no harm"
  15. How attribution-of-competence and scale-granularity explain the anchor precision effect in negotiations and estimations.
  16. Mapping ecosystem services in Colombia
  17. Work availability types and well-being in Germany–a latent class analysis among a nationally representative sample
  18. Erratum: Formalised and non-formalised methods in resource management-knowledge and social learning in participatory processes
  19. The challenges of gamifying CSR communication
  20. Discriminative clustering for market segmentation
  21. Predicting online user behavior based on Real-Time Advertising Data
  22. 3D Simulation of Electric Arcing and Pressure increase in an Automotive HVDC Relay During a Short Circuit Situation
  23. Root-root interactions: extending our perspective to be more inclusive of the range of theories in ecology and agriculture using in-vivo analyses
  24. Influence of Dy in solid solution on the degradation behavior of binary Mg-Dy alloys in cell culture medium
  25. Performance Saga: Interview 07
  26. Expectations on Hierarchical Scales of Discourse
  27. Microstructure by design
  28. DECODING SUSTAINABILITY IN THE HEALTHCARE SYSTEM. TEACHING STUDENTS HOW TO PROBLEMATIZE COMPLEX CONCEPTS