Treating dialogue quality evaluation as an anomaly detection problem

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Dialogue systems for interaction with humans have been enjoying increased popularity in the research and industry fields. To this day, the best way to estimate their success is through means of human evaluation and not automated approaches, despite the abundance of work done in the field. In this paper, we investigate the effectiveness of perceiving dialogue evaluation as an anomaly detection task. The paper looks into four dialogue modeling approaches and how their objective functions correlate with human annotation scores. A high-level perspective exhibits negative results. However, a more in-depth look shows limited potential for using anomaly detection for evaluating dialogues.

OriginalspracheEnglisch
TitelLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
HerausgeberNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Anzahl der Seiten5
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum2020
Seiten508-512
ISBN (elektronisch)9791095546344
PublikationsstatusErschienen - 2020
Extern publiziertJa
Veranstaltung12th International Conference on Language Resources and Evaluation, LREC 2020 - Le Palais du Pharao, Marseille, Frankreich
Dauer: 11.05.202016.05.2020
https://lrec2020.lrec-conf.org/en/about/organizers/index.html

Bibliographische Notiz

Publisher Copyright:
© European Language Resources Association (ELRA), licensed under CC-BY-NC

Links

Zuletzt angesehen

Publikationen

  1. Typewriting Dynamics
  2. Ob lang oder kurz, berührbar oder nicht: Ist die Längenschätzkompetenz eindimensional?
  3. Towards a Heuristic for Scheduling Offshore Installation Processes
  4. The parallel two-legged walking robot centaurob
  5. Digital twin support for laser-based assembly assistance
  6. A Robust Decoupling Estimator to Indentify Electrical Parameters for Three-Phase Permanent Magnet Synchronous Motors
  7. Effekte unterschiedlicher Kollaborationsskripte in chatbasiertem Computer-Supported Collaborative Learning am Beispiel von Lernprotokollen
  8. The performatization of space
  9. Embodiment and Gender Identity in Virtual Worlds
  10. Life satisfaction in Germany after reunification: Additional insights on the pattern of convergence
  11. Modernization
  12. Robust Current Decoupling in a Permanent Magnet Motor Combining a Geometric Method and SMC
  13. Investigation of the sulfur speciation in petroleum products by capillary gas chromatography with ICP-collision cell-MS detection
  14. The value of sub-national data
  15. Sustainable Development Discourse – Challenges for Universities
  16. De-Anonymizing Anonymous
  17. Simulation of fatigue crack growth in residual‐stress‐afflicted specimen with a phase‐field model
  18. IGH
  19. Number Pyramids as a Mathematically Rich Learning Environment for All Students
  20. Maintaining the Reputation of Reputation
  21. Promoting Navigation Health Literacy at the Intersection of Schools and Communities. Development of the Game-Based Intervention Nebolus
  22. A leverage points perspective on social networks to understand sustainability transformations
  23. Interdiffusion and atomic mobility in hcp Mg–Al–Sn alloys
  24. Letters to the editor
  25. Public Interest Litigation avant la lettre? Questions of Standing in the Wimbledon Case
  26. An extended kalman filter for temperature monitoring of a metal-polymer hybrid fibre based heater structure
  27. VALUES-BASED BUSINESS MODEL INNOVATION-THE CASE OF ECOSIA AND ITS BUSINESS MODEL
  28. The Exilic Classroom
  29. Vom Wildwuchs zur Norm
  30. The well- and unwell-being of a child
  31. Destinationale Governance-Analyse
  32. Estimation of the economy of heterotrophic microalgae- and insect-based food waste utilization processes
  33. Filter Devices having a Microwave Resonator
  34. Cyberspace Battleground
  35. Konfiguration der PPS
  36. Efficiency
  37. Ankunft einer Katze
  38. Introduction: Converging the Yet-Separate Theoretical Discourses of Testimony Studies
  39. Who is doing asylum in Niger? State bureaucrats’ perspectives and strategies on the externalization of refugee protection