Treating dialogue quality evaluation as an anomaly detection problem

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Dialogue systems for interaction with humans have been enjoying increased popularity in the research and industry fields. To this day, the best way to estimate their success is through means of human evaluation and not automated approaches, despite the abundance of work done in the field. In this paper, we investigate the effectiveness of perceiving dialogue evaluation as an anomaly detection task. The paper looks into four dialogue modeling approaches and how their objective functions correlate with human annotation scores. A high-level perspective exhibits negative results. However, a more in-depth look shows limited potential for using anomaly detection for evaluating dialogues.

OriginalspracheEnglisch
TitelLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
HerausgeberNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Anzahl der Seiten5
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum2020
Seiten508-512
ISBN (elektronisch)9791095546344
PublikationsstatusErschienen - 2020
Extern publiziertJa
Veranstaltung12th International Conference on Language Resources and Evaluation, LREC 2020 - Le Palais du Pharao, Marseille, Frankreich
Dauer: 11.05.202016.05.2020
https://lrec2020.lrec-conf.org/en/about/organizers/index.html

Bibliographische Notiz

Publisher Copyright:
© European Language Resources Association (ELRA), licensed under CC-BY-NC

Links

Zuletzt angesehen

Publikationen

  1. Simulation and optimization of material and energy flow systems
  2. Explaining Disagreement on Interest Rates in a Taylor-Rule Setting
  3. Value Orientations in the World of Visual Art: An Exploration Based on Latent Class and Correspondence Analysis
  4. Modeling and simulation of the heterogenous material behavior in thermal-sprayed coatings
  5. Memory Acts: Memory without Representation.
  6. CSR
  7. Modeling the distribution of white spruce (Picea glauca) for Alaska with high accuracy: an open access role-model for predicting tree species in last remaining wilderness areas
  8. Monitoring of microbially mediated corrosion and scaling processes using redox potential measurements
  9. Learning linear classifiers sensitive to example dependent and noisy costs
  10. Visual Detection of Traffic Incident through Automatic Monitoring of Vehicle Activities
  11. Geodesign as a boundary management process
  12. Differences of Four Work-Related Behavior and Experience Patterns in Work Ability and Other Work-Related Perceptions in a Finance Company
  13. Explaining the (Non-) Adoption of Advanced Data Analytics in Auditing
  14. Ecologies of Making
  15. Effects of maize roots on aggregate stability and enzyme activities in soil
  16. Octanol-Water Partition Coefficient Measurement by a Simple 1H NMR Method
  17. A Soft Alignment Model for Bug Deduplication
  18. Insights into Jatropha Projects Worldwide
  19. Towards combined methods for recording ground beetles
  20. Robustness of coherent sets computations
  21. Deeper Insights into Different Consumer Perceptions of CSR Communication
  22. Reducing the peaking phenomenon in Luenberger observers in presence of quasi-static disturbances for linear time invariant systems