FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum23.05.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 23.05.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Forschende

  1. Dirk Stegmann

Publikationen

  1. Supporting discourse in a synchronous learning environment
  2. Functional Richness and Relative Resilience of Bird Communities in Regions with Different Land Use Intensities
  3. From Open Access to Open Science
  4. Quantum Computing and the Analog/Digital Distinction
  5. Using Conjoint Analysis to Elicit Preferences for Occupational Health Services in Small and Microenterprises
  6. Cue predictability changes scaling in eye-movement fluctuations
  7. Effectiveness of the world network of biosphere reserves in maintaining forest ecosystem functions
  8. Exploring the efficacy of metabarcoding and non-target screening for detecting treated wastewater
  9. Adaptive capacity and learning to learn as leverage for social-ecological resilience
  10. From Enterprise Architecture to Business Ecosystem Architecture
  11. Drawing as a Generative Activity and Drawing as a Prognostic Activity
  12. Worauf warten?
  13. Schreiben in der Sekundarstufe II
  14. Computational Swarming
  15. The Measurement of Grip-Strength in Automobiles
  16. Georeferencing System for Maneuvering of Autonomous Truck in Mining Environment
  17. Irish English and Variational Pragmatics
  18. Unchanged food approach-avoidance behaviour of healthy men after oxytocin administration
  19. Bless
  20. Sprachliche Heterogenität im gesellschaftswissenschaftlichen Unterricht
  21. The Lüneburg Sustainable University Project in international comparison
  22. The Effect of Market Power on Electricity Storage Utilization
  23. Sex Differences in Double Poling Performance
  24. Effectiveness of a web-based intervention for injured claimants
  25. Effect of Welding Speed on Friction Stir Welds of PM2000 Alloy