FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum17.10.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 17.10.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Publikationen

  1. Towards a Bayesian Student Model for Detecting Decimal Misconceptions
  2. Mining positional data streams
  3. Universal Threshold Calculation for Fingerprinting Decoders using Mixture Models
  4. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  5. Real-time RDF extraction from unstructured data streams
  6. Combining a PI Controller with an Adaptive Feedforward Control in PMSM
  7. “Ideation is Fine, but Execution is Key”
  8. Age effects on controlling tools with sensorimotor transformations
  9. Applications of the Simultaneous Modular Approach in the Field of Material Flow Analysis
  10. Assessing Effects Through Semi-Field and Field Toxicity Testing
  11. Understanding reading as a form of language-use
  12. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  13. Computing regression statistics from grouped data
  14. HAWK - hybrid question answering using linked data
  15. A Line with Variable Direction, which Traces No Contour, and Delimits No Form
  16. Identification of conductive fiber parameters with transcutaneous electrical nerve stimulation signal using RLS algorithm
  17. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  18. Scaffolding argumentation in mathematics with CSCL scripts
  19. A localized boundary element method for the floating body problem
  20. TARGET SETTING FOR OPERATIONAL PERFORMANCE IMPROVEMENTS - STUDY CASE -
  21. Structure and dynamics laboratory testing of an indirectly controlled full variable valve train for camless engines
  22. Users’ handedness and performance when controlling integrated input devices
  23. OKBQA framework towards an open collaboration for development of natural language question-answering systems over knowledge bases
  24. Supporting discourse in a synchronous learning environment
  25. A Quality Data Model Based on Asset Administration Shell Technology to Enable Autonomous Quality Control Loops
  26. From Knowledge to Application
  27. Learning from Erroneous Examples: When and How do Students Benefit from them?