FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum17.10.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 17.10.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Publikationen

  1. Understanding the properties of isospectral points and pairs in graphs
  2. Analyzing math teacher students' sensitivity for aspects of the complexity of problem oriented mathematics instruction
  3. Trait correlation network analysis identifies biomass allocation traits and stem specific length as hub traits in herbaceous perennial plants
  4. The signal location task as a method quantifying the distribution of attention
  5. Applications of the Simultaneous Modular Approach in the Field of Material Flow Analysis
  6. Generating Energy Optimal Powertrain Force Trajectories with Dynamic Constraints
  7. Universal Threshold Calculation for Fingerprinting Decoders using Mixture Models
  8. Understanding reading as a form of language-use
  9. Towards a Bayesian Student Model for Detecting Decimal Misconceptions
  10. A statistical study of the spatial evolution of shock acceleration efficiency for 5 MeV protons and subsequent particle propagation
  11. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  12. “Ideation is Fine, but Execution is Key”
  13. Performance analysis for loss systems with many subscribers and concurrent services
  14. Simulating X-ray beam energy and detector signal processing of an industrial CT using implicit neural representations
  15. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  16. Stimulating Computing
  17. Improving students’ science text comprehension through metacognitive self-regulation when applying learning strategies
  18. Identification of conductive fiber parameters with transcutaneous electrical nerve stimulation signal using RLS algorithm
  19. Introducing split orders and optimizing operational policies in robotic mobile fulfillment systems
  20. A localized boundary element method for the floating body problem
  21. Foundations and applications of computer based material flow networks for einvironmental management
  22. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  23. TARGET SETTING FOR OPERATIONAL PERFORMANCE IMPROVEMENTS - STUDY CASE -
  24. Dynamic priority based dispatching of AGVs in flexible job shops
  25. An analytical approach to evaluating bivariate functions of fuzzy numbers with one local extremum
  26. Stability analysis of a linear model predictive control and its application in a water recovery process
  27. From Knowledge to Application
  28. Neural correlates of the enactment effect in the brain
  29. What can conservation strategies learn from the ecosystem services approach?
  30. Computer als Medium
  31. Analysis of long-term statistical data of cobalt flows in the EU
  32. Scaffolding argumentation in mathematics with CSCL scripts
  33. Simulation based optimization of lot sizes for opposing logistic objectives
  34. Robust feedback linearization control of a throttle plate by using an approximated pd regulator
  35. Text Comprehension as a Mediator in Solving Mathematical Reality-Based Tasks