FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum17.10.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 17.10.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Publikationen

  1. Towards a Bayesian Student Model for Detecting Decimal Misconceptions
  2. Real-time RDF extraction from unstructured data streams
  3. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  4. Age effects on controlling tools with sensorimotor transformations
  5. Considerations on efficient touch interfaces - How display size influences the performance in an applied pointing task
  6. An analytical approach to evaluating bivariate functions of fuzzy numbers with one local extremum
  7. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  8. Foundations and applications of computer based material flow networks for einvironmental management
  9. Robust feedback linearization control of a throttle plate by using an approximated pd regulator
  10. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  11. Integration of laser scanning and projection speckle pattern for advanced pipeline monitoring
  12. Artificial Intelligence Algorithms for Collaborative Book Recommender Systems
  13. Partitioned beta diversity patterns of plants across sharp and distinct boundaries of quartz habitat islands
  14. Using Fuzzy PD Controllers for Soft Motions in a Car-like Robot
  15. Switching from a Managing to a Monitoring Function on the Board
  16. The fuzzy relationship of intelligence and problem solving in computer simulations
  17. Performance concepts and performance theory
  18. A Structure and Content Prompt-based Method for Knowledge Graph Question Answering over Scholarly Data
  19. Changes of Perception
  20. Digging into the roots
  21. For a return to the forgotten formula: 'Data 1 + Data 2 > Data 1'
  22. Errors in Training Computer Skills
  23. Using augmented video to test in-car user experiences of context analog HUDs
  24. GENESIS - A generic RDF data access interface
  25. Factor structure and measurement invariance of the Students’ Self-report Checklist of Social and Learning Behaviour (SSL)
  26. Model predictive control for switching gain adaptation in a sliding mode controller of a DC drive with nonlinear friction
  27. Semantic Evaluation Services for Web-Based Exercises
  28. More input, better output
  29. How Much Home Office is Ideal? A Multi-Perspective Algorithm
  30. Optimizing price levels in e-commerce applications with respect to customer lifetime values
  31. Correlation of Microstructure and Local Mechanical Properties Along Build Direction for Multi-layer Friction Surfacing of Aluminum Alloys
  32. Emergency detection based on probabilistic modeling in AAL-environments
  33. Sliding-Mode-Based Input-Output Linearization of a Peltier Element for Ice Clamping Using a State and Disturbance Observer
  34. A general structural property in wavelet packets for detecting oscillation and noise components in signal analysis