FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum17.10.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 17.10.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Publikationen

  1. Towards a Bayesian Student Model for Detecting Decimal Misconceptions
  2. Real-time RDF extraction from unstructured data streams
  3. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  4. Combining a PI Controller with an Adaptive Feedforward Control in PMSM
  5. Improving students’ science text comprehension through metacognitive self-regulation when applying learning strategies
  6. “Ideation is Fine, but Execution is Key”
  7. Age effects on controlling tools with sensorimotor transformations
  8. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  9. Computing regression statistics from grouped data
  10. Performance analysis for loss systems with many subscribers and concurrent services
  11. Stimulating Computing
  12. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  13. Scaffolding argumentation in mathematics with CSCL scripts
  14. Foundations and applications of computer based material flow networks for einvironmental management
  15. A localized boundary element method for the floating body problem
  16. Robust feedback linearization control of a throttle plate by using an approximated pd regulator
  17. TARGET SETTING FOR OPERATIONAL PERFORMANCE IMPROVEMENTS - STUDY CASE -
  18. Integration of laser scanning and projection speckle pattern for advanced pipeline monitoring
  19. Partitioned beta diversity patterns of plants across sharp and distinct boundaries of quartz habitat islands
  20. Computer als Medium
  21. OKBQA framework towards an open collaboration for development of natural language question-answering systems over knowledge bases
  22. Learning from Erroneous Examples: When and How do Students Benefit from them?
  23. Analysis of PI controllers with anti-windup techniques on level systems
  24. An Adaptive and Optimized Switching Observer for Sensorless Control of an Electromagnetic Valve Actuator in Camless Internal Combustion Engines
  25. Gaussian processes for dispatching rule selection in production scheduling
  26. Learning Analytics with Matlab Grader in Undergraduate Engineering Courses
  27. TRY plant trait database – enhanced coverage and open access
  28. An evaluation of BPR methodologies adopting NIMSAD: A systematic framework for understanding and evaluating methodologies
  29. An expert-based reference list of variables for characterizing and monitoring social-ecological systems
  30. Practical guide to SAP Netweaver PI-development
  31. Two models for gradient inelasticity based on non-convex energy
  32. Modelling and implementation of an Order2Cash Process in distributed systems
  33. Preventive Diagnostics for cardiovascular diseases based on probabilistic methods and description logic
  34. An Orthogonal Wavelet Denoising Algorithm for Surface Images of Atomic Force Microscopy
  35. A Multilevel Inverter Bridge Control Structure with Energy Storage Using Model Predictive Control for Flat Systems