FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum23.05.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 23.05.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Publikationen

  1. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  2. Improving students’ science text comprehension through metacognitive self-regulation when applying learning strategies
  3. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  4. Computing regression statistics from grouped data
  5. Performance analysis for loss systems with many subscribers and concurrent services
  6. Stimulating Computing
  7. TARGET SETTING FOR OPERATIONAL PERFORMANCE IMPROVEMENTS - STUDY CASE -
  8. Integration of laser scanning and projection speckle pattern for advanced pipeline monitoring
  9. Comments on "Tracking Control of Robotic Manipulators With Uncertain Kinematics and Dynamics"
  10. Analysis of long-term statistical data of cobalt flows in the EU
  11. Simulation based optimization of lot sizes for opposing logistic objectives
  12. Gaussian processes for dispatching rule selection in production scheduling
  13. Exploring the limits of graph invariant- and spectrum-based discrimination of (sub)structures.
  14. Learning Analytics with Matlab Grader in Undergraduate Engineering Courses
  15. Neural network-based estimation and compensation of friction for enhanced deep drawing process control
  16. Teaching methods for modelling problems and students’ task-specific enjoyment, value, interest and self-efficacy expectations
  17. Self-regulation in error management training: emotion control and metacognition as mediators of performance effects
  18. Does thinking-aloud affect learning, visual information processing and cognitive load when learning with seductive details as expected from self-regulation perspective?
  19. For a return to the forgotten formula: 'Data 1 + Data 2 > Data 1'
  20. Using Language Learning Resources on YouTube
  21. Cognitive Predictors of Child Second Language Comprehension and Syntactic Learning
  22. A Theoretical Dynamical Noninteracting Model for General Manipulation Systems Using Axiomatic Geometric Structures
  23. Teachers’ use of data from digital learning platforms for instructional design
  24. Dynamic environment modelling and prediction for autonomous systems
  25. Machine Learning and Knowledge Discovery in Databases
  26. Multiphase-field modeling of temperature-driven intermetallic compound evolution in an Al-Mg system for application to solid-state joining processes
  27. Guided discovery learning with computer-based simulation games
  28. Modelling biodegradability based on OECD 301D data for the design of mineralising ionic liquids
  29. A longitudinal multilevel CFA-MTMM model for interchangeable and structurally different methods
  30. Quantifying diffuse and point inputs of perfluoroalkyl acids in a nonindustrial river catchment
  31. Is too much help an obstacle? Effects of interactivity and cognitive style on learning with dynamic versus non-dynamic visualizations with narrative explanations
  32. An application of multiple behavior SIA for analyzing data from student exams
  33. How, when and why do negotiators use reference points?