FaST: A linear time stack trace alignment heuristic for crash report deduplication

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

Original languageEnglish
Title of host publicationThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Number of pages12
Place of PublicationNew York
PublisherInstitute of Electrical and Electronics Engineers Inc.
Publication date17.10.2022
Pages549-560
ISBN (print)9781665452106
ISBN (electronic)978-1-4503-9303-4
DOIs
Publication statusPublished - 17.10.2022
Event19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, United States
Duration: 23.05.202224.05.2022
Conference number: 19
https://conf.researchr.org/home/msr-2022

Bibliographical note

Titel der Druckausgabe: 2022 IEEE/ACM 19th International Conference on Mining Software Repositories (MSR 2022)

Funding Information:
We would like to gratefully acknowledge the Natural Sciences and Engineering Research Council of Canada (NSERC), Ericsson, Ciena, and EffciOS for funding this project. Moreover, this research was enabled in part by the support provided by WestGrid (https://www. westgrid.ca/) and Compute Canada (www.computecanada.ca).

Publisher Copyright:
© 2022 ACM.

    Research areas

  • Automatic Crash Reporting, Crash Report Deduplication, Duplicate Crash Report, Duplicate Crash Report Detection, Stack Trace Similarity
  • Business informatics

DOI

Recently viewed

Activities

  1. Plenary lecture entitled: Discrete Lyapunov Controllers for an Actuator in Camless Engines
  2. International Conference on Methods and Models in Automation an Robotics - MMAR 2016
  3. Investigating learner pragmatics at the discourse level: Challenges, complexities and insights
  4. archiDART: an R package for the automated 2D computation of root architectural traits
  5. Placemaking today: integrating place-oriented thinking into cultural policy frameworks
  6. Reflexive Multi-Criteria Evaluation as a tool to integrate Multiple Values into Decision-Making – a Case Study from Germany
  7. Performativity and Authenticity in the Web 2.0-Enhanced Foreign Language Classroom
  8. Performance resource depletion influence on performance: Advancing concepts and findings
  9. Keynote: Routines Research: More Critical?
  10. That is not enough–Or is it? A qualitative investigation of reference points in negotiations
  11. European University Institute
  12. Staging Science: new approaches in science communication, popular and artistic science performance
  13. Teaching Listening Comprehension in the EFL Classroom: Processes, Materials, Tasks and Tests
  14. Artificial Intelligence and Intellectual Property
  15. Governance Pluralism to manage the Complexity of Ecosystem Services Co-production
  16. Urban spaces of possibility and imaginaries of sustainability
  17. Combination of a reduced order state observer and an Extended Kalman Filter for Peltier cells
  18. Comfort and Intervention Behavior of Drivers in Highly Automated Vehicles with Headway Control
  19. Comparative Perspectives on Social Values and Modernization - 2011
  20. Rational Design of Molecules by Life Cycle Engineering.
  21. Where To Start? Exploring 1-Year-Students’ Preconceptions of Sustainable Development

Publications

  1. Performance analysis for loss systems with many subscribers and concurrent services
  2. Volume of Imbalance Container Prediction using Kalman Filter and Long Short-Term Memory
  3. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  4. Text Comprehension as a Mediator in Solving Mathematical Reality-Based Tasks
  5. Eliciting Learner Perceptions of Web 2.0 Tasks through Mixed-Methods Classroom Research
  6. Multilevel bridge governor by using model predictive control in wavelet packets for tracking trajectories
  7. An empirical comparison of different implicit measures to predict consumer choice
  8. Technical concept and evaluation design of the state subsidized project [Level-Q]
  9. Performance of methods to select landscape metrics for modelling species richness
  10. An Adaptive Resonance Regulator for an Actuator using Periodic Signals in Camless Engine Systems
  11. Integration of laboratory experiments into introductory electrical engineering courses
  12. Canopy structure influences arthropod communities within and beyond tree identity effects
  13. Modeling and simulation of the microstructural behaviour in thermal sprayed coatings
  14. The explanatory power of Carnegie Classification in predicting engagement indicators
  15. Test of advanced hyperfine structure theory by precision radio-frequency and laser spectroscopy in molybdenum
  16. Accidental Representation–The Reconfiguration of Representation through Social Media
  17. Non-invariance? An Overstated Problem With Misconceived Causes
  18. archiDART: an R package for the automated computation of plant root architectural traits