FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum17.10.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 17.10.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Aktivitäten

  1. Material Migrations I Online Lecture Series
  2. 27th International Conference on System Theory, Control and Computing - ICSTCC 2023
  3. „Regional Conflicts & Contested Spatial Identities in the Digital Sphere”
  4. Transdisciplinary boundary-work: Exploring entities, identities and boundaries
  5. International Workshop - Pragmatic Markers, Discourse Markers and Modal Particles: What do we know and where do we go from here?
  6. The golden age of software architecture better named the middle age of software architecture - Some provocative thoughts
  7. The global classroom: Introduction, presentation and workshops
  8. Bridges or blocks: How objects cross or enact boundaries in interorganizational teams
  9. GET.ON PAPP: Feasibility of a mobile application for panic with and without agoraphobia
  10. Where Did It Come From? Deep Learning for Event Extraction in Art Provenance
  11. Field Experimentation in Governance Research. Early insights from researching the effectiveness of public participation in implementing the EU Floods Directive
  12. Creating Third Spaces for Transdisciplinary Research
  13. Methodology of Scenario Technique in Regional Development Processes
  14. Design based knowledge production in transdisciplinary Sustainability Science
  15. Conference on Transdisciplinary Research and Modeling - 2013
  16. The Water Framework Directive: Policy Implementation Through Multi-Level Governance
  17. Navigating in the Digital Jungle: Articulating Combinatory Affordances of Digital Infrastructures for Collaboration
  18. Organized Creativity - Introduction to the 1st Annual Conference
  19. Predicting Performance And Motivation In Teacher Education – The Role Of Opportunities To Learn, Intrinsic Needs And Perceived Self-efficacy
  20. Blogs in the Foreign Language Classroom

Publikationen

  1. Towards a Bayesian Student Model for Detecting Decimal Misconceptions
  2. What does it mean to be sensitive for the complexity of (problem oriented) teaching?
  3. Assessing Effects Through Semi-Field and Field Toxicity Testing
  4. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  5. Computing regression statistics from grouped data
  6. Graphism and Flatness. The Line as Mediator between Time and Space, Intuition and Concept
  7. Stimulating Computing
  8. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  9. Foundations and applications of computer based material flow networks for einvironmental management
  10. A localized boundary element method for the floating body problem
  11. Users’ handedness and performance when controlling integrated input devices
  12. OKBQA framework towards an open collaboration for development of natural language question-answering systems over knowledge bases
  13. Mirrored piezo servo hydraulic actuators for use in camless combustion engines and its Control with mirrored inputs and MPC
  14. Analyzing the Influence of Knowledge Graph Information on Relation Extraction.
  15. Input-Output Linearization of a Thermoelectric Cooler for an Ice Clamping System Using a Dual Extended Kalman Filter
  16. A simple nonlinear PD control for faster and high-precision positioning of servomechanisms with actuator saturation
  17. Life satisfaction in Germany after reunification: Additional insights on the pattern of convergence
  18. Differenz, Differenzierung
  19. Bayesian Analysis of Longitudinal Multitrait
  20. Control system strategy of a modular omnidirectional AGV
  21. Modeling of Logistic Processes in Assembly Areas
  22. PID Controller Application in a Gimbal Construction for Camera Stabilization and Tracking
  23. Active learning for network intrusion detection
  24. Embarrassment as a public vs. private emotion and symbolic coping behaviour
  25. Strategies of postural control in static and in dynamic testing situations
  26. Performance of process-based models for simulation of grain N in crop rotations across Europe
  27. Fuzzy based control of a nonholonomic car-like robot for drive assistant systems