FaST: A linear time stack trace alignment heuristic for crash report deduplication

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

In software projects, applications are often monitored by systems that automatically identify crashes, collect their information into reports, and submit them to developers. Especially in popular applications, such systems tend to generate a large number of crash reports in which a significant portion of them are duplicate. Due to this high submission volume, in practice, the crash report deduplication is supported by devising automatic systems whose efficiency is a critical constraint. In this paper, we focus on improving deduplication system throughput by speeding up the stack trace comparison. In contrast to the state-of-the-art techniques, we propose FaST, a novel sequence alignment method that computes the similarity score between two stack traces in linear time. Our method independently aligns identical frames in two stack traces by means of a simple alignment heuristic. We evaluate FaST and five competing methods on four datasets from open-source projects using ranking and binary metrics. Despite its simplicity, FaST consistently achieves state-of-the-art performance regarding all metrics considered. Moreover, our experiments confirm that FaST is substantially more efficient than methods based on optimal sequence alignment.

OriginalspracheEnglisch
TitelThe 2022 Mining Software Repositories Conference : MSR 2022, Proceedings; 18-20 May 2022, Virtual; 23-24 May 2022, Pittsburgh, Pennsylvania
Anzahl der Seiten12
ErscheinungsortNew York
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum17.10.2022
Seiten549-560
ISBN (Print)9781665452106
ISBN (elektronisch)978-1-4503-9303-4
DOIs
PublikationsstatusErschienen - 17.10.2022
Veranstaltung19th International Conference on Mining Software Repositories - MSR 2022 - Pittsburgh, USA / Vereinigte Staaten
Dauer: 23.05.202224.05.2022
Konferenznummer: 19
https://conf.researchr.org/home/msr-2022

Bibliographische Notiz

Publisher Copyright:
© 2022 ACM.

DOI

Zuletzt angesehen

Aktivitäten

  1. The golden age of software architecture better named the middle age of software architecture - Some provocative thoughts
  2. The Linguistic Complexity of Test Items: Differential Effects for Students With Low and High Language Proficiency
  3. Networking for the environment: How does environmental orientation influence start-ups’ networking frequency and size?
  4. Zombieland: Agent-based Computer Simulation
  5. Between Connections and Knowledge: An Approach to Culture through Graph Theory and Complex Systems
  6. From Podcast to Simulation Projects - Web 2.0 Projects in the Secondary EFL Classroom
  7. Global Platform Companies in Local Fields between Disruption and Integration
  8. Uncertainty and Subjectivity in Provenance Linked Open Data
  9. Interstitial spaces as garbage cans of field transformation where problems and solutions meet: the case of blockchain and music cross-fertilization
  10. Placemaking today: integrating place-oriented thinking into cultural policy frameworks
  11. From Iconography to Visual Framing: A New Approach in Visual Communication
  12. Navigating between Predictability and Creativity in Complex Innovation Processes: The Role of Entrainment and Detrainment in Temporal Work
  13. Dynamical systems methods in fluid mechanics
  14. Lagrangian aspects of turbulent superstructures: numerical analysis of long-term dynamics and transport properties
  15. Princeton University
  16. Field Experimentation in Governance Research. Early insights from researching the effectiveness of public participation in implementing the EU Floods Directive

Publikationen

  1. A new way of assessing the interaction of a metallic phase precursor with a modified oxide support substrate as a source of information for predicting metal dispersion
  2. Computing regression statistics from grouped data
  3. Foundations and applications of computer based material flow networks for einvironmental management
  4. Study on the effects of tool design and process parameters on the robustness of deep drawing
  5. Gaussian processes for dispatching rule selection in production scheduling
  6. On finding nonisomorphic connected subgraphs and distinct molecular substructures.
  7. Switching from a Managing to a Monitoring Function on the Board
  8. Modelling and implementation of an Order2Cash Process in distributed systems
  9. A Structure and Content Prompt-based Method for Knowledge Graph Question Answering over Scholarly Data
  10. Teaching methods for modelling problems and students’ task-specific enjoyment, value, interest and self-efficacy expectations
  11. Spaces for challenging experiences, indeterminacy, and experimentation
  12. Digging into the roots
  13. Using transition management concepts for the evaluation of intersecting policy domains ('grand challenges')
  14. Commitment to grand challenges in fluid forms of organizing
  15. Using Language Learning Resources on YouTube
  16. What would Colin say?
  17. Errors in Training Computer Skills
  18. Teachers’ use of data from digital learning platforms for instructional design
  19. Cognitive load and instructionally supported learning with provided and learner-generated visualizations
  20. Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability
  21. A Multimethod Latent State-Trait Model for Structurally Different and Interchangeable Methods
  22. Modeling Conditional Dependencies in Multiagent Trajectories
  23. Second language learners' performance in mathematics
  24. In-Vehicle Sensor System for Monitoring Efficiency of Vehicle E/E Architectures
  25. Acceleration of material-dominated calculations via phase-space simplicial subdivision and interpolation
  26. Factor structure and measurement invariance of the Students’ Self-report Checklist of Social and Learning Behaviour (SSL)
  27. Mechanism of dynamic recrystallization and evolution of texture in the hot working domains of the processing map for Mg-4Al-2Ba-2Ca Alloy
  28. More input, better output
  29. Modeling and simulation of size effects in metallic glasses with non-local continuum mechanics theory
  30. ActiveMath - a Learning Platform With Semantic Web Features
  31. A geometric approach for the design and control of an electromagnetic actuator to optimize its dynamic performance
  32. Optimizing price levels in e-commerce applications with respect to customer lifetime values
  33. Correlation of Microstructure and Local Mechanical Properties Along Build Direction for Multi-layer Friction Surfacing of Aluminum Alloys