A Soft Alignment Model for Bug Deduplication

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Bug tracking systems (BTS) are widely used in software projects. An important task in such systems consists of identifying duplicate bug reports, i.e., distinct reports related to the same software issue. For several reasons, reporting bugs that have already been reported is quite frequent, making their manual triage impractical in large BTSs. In this paper, we present a novel deep learning network based on soft-attention alignment to improve duplicate bug report detection. For a given pair of possibly duplicate reports, the attention mechanism computes interdependent representations for each report, which is more powerful than previous approaches. We evaluate our model on four well-known datasets derived from BTSs of four popular open-source projects. Our evaluation is based on a ranking-based metric, which is more realistic than decision-making metrics used in many previous works. Achieved results demonstrate that our model outperforms state-of-the-art systems and strong baselines in different scenarios. Finally, an ablation study is performed to confirm that the proposed architecture improves the duplicate bug reports detection.

Original languageEnglish
Title of host publication2020 IEEE/ACM 17th International Conference on Mining Software Repositories : MSR 2020, Proceedings; Seoul, Republic of Korea 29-30 June 2020
Number of pages11
Place of PublicationNew York
PublisherAssociation for Computing Machinery, Inc
Publication date29.06.2020
Pages43-53
ISBN (electronic)978-1-4503-7957-1
DOIs
Publication statusPublished - 29.06.2020
Externally publishedYes
Event17th IEEE/ACM International Conference on Mining Software Repositories, MSR 2020, co-located with the 42nd International Conference on Software Engineering. ICSE 2020 - Virtual, Online, Korea, Republic of
Duration: 29.06.202030.06.2020

    Research areas

  • Attention Mechanism, Bug Tracking Systems, Deep Learning, Duplicate Bug Report Detection
  • Business informatics

DOI

Recently viewed

Publications

  1. Predicate‐based model of problem‐solving for robotic actions planning
  2. Trajectory tracking using MPC and a velocity observer for flat actuator systems in automotive applications
  3. Deciphering movement and stasis
  4. Using density surface models to assess the ecological effectiveness of a protected area network in Tanzania
  5. Biomedical Entity Linking with Triple-aware Pre-Training
  6. DISKNET – A Platform for the Systematic Accumulation of Knowledge in IS Research
  7. The frame of the game
  8. Implementing UNESCO's Convention on Cultural Diversity at the regional level
  9. Defining the notion of mining, extraction and collection
  10. Integrating Common Ground and Informativeness in Pragmatic Word Learning
  11. Current issues in competence modeling and assessment
  12. An Approach for Ex-Post-Facto Analysis of Knowledge Graph-Driven Chatbots – The DBpedia Chatbot
  13. Covert and overt automatic imitation are correlated
  14. Back from the Deep
  15. Material flow analysis for the incremental sheet-bulk gearing by rotating tools
  16. Political discourse in the media
  17. "Doing" Sustainability Assessment in Different Consumption and Production Contexts-Lessons from Case Study Comparison
  18. Zapping-Fernbedienung
  19. From Fleeting Enchantment to Embodied Commitment
  20. Pathways and mechanisms for catalyzing social impact through Orchestration: Insights from an open social innovation project
  21. A New, Rapid, Fully Automated Method for Determination of Fluconazole in Serum by Column-Switching Liquid Chromatography
  22. TextCSN
  23. Land use affects dung beetle communities and their ecosystem service in forests and grasslands
  24. New incremental methods for springback compensation by stress superposition
  25. Existenzgründungen junger Handwerksmeister
  26. Same but different? Measurement invariance of the PIAAC motivation-to-learn scale across key socio-demographic groups
  27. Landscape modification and habitat fragmentation: a synthesis
  28. It is not what it is
  29. Newsfeed clutter as an inhibitor of sensemaking
  30. SMARTPHONE APPS FOR TINNITUS: A REVIEW ON INTERVENTION COMPONENTS AND BEHAVIOR CHANGE TECHNIQUES USED IN TINNITUS APPS
  31. y-Randomization and its variants in QSPR/QSAR
  32. Exports and productivity: A survey of the evidence from firm-level data
  33. Mythos
  34. Exports, R&D and Productivity
  35. Sigrid Kopfermann
  36. Effects of oral corrective feedback on the development of complex morphosyntax
  37. Quality and time-related indicators in inceptive plans
  38. Online to offline social networking
  39. Silver Work
  40. Towards a Real-world Laboratory
  41. Sustainable Statehood: Reflections on Critical (Pre-)Conditions, Requirements and Design Options
  42. Do it again
  43. Prologue: Analyzing the Fine Details of Political Commitment