A Soft Alignment Model for Bug Deduplication

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Bug tracking systems (BTS) are widely used in software projects. An important task in such systems consists of identifying duplicate bug reports, i.e., distinct reports related to the same software issue. For several reasons, reporting bugs that have already been reported is quite frequent, making their manual triage impractical in large BTSs. In this paper, we present a novel deep learning network based on soft-attention alignment to improve duplicate bug report detection. For a given pair of possibly duplicate reports, the attention mechanism computes interdependent representations for each report, which is more powerful than previous approaches. We evaluate our model on four well-known datasets derived from BTSs of four popular open-source projects. Our evaluation is based on a ranking-based metric, which is more realistic than decision-making metrics used in many previous works. Achieved results demonstrate that our model outperforms state-of-the-art systems and strong baselines in different scenarios. Finally, an ablation study is performed to confirm that the proposed architecture improves the duplicate bug reports detection.

Original languageEnglish
Title of host publication2020 IEEE/ACM 17th International Conference on Mining Software Repositories : MSR 2020, Proceedings; Seoul, Republic of Korea 29-30 June 2020
Number of pages11
Place of PublicationNew York
PublisherAssociation for Computing Machinery, Inc
Publication date29.06.2020
Pages43-53
ISBN (electronic)978-1-4503-7957-1
DOIs
Publication statusPublished - 29.06.2020
Externally publishedYes
Event17th IEEE/ACM International Conference on Mining Software Repositories, MSR 2020, co-located with the 42nd International Conference on Software Engineering. ICSE 2020 - Virtual, Online, Korea, Republic of
Duration: 29.06.202030.06.2020

    Research areas

  • Attention Mechanism, Bug Tracking Systems, Deep Learning, Duplicate Bug Report Detection
  • Business informatics

DOI

Recently viewed

Publications

  1. Development and comparison of processing maps of Mg-3Sn-1Ca alloy from data obtained in tension versus compression
  2. Discriminative clustering for market segmentation
  3. Kommentar zu Ute Tellmann
  4. Digital Seriality as Structure and Process
  5. Implementing the Kyoto Protocol without Russia
  6. Online-scheduling using past and real-time data
  7. Topic selection and development in learner-native speaker voice-based telecollaborative discourse
  8. Are Acute Effects of Foam-Rolling Attributed to Dynamic Warm Up Effects? A Comparative Study
  9. Hacking the Classroom
  10. Design of an Information-Based Distributed Production Planning System
  11. How people explain their own and others’ behavior:
  12. Tree diversity and mycorrhizal type co-determine multitrophic ecosystem functions
  13. Using Long-Duration Static Stretch Training to Counteract Strength and Flexibility Deficits in Moderately Trained Participants
  14. Development and Validation of a Us and German Short Version of the Later Life Workplace Index (llwi- S)
  15. Internet and computer based interventions for cannabis use
  16. Lessons from modeling 100% renewable scenarios using GENeSYS-MOD
  17. Res Lunae: Characterizing Diverse Lunar Resource Systems Using the Social-Ecological System Framework
  18. A geometric approach to the decoupling control and to speed up the dynamics of a general rigid body manipulation system
  19. Natural enemy diversity reduces temporal variability in wasp but not bee parasitism
  20. A comprehensive Eulerian modeling framework for airborne mercury species
  21. Continental mapping of forest ecosystem functions reveals a high but unrealised potential for forest multifunctionality.
  22. Building trust