Stopping duplicate bug reports before they start with Continuous Querying for bug reports

10.7287/peerj.preprints.2373v1 ◽

2016 ◽

Author(s):

Abram Hindle

Keyword(s):

Information Retrieval ◽

Software Engineering ◽

Search Engines ◽

Indexing Methods ◽

Bug Reports ◽

Bug Report ◽

Engineering Information ◽

String Search ◽

The Creation ◽

Duplicate Bug Reports

Bug deduplication is a hot topic in software engineering information retrieval research, but it is often not deployed. Typically to de-duplicate bug reports developers rely upon the search capabilities of the bug report software they employ, such as Bugzilla, Jira, or Github Issues. These search capabilities range from simple SQL string search to IR-based word indexing methods employed by search engines. Yet too often these searches do very little to stop the creation of duplicate bug reports. Some bug trackers have more than 10\% of their bug reports marked as duplicate. Perhaps these bug tracker search engines are not enough? In this paper we propose a method of attempting to prevent duplicate bug reports before they start: continuous querying. That is as the bug reporter types in their bug report their text is used to query the bug database to find duplicate or related bug reports. This continuous querying allows the reporter to be alerted to duplicate bug reports as they report the bug, rather than formulating queries to find the duplicate bug report. Thus this work ushers in a new way of evaluating bug report deduplication techniques, as well as a new kind of bug deduplication task. We show that simple IR measures show some promise for addressing this problem but also that further research is needed to refine this novel process that is integrate-able into modern bug report systems.

Download Full-text

Detecting duplicate bug reports with software engineering domain knowledge

Journal of Software Evolution and Process ◽

10.1002/smr.1821 ◽

2016 ◽

Vol 29 (3) ◽

pp. e1821 ◽

Cited By ~ 17

Author(s):

Karan Aggarwal ◽

Finbarr Timbers ◽

Tanner Rutgers ◽

Abram Hindle ◽

Eleni Stroulia ◽

...

Keyword(s):

Software Engineering ◽

Domain Knowledge ◽

Bug Reports ◽

Duplicate Bug Reports

Download Full-text

Detecting duplicate bug reports with software engineering domain knowledge

2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER) ◽

10.1109/saner.2015.7081831 ◽

2015 ◽

Cited By ~ 19

Author(s):

Karan Aggarwal ◽

Tanner Rutgers ◽

Finbarr Timbers ◽

Abram Hindle ◽

Russ Greiner ◽

...

Keyword(s):

Software Engineering ◽

Domain Knowledge ◽

Bug Reports ◽

Duplicate Bug Reports

Download Full-text

Improvement in bug localization based on kernel extreme learning machine

Journal of Communications Technology Electronics and Computer Science ◽

10.22385/jctecs.v5i0.77 ◽

2016 ◽

Vol 5 ◽

pp. 1

Author(s):

Marzie Rahmati ◽

Mohammad Ali Zare Chahooki

Keyword(s):

Machine Learning ◽

Information Retrieval ◽

Extreme Learning Machine ◽

Bug Localization ◽

Learning Methods ◽

Bug Reports ◽

Bug Report ◽

Mozilla Firefox ◽

Learning Machine ◽

Information Retrieval Methods

Bug localization uses bug reports received from users, developers and testers to locate buggy files. Since finding a buggy file among thousands of files is time consuming and tedious for developers, various methods based on information retrieval is suggested to automate this process. In addition to information retrieval methods for error localization, machine learning methods are used too. Machine learning-based approach, improves methods of describing bug report and program code by representing them in feature vectors. Learning hypothesis on Extreme Learning Machine (ELM) has been recently effective in many areas. This paper shows effectiveness of none-linear kernel of ELM in bug localization. Furthermore the effectiveness of Different kernels in ELM compare to other kernel-based learning methods is analyzed. The experimental results for hypothesis evaluation on Mozilla Firefox dataset show effectiveness of Kernel ELM for bug localization in software projects.

Download Full-text

Pengembangan Prototipe Piranti Lunak Sistem Informasi Manajemen Kegiatan Perekayasa Dengan Microsoft Excel

Jurnal ULTIMA InfoSys ◽

10.31937/si.v5i2.265 ◽

2014 ◽

Vol 5 (2) ◽

pp. 54-60

Author(s):

Ivransa Zuhdi Pane

Keyword(s):

Information System ◽

Software Engineering ◽

Management Information ◽

Microsoft Excel ◽

Development Stages ◽

Engineering Activity ◽

Engineering Information ◽

Analysis Design ◽

Index Terms ◽

Further Development

Management of engineering activities based on information systems is expected to increase Engineer’s perfomances in executing the daily tasks. The software of such management information system should be built on the platform which is easy to use and adaptable to the dynamics of engineering activity management in the future. Software engineering, consisting of analysis, design and implementation, was carried out to realize a prototype which is ready to be applied in the further development stages. Index Terms - engineering activity, Engineering, information system, software engineering.

Download Full-text

Systems and software engineering. Information technology project performance benchmarking framework

10.3403/30251290 ◽

2013 ◽

Keyword(s):

Information Technology ◽

Software Engineering ◽

Project Performance ◽

Performance Benchmarking ◽

Technology Project ◽

Information Technology Project ◽

Engineering Information

Download Full-text

Systems and software engineering. Information technology project performance benchmarking framework

10.3403/30281278 ◽

2015 ◽

Keyword(s):

Information Technology ◽

Software Engineering ◽

Project Performance ◽

Performance Benchmarking ◽

Technology Project ◽

Information Technology Project ◽

Engineering Information

Download Full-text

Fast Duplicate Bug Reports Detector Training using Sampling for Dimension Reduction: Using Instance-based Learning for Continous Query in Real-World

2020 11th International Conference on Information and Knowledge Technology (IKT) ◽

10.1109/ikt51791.2020.9345611 ◽

2020 ◽

Author(s):

Behzad Soleimani Neysiani ◽

Saeed Doostali ◽

Seyed Morteza Babamir ◽

Zahra Aminoroaya

Keyword(s):

Dimension Reduction ◽

Real World ◽

Bug Reports ◽

Instance Based Learning ◽

Duplicate Bug Reports

Download Full-text

Augmenting Bug Localization with Part-of-Speech and Invocation

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194017500346 ◽

2017 ◽

Vol 27 (06) ◽

pp. 925-949 ◽

Cited By ~ 5

Author(s):

Yu Zhou ◽

Yanxiang Tong ◽

Taolue Chen ◽

Jin Han

Keyword(s):

Software Maintenance ◽

Large Scale ◽

Bug Localization ◽

Bug Reports ◽

Part Of Speech ◽

Adaptive Technique ◽

Bug Report ◽

Software Maintenance And Evolution ◽

Speech Features ◽

Localization Approach

Bug localization represents one of the most expensive, as well as time-consuming, activities during software maintenance and evolution. To alleviate the workload of developers, numerous methods have been proposed to automate this process and narrow down the scope of reviewing buggy files. In this paper, we present a novel buggy source-file localization approach, using the information from both the bug reports and the source files. We leverage the part-of-speech features of bug reports and the invocation relationship among source files. We also integrate an adaptive technique to further optimize the performance of the approach. The adaptive technique discriminates Top 1 and Top N recommendations for a given bug report and consists of two modules. One module is to maximize the accuracy of the first recommended file, and the other one aims at improving the accuracy of the fixed defect file list. We evaluate our approach on six large-scale open source projects, i.e. ASpectJ, Eclipse, SWT, Zxing, Birt and Tomcat. Compared to the previous work, empirical results show that our approach can improve the overall prediction performance in all of these cases. Particularly, in terms of the Top 1 recommendation accuracy, our approach achieves an enhancement from 22.73% to 39.86% for ASpectJ, from 24.36% to 30.76% for Eclipse, from 31.63% to 46.94% for SWT, from 40% to 55% for ZXing, from 7.97% to 21.99% for Birt, and from 33.37% to 38.90% for Tomcat.

Download Full-text

Mining Bug Report Repositories to Identify Significant Information for Software Bug Fixing

Applied Science and Engineering Progress ◽

10.14416/j.asep.2021.03.005 ◽

2021 ◽

Author(s):

Bancha Luaphol ◽

Jantima Polpinij ◽

Manasawee Kaenampornpan

Keyword(s):

The Other ◽

Problem Domain ◽

Significant Information ◽

Bug Reports ◽

Bug Fixing ◽

Classification Technique ◽

Bug Report ◽

Multiple Issues ◽

Improved Accuracy ◽

Software Bug

Most studies relating to bug reports aims to automatically identify necessary information from bug reports for software bug fixing. Unfortunately, the study of bug reports focuses only on one issue, but more complete and comprehensive software bug fixing would be facilitated by assessing multiple issues concurrently. This becomes a challenge in this study, where it aims to present a method of identifying bug reports at severe level from a bug report repository, together with assembling their related bug reports to visualize the overall picture of a software problem domain. The proposed method is called “mining bug report repositories”. Two techniques of text mining are applied as the main mechanisms in this method. First, classification is applied for identifying severe bug reports, called “bug severity classification”, while “threshold-based similarity analysis” is then applied to assemble bug reports that are related to a bug report at severe level. Our datasets are from three opensource namely SeaMonkey, Firefox, and Core:Layout downloaded from the Bugzilla. Finally, the best models from the proposed method are selected and compared with two baseline methods. For identifying severe bug reports using classification technique, the results show that our method improved accuracy, F1, and AUC scores over the baseline by 11.39, 11.63, and 19% respectively. Meanwhile, for assembling related bug reports using threshold-based similarity technique, the results show that our method improved precision, and likelihood scores over the other baseline by 15.76, and 9.14% respectively. This demonstrate that our proposed method may help increasing chance to fix bugs completely.

Download Full-text