Bug reduction software data technology
ABSTRACT
¢ Software
companies spend over 45 percent of cost in dealing with software bugs. An
inevitable step of fixing bugs is bug triage, which aims to correctly assign a
developer to a new bug.
¢ To
decrease the time cost in manual work, text classification techniques are
applied to conduct automatic bug triage. In this paper, we address the problem
of data reduction for bug triage, i.e., how to reduce the scale and improve the
quality of bug data.
¢ We
combine instance selection with feature selection to simultaneously reduce data
scale on the bug dimension and the word dimension. To determine the order of
applying instance selection and feature selection, we extract attributes from
historical bug data sets and build a predictive model for a new bug data set.
¢ We empirically investigate the performance of
data reduction on totally 600,000 bug reports of two large open source
projects, namely Eclipse and Mozilla.
¢ The results show that our data reduction can
effectively reduce the data scale and improve the accuracy of bug triage. Our
work provides an approach to leveraging techniques on data processing to form
reduced and high-quality bug data in software development and maintenance.
EXISTING SYSTEM
¢ To
investigate the relationships in bug data, Sandusky et al. form a bug report
network to examine the dependency among bug reports.
¢ Besides
studying relationships among bug reports, Hong et al. build a developer social
network to examine the collaboration among developers based on the bug data in
Mozilla project. This developer social network is helpful to understand the
developer community and the project evolution.
¢ By
mapping bug priorities to developers, Xuan et al. identify the developer
prioritization in open source bug repositories. The developer prioritization
can distinguish developers and assist tasks in software maintenance.
¢ To
investigate the quality of bug data, Zimmermann et al. design questionnaires to
developers and users in three open source projects. Based on the analysis of
questionnaires, they characterize what makes a good bug report and train a
classifier to identify whether the quality of a bug report should be
improved.
¢ Duplicate
bug reports weaken the quality of bug data by delaying the cost of handling
bugs. To detect duplicate bug reports, Wang et al. design a natural language
processing approach by matching the execution information.
DISADVANTAGES OF EXISTING SYSTEM
¢ Traditional
software analysis is not completely suitable for the large-scale and complex
data in software repositories.
¢ In
traditional software development, new bugs are manually triaged by an expert
developer, i.e., a human triager. Due to the large number of daily bugs and the
lack of expertise of all the bugs, manual bug triage is expensive in time cost
and low in accuracy.
PROPOSED SYSTEM
¢ In
this paper, we address the problem of data reduction for bug triage, i.e., how
to reduce the bug data to save the labor cost of developers and improve the
quality to facilitate the process of bug triage.
¢ Data
reduction for bug triage aims to build a small-scale and high-quality set of
bug data by removing bug reports and words, which are redundant or
non-informative.
¢ In
our work, we combine existing techniques of instance selection and feature
selection to simultaneously reduce the bug dimension and the word dimension.
The reduced bug data contain fewer bug reports and fewer words than the
original bug data and provide similar information over the original bug data.
We evaluate the reduced bug data according to two criteria: the scale of a data
set and the accuracy of bug triage.
¢ In
this paper, we propose a predictive model to determine the order of applying
instance selection and feature selection. We refer to such determination as
prediction for reduction orders.
¢ Drawn
on the experiences in software metrics,1 we extract the attributes from
historical bug data sets. Then, we train a binary classifier on bug data sets
with extracted attributes and predict the order of applying instance selection
and feature selection for a new bug data set.
ADVANTAGES OF PROPOSED SYSTEM
¢ Experimental
results show that applying the instance selection technique to the data set can
reduce bug reports but the accuracy of bug triage may be decreased.
¢ Applying
the feature selection technique can reduce words in the bug data and the
accuracy can be increased.
¢ Meanwhile,
combining both techniques can increase the accuracy, as well as reduce bug
reports and words.
¢ Based
on the attributes from historical bug data sets, our predictive model can
provide the accuracy of 71.8 percent for predicting the reduction order.
¢ We
present the problem of data reduction for bug triage. This problem aims to
augment the data set of bug triage in two aspects, namely a) to simultaneously
reduce the scales of the bug dimension and the word dimension and b) to improve
the accuracy of bug triage.
¢ We
propose a combination approach to addressing the problem of data reduction.
This can be viewed as an application of instance selection and feature
selection in bug repositories.
¢ We
build a binary classifier to predict the order of applying instance selection
and feature selection. To our knowledge, the order of applying instance
selection and feature selection has not been investigated in related domains.
HARDWARE REQUIREMENTS
¢ System
: Pentium IV 2.4 GHz.
¢ Hard Disk :
40 GB.
¢ Floppy Drive : 1.44 Mb.
¢ Monitor :
15 VGA Colour.
¢ Ram : 512 Mb.
SOFTWARE REQUIREMENTS
Front End: HTML5, CSS3,
Bootstrap
Back End: PHP, MYSQL
Control End: Angular
Java Script
Tool: Android SDK,
Xampp, Eclipse
project-center-salem-erode-namakal-tiruchengode-karur-gandhipuram
project-center-mannargudi-pattukkottai
project-center-ambattur-avadi-ashokpillar-adyar-ekkaduthangal
project-center-bangalore-chennai-trivandrum
project-center-bhubaneswar-belgum-bhopal
project-center-chidambaram-mayiladuthurai-nagapattinam-cuddalore
project-center-coimbatore-chennai-salem-madurai-erode-trichy-tirunelveli-pondicherry
project-center-delhi-mumbai-hyderabad-visakhapatnam
project-center-dharmapuri-hosur-krishnagiri
project-center-dindigul-palani-rasipuram
project-center-tirunelveli-tiruchendur-nagercoil-virudhunagar-rajapalayam
project-center-tnagar-tambaram-nungambakkam-velachery
project-center-trivandrum-ernakulam
project-center-in-chennai
Android Project Titles 2017-2018
Android Project Titles 2017-2018
Comments
Post a Comment