SlideShare a Scribd company logo
1 of 11
Download to read offline
Scientific Journal Impact Factor (SJIF): 1.711
International Journal of Modern Trends in Engineering
and Research
www.ijmter.com
@IJMTER-2014, All rights Reserved 441
e-ISSN: 2349-9745
p-ISSN: 2393-8161
Software Defect Prediction Using Local and Global Analysis
Ms. N. Narmadha1
, Dr. M. Latha2
, Mr. R. Subramanian3
1
Research Scholar
2
M.Sc., M.Phil., Ph.D, Associate Professor, Department of Computer Science.
1,2
Sri Sarada College for Women, Salem, Tamilnadu, India
3
Erode Arts and Science College, Erode
Abstract -The software defect factors are used to measure the quality of the software. The software
effort estimation is used to measure the effort required for the software development process. The defect
factor makes an impact on the software development effort. Software development and cost factors are
also decided with reference to the defect and effort factors. The software defects are predicted with
reference to the module information. Module link information are used in the effort estimation process.
Data mining techniques are used in the software analysis process. Clustering techniques are used
in the property grouping process. Rule mining methods are used to learn rules from clustered data
values. The “WHERE” clustering scheme and “WHICH” rule mining scheme are used in the defect
prediction and effort estimation process. The system uses the module information for the defect
prediction and effort estimation process.
The proposed system is designed to improve the defect prediction and effort estimation process.
The Single Objective Genetic Algorithm (SOGA) is used in the clustering process. The rule learning
operations are carried out sing the Apriori algorithm. The system improves the cluster accuracy levels.
The defect prediction and effort estimation accuracy is also improved by the system. The system is
developed using the Java language and Oracle relation database environment.
I. INTRODUCTION
Software engineering is the technological and managerial discipline with systematic production
and maintenance of software products that are developed and modified on time and within cost
estimates. Successful software maintenance, like all software activities, requires a combination of
managerial skills and technical expertise. The primary goals of software engineering are to improve the
quality of software products and to increase the productivity and job satisfaction of software engineers.
Software engineering is concerned with development and maintenance of technological products,
problem-solving techniques common to all engineering disciplines
Engineering problem-solving techniques provides the basis for project planning, project
management, systematic analysis, methodical design, careful fabrication, extensive validation and
ongoing maintenance activities. Appropriate notations, tools and techniques are applied in each of these
areas. A fundamental principle of software engineering is to design software products that minimize the
intellectual distance between problem and solution; the variety of approaches to software development is
limited on lay by the creativity and ingenuity of the programmer.
The software industry’s inability to provide accurate estimates of development cost, effort and/or
time is well known. This inability is described in reports from project management consultancy
companies, case studies on project failures, articles in the computer press and estimation surveys. The
common belief seems to be that the cost overruns are very large and few software professionals and
researchers react with disbelief when being presented with the inaccuracy values reported in Standish
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 442
Group’s Chaos Report, i.e., an average cost overrun of 89%. There may be several reasons for this
attitude.
It is difficult to get a balanced view on the software industry’s estimation performance without
unbiased information from a representative set of projects and organizations. The surveys presented in
scientific journals and conferences may be a source for such unbiased information. The system
summarizes estimation results from surveys on software estimation. There has not been conducted any
structured review of the estimation surveys with the aim of summarizing the knowledge of software
effort estimation performance. Besides summarizing results from estimation surveys, the main purposes
of the system is to challenge some common estimation beliefs that may be inaccurate, to discuss
methodical aspects related to completion of industrial surveys and to support future surveys on software
effort estimation.
II. RELATED WORK
The main goal of this section is not to provide a generic review of defect prediction and effort
estimation work, but to highlight work that documents contradictory results which make it difficult to
generalize solutions for defect prediction or effort estimation.
2.1. Effort Estimation Process
Conclusion instability in effort estimation may be a fundamental property of the datasets we are
exploring. For example, tests the stability of Boehm’s COCOMO software development effort
estimation model. In that analysis, 20 times, we learned effort = using a
random 2/3rds sample from 93 NASA projects. Only the coefficient on lines of code (loc) was stable.
The observed ranges on the other βi coefficients were very large. In fact, the signs of five coefficients
even changed from positive to negative. This coefficient instability is particularly troubling because we
know of NASA project managers who have made acquisition decisions worth tens of millions of dollars
based on the COCOMO coefficients.
Other papers also report contradictory findings about effort estimation. Jorgensen reviewed 15
studies comparing model-based to expert-based methods. Five of those studies favored expert-based
methods, five found no difference and five favored model-based methods. Kitchenham et al. reviewed
studies that checked if data imported from other organizations were as useful as local data. From a total
of seven studies, three found that models from other organizations were not significantly worse than
those based on local data, while four found that they were significantly different. MacDonell and
Shepperd also performed a review on the value of local versus global effort estimation models through a
replication. From a total of 10 studies, two were found to be inconclusive, three supported global models
and five supported local models. Similarly, Mair and Shepperd compared regression to analogy methods
for effort estimation and found conflicting evidence. From a total of 20 empirical studies, seven favored
regression, four were indifferent and nine favored analogy.
2.2. Defect Prediction Process
In the area of defect prediction, there are also many contradictory findings. For example,
Zimmermann et al. learned defect predictors from 622 pairs of projects hproject1; project2i. In only 4
percent of pairs did the defect predictors learned in project1 work in project [2]. Similar findings
concern the OO metrics as well. For the manager of a software project is particularly troubling. Each
study makes a clear, but usually different, conclusion. Hence, it is difficult for a manager to make a clear
decision about, for example, the merits of a proposed coding standard, where maximum depth of
inheritance is required to be less than some expert specified threshold.
As to the root cause of the instabilities offer the following conjecture. The models learned from
different regions within effort data can have very different properties. If defect data were as varied as
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 443
effort data, then we would naturally expect that different samples of different projects would yield
different models due to dataset shift [4]. If this conjecture is correct, then we would expect that clusters
within the data should produce different models. This paper is a test of that conjecture. In short, we
show that different regions of the data generate different models. Further, the models built from
specialized regions within the dataset perform better than those learned across all data.
III. LOCAL AND GLOBAL ANALYSIS ON SOFTWARE COMPONENTS
Process and product data are used in software engineering (SE) to support a variety of tasks, such
as defect prediction, effort estimation, refactoring of source code, determination of the social networks
of programmers, learning the expected interface between modules and so on [3]. Two questions are at
the center of much of the research in the field: 1) What data are best suited to support specific tasks? and
2) what is the best way to reason about SE data? This paper explores the latter question, in the context
of: software effort reduction: finding rules for reducing a project’s development time;
software defect reduction: finding rules for reducing a project’s defect count.
Our focus in this paper is not on what data are used for building models for defect prediction or
effort estimation, but rather on the source of the data and implicitly, the applicability of the lessons
derived by the models. Software data come from some sources. These data show the defects or effort
associated with examples from that source. How should we reason about these data? On this point, the
literature is contradictory. Some existing work argues that data from multiple sources can generate rules
that apply in any context. We call these global lessons. On the other hand, other papers indicate that the
best lessons are learned from within one source, which implies that these lessons are only useful in their
context [5], [6], [7]. We call these local lessons.
When project managers want to make changes in their projects to minimize the development
effort or the rate of defects, they are faced with two options: 1) make changes based on global lessons
available from existing data, or 2) mine data about the current project and infer local lessons. The
dilemma of the manager is obvious. In the first case, expensive changes may be undertaken without
reaching the desired goal if the global lessons prove to be wrong for this context. In the second case, an
upfront investment is needed to collect and analyze data and to generate the local lessons, which may be
unnecessary if the global lessons apply to this context. This paper addresses this dilemma and reports on
experiments where:
Data from different sources are combined.
Within that combination, automatic tools find clusters of related examples. Note that clusters may
contain examples from multiple sources.
Data mining is then used to learn lessons from the examples in each cluster. The generated lessons
are compared.
Before this experiment, our previous work indicated that local lessons seem to be best for defect
prediction and effort estimation [8]. The surprising result of the experiment presented in this paper is
that the best lessons for a project from one source come from neighboring clusters with data from nearby
sources, but not inside that source. We will call such lessons neighbor lessons.
We conclude that when project managers attempt to draw lessons from some historical data
source, they should: 1) ignore any existing local divisions into multiple sources, 2) cluster across all
available data, then 3) restrict the learning of lessons to the clusters that are nearest to the test data.
While the paper focuses on defect prediction and effort estimation data, we believe that our conclusions
can translate to other types of data, related to different SE tasks. This paper extends a prior publication
[1] in the following way:
That paper only explored four datasets. Here, we explore over twice that number of datasets.
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 444
That prior publication only explored local versus global lessons and found evidence that
supported local lessons over the global ones. In this paper, explore clusters from multiple sources.
IV. PROBLEM STATEMENT
The software analysis system generates lessons learned for defect prediction and effort
estimation. Lessons are global to multiple projects or just local to particular projects. The system
comparatively evaluates local versus global lessons learned for effort estimation and defect prediction.
The system applies automated clustering tools to effort and defect datasets from the PROMISE
repository. Rule learners generated lessons learned from all the data, from local projects, or just from
each cluster. The lessons learned after combining small parts of different data sources were superior to
either generalizations formed over all the data or local lessons formed from particular projects. The
following problems are identified from the software effort estimation and defect prediction methods.
• The clustering system produces low accuracy in partitioning process.
• The rule learning system handles minimum of set of rules only.
• Defect prediction and effort estimation accuracy is low.
• Software module relationships are not fully focused in the defect and effort analysis.
V. GENETIC ALGORITHM BASED SOFTWARE ANALYSIS
The software defect prediction process is designed to identify the defects that are raised in the
software system. The effort estimation mechanism is applied to estimate the effort needed for the system
development process. The main goal of the system is not to provide a generic review of defect prediction
and effort estimation work, but to highlight work that documents contradictory results which make it
difficult to generalize solutions for defect prediction or effort estimation. In the area of defect prediction,
there are also many contradictory findings. Defect predictors from 622 pairs of projects hproject1;
project. In only 4 percent of pairs did the defect predictors learned in project work in project. Similar
findings concern the OO metrics as well. Lists 28 studies that offer contradictory conclusions regarding
the effectiveness of OO metrics for predicting defects.
Conclusion instability in effort estimation may be a fundamental property of the datasets. Only
the coefficient on lines of code was stable. The observed ranges on the other βi coefficients were very
large. In fact, the signs of five coefficients even changed from positive to negative. This coefficient
instability is particularly troubling. NASA project managers who have made acquisition decisions worth
tens of millions of dollars based on the COCOMO coefficients. The system is designed to perform
defect prediction and effort estimation process for the software systems [9]. The clustering and rule
learning methods are used to identify the defects and efforts. The Single Objective Genetic Algorithm
(SOGA) is used for the clustering process. The rules are learned using the Apriori algorithm. The system
improves the accuracy in clustering process. Defect prediction and effort estimation operations are also
improved with internal relationship of the software components.
5.1. Software Module Clustering
Existing approaches to the twin objectives of high cohesion and low coupling have combined
these two objectives into a single-objective function, with all of the drawbacks. Pareto optimality is an
alternative approach to handle multiple objectives that retains the character of the problem as a multi-
objective problem. Using Pareto optimality, it is not possible to measure “how much” better one solution
is than another, merely to determine whether one solution is better than another. In this way, Pareto
optimality combines a set of measurements into a single ordinal scale metric.
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 445
A Pareto optimal search yields a set of solutions that are mutually no dominating and which form
an approximation to the Pareto front. The Pareto front is the set of elements that are not dominated by
any possible element of the solution space. The Pareto front thus denotes the best results achievable; it is
the equivalent of the set of globally optimal points in a single-objective search. As with the single-
objective formulation, it is not possible to guarantee to locate this globally optimal solution set, merely
to attempt to approximate it as closely as possible. Each set of objectives leads to a different multi-
objective formulation of the problem. Two sets of objectives will be considered in the system. They are
the Maximizing Cluster Approach and the Equal-size Cluster Approach.
5.2. Module Clustering with GA
Genetic algorithms use concepts of population and of recombination inspired by Darwinian
Evolution A generic genetic algorithm is presented. An iterative process is executed, initialized by a
randomly chosen population. The iterations are called generations and the members of the population
are called chromosomes because of their analogs in natural evolution. The process terminates when a
population satisfies some predetermined condition.
Set generation number, m: = 0
Choose the initial population of candidate
Solution ,P(0)
Evaluate the fitness for each individual of
P(0),F(Pi(0))
loop
Recombine: P(M): = (P(m))
Mutate : P(m) : = M(P(m))
Evaluate : F(P(m)
Select : P(m+1) : = S(P(m))
m : = m +1
exit when goal or stopping
condition is satisfied
end loop;
A Generic Genetic Algorithm.
VI. LOCAL AND GLOBAL ANALYSIS MODEL FOR SOFTWARE DEFECT PREDICTION
The software defect prediction and effort estimation process is designed to measure the defect
and effort values from the software properties. The clustering and rule learning mechanism is used for
the system. The Single Objective Genetic Algorithm (SOGA) is used for the clustering process. The
Apriori algorithm is used for the learning process. The system divided into four major modules. They
are product data analysis, criteria property extraction, single objective clustering and defect prediction
and effort estimation process. The modules and their link information are extracted from the product
data analysis module. Criteria property extraction module is designed to identify the module property
values. Single objective clustering is performed with only one objective functions. The defect prediction
and effort estimation operations are carried out under the defect and effort estimation module.
6.1. Product Data Analysis
The product information is provided in module dependency graphs. Module link information is
provided under Module Dependency Graph (MDG) files. The MDG files are used as the input for the
system. Modules, links and weight information are extracted from the product information.
6.2. Criteria Property Extraction
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 446
The modules and their links are used for the criteria property extraction process. Communication
bandwidth is estimated with module link information. Each module is analyzed with it’s in and out link
information. The module length and feature location are also identified.
6.3. Single Objective Clustering
The genetic algorithm is used for the clustering process. Single objective clustering model is
used in the system. The objective function uses the module property values. Only one criteria is
considered in the clustering process.
6.4. Defect Prediction and Effort Estimation
The defect prediction and effort estimation module is divided into two sub modules. They are
defect prediction sub module and effort estimation sub module. The software defects are predicted with
reference to the internal relationship values. The effort estimation is carried out with the support of the
software modules and their length values. The system produces the predicted values and actual values
for each modules. The rule learning mechanism is used in the defect prediction and effort estimation
process.
VII. PERFORMANCE ANALYSIS
The software defect prediction and effort estimation system is implemented and tested with a set
of benchmark datasets. The Genetic Algorithm and Variational clustering (VC) schemes are used in the
testing process. The VC scheme is composed with “WHICH” and “WHERE” techniques. Three
performance metrics are used to evaluate the quality of the system. They are entropy, separation index
and prediction accuracy. The entropy and separation index measures are used to evaluate the quality of
the clustering process. Defect prediction and effort estimation process results are analyzed using the
prediction accuracy measure.
7.1. Entropy
Entropy reflects the homogeneity of a set of objects and thus can be used to indicate the
homogeneity of a cluster. This is referred to cluster entropy. Lower cluster entropy indicates more
homogeneous clusters. On the other hand, we can also measure the entropy of a prelabeled class of
objects, which indicates the homogeneity of a class with respect to the generated clusters. The less
fragmented a class across clusters, the higher its entropy and vice versa. This is referred to as class
entropy.
The entropy measurement is used to estimate the quality of the clusters. The cluster entropy and
overall entropy are calculated for different data intervals and MPC based models. The entropy values
for VC Scheme and GA Scheme models are produced in figure 7.1. The comparison result shows that
the GA Scheme outperforms than the VC Scheme. The GA scheme improves the cluster accuracy 5%
than the VC scheme.
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 447
Figure No: 7.1. Entropy Analysis of VC Scheme and GA Scheme
7.2. Separation Index
Separation Index (SI) is another cluster validity measure that utilizes cluster centroids to measure
the distance between clusters, as well as between points in a cluster to their respective cluster centroid. It
is defined as the ratio of average within-cluster variance to the square of the minimum pairwise distance
between clusters:
distD
N
i cixj ij
srNsrD
NC
i cixj ij
N
mxdist
mmdistN
mxdist
SI
c
c
2
min
1
2
2
,1
1
2
.
),(
)},(min
),(
∑ ∑
∑ ∑
= ∈
≤≤
= ∈
=
=
where mi is the centroid of cluster ci and distmin is
the minimum pairwise distance between cluster centroids. Clustering solutions with more compact
clusters and larger separation have lower Separation Index, thus higher values indicate better solutions.
This index is more computationally efficient than other validity indices, such as Dunn’s index, which is
also used to validate clusters that are compact and well separated. In addition, it is less sensitive to noisy
data.
The separation index measure is used to analyze the cluster intervals and transaction interval
values. The separation index analysis is shown in figure 7.2. The analysis result shows that the GA
Scheme improves the cluster quality 20% than the VC Scheme technique.
1.802
1.586
2.017
1.779 1.746
1.941
1.773
2.187 2.102 2.079
0
0.5
1
1.5
2
2.5
20 40 60 80 100
Entropy
Modules
Entropy AnalysisEntropy AnalysisEntropy AnalysisEntropy Analysis
between VC Scheme andbetween VC Scheme andbetween VC Scheme andbetween VC Scheme and
GA SchemeGA SchemeGA SchemeGA Scheme
VC
Scheme
GA
Scheme
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 448
Figure 7.2: Separation Index Analysis between VC Scheme and GA Scheme
7.3. Prediction Accuracy
The prediction accuracy measure is used to evaluate the quality of the rule learning process with
defect prediction and effort estimation process. The defect prediction and effort estimation results are
compared with the predicted values and evaluated values. The prediction accuracy analysis is shown in
figure 7.3. The GA scheme improves the prediction accuracy 20% than the VC scheme. The system
produces better cluster results with effective rule learning results.
Figure 7.3: Prediction Accuracy Analysis between VC Scheme and GA Scheme
VII. CONCLUSION AND FUTURE WORK
The software defect prediction and effort estimation operations are performed in the system with
the support of clustering and rule learning methods. The system has discussed ways to collect training
data for learning lessons from SE projects and to generate rules for reducing the number of defects or the
development effort. At issue here is how the chief information officers should set policies for their
projects. Should they demand that all their projects rigorously apply all the advice from the standard SE
textbooks.
The software defect prediction and effort estimation system is implemented to analyze the
software products to measure the defect and effort factors. The Genetic Algorithm based clustering
0.768
1.059
1.296 1.372
1.483
0.526
0.725
0.921 0.986 1.057
0
0.5
1
1.5
2
20 40 60 80 100
SeparationIndex
Modules
Separation Index Analysis
between VC Scheme and GA
Scheme
VC Scheme
GA Scheme
61.73 64.05 66.75 69.23 71.68
82.47 85.14 87.98 91.23 94.56
0
20
40
60
80
100
20 40 60 80 100
PredictionAccuracy(%)
Modules
Prediction Accuracy
Analysis between VC
Scheme and GA Scheme
VC Scheme
GA Scheme
International Journal of Modern Trends in Engineering and Research (IJMTER)
Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161
@IJMTER-2014, All rights Reserved 449
scheme is used for the data clustering process. The Apriori algorithm is used to carry out the rule
learning process. The system improves the cluster accuracy and prediction accuracy levels. The system
can be enhanced with the following features.
• The system can be enhanced to predict the software risk levels using design information.
• The system can be enhanced with Ant Colony Optimization (ACO) and Particle Swarm
Optimization (PSO) techniques in the prediction process.
• The system can be improved to measure the relationship level between the local and global rules.
• The system can be enhanced to perform the prediction process using the software and source code
analysis mechanism.
REFERENCES
[1] T. Menzies, A. Butcher and D. Cok, “Local vs Global Models for Effort Estimation and Defect Prediction,” Proc. 26th
IEEE/ACM Int’l Conf. Automated Software Eng., 2011.
[2] Martin Shepperd, David Bowes and Tracy Hall, “Researcher Bias: The Use of Machine Learning in Software Defect
Prediction”, IEEE Transactions On Software Engineering, Vol. 40, No. 6, June 2014.
[3] Saehwa Kim, “Synthesizing Multithreaded Code from Real-Time Object-Oriented Models via Schedulability-Aware
Thread Derivation”, IEEE Transactions On Software Engineering, Vol. 40, No. 4, April 2014.
[4] B. Turhan, “On the Dataset Shift Problem in Software Engineering Prediction Models,” Empirical Software Eng., vol. 17,
pp. 62-74, 2012.
[5] D. Posnett, V. Filkov and P. Devanbu, “Ecological Inference in Empirical Software Engineering,” Proc. IEEE/ACM Int’l
Conf. Automated Software Eng., 2011.
[6] N. Bettenburg, M. Nagappan and A.E. Hassan, “Think Locally, Act Globally: Improving Defect and Effort Prediction
Models,” Proc. Ninth IEEE Working Conf. Mining Software Repositories, pp. 60-69, 2012.
[7] Y. Yang, L. Xie, Z. He, Q and R. Valerdi, “Local Bias and Its Impacts on the Performance of Parametric Estimation
Models,” Proc. Seventh Int’l Conf. Predictive Models in Software Eng., 2011.
[8] Tim Menzies, Andrew Butcher, David Cok, Andrian Marcus, Lucas Layman, Forrest Shull, Burak Turhan and Thomas
Zimmermann, “Local versus Global Lessons for Defect Prediction and Effort Estimation”, IEEE Transactions On Software
Engineering, Vol. 39, No. 6, June 2013.
[9] Christian Kastner, Alexander Dreiling and Klaus Ostermann, “Variability Mining: Consistent Semi-automatic Detection
of Product-Line Features”, IEEE Transactions On Software Engineering, Vol. 40, No. 1, January 2014.
Software Defect Prediction Using Local and Global Analysis
Software Defect Prediction Using Local and Global Analysis

More Related Content

What's hot

Development of software defect prediction system using artificial neural network
Development of software defect prediction system using artificial neural networkDevelopment of software defect prediction system using artificial neural network
Development of software defect prediction system using artificial neural network
IJAAS Team
 
Review on Algorithmic and Non Algorithmic Software Cost Estimation Techniques
Review on Algorithmic and Non Algorithmic Software Cost Estimation TechniquesReview on Algorithmic and Non Algorithmic Software Cost Estimation Techniques
Review on Algorithmic and Non Algorithmic Software Cost Estimation Techniques
ijtsrd
 
PM2 ARTICALS
PM2 ARTICALSPM2 ARTICALS
PM2 ARTICALS
ra na
 
STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...
STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...
STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...
IJCSES Journal
 
Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986
Editor IJARCET
 
Abstract.doc
Abstract.docAbstract.doc
Abstract.doc
butest
 
Testing and test case generation by using fuzzy logic and n
Testing and test case generation by using fuzzy logic and nTesting and test case generation by using fuzzy logic and n
Testing and test case generation by using fuzzy logic and n
IAEME Publication
 

What's hot (18)

Development of software defect prediction system using artificial neural network
Development of software defect prediction system using artificial neural networkDevelopment of software defect prediction system using artificial neural network
Development of software defect prediction system using artificial neural network
 
Review on Algorithmic and Non Algorithmic Software Cost Estimation Techniques
Review on Algorithmic and Non Algorithmic Software Cost Estimation TechniquesReview on Algorithmic and Non Algorithmic Software Cost Estimation Techniques
Review on Algorithmic and Non Algorithmic Software Cost Estimation Techniques
 
PM2 ARTICALS
PM2 ARTICALSPM2 ARTICALS
PM2 ARTICALS
 
STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...
STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...
STATE-OF-THE-ART IN EMPIRICAL VALIDATION OF SOFTWARE METRICS FOR FAULT PRONEN...
 
The Impact of Software Complexity on Cost and Quality - A Comparative Analysi...
The Impact of Software Complexity on Cost and Quality - A Comparative Analysi...The Impact of Software Complexity on Cost and Quality - A Comparative Analysi...
The Impact of Software Complexity on Cost and Quality - A Comparative Analysi...
 
Kumar2021
Kumar2021Kumar2021
Kumar2021
 
Transitioning IT Projects to Operations Effectively in Public Sector : A Case...
Transitioning IT Projects to Operations Effectively in Public Sector : A Case...Transitioning IT Projects to Operations Effectively in Public Sector : A Case...
Transitioning IT Projects to Operations Effectively in Public Sector : A Case...
 
Fine–grained analysis and profiling of software bugs to facilitate waste iden...
Fine–grained analysis and profiling of software bugs to facilitate waste iden...Fine–grained analysis and profiling of software bugs to facilitate waste iden...
Fine–grained analysis and profiling of software bugs to facilitate waste iden...
 
Comparative Performance Analysis of Machine Learning Techniques for Software ...
Comparative Performance Analysis of Machine Learning Techniques for Software ...Comparative Performance Analysis of Machine Learning Techniques for Software ...
Comparative Performance Analysis of Machine Learning Techniques for Software ...
 
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
 
Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986
 
Benchmarking machine learning techniques
Benchmarking machine learning techniquesBenchmarking machine learning techniques
Benchmarking machine learning techniques
 
E018132735
E018132735E018132735
E018132735
 
STATE OF THE ART SURVEY ON DSPL SECURITY CHALLENGES
STATE OF THE ART SURVEY ON DSPL SECURITY CHALLENGESSTATE OF THE ART SURVEY ON DSPL SECURITY CHALLENGES
STATE OF THE ART SURVEY ON DSPL SECURITY CHALLENGES
 
Abstract.doc
Abstract.docAbstract.doc
Abstract.doc
 
Application of Genetic Algorithm in Software Engineering: A Review
Application of Genetic Algorithm in Software Engineering: A ReviewApplication of Genetic Algorithm in Software Engineering: A Review
Application of Genetic Algorithm in Software Engineering: A Review
 
Why is TDD so hard for Data Engineering and Analytics Projects?
Why is TDD so hard for Data Engineering and Analytics Projects?Why is TDD so hard for Data Engineering and Analytics Projects?
Why is TDD so hard for Data Engineering and Analytics Projects?
 
Testing and test case generation by using fuzzy logic and n
Testing and test case generation by using fuzzy logic and nTesting and test case generation by using fuzzy logic and n
Testing and test case generation by using fuzzy logic and n
 

Similar to Software Defect Prediction Using Local and Global Analysis

Argument Papers (5-7 pages in length)1. Do schools perpe.docx
Argument Papers (5-7 pages in length)1. Do schools perpe.docxArgument Papers (5-7 pages in length)1. Do schools perpe.docx
Argument Papers (5-7 pages in length)1. Do schools perpe.docx
fredharris32
 
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICSANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ijcsa
 
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICSANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ijcsa
 
Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986
Editor IJARCET
 
DETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKS
DETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKSDETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKS
DETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKS
ijseajournal
 
Efficient Indicators to Evaluate the Status of Software Development Effort Es...
Efficient Indicators to Evaluate the Status of Software Development Effort Es...Efficient Indicators to Evaluate the Status of Software Development Effort Es...
Efficient Indicators to Evaluate the Status of Software Development Effort Es...
IJMIT JOURNAL
 

Similar to Software Defect Prediction Using Local and Global Analysis (20)

A survey of predicting software reliability using machine learning methods
A survey of predicting software reliability using machine learning methodsA survey of predicting software reliability using machine learning methods
A survey of predicting software reliability using machine learning methods
 
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
Using Fuzzy Clustering and Software Metrics to Predict Faults in large Indust...
 
Exploring the Efficiency of the Program using OOAD Metrics
Exploring the Efficiency of the Program using OOAD MetricsExploring the Efficiency of the Program using OOAD Metrics
Exploring the Efficiency of the Program using OOAD Metrics
 
J034057065
J034057065J034057065
J034057065
 
SOFTWARE CODE MAINTAINABILITY: A LITERATURE REVIEW
SOFTWARE CODE MAINTAINABILITY: A LITERATURE REVIEWSOFTWARE CODE MAINTAINABILITY: A LITERATURE REVIEW
SOFTWARE CODE MAINTAINABILITY: A LITERATURE REVIEW
 
A guide to deal with uncertainties in software project management
A guide to deal with uncertainties in software project managementA guide to deal with uncertainties in software project management
A guide to deal with uncertainties in software project management
 
A Complexity Based Regression Test Selection Strategy
A Complexity Based Regression Test Selection StrategyA Complexity Based Regression Test Selection Strategy
A Complexity Based Regression Test Selection Strategy
 
Argument Papers (5-7 pages in length)1. Do schools perpe.docx
Argument Papers (5-7 pages in length)1. Do schools perpe.docxArgument Papers (5-7 pages in length)1. Do schools perpe.docx
Argument Papers (5-7 pages in length)1. Do schools perpe.docx
 
A noble methodology for users’ work
A noble methodology for users’ workA noble methodology for users’ work
A noble methodology for users’ work
 
A Combined Approach of Software Metrics and Software Fault Analysis to Estima...
A Combined Approach of Software Metrics and Software Fault Analysis to Estima...A Combined Approach of Software Metrics and Software Fault Analysis to Estima...
A Combined Approach of Software Metrics and Software Fault Analysis to Estima...
 
ITERATIVE AND INCREMENTAL DEVELOPMENT ANALYSIS STUDY OF VOCATIONAL CAREER INF...
ITERATIVE AND INCREMENTAL DEVELOPMENT ANALYSIS STUDY OF VOCATIONAL CAREER INF...ITERATIVE AND INCREMENTAL DEVELOPMENT ANALYSIS STUDY OF VOCATIONAL CAREER INF...
ITERATIVE AND INCREMENTAL DEVELOPMENT ANALYSIS STUDY OF VOCATIONAL CAREER INF...
 
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICSANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
 
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICSANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
ANALYSIS OF SOFTWARE QUALITY USING SOFTWARE METRICS
 
Defect effort prediction models in software
Defect effort prediction models in softwareDefect effort prediction models in software
Defect effort prediction models in software
 
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
 
Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986Volume 2-issue-6-1983-1986
Volume 2-issue-6-1983-1986
 
Productivity Factors in Software Development for PC Platform
Productivity Factors in Software Development for PC PlatformProductivity Factors in Software Development for PC Platform
Productivity Factors in Software Development for PC Platform
 
DETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKS
DETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKSDETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKS
DETERMINING THE RISKY SOFTWARE PROJECTS USING ARTIFICIAL NEURAL NETWORKS
 
Efficient Indicators to Evaluate the Status of Software Development Effort Es...
Efficient Indicators to Evaluate the Status of Software Development Effort Es...Efficient Indicators to Evaluate the Status of Software Development Effort Es...
Efficient Indicators to Evaluate the Status of Software Development Effort Es...
 
A Survey of Software Reliability factor
A Survey of Software Reliability factorA Survey of Software Reliability factor
A Survey of Software Reliability factor
 

More from Editor IJMTER

A NEW DATA ENCODER AND DECODER SCHEME FOR NETWORK ON CHIP
A NEW DATA ENCODER AND DECODER SCHEME FOR  NETWORK ON CHIPA NEW DATA ENCODER AND DECODER SCHEME FOR  NETWORK ON CHIP
A NEW DATA ENCODER AND DECODER SCHEME FOR NETWORK ON CHIP
Editor IJMTER
 
A CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMES
A CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMESA CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMES
A CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMES
Editor IJMTER
 
Textual Data Partitioning with Relationship and Discriminative Analysis
Textual Data Partitioning with Relationship and Discriminative AnalysisTextual Data Partitioning with Relationship and Discriminative Analysis
Textual Data Partitioning with Relationship and Discriminative Analysis
Editor IJMTER
 
SURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICE
SURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICESURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICE
SURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICE
Editor IJMTER
 
Software Quality Analysis Using Mutation Testing Scheme
Software Quality Analysis Using Mutation Testing SchemeSoftware Quality Analysis Using Mutation Testing Scheme
Software Quality Analysis Using Mutation Testing Scheme
Editor IJMTER
 

More from Editor IJMTER (20)

A NEW DATA ENCODER AND DECODER SCHEME FOR NETWORK ON CHIP
A NEW DATA ENCODER AND DECODER SCHEME FOR  NETWORK ON CHIPA NEW DATA ENCODER AND DECODER SCHEME FOR  NETWORK ON CHIP
A NEW DATA ENCODER AND DECODER SCHEME FOR NETWORK ON CHIP
 
A RESEARCH - DEVELOP AN EFFICIENT ALGORITHM TO RECOGNIZE, SEPARATE AND COUNT ...
A RESEARCH - DEVELOP AN EFFICIENT ALGORITHM TO RECOGNIZE, SEPARATE AND COUNT ...A RESEARCH - DEVELOP AN EFFICIENT ALGORITHM TO RECOGNIZE, SEPARATE AND COUNT ...
A RESEARCH - DEVELOP AN EFFICIENT ALGORITHM TO RECOGNIZE, SEPARATE AND COUNT ...
 
Analysis of VoIP Traffic in WiMAX Environment
Analysis of VoIP Traffic in WiMAX EnvironmentAnalysis of VoIP Traffic in WiMAX Environment
Analysis of VoIP Traffic in WiMAX Environment
 
A Hybrid Cloud Approach for Secure Authorized De-Duplication
A Hybrid Cloud Approach for Secure Authorized De-DuplicationA Hybrid Cloud Approach for Secure Authorized De-Duplication
A Hybrid Cloud Approach for Secure Authorized De-Duplication
 
Aging protocols that could incapacitate the Internet
Aging protocols that could incapacitate the InternetAging protocols that could incapacitate the Internet
Aging protocols that could incapacitate the Internet
 
A Cloud Computing design with Wireless Sensor Networks For Agricultural Appli...
A Cloud Computing design with Wireless Sensor Networks For Agricultural Appli...A Cloud Computing design with Wireless Sensor Networks For Agricultural Appli...
A Cloud Computing design with Wireless Sensor Networks For Agricultural Appli...
 
A CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMES
A CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMESA CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMES
A CAR POOLING MODEL WITH CMGV AND CMGNV STOCHASTIC VEHICLE TRAVEL TIMES
 
Sustainable Construction With Foam Concrete As A Green Green Building Material
Sustainable Construction With Foam Concrete As A Green Green Building MaterialSustainable Construction With Foam Concrete As A Green Green Building Material
Sustainable Construction With Foam Concrete As A Green Green Building Material
 
USE OF ICT IN EDUCATION ONLINE COMPUTER BASED TEST
USE OF ICT IN EDUCATION ONLINE COMPUTER BASED TESTUSE OF ICT IN EDUCATION ONLINE COMPUTER BASED TEST
USE OF ICT IN EDUCATION ONLINE COMPUTER BASED TEST
 
Textual Data Partitioning with Relationship and Discriminative Analysis
Textual Data Partitioning with Relationship and Discriminative AnalysisTextual Data Partitioning with Relationship and Discriminative Analysis
Textual Data Partitioning with Relationship and Discriminative Analysis
 
Testing of Matrices Multiplication Methods on Different Processors
Testing of Matrices Multiplication Methods on Different ProcessorsTesting of Matrices Multiplication Methods on Different Processors
Testing of Matrices Multiplication Methods on Different Processors
 
Survey on Malware Detection Techniques
Survey on Malware Detection TechniquesSurvey on Malware Detection Techniques
Survey on Malware Detection Techniques
 
SURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICE
SURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICESURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICE
SURVEY OF TRUST BASED BLUETOOTH AUTHENTICATION FOR MOBILE DEVICE
 
SURVEY OF GLAUCOMA DETECTION METHODS
SURVEY OF GLAUCOMA DETECTION METHODSSURVEY OF GLAUCOMA DETECTION METHODS
SURVEY OF GLAUCOMA DETECTION METHODS
 
Survey: Multipath routing for Wireless Sensor Network
Survey: Multipath routing for Wireless Sensor NetworkSurvey: Multipath routing for Wireless Sensor Network
Survey: Multipath routing for Wireless Sensor Network
 
Step up DC-DC Impedance source network based PMDC Motor Drive
Step up DC-DC Impedance source network based PMDC Motor DriveStep up DC-DC Impedance source network based PMDC Motor Drive
Step up DC-DC Impedance source network based PMDC Motor Drive
 
SPIRITUAL PERSPECTIVE OF AUROBINDO GHOSH’S PHILOSOPHY IN TODAY’S EDUCATION
SPIRITUAL PERSPECTIVE OF AUROBINDO GHOSH’S PHILOSOPHY IN TODAY’S EDUCATIONSPIRITUAL PERSPECTIVE OF AUROBINDO GHOSH’S PHILOSOPHY IN TODAY’S EDUCATION
SPIRITUAL PERSPECTIVE OF AUROBINDO GHOSH’S PHILOSOPHY IN TODAY’S EDUCATION
 
Software Quality Analysis Using Mutation Testing Scheme
Software Quality Analysis Using Mutation Testing SchemeSoftware Quality Analysis Using Mutation Testing Scheme
Software Quality Analysis Using Mutation Testing Scheme
 
Single Phase Thirteen-Level Inverter using Seven Switches for Photovoltaic sy...
Single Phase Thirteen-Level Inverter using Seven Switches for Photovoltaic sy...Single Phase Thirteen-Level Inverter using Seven Switches for Photovoltaic sy...
Single Phase Thirteen-Level Inverter using Seven Switches for Photovoltaic sy...
 
SIMULATION AND ANALYSIS OF TIBC FOR INDUCTION MOTOR DRIVE
SIMULATION AND ANALYSIS OF TIBC FOR INDUCTION MOTOR DRIVESIMULATION AND ANALYSIS OF TIBC FOR INDUCTION MOTOR DRIVE
SIMULATION AND ANALYSIS OF TIBC FOR INDUCTION MOTOR DRIVE
 

Recently uploaded

Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 

Recently uploaded (20)

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 

Software Defect Prediction Using Local and Global Analysis

  • 1. Scientific Journal Impact Factor (SJIF): 1.711 International Journal of Modern Trends in Engineering and Research www.ijmter.com @IJMTER-2014, All rights Reserved 441 e-ISSN: 2349-9745 p-ISSN: 2393-8161 Software Defect Prediction Using Local and Global Analysis Ms. N. Narmadha1 , Dr. M. Latha2 , Mr. R. Subramanian3 1 Research Scholar 2 M.Sc., M.Phil., Ph.D, Associate Professor, Department of Computer Science. 1,2 Sri Sarada College for Women, Salem, Tamilnadu, India 3 Erode Arts and Science College, Erode Abstract -The software defect factors are used to measure the quality of the software. The software effort estimation is used to measure the effort required for the software development process. The defect factor makes an impact on the software development effort. Software development and cost factors are also decided with reference to the defect and effort factors. The software defects are predicted with reference to the module information. Module link information are used in the effort estimation process. Data mining techniques are used in the software analysis process. Clustering techniques are used in the property grouping process. Rule mining methods are used to learn rules from clustered data values. The “WHERE” clustering scheme and “WHICH” rule mining scheme are used in the defect prediction and effort estimation process. The system uses the module information for the defect prediction and effort estimation process. The proposed system is designed to improve the defect prediction and effort estimation process. The Single Objective Genetic Algorithm (SOGA) is used in the clustering process. The rule learning operations are carried out sing the Apriori algorithm. The system improves the cluster accuracy levels. The defect prediction and effort estimation accuracy is also improved by the system. The system is developed using the Java language and Oracle relation database environment. I. INTRODUCTION Software engineering is the technological and managerial discipline with systematic production and maintenance of software products that are developed and modified on time and within cost estimates. Successful software maintenance, like all software activities, requires a combination of managerial skills and technical expertise. The primary goals of software engineering are to improve the quality of software products and to increase the productivity and job satisfaction of software engineers. Software engineering is concerned with development and maintenance of technological products, problem-solving techniques common to all engineering disciplines Engineering problem-solving techniques provides the basis for project planning, project management, systematic analysis, methodical design, careful fabrication, extensive validation and ongoing maintenance activities. Appropriate notations, tools and techniques are applied in each of these areas. A fundamental principle of software engineering is to design software products that minimize the intellectual distance between problem and solution; the variety of approaches to software development is limited on lay by the creativity and ingenuity of the programmer. The software industry’s inability to provide accurate estimates of development cost, effort and/or time is well known. This inability is described in reports from project management consultancy companies, case studies on project failures, articles in the computer press and estimation surveys. The common belief seems to be that the cost overruns are very large and few software professionals and researchers react with disbelief when being presented with the inaccuracy values reported in Standish
  • 2. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 442 Group’s Chaos Report, i.e., an average cost overrun of 89%. There may be several reasons for this attitude. It is difficult to get a balanced view on the software industry’s estimation performance without unbiased information from a representative set of projects and organizations. The surveys presented in scientific journals and conferences may be a source for such unbiased information. The system summarizes estimation results from surveys on software estimation. There has not been conducted any structured review of the estimation surveys with the aim of summarizing the knowledge of software effort estimation performance. Besides summarizing results from estimation surveys, the main purposes of the system is to challenge some common estimation beliefs that may be inaccurate, to discuss methodical aspects related to completion of industrial surveys and to support future surveys on software effort estimation. II. RELATED WORK The main goal of this section is not to provide a generic review of defect prediction and effort estimation work, but to highlight work that documents contradictory results which make it difficult to generalize solutions for defect prediction or effort estimation. 2.1. Effort Estimation Process Conclusion instability in effort estimation may be a fundamental property of the datasets we are exploring. For example, tests the stability of Boehm’s COCOMO software development effort estimation model. In that analysis, 20 times, we learned effort = using a random 2/3rds sample from 93 NASA projects. Only the coefficient on lines of code (loc) was stable. The observed ranges on the other βi coefficients were very large. In fact, the signs of five coefficients even changed from positive to negative. This coefficient instability is particularly troubling because we know of NASA project managers who have made acquisition decisions worth tens of millions of dollars based on the COCOMO coefficients. Other papers also report contradictory findings about effort estimation. Jorgensen reviewed 15 studies comparing model-based to expert-based methods. Five of those studies favored expert-based methods, five found no difference and five favored model-based methods. Kitchenham et al. reviewed studies that checked if data imported from other organizations were as useful as local data. From a total of seven studies, three found that models from other organizations were not significantly worse than those based on local data, while four found that they were significantly different. MacDonell and Shepperd also performed a review on the value of local versus global effort estimation models through a replication. From a total of 10 studies, two were found to be inconclusive, three supported global models and five supported local models. Similarly, Mair and Shepperd compared regression to analogy methods for effort estimation and found conflicting evidence. From a total of 20 empirical studies, seven favored regression, four were indifferent and nine favored analogy. 2.2. Defect Prediction Process In the area of defect prediction, there are also many contradictory findings. For example, Zimmermann et al. learned defect predictors from 622 pairs of projects hproject1; project2i. In only 4 percent of pairs did the defect predictors learned in project1 work in project [2]. Similar findings concern the OO metrics as well. For the manager of a software project is particularly troubling. Each study makes a clear, but usually different, conclusion. Hence, it is difficult for a manager to make a clear decision about, for example, the merits of a proposed coding standard, where maximum depth of inheritance is required to be less than some expert specified threshold. As to the root cause of the instabilities offer the following conjecture. The models learned from different regions within effort data can have very different properties. If defect data were as varied as
  • 3. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 443 effort data, then we would naturally expect that different samples of different projects would yield different models due to dataset shift [4]. If this conjecture is correct, then we would expect that clusters within the data should produce different models. This paper is a test of that conjecture. In short, we show that different regions of the data generate different models. Further, the models built from specialized regions within the dataset perform better than those learned across all data. III. LOCAL AND GLOBAL ANALYSIS ON SOFTWARE COMPONENTS Process and product data are used in software engineering (SE) to support a variety of tasks, such as defect prediction, effort estimation, refactoring of source code, determination of the social networks of programmers, learning the expected interface between modules and so on [3]. Two questions are at the center of much of the research in the field: 1) What data are best suited to support specific tasks? and 2) what is the best way to reason about SE data? This paper explores the latter question, in the context of: software effort reduction: finding rules for reducing a project’s development time; software defect reduction: finding rules for reducing a project’s defect count. Our focus in this paper is not on what data are used for building models for defect prediction or effort estimation, but rather on the source of the data and implicitly, the applicability of the lessons derived by the models. Software data come from some sources. These data show the defects or effort associated with examples from that source. How should we reason about these data? On this point, the literature is contradictory. Some existing work argues that data from multiple sources can generate rules that apply in any context. We call these global lessons. On the other hand, other papers indicate that the best lessons are learned from within one source, which implies that these lessons are only useful in their context [5], [6], [7]. We call these local lessons. When project managers want to make changes in their projects to minimize the development effort or the rate of defects, they are faced with two options: 1) make changes based on global lessons available from existing data, or 2) mine data about the current project and infer local lessons. The dilemma of the manager is obvious. In the first case, expensive changes may be undertaken without reaching the desired goal if the global lessons prove to be wrong for this context. In the second case, an upfront investment is needed to collect and analyze data and to generate the local lessons, which may be unnecessary if the global lessons apply to this context. This paper addresses this dilemma and reports on experiments where: Data from different sources are combined. Within that combination, automatic tools find clusters of related examples. Note that clusters may contain examples from multiple sources. Data mining is then used to learn lessons from the examples in each cluster. The generated lessons are compared. Before this experiment, our previous work indicated that local lessons seem to be best for defect prediction and effort estimation [8]. The surprising result of the experiment presented in this paper is that the best lessons for a project from one source come from neighboring clusters with data from nearby sources, but not inside that source. We will call such lessons neighbor lessons. We conclude that when project managers attempt to draw lessons from some historical data source, they should: 1) ignore any existing local divisions into multiple sources, 2) cluster across all available data, then 3) restrict the learning of lessons to the clusters that are nearest to the test data. While the paper focuses on defect prediction and effort estimation data, we believe that our conclusions can translate to other types of data, related to different SE tasks. This paper extends a prior publication [1] in the following way: That paper only explored four datasets. Here, we explore over twice that number of datasets.
  • 4. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 444 That prior publication only explored local versus global lessons and found evidence that supported local lessons over the global ones. In this paper, explore clusters from multiple sources. IV. PROBLEM STATEMENT The software analysis system generates lessons learned for defect prediction and effort estimation. Lessons are global to multiple projects or just local to particular projects. The system comparatively evaluates local versus global lessons learned for effort estimation and defect prediction. The system applies automated clustering tools to effort and defect datasets from the PROMISE repository. Rule learners generated lessons learned from all the data, from local projects, or just from each cluster. The lessons learned after combining small parts of different data sources were superior to either generalizations formed over all the data or local lessons formed from particular projects. The following problems are identified from the software effort estimation and defect prediction methods. • The clustering system produces low accuracy in partitioning process. • The rule learning system handles minimum of set of rules only. • Defect prediction and effort estimation accuracy is low. • Software module relationships are not fully focused in the defect and effort analysis. V. GENETIC ALGORITHM BASED SOFTWARE ANALYSIS The software defect prediction process is designed to identify the defects that are raised in the software system. The effort estimation mechanism is applied to estimate the effort needed for the system development process. The main goal of the system is not to provide a generic review of defect prediction and effort estimation work, but to highlight work that documents contradictory results which make it difficult to generalize solutions for defect prediction or effort estimation. In the area of defect prediction, there are also many contradictory findings. Defect predictors from 622 pairs of projects hproject1; project. In only 4 percent of pairs did the defect predictors learned in project work in project. Similar findings concern the OO metrics as well. Lists 28 studies that offer contradictory conclusions regarding the effectiveness of OO metrics for predicting defects. Conclusion instability in effort estimation may be a fundamental property of the datasets. Only the coefficient on lines of code was stable. The observed ranges on the other βi coefficients were very large. In fact, the signs of five coefficients even changed from positive to negative. This coefficient instability is particularly troubling. NASA project managers who have made acquisition decisions worth tens of millions of dollars based on the COCOMO coefficients. The system is designed to perform defect prediction and effort estimation process for the software systems [9]. The clustering and rule learning methods are used to identify the defects and efforts. The Single Objective Genetic Algorithm (SOGA) is used for the clustering process. The rules are learned using the Apriori algorithm. The system improves the accuracy in clustering process. Defect prediction and effort estimation operations are also improved with internal relationship of the software components. 5.1. Software Module Clustering Existing approaches to the twin objectives of high cohesion and low coupling have combined these two objectives into a single-objective function, with all of the drawbacks. Pareto optimality is an alternative approach to handle multiple objectives that retains the character of the problem as a multi- objective problem. Using Pareto optimality, it is not possible to measure “how much” better one solution is than another, merely to determine whether one solution is better than another. In this way, Pareto optimality combines a set of measurements into a single ordinal scale metric.
  • 5. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 445 A Pareto optimal search yields a set of solutions that are mutually no dominating and which form an approximation to the Pareto front. The Pareto front is the set of elements that are not dominated by any possible element of the solution space. The Pareto front thus denotes the best results achievable; it is the equivalent of the set of globally optimal points in a single-objective search. As with the single- objective formulation, it is not possible to guarantee to locate this globally optimal solution set, merely to attempt to approximate it as closely as possible. Each set of objectives leads to a different multi- objective formulation of the problem. Two sets of objectives will be considered in the system. They are the Maximizing Cluster Approach and the Equal-size Cluster Approach. 5.2. Module Clustering with GA Genetic algorithms use concepts of population and of recombination inspired by Darwinian Evolution A generic genetic algorithm is presented. An iterative process is executed, initialized by a randomly chosen population. The iterations are called generations and the members of the population are called chromosomes because of their analogs in natural evolution. The process terminates when a population satisfies some predetermined condition. Set generation number, m: = 0 Choose the initial population of candidate Solution ,P(0) Evaluate the fitness for each individual of P(0),F(Pi(0)) loop Recombine: P(M): = (P(m)) Mutate : P(m) : = M(P(m)) Evaluate : F(P(m) Select : P(m+1) : = S(P(m)) m : = m +1 exit when goal or stopping condition is satisfied end loop; A Generic Genetic Algorithm. VI. LOCAL AND GLOBAL ANALYSIS MODEL FOR SOFTWARE DEFECT PREDICTION The software defect prediction and effort estimation process is designed to measure the defect and effort values from the software properties. The clustering and rule learning mechanism is used for the system. The Single Objective Genetic Algorithm (SOGA) is used for the clustering process. The Apriori algorithm is used for the learning process. The system divided into four major modules. They are product data analysis, criteria property extraction, single objective clustering and defect prediction and effort estimation process. The modules and their link information are extracted from the product data analysis module. Criteria property extraction module is designed to identify the module property values. Single objective clustering is performed with only one objective functions. The defect prediction and effort estimation operations are carried out under the defect and effort estimation module. 6.1. Product Data Analysis The product information is provided in module dependency graphs. Module link information is provided under Module Dependency Graph (MDG) files. The MDG files are used as the input for the system. Modules, links and weight information are extracted from the product information. 6.2. Criteria Property Extraction
  • 6. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 446 The modules and their links are used for the criteria property extraction process. Communication bandwidth is estimated with module link information. Each module is analyzed with it’s in and out link information. The module length and feature location are also identified. 6.3. Single Objective Clustering The genetic algorithm is used for the clustering process. Single objective clustering model is used in the system. The objective function uses the module property values. Only one criteria is considered in the clustering process. 6.4. Defect Prediction and Effort Estimation The defect prediction and effort estimation module is divided into two sub modules. They are defect prediction sub module and effort estimation sub module. The software defects are predicted with reference to the internal relationship values. The effort estimation is carried out with the support of the software modules and their length values. The system produces the predicted values and actual values for each modules. The rule learning mechanism is used in the defect prediction and effort estimation process. VII. PERFORMANCE ANALYSIS The software defect prediction and effort estimation system is implemented and tested with a set of benchmark datasets. The Genetic Algorithm and Variational clustering (VC) schemes are used in the testing process. The VC scheme is composed with “WHICH” and “WHERE” techniques. Three performance metrics are used to evaluate the quality of the system. They are entropy, separation index and prediction accuracy. The entropy and separation index measures are used to evaluate the quality of the clustering process. Defect prediction and effort estimation process results are analyzed using the prediction accuracy measure. 7.1. Entropy Entropy reflects the homogeneity of a set of objects and thus can be used to indicate the homogeneity of a cluster. This is referred to cluster entropy. Lower cluster entropy indicates more homogeneous clusters. On the other hand, we can also measure the entropy of a prelabeled class of objects, which indicates the homogeneity of a class with respect to the generated clusters. The less fragmented a class across clusters, the higher its entropy and vice versa. This is referred to as class entropy. The entropy measurement is used to estimate the quality of the clusters. The cluster entropy and overall entropy are calculated for different data intervals and MPC based models. The entropy values for VC Scheme and GA Scheme models are produced in figure 7.1. The comparison result shows that the GA Scheme outperforms than the VC Scheme. The GA scheme improves the cluster accuracy 5% than the VC scheme.
  • 7. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 447 Figure No: 7.1. Entropy Analysis of VC Scheme and GA Scheme 7.2. Separation Index Separation Index (SI) is another cluster validity measure that utilizes cluster centroids to measure the distance between clusters, as well as between points in a cluster to their respective cluster centroid. It is defined as the ratio of average within-cluster variance to the square of the minimum pairwise distance between clusters: distD N i cixj ij srNsrD NC i cixj ij N mxdist mmdistN mxdist SI c c 2 min 1 2 2 ,1 1 2 . ),( )},(min ),( ∑ ∑ ∑ ∑ = ∈ ≤≤ = ∈ = = where mi is the centroid of cluster ci and distmin is the minimum pairwise distance between cluster centroids. Clustering solutions with more compact clusters and larger separation have lower Separation Index, thus higher values indicate better solutions. This index is more computationally efficient than other validity indices, such as Dunn’s index, which is also used to validate clusters that are compact and well separated. In addition, it is less sensitive to noisy data. The separation index measure is used to analyze the cluster intervals and transaction interval values. The separation index analysis is shown in figure 7.2. The analysis result shows that the GA Scheme improves the cluster quality 20% than the VC Scheme technique. 1.802 1.586 2.017 1.779 1.746 1.941 1.773 2.187 2.102 2.079 0 0.5 1 1.5 2 2.5 20 40 60 80 100 Entropy Modules Entropy AnalysisEntropy AnalysisEntropy AnalysisEntropy Analysis between VC Scheme andbetween VC Scheme andbetween VC Scheme andbetween VC Scheme and GA SchemeGA SchemeGA SchemeGA Scheme VC Scheme GA Scheme
  • 8. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 448 Figure 7.2: Separation Index Analysis between VC Scheme and GA Scheme 7.3. Prediction Accuracy The prediction accuracy measure is used to evaluate the quality of the rule learning process with defect prediction and effort estimation process. The defect prediction and effort estimation results are compared with the predicted values and evaluated values. The prediction accuracy analysis is shown in figure 7.3. The GA scheme improves the prediction accuracy 20% than the VC scheme. The system produces better cluster results with effective rule learning results. Figure 7.3: Prediction Accuracy Analysis between VC Scheme and GA Scheme VII. CONCLUSION AND FUTURE WORK The software defect prediction and effort estimation operations are performed in the system with the support of clustering and rule learning methods. The system has discussed ways to collect training data for learning lessons from SE projects and to generate rules for reducing the number of defects or the development effort. At issue here is how the chief information officers should set policies for their projects. Should they demand that all their projects rigorously apply all the advice from the standard SE textbooks. The software defect prediction and effort estimation system is implemented to analyze the software products to measure the defect and effort factors. The Genetic Algorithm based clustering 0.768 1.059 1.296 1.372 1.483 0.526 0.725 0.921 0.986 1.057 0 0.5 1 1.5 2 20 40 60 80 100 SeparationIndex Modules Separation Index Analysis between VC Scheme and GA Scheme VC Scheme GA Scheme 61.73 64.05 66.75 69.23 71.68 82.47 85.14 87.98 91.23 94.56 0 20 40 60 80 100 20 40 60 80 100 PredictionAccuracy(%) Modules Prediction Accuracy Analysis between VC Scheme and GA Scheme VC Scheme GA Scheme
  • 9. International Journal of Modern Trends in Engineering and Research (IJMTER) Volume 02, Issue 01, [January - 2015] e-ISSN: 2349-9745, p-ISSN: 2393-8161 @IJMTER-2014, All rights Reserved 449 scheme is used for the data clustering process. The Apriori algorithm is used to carry out the rule learning process. The system improves the cluster accuracy and prediction accuracy levels. The system can be enhanced with the following features. • The system can be enhanced to predict the software risk levels using design information. • The system can be enhanced with Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) techniques in the prediction process. • The system can be improved to measure the relationship level between the local and global rules. • The system can be enhanced to perform the prediction process using the software and source code analysis mechanism. REFERENCES [1] T. Menzies, A. Butcher and D. Cok, “Local vs Global Models for Effort Estimation and Defect Prediction,” Proc. 26th IEEE/ACM Int’l Conf. Automated Software Eng., 2011. [2] Martin Shepperd, David Bowes and Tracy Hall, “Researcher Bias: The Use of Machine Learning in Software Defect Prediction”, IEEE Transactions On Software Engineering, Vol. 40, No. 6, June 2014. [3] Saehwa Kim, “Synthesizing Multithreaded Code from Real-Time Object-Oriented Models via Schedulability-Aware Thread Derivation”, IEEE Transactions On Software Engineering, Vol. 40, No. 4, April 2014. [4] B. Turhan, “On the Dataset Shift Problem in Software Engineering Prediction Models,” Empirical Software Eng., vol. 17, pp. 62-74, 2012. [5] D. Posnett, V. Filkov and P. Devanbu, “Ecological Inference in Empirical Software Engineering,” Proc. IEEE/ACM Int’l Conf. Automated Software Eng., 2011. [6] N. Bettenburg, M. Nagappan and A.E. Hassan, “Think Locally, Act Globally: Improving Defect and Effort Prediction Models,” Proc. Ninth IEEE Working Conf. Mining Software Repositories, pp. 60-69, 2012. [7] Y. Yang, L. Xie, Z. He, Q and R. Valerdi, “Local Bias and Its Impacts on the Performance of Parametric Estimation Models,” Proc. Seventh Int’l Conf. Predictive Models in Software Eng., 2011. [8] Tim Menzies, Andrew Butcher, David Cok, Andrian Marcus, Lucas Layman, Forrest Shull, Burak Turhan and Thomas Zimmermann, “Local versus Global Lessons for Defect Prediction and Effort Estimation”, IEEE Transactions On Software Engineering, Vol. 39, No. 6, June 2013. [9] Christian Kastner, Alexander Dreiling and Klaus Ostermann, “Variability Mining: Consistent Semi-automatic Detection of Product-Line Features”, IEEE Transactions On Software Engineering, Vol. 40, No. 1, January 2014.