Studying the impact of dependency network measures on software quality

•

0 j'aime•356 vues

ICSM 2010

Technologie

Studying
the
impact
of
dependency
network
measures

on
soIware
quality

Thanh
H.
D.
Nguyen,
Bram
Adams,
Ahmed
E.
Hassan

SAIL,
School
of
Compu?ng,
Queen’s
University,
Kingston,
Canada

Code
Quality

  Problem:

  Quality
improvement
resources
are
limited

  Solu?on:

  Bug
predic5on
iden5ﬁes
defect-‐prone
modules

2

Bug
predic?on
models

Bug

Predic5on

Model

High
Recall
-‐>
We
won’t
miss
a
possible
bug

High
Precision
-‐>
We
won’t
waste
eﬀort
3

SoIware
is
more
than
just

size
and
complexity

Node" A
D

C

Local
Neighborhood" B
F

Global
Neighborhood" E
G

4

SoIware
is
more
than
just

size
and
complexity

Traditional Metrics
Node"
(MET)"
Local
Neighborhood" Social Network
Measures!
Global (SNA)"
Neighborhood"

5

Bug

Predic5on

Model

Would
SNA
improve
performance?
6

Bug

Predic5on

Model

Would
SNA
improve
performance?
12

Bug

Predic5on

Model

Would
SNA
improve
performance?
13

Which
metrics
provide
the

improvement?

Node" 12
Metrics

Local
11
Metrics

Neighborhood"

Global
Neighborhood" 12
Metrics

Use
hierarchical
modeling
to
ﬁnd

important
group
[Caltado
et
al.
TSE10]

16

Which
metrics
provide
the

improvement?

Node" 12
Metrics
7%

Local
11
Metrics
+2.7%

Neighborhood"

Global
Neighborhood" 12
Metrics
+0.3%

17

Which
metrics
provide
the

improvement?

Node" 12
Metrics
7%

Local
11
Metrics
+2.7%

Neighborhood"

Global
Neighborhood" 12
Metrics
+0.3%

Local
neighbours
have
most
of
the

important
improvement
18

Which
local
measures
have
the
most
impact?

19

How
well
do
we
perform
in
prac?ce?

✔ ✗

26

Comparing
Performance
Using
Eﬀort

Aware
Curves

100
80
File
A
B
C

% bugs caught
#bug
0
1
2

60
LOC
48
8
44

40
ROI
0
0.125
0.045

20
Risk
0.78
0.56
0.34

0

0 20 40 60 80 100

% lines of code reviewed
28

Comparing
Performance
Using
Eﬀort

Aware
Curves

100
80
File
A
B
C

% bugs caught
#bug
0
1
2

60
LOC
48
8
44

40 A

ROI
0
0.125
0.045

20
Risk
0.78
0.56
0.34

0

0 20 40 60 80 100

% lines of code reviewed
29

Comparing
Performance
Using
Eﬀort

Aware
Curves

100
80
File
A
B
C

% bugs caught
#bug
0
1
2

60
LOC
48
8
44

40
ROI
0
0.125
0.045

20
B

Risk
0.78
0.56
0.34

0

0 20 40 60 80 100

% lines of code reviewed
30

Comparing
Performance
Using
Eﬀort

Aware
Curves

100
80
File
A
B
C

% bugs caught
#bug
0
1
2

60
LOC
48
8
44

40 C

ROI
0
0.125
0.045

20
Risk
0.78
0.56
0.34

0

0 20 40 60 80 100

% lines of code reviewed
31

Is
this
a
good
predic?on?

100
80
File
A
B
C

% bugs caught
#bug
0
1
2

60
LOC
48
8
44

40
ROI
0
0.125
0.045

20
Risk
0.78
0.56
0.34

0

0 20 40 60 80 100

% lines of code reviewed
32

Beeer
predic?on
means
a
higher
curve

100
Good

80
File
A
B
C

% bugs caught
#bug
0
1
2

60
LOC
48
8
44

40
ROI
0
0.125
0.045
Bad

20
Bad
0.78
0.56
0.34

Good
0.32
0.72
0.55

0

0 20 40 60 80 100

% lines of code reviewed
33

The
predic?on
model
helps

reduce
tes?ng
eﬀort

100
Random

File

80
% bugs caught

60

File

40

Package

20
0

0 20 40 60 80 100

% lines of code reviewed 34

Class
pred.
has
more
poten?al

36

Thanh
H.
D.
Nguyen

(thanhnguyen@cs.queensu.ca)

Deviance
explained

+2.7%"

+0.3%"
+1,9%"

+1.1%"

Bugginess
~
Traditional
metrics
+
Local
+
Global

37

Thanh
H.
D.
Nguyen

(thanhnguyen@cs.queensu.ca)

Contenu connexe

Similaire à Studying the impact of dependency network measures on software quality

IHC 2011 - Widgets InternshipEduardo Oliveira

Dp and causal analysis guidelineM H Chandra

Predicting Defects using Network Analysis on Dependency GraphsThomas Zimmermann

Business Value of Agile Methods: Using ROI and REal OptionsDavid Rico

A 3-Day Introduction for Sr. Engineers and Tech. Support StaffDavid Rico

Achieving agility at_scale-martin_nallyIBM

Achieving Agility At Scale - Martin NallyRoopa Nadkarni

Lean & Agile Project Management: For Executives, Sr. Managers, & Key Decision...David Rico

Competing in a service economy 20120913 v1ISSIP

Design Verification: The Past, Present and FuturereDVClub

Design verification--the-past-present-and-futureObsidian Software

Failure Reporting Webex Slides - March 9, 2010Ricky Smith CMRP, CMRT

Lte asia 2011 s niriShahram G Niri

LeuvenArmando Vieira

Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12OnTechnicalDebt

Test-Driven Development (TDD)Brian Rasmussen

Chattanooga sme oee down time presentationJames Mansfield

ParticleVMTill Riedel

Implications of Change on Workplace LearningBill Stirling

Similaire à Studying the impact of dependency network measures on software quality (20)

IHC 2011 - Widgets Internship

Dp and causal analysis guideline

Predicting Defects using Network Analysis on Dependency Graphs

Business Value of Agile Methods: Using ROI and REal Options

A 3-Day Introduction for Sr. Engineers and Tech. Support Staff

Achieving agility at_scale-martin_nally

Achieving Agility At Scale - Martin Nally

Lean & Agile Project Management: For Executives, Sr. Managers, & Key Decision...

Competing in a service economy 20120913 v1

Design Verification: The Past, Present and Futurere

Design verification--the-past-present-and-future

Failure Reporting Webex Slides - March 9, 2010

Lte asia 2011 s niri

Leuven

Estimating the principal of Technical Debt - Dr. Bill Curtis - WTD '12

Test-Driven Development (TDD)

Chattanooga sme oee down time presentation

ParticleVM

Implications of Change on Workplace Learning

Plus de ICSM 2010

A tree kernel based approach for clone detectionICSM 2010

Scalable Semantic Web-based Source Code Search InfrastructureICSM 2010

2D and 3D Visualizations In Wikidev2.0 M. Fokaefs, D. Serrano, B. Tansey and ...ICSM 2010

Wiki dev nlpICSM 2010

iFL: An Interactive Environment for Understanding Feature ImplementationsICSM 2010

Using Clone Detection to Identify Bugs in Concurrent SoftwareICSM 2010

Physical and Conceptual Identifier Dispersion: Measures and Relation to Fault...ICSM 2010

Automatically Repairing Test Cases for Evolving Method DeclarationsICSM 2010

Automated Identification of Cross-browser Issues in Web ApplicationsICSM 2010

Reverse Engineering Object-Oriented Distributed SystemsICSM 2010

Software asset managementICSM 2010

Successfulresearch 100915022614-phpapp01ICSM 2010

Enabling multi tenancy(An Industrial Experience Report)ICSM 2010

Ponsini automatic slidesICSM 2010

Icsm2010 AnnouncementICSM 2010

Plus de ICSM 2010 (15)

A tree kernel based approach for clone detection

Scalable Semantic Web-based Source Code Search Infrastructure

2D and 3D Visualizations In Wikidev2.0 M. Fokaefs, D. Serrano, B. Tansey and ...

Wiki dev nlp

iFL: An Interactive Environment for Understanding Feature Implementations

Using Clone Detection to Identify Bugs in Concurrent Software

Physical and Conceptual Identifier Dispersion: Measures and Relation to Fault...

Automatically Repairing Test Cases for Evolving Method Declarations

Automated Identification of Cross-browser Issues in Web Applications

Reverse Engineering Object-Oriented Distributed Systems

Software asset management

Successfulresearch 100915022614-phpapp01

Enabling multi tenancy(An Industrial Experience Report)

Ponsini automatic slides

Icsm2010 Announcement

Dernier

The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Training state-of-the-art general text embeddingZilliz

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz

Anypoint Exchange: It’s Not Just a Repo!Manik S Magar

CloudStudio User manual (basic edition):comworks

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

AI as an Interface for Commercial BuildingsMemoori

Commit 2024 - Secret Management made easyAlfredo García Lavilla

Dernier (20)

The Future of Software Development - Devin AI Innovative Approach.pdf

My INSURER PTE LTD - Insurtech Innovation Award 2024

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Training state-of-the-art general text embedding

DevoxxFR 2024 Reproducible Builds with Apache Maven

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost

Anypoint Exchange: It’s Not Just a Repo!

CloudStudio User manual (basic edition):

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

DevEX - reference for building teams, processes, and platforms

Vector Databases 101 - An introduction to the world of Vector Databases

Unleash Your Potential - Namagunga Girls Coding Club

Connect Wave/ connectwave Pitch Deck Presentation

My Hashitalk Indonesia April 2024 Presentation

Developer Data Modeling Mistakes: From Postgres to NoSQL

What's New in Teams Calling, Meetings and Devices March 2024

AI as an Interface for Commercial Buildings

Commit 2024 - Secret Management made easy

Studying the impact of dependency network measures on software quality

1. Studying the impact of dependency network measures on soIware quality Thanh H. D. Nguyen, Bram Adams, Ahmed E. Hassan SAIL, School of Compu?ng, Queen’s University, Kingston, Canada

2. Code Quality   Problem:   Quality improvement resources are limited   Solu?on:   Bug predic5on iden5ﬁes defect-‐prone modules 2

3. Bug predic?on models Bug Predic5on Model High Recall -‐> We won’t miss a possible bug High Precision -‐> We won’t waste eﬀort 3

4. SoIware is more than just size and complexity Node" A D C Local Neighborhood" B F Global Neighborhood" E G 4

5. SoIware is more than just size and complexity Traditional Metrics Node" (MET)" Local Neighborhood" Social Network Measures! Global (SNA)" Neighborhood" 5

6. Bug Predic5on Model Would SNA improve performance? 6

7. Would SNA improve performance? 7

8. Would SNA improve performance? 8

9. Would SNA improve performance? 9

10. Would SNA improve performance? 10

11. Why Eclipse? 11

12. Bug Predic5on Model Would SNA improve performance? 12

13. Bug Predic5on Model Would SNA improve performance? 13

14. +25% for Recall and Precision 14

15. Does this generalize? 15

16. Which metrics provide the improvement? Node" 12 Metrics Local 11 Metrics Neighborhood" Global Neighborhood" 12 Metrics Use hierarchical modeling to ﬁnd important group [Caltado et al. TSE10] 16

17. Which metrics provide the improvement? Node" 12 Metrics 7% Local 11 Metrics +2.7% Neighborhood" Global Neighborhood" 12 Metrics +0.3% 17

18. Which metrics provide the improvement? Node" 12 Metrics 7% Local 11 Metrics +2.7% Neighborhood" Global Neighborhood" 12 Metrics +0.3% Local neighbours have most of the important improvement 18

19. Which local measures have the most impact? 19

20. Cluster fan-‐in 20

21. Cluster fan-‐in 21

22. Layer bypass 22

23. Layer bypass 23

24. Layer bypass 24

25. Consider your neighbor connec?ons 25

26. How well do we perform in prac?ce? ✔ ✗ 26

27. Eﬀort Aware Predic?on Models 27

28. Comparing Performance Using Eﬀort Aware Curves 100 80 File A B C % bugs caught #bug 0 1 2 60 LOC 48 8 44 40 ROI 0 0.125 0.045 20 Risk 0.78 0.56 0.34 0 0 20 40 60 80 100 % lines of code reviewed 28

29. Comparing Performance Using Eﬀort Aware Curves 100 80 File A B C % bugs caught #bug 0 1 2 60 LOC 48 8 44 40 A ROI 0 0.125 0.045 20 Risk 0.78 0.56 0.34 0 0 20 40 60 80 100 % lines of code reviewed 29

30. Comparing Performance Using Eﬀort Aware Curves 100 80 File A B C % bugs caught #bug 0 1 2 60 LOC 48 8 44 40 ROI 0 0.125 0.045 20 B Risk 0.78 0.56 0.34 0 0 20 40 60 80 100 % lines of code reviewed 30

31. Comparing Performance Using Eﬀort Aware Curves 100 80 File A B C % bugs caught #bug 0 1 2 60 LOC 48 8 44 40 C ROI 0 0.125 0.045 20 Risk 0.78 0.56 0.34 0 0 20 40 60 80 100 % lines of code reviewed 31

32. Is this a good predic?on? 100 80 File A B C % bugs caught #bug 0 1 2 60 LOC 48 8 44 40 ROI 0 0.125 0.045 20 Risk 0.78 0.56 0.34 0 0 20 40 60 80 100 % lines of code reviewed 32

33. Beeer predic?on means a higher curve 100 Good 80 File A B C % bugs caught #bug 0 1 2 60 LOC 48 8 44 40 ROI 0 0.125 0.045 Bad 20 Bad 0.78 0.56 0.34 Good 0.32 0.72 0.55 0 0 20 40 60 80 100 % lines of code reviewed 33

34. The predic?on model helps reduce tes?ng eﬀort 100 Random File 80 % bugs caught 60 File 40 Package 20 0 0 20 40 60 80 100 % lines of code reviewed 34

35. 35

36. Class pred. has more poten?al 36 Thanh H. D. Nguyen (thanhnguyen@cs.queensu.ca)

37. Deviance explained +2.7%" +0.3%" +1,9%" +1.1%" Bugginess ~ Traditional metrics + Local + Global 37 Thanh H. D. Nguyen (thanhnguyen@cs.queensu.ca)

38. Anova on M3 38

Studying the impact of dependency network measures on software quality

Recommandé

Recommandé

Contenu connexe

Similaire à Studying the impact of dependency network measures on software quality

Similaire à Studying the impact of dependency network measures on software quality (20)

Plus de ICSM 2010

Plus de ICSM 2010 (15)

Dernier

Dernier (20)

Studying the impact of dependency network measures on software quality