SZZ Unleashed: An Open Implementation of the SZZ Algorithm

MaLTeSQuE 2019
Aug 27, 2019
Markus Borg
@mrksbrg
mrksbrg.com
RISE Research Institutes of Sweden AB
SZZ Unleashed:
An Open Implementation
of the SZZ Algorithm
- Featuring Example Usage in a Study
of Just-in-Time Bug Prediction for the
Jenkins Project

Daniel Hansson
Oscar Svensson
Kristian Berg
https://github.com/wogscpar/SZZUnleashed

Feed ML with SZZ output
SZZ Unleashed is on GitHub

Who is Markus?
• Development engineer, ABB 2007-2010
– Process automation
– Editor and compiler development
• PhD student, Lund University 2010-2015
– Requirements engineering and testing
– Traceability, change impact analysis
• Senior researcher, RISE 2015-

More of Markus
• Adjunct lecturer (20%), Lund University
– Teaching software engineering
• Member of the board (10%), Swedsoft
– Influence decision makers
– Write comment letters
– Facilitate networking

ML is data-hungry
• ML in SE often relies on bug data
• Bug trackers contain info about
fixes
• What about when bugs were
introduced?
– We need these commits!

Śliwerski, Zimmermann, and Zeller (SZZ)
• A heuristic approach to find bug-introducing commits
• “Few publicly available implementations”
- Rodríguez-Pérez et al. (2018)
• Many homegrown SZZ implementations
• Wasted research effort on commodity development
9
Rodríguez-Pérez, Robles, and González-Barahona.
Reproducibility and credibility in empirical software engineering: A case study based on a systematic literature
review of the use of the SZZ algorithm.
Information and Software Technology, 99, pp.164-176, 2018.

Van der Linden, Lundell, and Marttiin.
Commodification of industrial software: A case for open source.
IEEE Software, 26(4), pp.77-83, 2009.

SZZ in a nutshell
Use closed bug reports to
find bug-fixing commits
Phase 2

Bug-fixing
commits
(A)
git blame
(B)
Bug-introducing
commit candidates
(C)
SZZ in a nutshell
Find all commits that
changed the buggy
lines of code

Bug-fixing
commits
(A)
git blame
(B)
Bug-introducing
commit candidates
(C)
SZZ in a nutshell
too recent?
partial fix?
buggy fix?
Bug-introducing
commits

https://github.com/wogscpar/SZZUnleashed
target project

Output JSON
[["a79fdaa4b34b8f7fddb39bed3eabf4763940d11b",
"26ec7bdf936dfbc3f496b1165cea36488a3a06b2"],
["a79fdaa4b34b8f7fddb39bed3eabf4763940d11b",
"05b46659e451c316fb5f1a5243c49b9a84a50702"],
…
"4e7a43c5863b5e7ad637a5034f75d3c144c45129"],
"b89baa56bf06b2a0f6b67a3e521236e476fe5a9d"],
"05b46659e451c316fb5f1a5243c49b9a84a50702"]]

Commit Features
Lines of code added / Total lines of code
Code churn as defined by Nagappan
and Ball et al. (2005)
Lines of code deleted / Total lines of code
Files churned / Number of files
Lines of code in previous version
Used by Kamei et al. in “A Large-scale
Empirical Study of Just-in-Time
Quality Assurance” IEEE Transactions
on Software Engineering, 39(6),
2013.
Number of modified subsystems
Number of modified sub-directories
Entropy (spreading of changes)
Purpose of a change (e.g., bug fix)
Number of previous committers
Time between committer’s contributions
Number of unique changes
Overall experience of committer
Recent experience of committer
Number of highly coupled files Coupling measures
proposed by D’Ambros et al. (2009)Number of coupled files for all degrees
Number of non-modified coupled files

Goal: Just-in-time bug prediction
• Axis interested in commit-level bug prediction
– Highlight commits that need more review
• Proof-of-concept for Jenkins
– Axis is a frequent contributor
– Jenkins is open source
21

Method
• Jenkins Dataset (~12 years 2006-2018)
– 26,378 commits (3.6% bug-introducing)
• Trained random forest classifier on 16 commit features
RQ1: Effects of oversampling and undersampling?
RQ2: Difference between cross-validation and a time-sensitive
evaluation?
22

Relative Importance of the Features
Lines of code added / Total lines of code 0.17
ChurnLines of code deleted / Total lines of code 0.04
Files churned / Number of files 0.08
Lines of code in previous version 0.07
Other features
Number of modified subsystems 0.11
Number of modified sub-directories 0.09
Entropy (spreading of changes) 0.16
Purpose of a change (e.g., bug fix) 0.03
Number of previous committers 0.08
Time between committer’s contributions 0.04
Number of unique changes 0.04
Overall experience of committer 0.04
Recent experience of committer 0.03
Number of highly coupled files 0.00
CouplingNumber of coupled files for all degrees 0.01
Number of non-modified coupled files 0.01
1. Churn
2. Size
3. #Committers

Answering the RQs…
RQ1: Effects of oversampling and undersampling?
• Baseline sampling too conservative (<3% recall)
• Oversampling is essential
RQ2: Difference between cross-validation and a time-sensitive
evaluation?
• Disregarding time gives overly positive recall (twice as high)
• Go beyond cross-validation
26
But 10-15%
F-score is low…

Current focus: SZZ for Faster Automatic Program Repair
27
Commits
Regression fault
Binary search
Commits
Regression faultML for risk profiling of commits
Complement training data with bug-introducing commits from SZZ

Feed ML with SZZ output
SZZ Unleashed is on GitHub
markus.borg@ri.se
@mrksbrg
mrksbrg.com

SZZ Unleashed: An Open Implementation of the SZZ Algorithm

Recommended

Recommended

More Related Content

Similar to SZZ Unleashed: An Open Implementation of the SZZ Algorithm

Similar to SZZ Unleashed: An Open Implementation of the SZZ Algorithm (20)

More from Markus Borg

More from Markus Borg (19)

Recently uploaded

Recently uploaded (20)

SZZ Unleashed: An Open Implementation of the SZZ Algorithm