SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Five Days of
Empirical Software Engineering:
The PASED Experience
Massimiliano Di Penta
Giuliano Antoniol
Daniel M. Germán
Yann-Gaël Guéhéneuc
Bram Adams
Motivation
Empirical background important for graduate students
Courses on statistics insufficient to provide such a background
(Most) University curricula could not afford do have specific
courses
Some exceptions (there are others for sure!):

•

Easterbrook’s CSC2130: Empirical Research Methods in
Software Engineering at the University of Toronto (2009)

•

Herbsleb’s 08-803: Empirical Methods for Socio-Technical
Research at CMU (2010)

•

Dewayne E. Perry course, Univ. of Texas at Austin
So students need that!
General Info
About the School
• Ecole Polytechnique de Montréal, June 2011
• Funded by MITACS
• low fee for students $250 all included
• 44 participants, 9 countries and 25 different
institutions

• More on http://pased.soccerlab.polymtl.ca
Learning Objectives
1. plan and conduct software engineering
experiments with human subjects and collect
related data
2. plan and conduct software engineering studies
involving the mining of data from
(un)structured software repositories
3. build prediction and classification models from
the collected data, and to use these models
Challenges

Choosing
Topics

Combining
theory and
practice
Dealing with
heterogeneous
participants
What Topics?
What Topics?

Planning the
study
What Topics?
Getting the
data

Planning the
study
What Topics?
Getting the
data

Planning the
study

Analyzing
results
Only 5 days....

...that’s too much!
The Approach
• Learn by example and by doing format
• Experiment design principles and

statistics introduced by presenting cases
from studies in literature

• Practical application of theoretical concepts
during labs

• Course material and laboratory packages
available online, including course videos
School Content

PM

Exp.
Design

Text
mining

Keynote

AM

Mining
Software
Archives

Keynote

Keynote

Keynote

Keynote

Hands
on
lab

Hands
on
lab

Hands
on
lab

Hands
on
lab

Hands
on
lab

Statistical Predictor
Analysis
Models
Learning by doing...
Running example I

from the
school

• Use of UML Stereotypes in comprehension and
maintenance tasks

• Filippo Ricca, Massimiliano Di Penta, Marco Torchiano, Paolo Tonella, Mariano

Ceccato: How Developers' Experience and Ability Influence Web Application
Comprehension Tasks Supported by UML Stereotypes: A Series of Four
Experiments. IEEE Trans. Software Eng. 36(1): 96-118 (2010)

• Filippo Ricca, Massimiliano Di Penta, Marco Torchiano, Paolo Tonella, Mariano

Ceccato: The Role of Experience and Ability in Comprehension Tasks Supported by
UML Stereotypes. ICSE 2007: 375-384

• In the following briefly referred as “Conallen”
Experiment Design

from the
school

Group 1
Lab 1

Lab 2

Group 2

Group 3

Group 4

Sys A

Sys A

Sys B

Sys B

Sys B

Sys B

Sys A

Sys A
Data format: example

from the
school
Boxplots: Conallen

from the
school
Paired analysis: example
ID

F.Conallen

F.UML

T20

0.74

0.74

T21

0.74
0.7

0.29

T24

0.88

0.62

T25

0.75

0.8

T26

0.66

0.39

T27

0.35

0.51

T28

0.62

0.59

T29

0.57

0.68

T30

0.73

0.43

T32

0.74

>wilcox.test(F.Conallen,F.UML, paired=TRUE,
alternative="greater")

0.51

T22

from the
school

0.56

Wilcoxon signed rank test with continuity correction

data: F.Conallen and F.UML
V = 138, p-value = 0.04354
alternative hypothesis: true location shift is greater than 0

• Must have data in a paired format
• or you can use a proper R script
• Need to remove subjects that took part to one lab only
• For parametric statistics just replace wilcox.test with t.test
Hands on Labs
•

Mining software repository challenge: extract
interesting facts from git

•

Experiment design: groups working together on
designing a study

•

Data analysis: text mining, analyze working data sets of
previous experiments and build bug predictors

•

Working data sets from previous experiments,
PROMISE data sets

•

Tools: R and Weka
Lab Script Example
#UNPAIRED ANALYSIS
#Analysis of single experiment
#Mann-Whitney
attach(tbn)
wilcox.test(Correct[Fit=="yes"],Correct[Fit=="no"],paired=FALSE,alternative="greater")
attach(ttrento)
wilcox.test(Correct[Fit=="yes"],Correct[Fit=="no"],paired=FALSE,alternative="greater")
attach(tphd)
wilcox.test(Correct[Fit=="yes"],Correct[Fit=="no"],paired=FALSE,alternative="greater")
#All data
attach(t)
wilcox.test(Correct[Fit=="yes"],Correct[Fit=="no"],paired=FALSE,alternative="greater")
#Exercises:
# 1) perform a two-tailed test
# 2) can t-test be applied instead of Wilcoxon test? test for data normality using the Wilk-Shapiro test
# 3) repeat the analysis using the t-test?
# 4) repeat the analysis for the time dependent variable
Calibrating Courses to
Participants’ Profiles
Excellent

Excellent

1

Good

Good

21

Basic
0

10

20

Good

24

10

20

30

Mining Sw repositories

30

12

Basic

25

None

1

20

1

Good

16

Basic

10

Empirical Sw Engineering
Excellent

4

0

2

0

30

Statistical Analyses

None

18

None

1

Excellent

20

Basic

23

None

4

7

0

10

20

30

Machine learning
Feedbacks
Guidelines on
what not to do

Tutorials
on tools

Longer
Labs

How to write
empirical
papers
Acknowledgments
•
•

•
•
•

Lecturers (other than the paper authors):

•
•

Ahmed E. Hassan, Queen’s University, Canada
Andrian Marcus, Wayne State University, USA

Keynote Speakers

•
•
•
•
•

Gail Murphy, University of British Columbia, USA
Prem Devanbu, UC Davis, USA
Alain Picard, Benchmark Consulting Services Inc., Canada
Maria Codipiero, Peter Colligan, Kal Murtaia, SAP, Canada
Marc-André Decoste, Google Montréal, Canada

Student volunteers
The attendees!
MITACS (http://www.mitacs.ca/)
Conclusions
See12.ppt

Contenu connexe

En vedette

En vedette (19)

MSR Asia Summit
MSR Asia SummitMSR Asia Summit
MSR Asia Summit
 
Ppap13b.ppt
Ppap13b.pptPpap13b.ppt
Ppap13b.ppt
 
Ssbse12a.ppt
Ssbse12a.pptSsbse12a.ppt
Ssbse12a.ppt
 
Icsoc12 tooldemo.ppt
Icsoc12 tooldemo.pptIcsoc12 tooldemo.ppt
Icsoc12 tooldemo.ppt
 
Wcre12c.ppt
Wcre12c.pptWcre12c.ppt
Wcre12c.ppt
 
Ppap13a.ppt
Ppap13a.pptPpap13a.ppt
Ppap13a.ppt
 
Wcre13c.pdf
Wcre13c.pdfWcre13c.pdf
Wcre13c.pdf
 
Ssbse12b.ppt
Ssbse12b.pptSsbse12b.ppt
Ssbse12b.ppt
 
Wcre13b.ppt
Wcre13b.pptWcre13b.ppt
Wcre13b.ppt
 
Rsse12.ppt
Rsse12.pptRsse12.ppt
Rsse12.ppt
 
Icsm07 tooldemo.pdf
Icsm07 tooldemo.pdfIcsm07 tooldemo.pdf
Icsm07 tooldemo.pdf
 
Wcre12b.ppt
Wcre12b.pptWcre12b.ppt
Wcre12b.ppt
 
Wcre13a.ppt
Wcre13a.pptWcre13a.ppt
Wcre13a.ppt
 
Mribp13.ppt
Mribp13.pptMribp13.ppt
Mribp13.ppt
 
Software Design Patterns in Theory
Software Design Patterns in TheorySoftware Design Patterns in Theory
Software Design Patterns in Theory
 
Quality and Software Design Patterns
Quality and Software Design PatternsQuality and Software Design Patterns
Quality and Software Design Patterns
 
AsianPLoP'14: How and Why Design Patterns Impact Quality and Future Challenges
AsianPLoP'14: How and Why Design Patterns Impact Quality and Future ChallengesAsianPLoP'14: How and Why Design Patterns Impact Quality and Future Challenges
AsianPLoP'14: How and Why Design Patterns Impact Quality and Future Challenges
 
Software Design Patterns in Practice
Software Design Patterns in PracticeSoftware Design Patterns in Practice
Software Design Patterns in Practice
 
Jcom02.ppt
Jcom02.pptJcom02.ppt
Jcom02.ppt
 

Similaire à See12.ppt

01_introduction.pdfbnmelllleitrthnjjjkkk
01_introduction.pdfbnmelllleitrthnjjjkkk01_introduction.pdfbnmelllleitrthnjjjkkk
01_introduction.pdfbnmelllleitrthnjjjkkk
JesusTekonbo
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Into
butest
 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdf
SisayNegash4
 

Similaire à See12.ppt (20)

See12.ppt
See12.pptSee12.ppt
See12.ppt
 
ICSE12 SEE.ppt
ICSE12 SEE.pptICSE12 SEE.ppt
ICSE12 SEE.ppt
 
Hayes
HayesHayes
Hayes
 
Digital experiences in technical higher education
Digital experiences in technical higher educationDigital experiences in technical higher education
Digital experiences in technical higher education
 
Understand students’ self-reflections through learning analytics
Understand students’ self-reflections through learning analyticsUnderstand students’ self-reflections through learning analytics
Understand students’ self-reflections through learning analytics
 
The power of learning analytics for UCL: lessons learned from the Open Univer...
The power of learning analytics for UCL: lessons learned from the Open Univer...The power of learning analytics for UCL: lessons learned from the Open Univer...
The power of learning analytics for UCL: lessons learned from the Open Univer...
 
Machine learning
Machine learningMachine learning
Machine learning
 
The Anatomy of a 21st Century Educator
The Anatomy of a 21st Century Educator The Anatomy of a 21st Century Educator
The Anatomy of a 21st Century Educator
 
ASEE Paper Using Online Endless Quizzes as Graded Homework.pdf
ASEE Paper  Using Online Endless Quizzes as Graded Homework.pdfASEE Paper  Using Online Endless Quizzes as Graded Homework.pdf
ASEE Paper Using Online Endless Quizzes as Graded Homework.pdf
 
01_introduction.pdfbnmelllleitrthnjjjkkk
01_introduction.pdfbnmelllleitrthnjjjkkk01_introduction.pdfbnmelllleitrthnjjjkkk
01_introduction.pdfbnmelllleitrthnjjjkkk
 
INTRO TO ML.pptx
INTRO TO ML.pptxINTRO TO ML.pptx
INTRO TO ML.pptx
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Into
 
Machine Learning GDSC DCE Darbhanga.pptx
Machine Learning GDSC DCE Darbhanga.pptxMachine Learning GDSC DCE Darbhanga.pptx
Machine Learning GDSC DCE Darbhanga.pptx
 
The students intrinsic and extrinsic motivations
The students intrinsic and extrinsic motivationsThe students intrinsic and extrinsic motivations
The students intrinsic and extrinsic motivations
 
.doc
.doc.doc
.doc
 
.doc
.doc.doc
.doc
 
Keynote H818 The Power of (In)formal learning: a learning analytics approach
Keynote H818 The Power of (In)formal learning: a learning analytics approachKeynote H818 The Power of (In)formal learning: a learning analytics approach
Keynote H818 The Power of (In)formal learning: a learning analytics approach
 
Heidelberg presentation
Heidelberg presentationHeidelberg presentation
Heidelberg presentation
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdf
 

Plus de Ptidej Team

Plus de Ptidej Team (20)

From IoT to Software Miniaturisation
From IoT to Software MiniaturisationFrom IoT to Software Miniaturisation
From IoT to Software Miniaturisation
 
Presentation
PresentationPresentation
Presentation
 
Presentation
PresentationPresentation
Presentation
 
Presentation
PresentationPresentation
Presentation
 
Presentation by Lionel Briand
Presentation by Lionel BriandPresentation by Lionel Briand
Presentation by Lionel Briand
 
Manel Abdellatif
Manel AbdellatifManel Abdellatif
Manel Abdellatif
 
Azadeh Kermansaravi
Azadeh KermansaraviAzadeh Kermansaravi
Azadeh Kermansaravi
 
Mouna Abidi
Mouna AbidiMouna Abidi
Mouna Abidi
 
CSED - Manel Grichi
CSED - Manel GrichiCSED - Manel Grichi
CSED - Manel Grichi
 
Cristiano Politowski
Cristiano PolitowskiCristiano Politowski
Cristiano Politowski
 
Will io t trigger the next software crisis
Will io t trigger the next software crisisWill io t trigger the next software crisis
Will io t trigger the next software crisis
 
MIPA
MIPAMIPA
MIPA
 
Thesis+of+laleh+eshkevari.ppt
Thesis+of+laleh+eshkevari.pptThesis+of+laleh+eshkevari.ppt
Thesis+of+laleh+eshkevari.ppt
 
Thesis+of+nesrine+abdelkafi.ppt
Thesis+of+nesrine+abdelkafi.pptThesis+of+nesrine+abdelkafi.ppt
Thesis+of+nesrine+abdelkafi.ppt
 
Medicine15.ppt
Medicine15.pptMedicine15.ppt
Medicine15.ppt
 
Qrs17b.ppt
Qrs17b.pptQrs17b.ppt
Qrs17b.ppt
 
Icpc11c.ppt
Icpc11c.pptIcpc11c.ppt
Icpc11c.ppt
 
Icsme16.ppt
Icsme16.pptIcsme16.ppt
Icsme16.ppt
 
Msr17a.ppt
Msr17a.pptMsr17a.ppt
Msr17a.ppt
 
Icsoc15.ppt
Icsoc15.pptIcsoc15.ppt
Icsoc15.ppt
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

See12.ppt