SlideShare une entreprise Scribd logo
1  sur  19
Q: HOW TO DO
BETTER
EXPERIMENTS IN
SE?
TIM@MENZIES.US
WVU, SEPT 2013
FROM TURKISH TOASTERS
TO NASA SPACE SHIPS
2
Turhan, ESEj’09
Q: WHY IS THIS
AN URGENT
QUESTION?
4
WHAT’S AT
STAKE?
• “Transfer” is a core
scientific issue
• Lack of transfer of causal
effects is the
scandal of SE
• Replication is
Empirical SE is rare
• Conclusion instability
• It all depends.
• The full stop
syndrome
• The result?
• A funding crisis
5
THE BAD NEWS
7
WAR STORIES
(DEFECT PREDICTION)
Menzies:TSE’13
MANUAL TRANSFER (WAR STORIES)
• Kitchenham, Mendes et al, TSE 2007: for = against
• Zimmermann FSE, 2009: cross works in 4/600 times
8
THE GOOD NEWS
BETWEEN TURKISH TOASTERS
AND NASA SPACE SHIPS
10
Turhan, ESEj’09
Q: HOW TO TRANSFER
LESSONS LEARNED?
Ignore most of the data
• relevancy filtering: Turhan ESEj’09; Peters TSE’13
• variance filtering: Kocaguneli TSE’12,TSE’13
• performance similarities: He ESEM’13
Contort the data
• spectral learning (working in PCA
space or some other rotation)
Menzies, TSE’13; Nam, ICSE’13
Buildi a bickering committee
• Ensembles Minku, PROMISE’12
11
BTW, SOMETIMES, TRANFER
BETTER THAN LOCAL
12/1/2011
12
Minku:PROMISE’12 Nam:ICSE’13
Peters:TSE’13
THERE IS HOPE
• We’ve been looking in the wrong direction
• SE project data = surface features of an underlying effect
• Go beneath the surface
14
REFLECT LESS
ON RAW DIMENSIONS
12/1/2011
15
WHAT’S CHANGED?
Mark of the old novice:
• Mostly manual analysis
• Obsesses on all the raw data
• Shares “the” model (the only, the single)
• E.g. “Depth of inheritance
is “the” most important
predictor for defects.”
Mark of the new expert:
• Manual and automatic analysis
• Combinations of Human + AI:
• Each offering input and insights to the other
• Filters most of the data, transforms the rest
• Shares analysis methods
• Cost effective methods for generating local lessons
12/1/2011
16
Most
probably
wrong
NOT EXTERNAL VALIDITY
BUT “META-EXTERNAL VALIDITY”
No pair
programming,
CMM5, agile
programming,
etc etc
But
conclusion
stability,
generality
10/11/2013
17
With new data mining technologies, true picture
emerges, where we can see what is going on
12/1/2011
18
SO THERE IS HOPE
19

Contenu connexe

Plus de CS, NcState

Lexisnexis june9
Lexisnexis june9Lexisnexis june9
Lexisnexis june9
CS, NcState
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
CS, NcState
 
Automated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSUAutomated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSU
CS, NcState
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
CS, NcState
 

Plus de CS, NcState (20)

Future se oct15
Future se oct15Future se oct15
Future se oct15
 
GALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringGALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software Engineering
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
 
Lexisnexis june9
Lexisnexis june9Lexisnexis june9
Lexisnexis june9
 
Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).
 
Icse15 Tech-briefing Data Science
Icse15 Tech-briefing Data ScienceIcse15 Tech-briefing Data Science
Icse15 Tech-briefing Data Science
 
Kits to Find the Bits that Fits
Kits to Find  the Bits that Fits Kits to Find  the Bits that Fits
Kits to Find the Bits that Fits
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
 
Automated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSUAutomated Software Enging, Fall 2015, NCSU
Automated Software Enging, Fall 2015, NCSU
 
Requirements Engineering
Requirements EngineeringRequirements Engineering
Requirements Engineering
 
172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia
 
Automated Software Engineering
Automated Software EngineeringAutomated Software Engineering
Automated Software Engineering
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)
 
Tim Menzies, directions in Data Science
Tim Menzies, directions in Data ScienceTim Menzies, directions in Data Science
Tim Menzies, directions in Data Science
 
Goldrush
GoldrushGoldrush
Goldrush
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
 
Know thy tools
Know thy toolsKnow thy tools
Know thy tools
 
The Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software DataThe Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software Data
 
What Metrics Matter?
What Metrics Matter? What Metrics Matter?
What Metrics Matter?
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Dernier (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 

How to do better experiments in SE

  • 1. Q: HOW TO DO BETTER EXPERIMENTS IN SE? TIM@MENZIES.US WVU, SEPT 2013
  • 2. FROM TURKISH TOASTERS TO NASA SPACE SHIPS 2 Turhan, ESEj’09
  • 3. Q: WHY IS THIS AN URGENT QUESTION?
  • 4. 4
  • 5. WHAT’S AT STAKE? • “Transfer” is a core scientific issue • Lack of transfer of causal effects is the scandal of SE • Replication is Empirical SE is rare • Conclusion instability • It all depends. • The full stop syndrome • The result? • A funding crisis 5
  • 8. MANUAL TRANSFER (WAR STORIES) • Kitchenham, Mendes et al, TSE 2007: for = against • Zimmermann FSE, 2009: cross works in 4/600 times 8
  • 10. BETWEEN TURKISH TOASTERS AND NASA SPACE SHIPS 10 Turhan, ESEj’09
  • 11. Q: HOW TO TRANSFER LESSONS LEARNED? Ignore most of the data • relevancy filtering: Turhan ESEj’09; Peters TSE’13 • variance filtering: Kocaguneli TSE’12,TSE’13 • performance similarities: He ESEM’13 Contort the data • spectral learning (working in PCA space or some other rotation) Menzies, TSE’13; Nam, ICSE’13 Buildi a bickering committee • Ensembles Minku, PROMISE’12 11
  • 12. BTW, SOMETIMES, TRANFER BETTER THAN LOCAL 12/1/2011 12 Minku:PROMISE’12 Nam:ICSE’13 Peters:TSE’13
  • 13.
  • 14. THERE IS HOPE • We’ve been looking in the wrong direction • SE project data = surface features of an underlying effect • Go beneath the surface 14
  • 15. REFLECT LESS ON RAW DIMENSIONS 12/1/2011 15
  • 16. WHAT’S CHANGED? Mark of the old novice: • Mostly manual analysis • Obsesses on all the raw data • Shares “the” model (the only, the single) • E.g. “Depth of inheritance is “the” most important predictor for defects.” Mark of the new expert: • Manual and automatic analysis • Combinations of Human + AI: • Each offering input and insights to the other • Filters most of the data, transforms the rest • Shares analysis methods • Cost effective methods for generating local lessons 12/1/2011 16 Most probably wrong
  • 17. NOT EXTERNAL VALIDITY BUT “META-EXTERNAL VALIDITY” No pair programming, CMM5, agile programming, etc etc But conclusion stability, generality 10/11/2013 17
  • 18. With new data mining technologies, true picture emerges, where we can see what is going on 12/1/2011 18 SO THERE IS HOPE
  • 19. 19