SlideShare une entreprise Scribd logo
1  sur  12
Comments, panel session: ICSE’14:
“After the gold rush”
tim.menzies@gmail.com
Dagstuhl workshop on SA
June 23, 2014
Organized by…. Da bird Da Men Da Mann 1
Welcome to
the Wild West
• Gold rush:
– To extract nuggets of insight.
• Too many inexperienced
“cowboys”:
– No use of best or safe practices.
• “Goldfish bowl” panel:
– Best practices, what not to do
2
Street light effect
• Moral #1:
– look at the real data, not
just the conveniently
available data
• “Before we collect the
data, need to redefine the
right data to collect.”
• “Garbage in, garbage out”
• “Before analyzing
terabytes of data, reflect
some on user goals.”
“I’m looking for my keys.”
3
Can we reason about data, without
detailed background knowledge?
• Traditional view:
– GQM
– Traditional science:
• Define data to collected
• Collect
• Infer
• “Newer” view:
– Operational science (Mockus, keynote, MSR, 2014)
• Find data
• Reason about it
• Prone to “streetlight effect”
4
Empiricists : Rationalists
• Observation : use of background knowledge
– Mockus : Basili
– Norvig : Chomsky
– Locke : Leibnitz
– Aristotle : Plato
– Newton : Descartes
• Ike said:
– “I make no hypotheses.”
– “I feign no hypotheses.”
5
Working in the light
• Moral #2:
– “before we rush to the new,
lets reflect on what we can
learn from what we can see
right now. “
• “Too fast, too early to
expect actionable data for
some specific area. “
• “Building models on what
we have before pushing
ahead.”
• “Need to invest more in
data science
infrastructure”
“I’m exploring the data
without pre-conceived biases.”
6
Full disclosure: Menzies is not rational
(he’s an empiricist)
Accidental
discoveries
• America
• Penicillin
• Anesthesia
• Big bang radiation
• Internal pacemakers
• Microwave ovens
• X-rays
• Safety glass
• Plastics (non-conductive heat
resistant polymer)
• Vulcanised rubber
• Viagra
Wikipedia lists of human
cognitive biases
• 100+ entries
– The way we routinely get it
wrong, every day.
7
All (current) conclusions
are wrong
Prone to revisions
• By subsequent analysis
• Any current models is
– Wrong, but useful
• Timm’s Law:
– Less conclusions, More
conversations
To find better conclusions …
• Just keep looking
• For a community to find
better conclusions
– Discuss more, share
more
8
Between Turkish Toasters
AND NASA Space Ships
9
Raw dimensions less informative than
underlying dimensions
10
Q: How to TRANSFER
Lessons Learned?
• Ignore most of the data
• relevancy filtering: Turhan ESEj’09; Peters TSE’13
• variance filtering: Kocaguneli TSE’12,TSE’13
• performance similarities: He ESEM’13
• Contort the data
• spectral learning (working in PCA
space or some other rotation)
Menzies, TSE’13; Nam, ICSE’13
• Build a bickering committee
• Ensembles Minku, PROMISE’12
11
A little bit of this,
a little bit of that
• Moral #3:
– “My Father’s house has
many rooms.”
• “False dichotomy between
these two approaches.
Really need to bridge
these two modes.“
• “Not the scientific method
but scientific methods
(plural).”
“Aha! Bus tracks! I can follow these to the bus stop
and not drive home drunk.”
12

Contenu connexe

Similaire à Goldrush

Laws and limits of data science 11 10-14
Laws and limits of data science 11 10-14Laws and limits of data science 11 10-14
Laws and limits of data science 11 10-14Michael Brodie
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxHASHEMHASH
 
Where is the Share Button? Intellectual Property Issues Surrounding Data for ...
Where is the Share Button? Intellectual Property Issues Surrounding Data for ...Where is the Share Button? Intellectual Property Issues Surrounding Data for ...
Where is the Share Button? Intellectual Property Issues Surrounding Data for ...Rebekah Cummings
 
Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...
Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...
Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...Seattle Interactive Conference
 
The science and art of methodology
The science and art of methodologyThe science and art of methodology
The science and art of methodologyJosh LaMar
 
Data science and good questions eric kostello
Data science and good questions eric kostelloData science and good questions eric kostello
Data science and good questions eric kostelloData Con LA
 
Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014MedicReS
 
Look Around: Question Answering, Serendipity, and the Research Process of Sch...
Look Around: Question Answering, Serendipity, and the Research Process of Sch...Look Around: Question Answering, Serendipity, and the Research Process of Sch...
Look Around: Question Answering, Serendipity, and the Research Process of Sch...KimberleyMartin
 
Social Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentSocial Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentVaticle
 
Privacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebPrivacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebMatthew Russell
 
NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015Jackie Wirz, PhD
 
Citizen Scientists: Design and Practice
Citizen Scientists: Design and PracticeCitizen Scientists: Design and Practice
Citizen Scientists: Design and PracticeAlice Sheppard
 
Bias and the Data Lifecycle
Bias and the Data LifecycleBias and the Data Lifecycle
Bias and the Data LifecycleRichard Ferrers
 
A Brief Tutorial On Data Mining-20140701
A Brief Tutorial On Data Mining-20140701A Brief Tutorial On Data Mining-20140701
A Brief Tutorial On Data Mining-20140701Xiaming Chen
 
المحاضرة الثانية - سلسلة اساسيات البحث العلمي
المحاضرة الثانية - سلسلة اساسيات البحث العلميالمحاضرة الثانية - سلسلة اساسيات البحث العلمي
المحاضرة الثانية - سلسلة اساسيات البحث العلميEgypt Scholars Inc.
 
BSides Augusta 2015 - Building a Better Analyst Using Cognitive Psychology
BSides Augusta 2015 - Building a Better Analyst Using Cognitive PsychologyBSides Augusta 2015 - Building a Better Analyst Using Cognitive Psychology
BSides Augusta 2015 - Building a Better Analyst Using Cognitive Psychologychrissanders88
 

Similaire à Goldrush (20)

Laws and limits of data science 11 10-14
Laws and limits of data science 11 10-14Laws and limits of data science 11 10-14
Laws and limits of data science 11 10-14
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
Where is the Share Button? Intellectual Property Issues Surrounding Data for ...
Where is the Share Button? Intellectual Property Issues Surrounding Data for ...Where is the Share Button? Intellectual Property Issues Surrounding Data for ...
Where is the Share Button? Intellectual Property Issues Surrounding Data for ...
 
Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...
Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...
Communicating Clickable Complexities: From Nuclei to AI by Jenny Burns & Rach...
 
The science and art of methodology
The science and art of methodologyThe science and art of methodology
The science and art of methodology
 
Data science and good questions eric kostello
Data science and good questions eric kostelloData science and good questions eric kostello
Data science and good questions eric kostello
 
Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014
 
Look Around: Question Answering, Serendipity, and the Research Process of Sch...
Look Around: Question Answering, Serendipity, and the Research Process of Sch...Look Around: Question Answering, Serendipity, and the Research Process of Sch...
Look Around: Question Answering, Serendipity, and the Research Process of Sch...
 
Social Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentSocial Graphs for Better Drug Development
Social Graphs for Better Drug Development
 
OpenML 2014
OpenML 2014OpenML 2014
OpenML 2014
 
Privacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebPrivacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social Web
 
NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015
 
Citizen Scientists: Design and Practice
Citizen Scientists: Design and PracticeCitizen Scientists: Design and Practice
Citizen Scientists: Design and Practice
 
Bias and the Data Lifecycle
Bias and the Data LifecycleBias and the Data Lifecycle
Bias and the Data Lifecycle
 
A Brief Tutorial On Data Mining-20140701
A Brief Tutorial On Data Mining-20140701A Brief Tutorial On Data Mining-20140701
A Brief Tutorial On Data Mining-20140701
 
World ctc2013scoopitcytomics
World ctc2013scoopitcytomicsWorld ctc2013scoopitcytomics
World ctc2013scoopitcytomics
 
Welcome to Earth Science
Welcome to Earth ScienceWelcome to Earth Science
Welcome to Earth Science
 
المحاضرة الثانية - سلسلة اساسيات البحث العلمي
المحاضرة الثانية - سلسلة اساسيات البحث العلميالمحاضرة الثانية - سلسلة اساسيات البحث العلمي
المحاضرة الثانية - سلسلة اساسيات البحث العلمي
 
BSides Augusta 2015 - Building a Better Analyst Using Cognitive Psychology
BSides Augusta 2015 - Building a Better Analyst Using Cognitive PsychologyBSides Augusta 2015 - Building a Better Analyst Using Cognitive Psychology
BSides Augusta 2015 - Building a Better Analyst Using Cognitive Psychology
 
Big data and the question of objectivity
Big data and  the question of objectivityBig data and  the question of objectivity
Big data and the question of objectivity
 

Plus de CS, NcState

Talks2015 novdec
Talks2015 novdecTalks2015 novdec
Talks2015 novdecCS, NcState
 
GALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringGALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringCS, NcState
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest linkCS, NcState
 
Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).CS, NcState
 
Icse15 Tech-briefing Data Science
Icse15 Tech-briefing Data ScienceIcse15 Tech-briefing Data Science
Icse15 Tech-briefing Data ScienceCS, NcState
 
Kits to Find the Bits that Fits
Kits to Find  the Bits that Fits Kits to Find  the Bits that Fits
Kits to Find the Bits that Fits CS, NcState
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab templateCS, NcState
 
Requirements Engineering
Requirements EngineeringRequirements Engineering
Requirements EngineeringCS, NcState
 
172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginiaCS, NcState
 
Automated Software Engineering
Automated Software EngineeringAutomated Software Engineering
Automated Software EngineeringCS, NcState
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)CS, NcState
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1CS, NcState
 
What Metrics Matter?
What Metrics Matter? What Metrics Matter?
What Metrics Matter? CS, NcState
 
In the age of Big Data, what role for Software Engineers?
In the age of Big Data, what role for Software Engineers?In the age of Big Data, what role for Software Engineers?
In the age of Big Data, what role for Software Engineers?CS, NcState
 
Sayyad slides ase13_v4
Sayyad slides ase13_v4Sayyad slides ase13_v4
Sayyad slides ase13_v4CS, NcState
 
Warning: don't do CS
Warning: don't do CSWarning: don't do CS
Warning: don't do CSCS, NcState
 
How to do better experiments in SE
How to do better experiments in SEHow to do better experiments in SE
How to do better experiments in SECS, NcState
 

Plus de CS, NcState (20)

Talks2015 novdec
Talks2015 novdecTalks2015 novdec
Talks2015 novdec
 
Future se oct15
Future se oct15Future se oct15
Future se oct15
 
GALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringGALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software Engineering
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).Welcome to ICSE NIER’15 (new ideas and emerging results).
Welcome to ICSE NIER’15 (new ideas and emerging results).
 
Icse15 Tech-briefing Data Science
Icse15 Tech-briefing Data ScienceIcse15 Tech-briefing Data Science
Icse15 Tech-briefing Data Science
 
Kits to Find the Bits that Fits
Kits to Find  the Bits that Fits Kits to Find  the Bits that Fits
Kits to Find the Bits that Fits
 
Ai4se lab template
Ai4se lab templateAi4se lab template
Ai4se lab template
 
Requirements Engineering
Requirements EngineeringRequirements Engineering
Requirements Engineering
 
172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia172529main ken and_tim_software_assurance_research_at_west_virginia
172529main ken and_tim_software_assurance_research_at_west_virginia
 
Automated Software Engineering
Automated Software EngineeringAutomated Software Engineering
Automated Software Engineering
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
 
Know thy tools
Know thy toolsKnow thy tools
Know thy tools
 
What Metrics Matter?
What Metrics Matter? What Metrics Matter?
What Metrics Matter?
 
In the age of Big Data, what role for Software Engineers?
In the age of Big Data, what role for Software Engineers?In the age of Big Data, what role for Software Engineers?
In the age of Big Data, what role for Software Engineers?
 
Sayyad slides ase13_v4
Sayyad slides ase13_v4Sayyad slides ase13_v4
Sayyad slides ase13_v4
 
Ase2013
Ase2013Ase2013
Ase2013
 
Warning: don't do CS
Warning: don't do CSWarning: don't do CS
Warning: don't do CS
 
How to do better experiments in SE
How to do better experiments in SEHow to do better experiments in SE
How to do better experiments in SE
 

Dernier

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 

Dernier (20)

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Goldrush

  • 1. Comments, panel session: ICSE’14: “After the gold rush” tim.menzies@gmail.com Dagstuhl workshop on SA June 23, 2014 Organized by…. Da bird Da Men Da Mann 1
  • 2. Welcome to the Wild West • Gold rush: – To extract nuggets of insight. • Too many inexperienced “cowboys”: – No use of best or safe practices. • “Goldfish bowl” panel: – Best practices, what not to do 2
  • 3. Street light effect • Moral #1: – look at the real data, not just the conveniently available data • “Before we collect the data, need to redefine the right data to collect.” • “Garbage in, garbage out” • “Before analyzing terabytes of data, reflect some on user goals.” “I’m looking for my keys.” 3
  • 4. Can we reason about data, without detailed background knowledge? • Traditional view: – GQM – Traditional science: • Define data to collected • Collect • Infer • “Newer” view: – Operational science (Mockus, keynote, MSR, 2014) • Find data • Reason about it • Prone to “streetlight effect” 4
  • 5. Empiricists : Rationalists • Observation : use of background knowledge – Mockus : Basili – Norvig : Chomsky – Locke : Leibnitz – Aristotle : Plato – Newton : Descartes • Ike said: – “I make no hypotheses.” – “I feign no hypotheses.” 5
  • 6. Working in the light • Moral #2: – “before we rush to the new, lets reflect on what we can learn from what we can see right now. “ • “Too fast, too early to expect actionable data for some specific area. “ • “Building models on what we have before pushing ahead.” • “Need to invest more in data science infrastructure” “I’m exploring the data without pre-conceived biases.” 6
  • 7. Full disclosure: Menzies is not rational (he’s an empiricist) Accidental discoveries • America • Penicillin • Anesthesia • Big bang radiation • Internal pacemakers • Microwave ovens • X-rays • Safety glass • Plastics (non-conductive heat resistant polymer) • Vulcanised rubber • Viagra Wikipedia lists of human cognitive biases • 100+ entries – The way we routinely get it wrong, every day. 7
  • 8. All (current) conclusions are wrong Prone to revisions • By subsequent analysis • Any current models is – Wrong, but useful • Timm’s Law: – Less conclusions, More conversations To find better conclusions … • Just keep looking • For a community to find better conclusions – Discuss more, share more 8
  • 9. Between Turkish Toasters AND NASA Space Ships 9
  • 10. Raw dimensions less informative than underlying dimensions 10
  • 11. Q: How to TRANSFER Lessons Learned? • Ignore most of the data • relevancy filtering: Turhan ESEj’09; Peters TSE’13 • variance filtering: Kocaguneli TSE’12,TSE’13 • performance similarities: He ESEM’13 • Contort the data • spectral learning (working in PCA space or some other rotation) Menzies, TSE’13; Nam, ICSE’13 • Build a bickering committee • Ensembles Minku, PROMISE’12 11
  • 12. A little bit of this, a little bit of that • Moral #3: – “My Father’s house has many rooms.” • “False dichotomy between these two approaches. Really need to bridge these two modes.“ • “Not the scientific method but scientific methods (plural).” “Aha! Bus tracks! I can follow these to the bus stop and not drive home drunk.” 12