Community-Assisted Software Engineering Decision Making

•

0 j'aime•382 vues

gregoryg

Formation Technologie

AI in SE: A Success Story
Large, active field, with:
● Growing research community
● Numerous conferences and workshops,
such as MSR, PROMISE, RAISE
● Large data repositories
● History of collaboration between industry
and academia
2

We're already good at drawing useful
conclusions. We expect further algorithmic
improvements.
But...
We need to improve our data!
3

Problem 1:
We don't know what data we need.
Trying to solve complex problems. Make
guesses, then collect data.
Results in missing attributes, added noise.
4

Problem 2:
The data we have is often weak.
Solution quality depends on data quality.
Some commonly-used data sets infamous for
missing values, unhelpful attributes, poor
recording standards.
5

We should improve data standards, but..
We need to use the data we have.
Synergy of human feedback and AI to turn
static data models into dynamic models.
Bring a Wikipedia model to data sets.
6

Enhanced Feedback Loop
8
Recommendation:
MC/DC
Helpful?
Yes
New Values for
Existing Attributes:
Num. Boolean
Expressions: 219
Num. Numeric
Calculations: 73
New Attributes to
Collect (and Values):
Ratio of Boolean to
Numeric Calculations:
3:1
Data to Delete:
Projects 1, 3, 7

Why should we enhance our data?
These dynamic data models allow:
● Low start-up costs.
● Build body of evidence over time.
● Address data quality issues.
● Human-in-the-loop feedback.
9

Challenge 1:
How do we collect feedback?
10

Challenge 2:
How do we use feedback?
Fundamental trade-off between human curation
and automated AI learning.
When should attributes be filtered? Un-updated
data phased out? New data added?
11

Challenge 3:
Motivating Users
How do we motivate users to:
● Provide feedback.
● Add new data.
● Update old data.
12

Motivation requires:
1. Incentive.
2. Ease of use/contribution.
3. Utility from and trust in the model.
13

We propose feedback-driven dynamic
data models maintained by a synergy of
user-feedback and automated AI techniques.
We propose that dynamic data will allow for
low start-up costs, a stronger body of
evidence over time, and adaptations to
changing industrial conditions.
14

For discussion...
1. Is this even a good idea?
2. What can we do to solve data quality
issues? (other than just the idea suggested
here)
3. What kind of data would benefit from
dynamic adaptation?
4. How do we motivate users to provide
feedback, new data, and update old data?
15

Contenu connexe

Tendances

20151016 Data Science For Project ManagersTze-Yiu Yong

Introduction to data scienceKoo Ping Shung

1440 track 2 boire_using our laptopRising Media, Inc.

Future of datasciencejyostnanareshit

IT & Innovation - short summaryPerry Nouwens

Ml in a day v 1.1CCG

Vikrant data scientistVikrant Narayan

Managing Data Science | Lessons from the Field Domino Data Lab

CRISP-DM: a data science project methodologySergey Shelpuk

What Is Data Science? | Introduction to Data Science | Data Science For Begin...Simplilearn

Evaluation of big data analysisΚαρολίνα Κάτι

Data quality management BasicKhaled Mosharraf

5 ways to get more from data scienceTyrone Systems

Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Edureka!

Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!

Simplify your analytics strategyTavva G N R S N Prudhvith

PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014Daniel Westzaan

eResearch AU 2015, intro slidesTomasz Bednarz

What is data science ?Bohitesh Misra, PMP

Tendances (19)

20151016 Data Science For Project Managers

Introduction to data science

1440 track 2 boire_using our laptop

Future of datascience

IT & Innovation - short summary

Ml in a day v 1.1

Vikrant data scientist

Managing Data Science | Lessons from the Field

CRISP-DM: a data science project methodology

What Is Data Science? | Introduction to Data Science | Data Science For Begin...

Evaluation of big data analysis

Data quality management Basic

5 ways to get more from data science

Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...

Data Science Tutorial | Introduction To Data Science | Data Science Training ...

Simplify your analytics strategy

PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014

eResearch AU 2015, intro slides

What is data science ?

En vedette

The Robust Optimization of Non-Linear Requirements Modelsgregoryg

Cukic Promise08 V3gregoryg

Unit 6Azhar Shaik

14 software technical_metricsUniversity of Computer Science and Technology

Software MetricsMassimo Felici

Software Metricsswatisinghal

Software design metricsPrasad Narasimhan

En vedette (7)

The Robust Optimization of Non-Linear Requirements Models

Cukic Promise08 V3

Unit 6

14 software technical_metrics

Software Metrics

Software design metrics

Similaire à Community-Assisted Software Engineering Decision Making

Data Science for Business Managers - An intro to ROI for predictive analyticsAkin Osman Kazakci

Better Living Through Analytics - Strategies for Data DecisionsProduct School

The Analytics and Data Science LandscapePhilip Bourne

Introduction to Data ScienceSrishti44

Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Sahilakhurana

Big Data & Business Analytics: Understanding the MarketspaceBala Iyer

Implementing Data Mesh WP LTIMindtree White Papershashanksalunkhe12

Top Rated Dissertation Data Analysis Services | PhD AssistancePHDAssistance2

Real-Time Data Analytics ExamplesPHDAssistance2

Building the Analytics CapabilityBala Iyer

big data analytics pgpmx2015Sanmeet Dhokay

Why Everything You Know About bigdata Is A LieSunil Ranka

IRJET- A Survey on Mining of Tweeter Data for Predicting User BehaviorIRJET Journal

Data Science course in Hyderabad .rajasrichalamala3zen

data science course in Hyderabad data science course in Hyderabadakhilamadupativibhin

data science course training in Hyderabadmadhupriya3zen

data science.pptxshaikruhiarsha3zenco

best data science course institutes in Hyderabadrajasrichalamala3zen

Similaire à Community-Assisted Software Engineering Decision Making (20)

Data Science for Business Managers - An intro to ROI for predictive analytics

Better Living Through Analytics - Strategies for Data Decisions

The Analytics and Data Science Landscape

Introduction to Data Science

Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...

Big Data & Business Analytics: Understanding the Marketspace

Implementing Data Mesh WP LTIMindtree White Paper

Top Rated Dissertation Data Analysis Services | PhD Assistance

Real-Time Data Analytics Examples

Building the Analytics Capability

big data analytics pgpmx2015

Why Everything You Know About bigdata Is A Lie

IRJET- A Survey on Mining of Tweeter Data for Predicting User Behavior

Data Science course in Hyderabad .

data science course in Hyderabad data science course in Hyderabad

data science course training in Hyderabad

data science.pptx

best data science course institutes in Hyderabad

Plus de gregoryg

Finding Robust Solutions to Requirements Modelsgregoryg

Distributed Decision Tree Inductiongregoryg

Irrf Presentationgregoryg

Optimizing Requirements Decisions with KEYSgregoryg

Confidence in Software Cost Estimation Results based on MMRE and PREDgregoryg

Promise08 Wrapupgregoryg

Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...gregoryg

Software Defect Repair Times: A Multiplicative Modelgregoryg

Complementing Approaches in ERP Effort Estimation Practice: an Industrial Studygregoryg

Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...gregoryg

Implications of Ceiling Effects in Defect Predictorsgregoryg

Practical use of defect detection and predictiongregoryg

Risk And Relevance 20080414pptgregoryg

Organizations Use Datagregoryg

Boetticher Presentation Promise 2008v2gregoryg

Elane - Promise08gregoryg

Risk And Relevance 20080414pptgregoryg

Introduction Promise 2008 V3gregoryg

Plus de gregoryg (18)

Finding Robust Solutions to Requirements Models

Distributed Decision Tree Induction

Irrf Presentation

Optimizing Requirements Decisions with KEYS

Confidence in Software Cost Estimation Results based on MMRE and PRED

Promise08 Wrapup

Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...

Software Defect Repair Times: A Multiplicative Model

Complementing Approaches in ERP Effort Estimation Practice: an Industrial Study

Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...

Implications of Ceiling Effects in Defect Predictors

Practical use of defect detection and prediction

Risk And Relevance 20080414ppt

Organizations Use Data

Boetticher Presentation Promise 2008v2

Elane - Promise08

Risk And Relevance 20080414ppt

Introduction Promise 2008 V3

Dernier

Sports & Fitness Value Added Course FY..Disha Kariya

Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732

Advance Mobile Application Development class 07Dr. Mazin Mohamed alkathiri

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31

APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management

Advanced Views - Calendar View in Odoo 17Celine George

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

The byproduct of sericulture in different industries.pptxShobhayan Kirtania

Software Engineering Methodologies (overview)eniolaolutunde

1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh

Interactive Powerpoint_How to Master effective communicationnomboosow

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

Arihant handbook biology for class 11 .pdfchloefrazer622

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"National Information Standards Organization (NISO)

Activity 01 - Artificial Culture (1).pdfciinovamais

Nutritional Needs Presentation - HLTH 104misteraugie

microwave assisted reaction. General introductionMaksud Ahmed

Mattingly "AI & Prompt Design: The Basics of Prompt Design"National Information Standards Organization (NISO)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

Dernier (20)

Sports & Fitness Value Added Course FY..

Separation of Lanthanides/ Lanthanides and Actinides

Advance Mobile Application Development class 07

Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...

APM Welcome, APM North West Network Conference, Synergies Across Sectors

Advanced Views - Calendar View in Odoo 17

CARE OF CHILD IN INCUBATOR..........pptx

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx

The byproduct of sericulture in different industries.pptx

Software Engineering Methodologies (overview)

1029-Danh muc Sach Giao Khoa khoi 6.pdf

Interactive Powerpoint_How to Master effective communication

Call Girls in Dwarka Mor Delhi Contact Us 9654467111

Arihant handbook biology for class 11 .pdf

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"

Activity 01 - Artificial Culture (1).pdf

Nutritional Needs Presentation - HLTH 104

microwave assisted reaction. General introduction

Mattingly "AI & Prompt Design: The Basics of Prompt Design"

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

Community-Assisted Software Engineering Decision Making

1. Community-Assisted Software Engineering Decision Making Gregory Gay and Mats Heimdahl University of Minnesota

2. AI in SE: A Success Story Large, active field, with: ● Growing research community ● Numerous conferences and workshops, such as MSR, PROMISE, RAISE ● Large data repositories ● History of collaboration between industry and academia 2

3. We're already good at drawing useful conclusions. We expect further algorithmic improvements. But... We need to improve our data! 3

4. Problem 1: We don't know what data we need. Trying to solve complex problems. Make guesses, then collect data. Results in missing attributes, added noise. 4

5. Problem 2: The data we have is often weak. Solution quality depends on data quality. Some commonly-used data sets infamous for missing values, unhelpful attributes, poor recording standards. 5

6. We should improve data standards, but.. We need to use the data we have. Synergy of human feedback and AI to turn static data models into dynamic models. Bring a Wikipedia model to data sets. 6

7. Inspiration: Recommender Systems 7

8. Enhanced Feedback Loop 8 Recommendation: MC/DC Helpful? Yes New Values for Existing Attributes: Num. Boolean Expressions: 219 Num. Numeric Calculations: 73 New Attributes to Collect (and Values): Ratio of Boolean to Numeric Calculations: 3:1 Data to Delete: Projects 1, 3, 7

9. Why should we enhance our data? These dynamic data models allow: ● Low start-up costs. ● Build body of evidence over time. ● Address data quality issues. ● Human-in-the-loop feedback. 9

10. Challenge 1: How do we collect feedback? 10

11. Challenge 2: How do we use feedback? Fundamental trade-off between human curation and automated AI learning. When should attributes be filtered? Un-updated data phased out? New data added? 11

12. Challenge 3: Motivating Users How do we motivate users to: ● Provide feedback. ● Add new data. ● Update old data. 12

13. Motivation requires: 1. Incentive. 2. Ease of use/contribution. 3. Utility from and trust in the model. 13

14. We propose feedback-driven dynamic data models maintained by a synergy of user-feedback and automated AI techniques. We propose that dynamic data will allow for low start-up costs, a stronger body of evidence over time, and adaptations to changing industrial conditions. 14

15. For discussion... 1. Is this even a good idea? 2. What can we do to solve data quality issues? (other than just the idea suggested here) 3. What kind of data would benefit from dynamic adaptation? 4. How do we motivate users to provide feedback, new data, and update old data? 15

Community-Assisted Software Engineering Decision Making

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (19)

En vedette

En vedette (7)

Similaire à Community-Assisted Software Engineering Decision Making

Similaire à Community-Assisted Software Engineering Decision Making (20)

Plus de gregoryg

Plus de gregoryg (18)

Dernier

Dernier (20)

Community-Assisted Software Engineering Decision Making