SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
AI engineering
New challenges in system and software
architecting and managing lifecycle for AI-
based systems
ECSA 2020 14-18 Sept, L’Aquila
Ivica Crnkovic, ivica.crnkovic@chalmers.se
9/16/20
Chalmers 3
Types of Machine Learning
SUPERVISED UNSUPERVISED REINFORCEMENT
Task Driven
predict next value,
classification
Data Driven
Identify clusters
Learn From
Mistakes
Trial and Error
4
95% of ML
9/16/20
Chalmers 6
AI as effective as medical specialists at diagnosing diseas
AI in transportation
• Self-driving vehicles, Traffic management, delay predictions…
AI and NLP (Natural Language processes)
Recognition
Traffic Medicine Robotics
Finance
Health Space
AI success Stories
virtually no area that does not (plan to) use AI
9/16/20
AI failures 2018
https://medium.com/syncedreview/2018-in-review-10-ai-failures-c18faadf5983
2020-09-16
Chalmers University of
Technology
8
https://www.reuters.com/article/us-amazon-com-jobs-automation-insight/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G
Amazon AI recruiting
tool is gender biased
BiggestAI failures 2018AI failures 2018
2020-09-16
Chalmers University of
Technology
9
AI failures 2018AI failures 2018
AI World Cup 2018 - predictions almost all wrong J
2020-09-16 10
Why these
failures?
What are the
challenges inAI
development?
9/16/20 11
• Non technical character
• Who is owner of data?
• What are the ethical aspects of using data?
• What can we allow a machine to decide?
• How do interpret the results from AI models?
New questions in AI Development
• Technical nature
• How to efficiently collect, store, process, analyse, and present data?
• How to efficiently build the AI-based systems?
• How to ensure enough resources (computation, storage, timing)
• How to ensure dependability/trustworthy of such systems?
• What system and software architecture are required for AI-systems?
• WHAT KIND OF SOFTWARE ENGINEERING SUPPORT IS NEEDED?
9/16/20 12
ML Development cycle
assemble
datasets
create
models
train and
evaluate deployment
Life cycle
ML Code –
Create model
Train model
Machine Learning: The High-Interest Credit Card of Technical Debt, D. Sculley, G. Holt, D. Golovin, E. Davydov, T. Phillips, D. Ebner, V. Chaudhary, M. Young
9/16/20
AI-System - system architecture (example)
https://www.youtube.com/watch?v=XEHG2oYdsiU
Collecting data
Analyzing Data
Controlling system
Sending data
Updating system
System evolution
Example: vehicular system (a Cyber-physical system)
Architecture of a car control system
Complex services
– distributed components
Sensor 1
Sensor 2
Sensor 3
Sensor ..
Network
resources
++++++++++
++++++++++
++++++++++
Sensor ..
Execution
resources
Node 1
Node 2
Node 3
Node …
Node …
Actuator 1
Actuator 2
Actuator 3
Actuator …
Actuator …
ECU ECU ECU
C1 C2
Shared resources
Challenges:
• Real-time requirements
• Shared resources
• Resource constraints
Software architecture
9/16/20 17
Platform
Middleware
Subsystem A
HW
Subsystem B Subsystem C
C1
C2
C3
C4
C3
Subsystem and components
Component-based and service-based approach
- Components
- Encapsulation of data
- Encapsulation of functionality
- Dependency between components defined and controlled
MP
9/16/20
18
AI-System - system architecture (example)
Platform
Middleware
Subsystem A
HW
Subsystem B Subsystem C
C1
C2
C3
C4
C3
Subsystem and components
Components – black boxes
- Components
- Encapsulation of AI-based functionality
- Dependency between components defined and controlled
- What about AI code and data?
MP
9/16/20
AI-System - system architecture (example)
https://www.youtube.com/watch?v=XEHG2oYdsiU
Collecting more data
Analyzing more data
More computation
Control system
Further data
processing
Sending data
For diagnostic
And for ML
Updating system
AI-based vehicular system System evolutionCollecting data
System training
- large amounts of data processing
- large computational requirements
With AI the existing architecture is not feasible
CPU GPU
FPGA
EDGE computing
Cloud
computing
Challenges
- New computation models
- Heterogenous platform
- Edge sensor vs. edge centralized level vs. cloud computing
- Development process
- separation of training process from operational process
- New monitoring
- New quality attributes and their metrics
- Migration process from the existing architecture
CPU GPU
FPGA
Cloud
computing
EDGE computing
9/16/20 22
AI- based components – fundamental problems
C1
controlled
Data – used for ML
Controlled?
• The results depend not only
on the algorithms and controlled data
but also on uncontrolled/unknown data
• The AI-based functions are not continuous:
small change of data can cause big changes
• Due to data impact on the model
many new challenges appear
2020-09-16 23
1. Entanglement (Data fusion)
2. Data dependencies
• Unstable data dependencies
• Underutilized Data Dependencies
3. Hidden Feedback Loops
4. Undeclared Consumers
5. Correction Cascades
Examples of Data-related challenges
DATA
DATA
DATA
AI-component
AI-component
DATA
DATA
DATA
AI-component
AI-component
Not needed
DATA
DATA
AI-component
DATA
DATA
AI-component
C1
AI-component
C2DATA
DATA
DATA
AI-component
AI-component
9/16/20 Chalmers 31
• Heterogeneity of data (different formats, accuracy, semantics) and use of
standard ML functions require a lot Glue code
• 95% of code in AI-based systems is a glue-code (empirical data)
• Requires
• Frequent refactoring of code
• Re-implementing AI models
• Pipeline Jungles
• ML-friendly format data become a jungle of scrapes, joins, and sampling steps,
intermediate files
• Requires – a close team work of data and domain engineers
System-design anti-patterns (1)
9/16/20 Chalmers 32
Dead Experimental Code paths
• AI solution requires a lot of experimentation
• A lot of code that will not be used later
• Problems
• Dead code
• Version management – how to preserve useful configuration
branches, and remove unnecessary
System-design anti-patterns (2)
9/16/20 Chalmers 33
Managing Changes in the External World
System
Cloud Data
Local Data
Local Data
Local Data
Historical data
Used in ML
Continuous change of data
Challenge:
models and system behaviour
dependent of data
Examples:
- Threshold changes
- Correlation between data
Requirements: Continuous monitoring of data and system. Continuous test.
35
Study : Overview of industrial ML systems
Case studies
A. Project EST: Real Estate Valuation
B. Project OIL: Predicting Oil and Gas Recovery Potential
C. Project RET: Predicting User Retention
D. Project WEA: Weather Forecasting
E. Project CCF: Credit Card Fraud Detection
F. Project BOT: Poker Bot Identification
G. Project REC: Media Recommendations
Software Engineering Challenges of Deep Learning, Euromicro SEAA 2018, A. Arpteg, B. Brinne, l. Crnkovic-Friis, J. Bosch
Which challenges are the most difficult
in development of AI-based systems?
2020-09-16 Chalmers University of Technology 37
Study: Engineering challenges of DL
■Dependency
Management
◆Glue code and supporting systems
●Unintentional feedback loops
●Monitoring and logging
●Troubleshooting
●Testing ●Limited transparency
●Experimental management
■ Organisational challenges
● Production/operation challenges
◆ Development challenges
◆Effort estimation
■Cultural differences
Hard to solve
Easier to solve
Lower business impact High business impact
●Resource Limitations
◆Privacy and Safety/security
Experiment
prototyping
Non-critical
deployment
Critical deployment Cascading deployment
Assemble
dataset
Issues with problem
formulation and
specifying desired
outcome
Data silos, scarcity of
labelled data, imbalanced
training set
Limitations in techniques for
gathering training data from
large-scale, non-stationary
data streams
Complex and effects of data
dependencies
Create
model
Use of non-
representative dataset,
data drifts
No critical analysis of
training data
Difficulties in building highly
scalable ML pipeline
Entanglements causing
difficulties in isolating
improvements
Train and
evaluate
model
Lack of
well-established ground
truth
No evaluation of models
with business-centric
measures
Difficulties in reproducing
models, results and debugging
DL models
Need of techniques for sliced
analysis in final model
Deploy
model No deployment
mechanism
Training- serving
skew
Adhering to stringent serving
requirements e.g., of latency,
throughput
Hidden feedback-loops and
undeclared consumers of the
models
LE Lwakatare, A Raj, J Bosch, HH Olsson, I Crnkovic, A taxonomy of software engineering challenges for machine learning systems:
An empirical investigation - International Conference on Agile Software …, 2019
The challenges in evolution of development and use ML components
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint)
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint)
Data quality
management
Design
methods and
processes
Model
performance
Deployment
and
compliance
Data versioning &
dependency
mgmt
DataOps/DevOps Automated data
labeling
Federated
learning
infrastructure
Reuse of pre-
developed ML
models
Manage multiple
models
Deployment
infrastructure
Quality attributes A/B testing of
models
Storage and
computing
infrastructure
Integration of
models and SW
components
Monitoring &
logging
Strategic focus Infrastucture Development Process
AI Engineering challenges
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint)
Data quality
management
Design
methods and
processes
Model
performance
Deployment
and
compliance
Federated collection &
storage of data
Federated ML models
Heterogenous HW
platforms
Transfer learning
Strategic focus Cyber-physical
systems
Safety-critical
systems
Autonomously
Improving systems
Data trail
Explainable models
reproducibility
Safety validation
Data generation for ML
Automated
experimentation
Exploration vs.
exploratiom
Online evaluation
Domain-specific AI Engineering challenges
J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint)
43
DevOps
ML Workflow
Example – Process integration
Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
44
Conceptual ML Workflow and DevOps Process Integration
Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
9/16/20 Chalmers 46
Excitement of a researcher
ML/DL introduces many new challenges for software
• On addition to software evolution there is data evolution
• Context change – in behaviour and in data
• New (distributed) system configuration
• Continuous integration, continuous deployment
• Continuous training
Many companies will not be able to cope with new types of complexity
Many opportunities for researchers!
Conclusion
2020 09-16-ai-engineering challanges

Contenu connexe

Tendances

Resume_20160610_MarcusGohm
Resume_20160610_MarcusGohmResume_20160610_MarcusGohm
Resume_20160610_MarcusGohm
Marcus Gohm
 

Tendances (11)

Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...Emerging engineering issues for building large scale AI systems By Srinivas P...
Emerging engineering issues for building large scale AI systems By Srinivas P...
 
Data science for business decisions (public)
Data science for business decisions (public)Data science for business decisions (public)
Data science for business decisions (public)
 
MedPath Designer: A Process-based Modeling Language for Designing Care Pathways
MedPath Designer: A Process-based Modeling Language for Designing Care PathwaysMedPath Designer: A Process-based Modeling Language for Designing Care Pathways
MedPath Designer: A Process-based Modeling Language for Designing Care Pathways
 
Innoslate for Academia
Innoslate for AcademiaInnoslate for Academia
Innoslate for Academia
 
Semantic Web for Advanced Engineering
Semantic Web for Advanced EngineeringSemantic Web for Advanced Engineering
Semantic Web for Advanced Engineering
 
Semantic Web Technologies for Intelligent Engineering Applications
Semantic Web Technologies for  Intelligent Engineering ApplicationsSemantic Web Technologies for  Intelligent Engineering Applications
Semantic Web Technologies for Intelligent Engineering Applications
 
Resume_20160610_MarcusGohm
Resume_20160610_MarcusGohmResume_20160610_MarcusGohm
Resume_20160610_MarcusGohm
 
OA centre of excellence
OA centre of excellenceOA centre of excellence
OA centre of excellence
 
ERP vs SAP (Difference)
ERP vs SAP (Difference)ERP vs SAP (Difference)
ERP vs SAP (Difference)
 
Csun pse-006-presentation-2013 v2.1
Csun pse-006-presentation-2013 v2.1Csun pse-006-presentation-2013 v2.1
Csun pse-006-presentation-2013 v2.1
 
Introduction to Computer Engineering
Introduction to Computer EngineeringIntroduction to Computer Engineering
Introduction to Computer Engineering
 

Similaire à 2020 09-16-ai-engineering challanges

Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
MLconf
 

Similaire à 2020 09-16-ai-engineering challanges (20)

INCOSE IS 2019: AI and Systems Engineering
INCOSE IS 2019: AI and Systems EngineeringINCOSE IS 2019: AI and Systems Engineering
INCOSE IS 2019: AI and Systems Engineering
 
SESE 2021: Where Systems Engineering meets AI/ML
SESE 2021: Where Systems Engineering meets AI/MLSESE 2021: Where Systems Engineering meets AI/ML
SESE 2021: Where Systems Engineering meets AI/ML
 
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...AI4SE: Challenges and opportunities in the integration of Systems Engineering...
AI4SE: Challenges and opportunities in the integration of Systems Engineering...
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Production machine learning: Managing models, workflows and risk at scale
Production machine learning: Managing models, workflows and risk at scaleProduction machine learning: Managing models, workflows and risk at scale
Production machine learning: Managing models, workflows and risk at scale
 
Deep learning in manufacturing predicting and preventing manufacturing defect...
Deep learning in manufacturing predicting and preventing manufacturing defect...Deep learning in manufacturing predicting and preventing manufacturing defect...
Deep learning in manufacturing predicting and preventing manufacturing defect...
 
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
Data Con LA 2022 - Building Field-level Lineage from Scratch for Modern Data ...
 
From Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems ArchitecturesFrom Model-based to Model and Simulation-based Systems Architectures
From Model-based to Model and Simulation-based Systems Architectures
 
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
Project FMEA for Recognizing Difficulties in Machine Learning Application Sys...
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
 
An Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data ManagementAn Integrated Simulation Tool Framework for Process Data Management
An Integrated Simulation Tool Framework for Process Data Management
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
 
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
Is that a Time Machine? Some Design Patterns for Real World Machine Learning ...
 
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdfSLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
SLIDES_Electrification__AI_and_the_Future_of_Engineering_Education.pdf.pdf
 
Practical machine learning
Practical machine learningPractical machine learning
Practical machine learning
 

Plus de Ivica Crnkovic

Software Assurance: What Should We Do next? - Software Design for Reliability
Software Assurance: What Should We Do next?  - Software Design for ReliabilitySoftware Assurance: What Should We Do next?  - Software Design for Reliability
Software Assurance: What Should We Do next? - Software Design for Reliability
Ivica Crnkovic
 
Teaching in multicultural classromre
Teaching in multicultural  classromreTeaching in multicultural  classromre
Teaching in multicultural classromre
Ivica Crnkovic
 
The challenges and opportunities in open source reuse
The challenges and opportunities in open source reuseThe challenges and opportunities in open source reuse
The challenges and opportunities in open source reuse
Ivica Crnkovic
 
Resilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolutionResilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolution
Ivica Crnkovic
 
Empirical se 2013-01-17
Empirical se 2013-01-17Empirical se 2013-01-17
Empirical se 2013-01-17
Ivica Crnkovic
 
Crnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptxCrnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptx
Ivica Crnkovic
 
Ten Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering EducationTen Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering Education
Ivica Crnkovic
 

Plus de Ivica Crnkovic (16)

ICSE 2018 opening session
ICSE 2018 opening sessionICSE 2018 opening session
ICSE 2018 opening session
 
Beyond digitalisation 2016-06-07
Beyond digitalisation  2016-06-07Beyond digitalisation  2016-06-07
Beyond digitalisation 2016-06-07
 
ICSE2018 presentation 2016-05-20
ICSE2018 presentation 2016-05-20ICSE2018 presentation 2016-05-20
ICSE2018 presentation 2016-05-20
 
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
Component-Based and Model-Driven Engineering: what is the difference? A CBSE ...
 
European Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementEuropean Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 Announcement
 
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
Rapid Continuous Software Engineering - Meeting the challenges of modern sof...
 
Software Assurance: What Should We Do next? - Software Design for Reliability
Software Assurance: What Should We Do next?  - Software Design for ReliabilitySoftware Assurance: What Should We Do next?  - Software Design for Reliability
Software Assurance: What Should We Do next? - Software Design for Reliability
 
Sa past-future
Sa past-futureSa past-future
Sa past-future
 
A classification framework for component models
A classification framework for component modelsA classification framework for component models
A classification framework for component models
 
Teaching in multicultural classromre
Teaching in multicultural  classromreTeaching in multicultural  classromre
Teaching in multicultural classromre
 
The challenges and opportunities in open source reuse
The challenges and opportunities in open source reuseThe challenges and opportunities in open source reuse
The challenges and opportunities in open source reuse
 
Resilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolutionResilient systems - predicatbility ane evolution
Resilient systems - predicatbility ane evolution
 
Empirical se 2013-01-17
Empirical se 2013-01-17Empirical se 2013-01-17
Empirical se 2013-01-17
 
SPL in Clouds
SPL in CloudsSPL in Clouds
SPL in Clouds
 
Crnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptxCrnkovic cbse-impact.pptx
Crnkovic cbse-impact.pptx
 
Ten Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering EducationTen Tips to Succeed in Global Software Engineering Education
Ten Tips to Succeed in Global Software Engineering Education
 

Dernier

%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 

Dernier (20)

%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 

2020 09-16-ai-engineering challanges

  • 1. AI engineering New challenges in system and software architecting and managing lifecycle for AI- based systems ECSA 2020 14-18 Sept, L’Aquila Ivica Crnkovic, ivica.crnkovic@chalmers.se
  • 2.
  • 4. Types of Machine Learning SUPERVISED UNSUPERVISED REINFORCEMENT Task Driven predict next value, classification Data Driven Identify clusters Learn From Mistakes Trial and Error 4 95% of ML
  • 5. 9/16/20 Chalmers 6 AI as effective as medical specialists at diagnosing diseas AI in transportation • Self-driving vehicles, Traffic management, delay predictions… AI and NLP (Natural Language processes) Recognition Traffic Medicine Robotics Finance Health Space AI success Stories virtually no area that does not (plan to) use AI
  • 8. 2020-09-16 Chalmers University of Technology 9 AI failures 2018AI failures 2018 AI World Cup 2018 - predictions almost all wrong J
  • 9. 2020-09-16 10 Why these failures? What are the challenges inAI development?
  • 10. 9/16/20 11 • Non technical character • Who is owner of data? • What are the ethical aspects of using data? • What can we allow a machine to decide? • How do interpret the results from AI models? New questions in AI Development • Technical nature • How to efficiently collect, store, process, analyse, and present data? • How to efficiently build the AI-based systems? • How to ensure enough resources (computation, storage, timing) • How to ensure dependability/trustworthy of such systems? • What system and software architecture are required for AI-systems? • WHAT KIND OF SOFTWARE ENGINEERING SUPPORT IS NEEDED?
  • 11. 9/16/20 12 ML Development cycle assemble datasets create models train and evaluate deployment
  • 12. Life cycle ML Code – Create model Train model Machine Learning: The High-Interest Credit Card of Technical Debt, D. Sculley, G. Holt, D. Golovin, E. Davydov, T. Phillips, D. Ebner, V. Chaudhary, M. Young
  • 13. 9/16/20 AI-System - system architecture (example) https://www.youtube.com/watch?v=XEHG2oYdsiU Collecting data Analyzing Data Controlling system Sending data Updating system System evolution Example: vehicular system (a Cyber-physical system)
  • 14. Architecture of a car control system
  • 15. Complex services – distributed components Sensor 1 Sensor 2 Sensor 3 Sensor .. Network resources ++++++++++ ++++++++++ ++++++++++ Sensor .. Execution resources Node 1 Node 2 Node 3 Node … Node … Actuator 1 Actuator 2 Actuator 3 Actuator … Actuator … ECU ECU ECU C1 C2 Shared resources Challenges: • Real-time requirements • Shared resources • Resource constraints
  • 16. Software architecture 9/16/20 17 Platform Middleware Subsystem A HW Subsystem B Subsystem C C1 C2 C3 C4 C3 Subsystem and components Component-based and service-based approach - Components - Encapsulation of data - Encapsulation of functionality - Dependency between components defined and controlled MP
  • 17. 9/16/20 18 AI-System - system architecture (example) Platform Middleware Subsystem A HW Subsystem B Subsystem C C1 C2 C3 C4 C3 Subsystem and components Components – black boxes - Components - Encapsulation of AI-based functionality - Dependency between components defined and controlled - What about AI code and data? MP
  • 18. 9/16/20 AI-System - system architecture (example) https://www.youtube.com/watch?v=XEHG2oYdsiU Collecting more data Analyzing more data More computation Control system Further data processing Sending data For diagnostic And for ML Updating system AI-based vehicular system System evolutionCollecting data System training - large amounts of data processing - large computational requirements
  • 19. With AI the existing architecture is not feasible CPU GPU FPGA EDGE computing Cloud computing
  • 20. Challenges - New computation models - Heterogenous platform - Edge sensor vs. edge centralized level vs. cloud computing - Development process - separation of training process from operational process - New monitoring - New quality attributes and their metrics - Migration process from the existing architecture CPU GPU FPGA Cloud computing EDGE computing
  • 21. 9/16/20 22 AI- based components – fundamental problems C1 controlled Data – used for ML Controlled? • The results depend not only on the algorithms and controlled data but also on uncontrolled/unknown data • The AI-based functions are not continuous: small change of data can cause big changes • Due to data impact on the model many new challenges appear
  • 22. 2020-09-16 23 1. Entanglement (Data fusion) 2. Data dependencies • Unstable data dependencies • Underutilized Data Dependencies 3. Hidden Feedback Loops 4. Undeclared Consumers 5. Correction Cascades Examples of Data-related challenges DATA DATA DATA AI-component AI-component DATA DATA DATA AI-component AI-component Not needed DATA DATA AI-component DATA DATA AI-component C1 AI-component C2DATA DATA DATA AI-component AI-component
  • 23. 9/16/20 Chalmers 31 • Heterogeneity of data (different formats, accuracy, semantics) and use of standard ML functions require a lot Glue code • 95% of code in AI-based systems is a glue-code (empirical data) • Requires • Frequent refactoring of code • Re-implementing AI models • Pipeline Jungles • ML-friendly format data become a jungle of scrapes, joins, and sampling steps, intermediate files • Requires – a close team work of data and domain engineers System-design anti-patterns (1)
  • 24. 9/16/20 Chalmers 32 Dead Experimental Code paths • AI solution requires a lot of experimentation • A lot of code that will not be used later • Problems • Dead code • Version management – how to preserve useful configuration branches, and remove unnecessary System-design anti-patterns (2)
  • 25. 9/16/20 Chalmers 33 Managing Changes in the External World System Cloud Data Local Data Local Data Local Data Historical data Used in ML Continuous change of data Challenge: models and system behaviour dependent of data Examples: - Threshold changes - Correlation between data Requirements: Continuous monitoring of data and system. Continuous test.
  • 26. 35 Study : Overview of industrial ML systems Case studies A. Project EST: Real Estate Valuation B. Project OIL: Predicting Oil and Gas Recovery Potential C. Project RET: Predicting User Retention D. Project WEA: Weather Forecasting E. Project CCF: Credit Card Fraud Detection F. Project BOT: Poker Bot Identification G. Project REC: Media Recommendations Software Engineering Challenges of Deep Learning, Euromicro SEAA 2018, A. Arpteg, B. Brinne, l. Crnkovic-Friis, J. Bosch Which challenges are the most difficult in development of AI-based systems?
  • 27. 2020-09-16 Chalmers University of Technology 37 Study: Engineering challenges of DL ■Dependency Management ◆Glue code and supporting systems ●Unintentional feedback loops ●Monitoring and logging ●Troubleshooting ●Testing ●Limited transparency ●Experimental management ■ Organisational challenges ● Production/operation challenges ◆ Development challenges ◆Effort estimation ■Cultural differences Hard to solve Easier to solve Lower business impact High business impact ●Resource Limitations ◆Privacy and Safety/security
  • 28. Experiment prototyping Non-critical deployment Critical deployment Cascading deployment Assemble dataset Issues with problem formulation and specifying desired outcome Data silos, scarcity of labelled data, imbalanced training set Limitations in techniques for gathering training data from large-scale, non-stationary data streams Complex and effects of data dependencies Create model Use of non- representative dataset, data drifts No critical analysis of training data Difficulties in building highly scalable ML pipeline Entanglements causing difficulties in isolating improvements Train and evaluate model Lack of well-established ground truth No evaluation of models with business-centric measures Difficulties in reproducing models, results and debugging DL models Need of techniques for sliced analysis in final model Deploy model No deployment mechanism Training- serving skew Adhering to stringent serving requirements e.g., of latency, throughput Hidden feedback-loops and undeclared consumers of the models LE Lwakatare, A Raj, J Bosch, HH Olsson, I Crnkovic, A taxonomy of software engineering challenges for machine learning systems: An empirical investigation - International Conference on Agile Software …, 2019 The challenges in evolution of development and use ML components
  • 29. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint)
  • 30. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda (arXiv:2001.07522, 2020 preprint) Data quality management Design methods and processes Model performance Deployment and compliance Data versioning & dependency mgmt DataOps/DevOps Automated data labeling Federated learning infrastructure Reuse of pre- developed ML models Manage multiple models Deployment infrastructure Quality attributes A/B testing of models Storage and computing infrastructure Integration of models and SW components Monitoring & logging Strategic focus Infrastucture Development Process AI Engineering challenges
  • 31. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint) Data quality management Design methods and processes Model performance Deployment and compliance Federated collection & storage of data Federated ML models Heterogenous HW platforms Transfer learning Strategic focus Cyber-physical systems Safety-critical systems Autonomously Improving systems Data trail Explainable models reproducibility Safety validation Data generation for ML Automated experimentation Exploration vs. exploratiom Online evaluation Domain-specific AI Engineering challenges
  • 32. J Bosch, H Holmström Olsson, I Crnkovic: Engineering AI Systems A Research Agenda, arXiv:2001.07522, 2020 preprint)
  • 33. 43 DevOps ML Workflow Example – Process integration Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
  • 34. 44 Conceptual ML Workflow and DevOps Process Integration Lwakatare, Crnkovic & Bosch. DevOps for AI – Challenges in Development of AI-enabled applications, SoftCom 2020
  • 35. 9/16/20 Chalmers 46 Excitement of a researcher ML/DL introduces many new challenges for software • On addition to software evolution there is data evolution • Context change – in behaviour and in data • New (distributed) system configuration • Continuous integration, continuous deployment • Continuous training Many companies will not be able to cope with new types of complexity Many opportunities for researchers! Conclusion