SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Ikhlaq Sidhu
Chief Scientist & Founding Director, Sutardja Center for Entrepreneurship & Technology
IEOR Emerging Area Professor Award, UC Berkeley
About Me:
Data-X: A Framework for Rapid Impact
in Digital Transformation
Data X
Ikhlaq Sidhu
Chief Scientist & Founding Director
Sutardja Center for Entrepreneurship & Technology
Industrial Engineering & Operations Research
IEOR Emerging Area Professor Award
UC Berkeley
Ikhlaq Sidhu, UC Berkeley
q Chief Scientist and Founder Sutardja Center
q Professor in IEOR at UC Berkeley
q Created many Berkeley programs
q Developed Data-X
q Advisor to many firms and executives
q Granted over 60 US Patents
q Invented technologies used at Skype, HP, US
Robotics, IBM, and licensed to many others …
q Awarded 3Com’s “Inventor of the Year”
q HP – Laser Printer Design
q Venture Advisor at Onset Ventures, X-Fund
q Numerous Advisory Boards and non-profits
All degrees: Electrical Engineering and
Computer Science (EECS), BS to Ph.D.
One of my newest courses at Berkeley:
IEOR 135 Applied Data Science with
Venture Applications
Based on the Data-X Project Framework
• Detection of fake news
• Prediction of long-term energy prices
to solve Wall Street problem
• Prediction applications stock market,
sports betting, and more
• AI for crime detection, traffic guidance,
medical diagnostics, etc.
• A version of Zillow that is recalculated
with the effects of AirBnB income
and many more…
IEOR 135 Applied Data Science with Venture Applications
Sample Data-X Projects
We are in a new phase of evolution due to data, AI, crypto-
systems, blockchain, algorithms -> Data-X
Drivers of Data-X
It is a significant problem to our national agenda if students can’t
participate, build, and harness these types of technologies
New technologies on the horizon * World is changing * Next Industrial Revolution
National and Global SecurityNational Competitiveness
The result of skill and behavior mismatch:
Our model has
adapted: Business
training is not the
only key element
I’ve seen many
technical projects with
smart people go off
track
Why we can’t deliver:
• Theoretical understanding without a practical
understanding of implementation
• Narrow focus: silos of disconnected expertise not
leading to any useful work product or innovation
• Over-design: way too complex
• Not even sure what to create. Wanting
implementation specs that no one has.
• Expensive cost over-runs on development, sometimes
even trying to create something that already exists
• Disconnected from technical reality
• People not on the same page (misaligned), cannot
work with each other, team breakdown.
Data-X
Framework
Innovation
Leadership
Culture of Innovation:
Behaviors and Mindsets
Story
Adaptation
Ecosystem,
Stakeholders
Operational
& Financial
System
Architecture
Open Source
Tools
Components
Minimal
Implementation
Working
Model
Innovation in
Algorithms
At Berkeley, we have
results:
People in our
programs can build
amazing, working
projects in 3 months
with a relatively little
background in ML, AI,
and other data
technologies.
Applicable to all categories of digital transformation
Students/ technical staff
Leaders/
Entrepreneurs
A Solution for Rapid Implementation
DATA-X
PROJECT
EXAMPLES
Deep Dave
David Lin
Sharon Ng
Vanessa Salas
Alexandre Vincent
Airfare Data Scraper
14
Final Product
Safest Path Suggestion
• GREEN: SAFEST PATH
• RED: SHORTEST PATH
Downtown Berkeley to Cal Memorial Stadium
Watch live demo here: https://stayfe.herokuapp.com/
CartilageX:
Automated anomaly
detection in knee MRIs
Iriondo C, Jain D, Muhamedrahimov R, Papanikolaou V, Trotskovsky K, Sun L
Commercialization of RecycleAI
1
Image taken of waste
object and input into
model
2
Model classifies
waste object
Our
Project
3
Object sorted to its
appropriate destination
- Bin Sorter
- Robots
- Conveyor Belts
Prediction of Bitcoin
Prices
Aashray Yadav
Nicolas Sarquis
Bhavya Vashisht
Sai Kannan Sampath
Mubarak Abdul Kader
UC Berkeley | Data-X
Berkeley
Innovation
Index My Dinh
Jessica Gu
Aaron Lu
Dayou Wang
Yan Zeng
Yujun Zou
What happens is we don’t teach courses in this manner?
1. Deep technical students learn many disconnected theories and skills,
but they cannot deliver implementations
2. And they work in teams which cannot deliver innovation
within companies, government, and research instiutions
Open Source
CS Tools
Math
Quantitative
Data-X
Project: Real
Applications
with Code
Samples
What is in this course
Holistic Perspective: Industry, Social Applications, Customer Driven
What is in this class?
Common Open
Source CS Tools:
• Numpy, SciPy
• Pandas
• TensorFlow, Sklearn
• SQL to Pandas
• NLP / NLTK
• Matplotlib
Quantitative
• Prediction: Regression
• ML Classification: Logistic,
SVM.. Trees, Forests,
Bagging, Boosting,..
• Entropy / Information
Topics
• Deep Learning examples,
including CCNs
• Correlations
• Markov Processes
• LTI Systems: Fourier, Filters
where applicable
• Control Models where
applicable
Building Block Code
Samples
• Webscraping
• Stock market live download,
simple trading
• Convolutional Neural
Networks
• Next Word Predictor, Spell
Checking
• Recommendation
• Web Crawler
• Chatbot, E-mail
• Social net interfaces
including twitter
This class will help you combine math and data concepts
The course updates with new tools to stay current. You may learn and use tools not presented in the class project.
Often: Working Code First
Fill In Theory After
What is actually in this class?
Common Open
Source CS Tools:
• Numpy, SciPy
• Pandas
• TensorFlow, Sklearn
• SQL to Pandas
• NLP / NLTK
• Matplotlib
Quantitative
• Prediction: Regression
• ML Classification: Logistic,
SVM.. Trees, Forests,
Bagging, Boosting,..
• Entropy / Information
Topics
• Deep Learning examples,
including CCNs
• Correlations
• Markov Processes
• LTI Systems: Fourier, Filters
where applicable
• Control Models where
applicable
Building Block Code
Samples
• Webscraping
• Stock market live download,
simple trading
• Convolutional Neural
Networks
• Next Word Predictor, Spell
Checking
• Recommendation
• Web Crawler
• Chatbot, E-mail
• Social net interfaces
including twitter
Often: Working Code First
Fill In Theory After
• The ML stack use most commonly used in creating ML/AI/Data
applications
• Application and systems viewpoint of data and ML
• Implementation, architecture, and relevant process to build anything
• Statistical, rule based, and hybrid decision systems
• Connection with relevant mathematical foundations (entropy, correlation,
spectral, LTI, basic prediction, classification)
• Practical insight into advanced techniques and tools: (eg. CNNs, NLP,
scraping, recurrent networks, etc.)
• System modeling for data applications
Many Course Resources Are Already Available at data-x.blog
For those who want to help students or technical experts learn these skills
We can help in other ways as well
OUR APPROACH
Make the Tools Use the Tools
(Optimally)
Architect the System Why and how
you build
Most CS Sutardja CenterThis Course
Where we focus:
Propose
Low Tech
Solution (1)
Brainstorm
Challenge
and Validate (4)
Demo
or Die
(1)
Execute * Iterate
BMoE Reflections
Agile Sprint (8)
Insightful Story Solution
How the Data-X Course Works:
Team: typically 5 students, with available advisor network
The Data-X System View
Web Scrape
Possible Input Code Blocks
Download
Crawl
…
Stream or Poll
Social Net / IoT
Application with Automated
Decisions
Algorithm Options w/ Tables/Matrix
Prediction / Classification
Test, train, split
Keep state
Pandas: Short Term Storage
Long Term Storage: SQL and File
Formats (JSON, CSV, Excel)
Web
Possible Output Code Blocks
Email
Control
Decision
…
Chatbot
Feedback from
External System (World)
Pre-
process
Natural
Language,
State
Features
Blockchain (public ledger or cryptolock)APIs, Services APIs, Services
Our model has
adapted: Business
training is not the
only key element
Observation: student projects
and professional projects that
do well require a different
understanding. We created a
model and framework
to provide these:
Data-X Model Layers
1. Tools: using vs making.
Learn to use and understand state of the art tools
and technical approaches
2. Theory:
Understanding the theory and frameworks behind
the tools using first principals
3. Projects:
Story first, second is development agility and
stakeholders acquisition
4. Project Viewpoints:
5 Viewpoints integrated into the teaching model
5. Behaviors and Mindsets:
6 Behaviors and mindsets tuned for innovation
Our model has
adapted: Business
training is not the
only key element
Notes: #
* combinations of on-line active
systems, API economy, powerful
open source tools, live systems that
must run and be current all the time,
cloud infrastructure compute and
storage blocks, ..
Data-X Model Layers
5 Project Viewpoints:
a) Customer touchpoints
b) Systems and architecture
c) Risk mitigation,
d) Agile increments,
e) Swim-lanes and team dynamics
6 Necessary Behaviors and Mindsets:
a) The target is moving,
b) Tools are powerful - use them
c) The system is the whole world *
d) There is no greenfield - connect to the existing
structure, means know the existing structure
e) You can’t know it all before you start
f) Develop insight, use technical/theoretical
analogies, first principals, but don’t just plug and
play
Project Types
Business or Consumer
Use Case
Social Impact Its Just Cool
(or improve part of a data pipeline
or work towards a research result)
End of Section

Contenu connexe

Tendances

Tendances (20)

Berkeley Method of Innovation Leadership
Berkeley Method of Innovation LeadershipBerkeley Method of Innovation Leadership
Berkeley Method of Innovation Leadership
 
Ikhlaq Sidhu's GVL 2017 Presentation
Ikhlaq Sidhu's GVL 2017 PresentationIkhlaq Sidhu's GVL 2017 Presentation
Ikhlaq Sidhu's GVL 2017 Presentation
 
Newton Innovator Lecture Series Introduction
Newton Innovator Lecture Series IntroductionNewton Innovator Lecture Series Introduction
Newton Innovator Lecture Series Introduction
 
Entrepreneurship for Larger Organizations, IEEE, TEMS, Sidhu
Entrepreneurship for Larger Organizations, IEEE, TEMS, SidhuEntrepreneurship for Larger Organizations, IEEE, TEMS, Sidhu
Entrepreneurship for Larger Organizations, IEEE, TEMS, Sidhu
 
Denmark Keynote: Universities, New Ventures, and Culture
Denmark Keynote: Universities, New Ventures, and CultureDenmark Keynote: Universities, New Ventures, and Culture
Denmark Keynote: Universities, New Ventures, and Culture
 
BMoE4: Mindset and Culture
BMoE4: Mindset and CultureBMoE4: Mindset and Culture
BMoE4: Mindset and Culture
 
BMoE2: Berkeley Method for Execution
BMoE2: Berkeley Method for ExecutionBMoE2: Berkeley Method for Execution
BMoE2: Berkeley Method for Execution
 
Data at Scale and AI for Business, Government, and Society
Data at Scale and AI for Business, Government, and SocietyData at Scale and AI for Business, Government, and Society
Data at Scale and AI for Business, Government, and Society
 
BMoE1: Berkeley Method Opportunity Identification
BMoE1: Berkeley Method Opportunity IdentificationBMoE1: Berkeley Method Opportunity Identification
BMoE1: Berkeley Method Opportunity Identification
 
Data-X-Sparse-v2
Data-X-Sparse-v2Data-X-Sparse-v2
Data-X-Sparse-v2
 
Best Practices in Product Management
Best Practices in Product ManagementBest Practices in Product Management
Best Practices in Product Management
 
Sidhu Philippines Inclusive Innovation with AI and Data
Sidhu Philippines Inclusive Innovation with AI and DataSidhu Philippines Inclusive Innovation with AI and Data
Sidhu Philippines Inclusive Innovation with AI and Data
 
Venture Development: Concept to Execution
Venture Development: Concept to ExecutionVenture Development: Concept to Execution
Venture Development: Concept to Execution
 
The day the robots stole your job adapting hr functions post automation
The day the robots stole your job   adapting hr functions post automationThe day the robots stole your job   adapting hr functions post automation
The day the robots stole your job adapting hr functions post automation
 
University Of California Berkeley Tenderich
University Of California Berkeley TenderichUniversity Of California Berkeley Tenderich
University Of California Berkeley Tenderich
 
Hybrid Business Incubation Models (NBIA 2015)
Hybrid Business Incubation Models (NBIA 2015)Hybrid Business Incubation Models (NBIA 2015)
Hybrid Business Incubation Models (NBIA 2015)
 
What lean startup model tells us about tech transfer?
What lean startup model tells us about tech transfer?What lean startup model tells us about tech transfer?
What lean startup model tells us about tech transfer?
 
Funding for Innovation in ICT 21 May 2019
Funding for Innovation in ICT 21 May 2019Funding for Innovation in ICT 21 May 2019
Funding for Innovation in ICT 21 May 2019
 
The Startup Factories
The Startup FactoriesThe Startup Factories
The Startup Factories
 
The entrepreneur engineer
The entrepreneur engineerThe entrepreneur engineer
The entrepreneur engineer
 

Similaire à Data-X-v3.1

Similaire à Data-X-v3.1 (20)

ai_ml aicet internship report ppt 1.pptx
ai_ml aicet internship report ppt 1.pptxai_ml aicet internship report ppt 1.pptx
ai_ml aicet internship report ppt 1.pptx
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and Placement
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 

Dernier

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
JohnnyPlasten
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
shivangimorya083
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
shambhavirathore45
 

Dernier (20)

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 

Data-X-v3.1

  • 1. Ikhlaq Sidhu Chief Scientist & Founding Director, Sutardja Center for Entrepreneurship & Technology IEOR Emerging Area Professor Award, UC Berkeley About Me: Data-X: A Framework for Rapid Impact in Digital Transformation Data X
  • 2. Ikhlaq Sidhu Chief Scientist & Founding Director Sutardja Center for Entrepreneurship & Technology Industrial Engineering & Operations Research IEOR Emerging Area Professor Award UC Berkeley Ikhlaq Sidhu, UC Berkeley q Chief Scientist and Founder Sutardja Center q Professor in IEOR at UC Berkeley q Created many Berkeley programs q Developed Data-X q Advisor to many firms and executives q Granted over 60 US Patents q Invented technologies used at Skype, HP, US Robotics, IBM, and licensed to many others … q Awarded 3Com’s “Inventor of the Year” q HP – Laser Printer Design q Venture Advisor at Onset Ventures, X-Fund q Numerous Advisory Boards and non-profits All degrees: Electrical Engineering and Computer Science (EECS), BS to Ph.D.
  • 3. One of my newest courses at Berkeley: IEOR 135 Applied Data Science with Venture Applications Based on the Data-X Project Framework
  • 4. • Detection of fake news • Prediction of long-term energy prices to solve Wall Street problem • Prediction applications stock market, sports betting, and more • AI for crime detection, traffic guidance, medical diagnostics, etc. • A version of Zillow that is recalculated with the effects of AirBnB income and many more… IEOR 135 Applied Data Science with Venture Applications Sample Data-X Projects
  • 5. We are in a new phase of evolution due to data, AI, crypto- systems, blockchain, algorithms -> Data-X Drivers of Data-X
  • 6. It is a significant problem to our national agenda if students can’t participate, build, and harness these types of technologies New technologies on the horizon * World is changing * Next Industrial Revolution National and Global SecurityNational Competitiveness The result of skill and behavior mismatch:
  • 7. Our model has adapted: Business training is not the only key element I’ve seen many technical projects with smart people go off track Why we can’t deliver: • Theoretical understanding without a practical understanding of implementation • Narrow focus: silos of disconnected expertise not leading to any useful work product or innovation • Over-design: way too complex • Not even sure what to create. Wanting implementation specs that no one has. • Expensive cost over-runs on development, sometimes even trying to create something that already exists • Disconnected from technical reality • People not on the same page (misaligned), cannot work with each other, team breakdown.
  • 8. Data-X Framework Innovation Leadership Culture of Innovation: Behaviors and Mindsets Story Adaptation Ecosystem, Stakeholders Operational & Financial System Architecture Open Source Tools Components Minimal Implementation Working Model Innovation in Algorithms At Berkeley, we have results: People in our programs can build amazing, working projects in 3 months with a relatively little background in ML, AI, and other data technologies. Applicable to all categories of digital transformation Students/ technical staff Leaders/ Entrepreneurs A Solution for Rapid Implementation
  • 9. DATA-X PROJECT EXAMPLES Deep Dave David Lin Sharon Ng Vanessa Salas Alexandre Vincent Airfare Data Scraper 14 Final Product Safest Path Suggestion • GREEN: SAFEST PATH • RED: SHORTEST PATH Downtown Berkeley to Cal Memorial Stadium Watch live demo here: https://stayfe.herokuapp.com/ CartilageX: Automated anomaly detection in knee MRIs Iriondo C, Jain D, Muhamedrahimov R, Papanikolaou V, Trotskovsky K, Sun L Commercialization of RecycleAI 1 Image taken of waste object and input into model 2 Model classifies waste object Our Project 3 Object sorted to its appropriate destination - Bin Sorter - Robots - Conveyor Belts Prediction of Bitcoin Prices Aashray Yadav Nicolas Sarquis Bhavya Vashisht Sai Kannan Sampath Mubarak Abdul Kader UC Berkeley | Data-X Berkeley Innovation Index My Dinh Jessica Gu Aaron Lu Dayou Wang Yan Zeng Yujun Zou
  • 10. What happens is we don’t teach courses in this manner? 1. Deep technical students learn many disconnected theories and skills, but they cannot deliver implementations 2. And they work in teams which cannot deliver innovation within companies, government, and research instiutions
  • 11. Open Source CS Tools Math Quantitative Data-X Project: Real Applications with Code Samples What is in this course Holistic Perspective: Industry, Social Applications, Customer Driven
  • 12. What is in this class? Common Open Source CS Tools: • Numpy, SciPy • Pandas • TensorFlow, Sklearn • SQL to Pandas • NLP / NLTK • Matplotlib Quantitative • Prediction: Regression • ML Classification: Logistic, SVM.. Trees, Forests, Bagging, Boosting,.. • Entropy / Information Topics • Deep Learning examples, including CCNs • Correlations • Markov Processes • LTI Systems: Fourier, Filters where applicable • Control Models where applicable Building Block Code Samples • Webscraping • Stock market live download, simple trading • Convolutional Neural Networks • Next Word Predictor, Spell Checking • Recommendation • Web Crawler • Chatbot, E-mail • Social net interfaces including twitter This class will help you combine math and data concepts The course updates with new tools to stay current. You may learn and use tools not presented in the class project. Often: Working Code First Fill In Theory After
  • 13. What is actually in this class? Common Open Source CS Tools: • Numpy, SciPy • Pandas • TensorFlow, Sklearn • SQL to Pandas • NLP / NLTK • Matplotlib Quantitative • Prediction: Regression • ML Classification: Logistic, SVM.. Trees, Forests, Bagging, Boosting,.. • Entropy / Information Topics • Deep Learning examples, including CCNs • Correlations • Markov Processes • LTI Systems: Fourier, Filters where applicable • Control Models where applicable Building Block Code Samples • Webscraping • Stock market live download, simple trading • Convolutional Neural Networks • Next Word Predictor, Spell Checking • Recommendation • Web Crawler • Chatbot, E-mail • Social net interfaces including twitter Often: Working Code First Fill In Theory After • The ML stack use most commonly used in creating ML/AI/Data applications • Application and systems viewpoint of data and ML • Implementation, architecture, and relevant process to build anything • Statistical, rule based, and hybrid decision systems • Connection with relevant mathematical foundations (entropy, correlation, spectral, LTI, basic prediction, classification) • Practical insight into advanced techniques and tools: (eg. CNNs, NLP, scraping, recurrent networks, etc.) • System modeling for data applications
  • 14. Many Course Resources Are Already Available at data-x.blog For those who want to help students or technical experts learn these skills We can help in other ways as well
  • 16. Make the Tools Use the Tools (Optimally) Architect the System Why and how you build Most CS Sutardja CenterThis Course Where we focus:
  • 17. Propose Low Tech Solution (1) Brainstorm Challenge and Validate (4) Demo or Die (1) Execute * Iterate BMoE Reflections Agile Sprint (8) Insightful Story Solution How the Data-X Course Works: Team: typically 5 students, with available advisor network
  • 18. The Data-X System View Web Scrape Possible Input Code Blocks Download Crawl … Stream or Poll Social Net / IoT Application with Automated Decisions Algorithm Options w/ Tables/Matrix Prediction / Classification Test, train, split Keep state Pandas: Short Term Storage Long Term Storage: SQL and File Formats (JSON, CSV, Excel) Web Possible Output Code Blocks Email Control Decision … Chatbot Feedback from External System (World) Pre- process Natural Language, State Features Blockchain (public ledger or cryptolock)APIs, Services APIs, Services
  • 19. Our model has adapted: Business training is not the only key element Observation: student projects and professional projects that do well require a different understanding. We created a model and framework to provide these: Data-X Model Layers 1. Tools: using vs making. Learn to use and understand state of the art tools and technical approaches 2. Theory: Understanding the theory and frameworks behind the tools using first principals 3. Projects: Story first, second is development agility and stakeholders acquisition 4. Project Viewpoints: 5 Viewpoints integrated into the teaching model 5. Behaviors and Mindsets: 6 Behaviors and mindsets tuned for innovation
  • 20. Our model has adapted: Business training is not the only key element Notes: # * combinations of on-line active systems, API economy, powerful open source tools, live systems that must run and be current all the time, cloud infrastructure compute and storage blocks, .. Data-X Model Layers 5 Project Viewpoints: a) Customer touchpoints b) Systems and architecture c) Risk mitigation, d) Agile increments, e) Swim-lanes and team dynamics 6 Necessary Behaviors and Mindsets: a) The target is moving, b) Tools are powerful - use them c) The system is the whole world * d) There is no greenfield - connect to the existing structure, means know the existing structure e) You can’t know it all before you start f) Develop insight, use technical/theoretical analogies, first principals, but don’t just plug and play
  • 21. Project Types Business or Consumer Use Case Social Impact Its Just Cool (or improve part of a data pipeline or work towards a research result)