SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Copyright © 2017 Anki, Inc. 1
Andrew Stein
May 2017
Making Cozmo See
Copyright © 2017 Anki, Inc. 2
Introducing Anki & Cozmo
• About Anki [http://www.anki.com]
• Consumer Robotics & AI Company, currently focused
on entertainment
• Two Products: Overdrive and Cozmo
• About Cozmo [http://www.anki.com/en-us/cozmo]
• App-driven, autonomous robot character
• Launched Fall 2016, $179
• Full-featured SDK (low- and high-level API)
Copyright © 2017 Anki, Inc. 3
• Primary Goal: Bring Cozmo to Life
• All about personality, via animation
• Interactive, aware
About Cozmo
Copyright © 2017 Anki, Inc. 4
Cozmo’s Personality
Copyright © 2017 Anki, Inc. 5
• Outputs:
• 4 Mechanical DoF: Head, Lift, Treaded Wheels
• Speaker
• OLED face
• Sensors:
• Camera (VGA, 15fps)
• Cliff sensor
• Encoders and IMU
• Battery Life: 1-2 hours
• 3 Processors: comms, sensors/motor control, supervisor
Cozmo’s Core Hardware Capabilities
Copyright © 2017 Anki, Inc. 6
Simple Linear Flow
Script Animation Library
I wish!
Copyright © 2017 Anki, Inc. 7
Cozmo Flow
Reaction Triggers
Game
Logic
World Model
Path Planning
Sensors
Emotional
State
Controlled
Movement
Animation Library
Behaviors
Copyright © 2017 Anki, Inc. 8
• Cozmo moves
• Motion blur
• Must not drive off cliffs
• Odometry not reliable (treads!)
• Cozmo gets moved
• “Robot Kidnapping” not just
academic with six-year-olds!
• How to maintain and
communicate world state?
• Cozmo interacts: Faces, cubes, charger, etc.
Not a Static Surveillance Camera
Copyright © 2017 Anki, Inc. 9
Vision Capabilities
Copyright © 2017 Anki, Inc. 10
• Detect/track/recognize: cubes, charger, faces, motion, etc.
• Robust enrollment for face recognition
• Make eye contact
• Estimate 3D pose of objects
• Maintain map of the world (even if picked up)
• Rolling shutter correction
• Flexible scheduling according to needs and computational budget
Vision Capabilities
Copyright © 2017 Anki, Inc. 11
• Store up to 10 identities across play sessions, enrolled via app activity
• Remember 3D poses for eye contact
Face Detection, Tracking, & Recognition
Copyright © 2017 Anki, Inc. 12
Cat & Dog Face Detection
Copyright © 2017 Anki, Inc. 13
Markers: Cubes, Charger, & Custom Objects (SDK)
Custom Markers for SDK
Copyright © 2017 Anki, Inc. 15
Markers: Design vs. Vision
Vs.
Copyright © 2017 Anki, Inc. 16
Detecting “Interesting Stuff”
Copyright © 2017 Anki, Inc. 17
System Architecture
Copyright © 2017 Anki, Inc. 18
• Use Smart Device for heavy
computation
• Reduce robot cost
• Better development environment
System Architecture
UI + Game Logic
Robotics
Engine
(Vision / AI)
Hardware Drivers
Low-Level
Controllers
Comms
Python
SDK
CLAD via Wifi CLAD via USB
CLAD: Custom
Cross-Platform Messaging
(similar to protobufs)
CLAD (Local)
Copyright © 2017 Anki, Inc. 19
• Animations don’t play instantaneously (must buffer audio)
• Face Enrollment in “Meet Cozmo” activity (limited visual feedback)
• State Mirroring (from engine to Robot / Game / SDK)
• Latency for vision and control
Architecture Complications
Copyright © 2017 Anki, Inc. 21
Under the Hood of the Full System
Copyright © 2017 Anki, Inc. 22
Under the Hood (Manual Drive)
Copyright © 2017 Anki, Inc. 23
Under the Hood (Autonomous)
Copyright © 2017 Anki, Inc. 24
Computer Vision in a Product at Scale
Copyright © 2017 Anki, Inc. 25
Computer Vision is Not the Product
Realistic
Expectations
Product
Requirements
Computer Vision
Copyright © 2017 Anki, Inc. 26
Computer Vision at Scale: Calibration
Manipulating cubes requires high precision
• Camera variability (lens placement)
• Mechanical tolerances
Must calibrate each camera at factory
• Verify camera placement inside head
• Estimate intrinsic parameters
Copyright © 2017 Anki, Inc. 27
• Cannot control
lighting in
people’s homes
• Cube lights
complicate local
binarization
Computer Vision at Scale: Illumination
Copyright © 2017 Anki, Inc. 28
• Detecting and responding to failures
is as important as being successful
Failure Detection
Copyright © 2017 Anki, Inc. 30
• Cozmo: interactive, complex, full of personality
• Computer vision is only a piece of the product,
does not drive all decisions
• Computer vision at scale, in a consumer product,
comes with special challenges and considerations
• Setting expectations for what is (not) possible with
vision is important in a world of sci-fi TV & movies
• Cozmo is a fun little robot to develop for
Conclusions
Copyright © 2017 Anki, Inc. 31
What Cozmo is All About
Copyright © 2017 Anki, Inc. 32
Join us!
We’re
hiring!

Contenu connexe

Plus de Edge AI and Vision Alliance

“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...Edge AI and Vision Alliance
 
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...Edge AI and Vision Alliance
 
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...Edge AI and Vision Alliance
 
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
“Practical Approaches to DNN Quantization,” a Presentation from Magic LeapEdge AI and Vision Alliance
 
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ..."Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...Edge AI and Vision Alliance
 
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...Edge AI and Vision Alliance
 
“A Survey of Model Compression Methods,” a Presentation from Instrumental
“A Survey of Model Compression Methods,” a Presentation from Instrumental“A Survey of Model Compression Methods,” a Presentation from Instrumental
“A Survey of Model Compression Methods,” a Presentation from InstrumentalEdge AI and Vision Alliance
 
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AIEdge AI and Vision Alliance
 

Plus de Edge AI and Vision Alliance (20)

“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
“Tracking and Fusing Diverse Risk Factors to Drive a SAFER Future,” a Present...
 
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
“MIPI CSI-2 Image Sensor Interface Standard Features Enable Efficient Embedde...
 
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
“Introduction to the CSI-2 Image Sensor Interface Standard,” a Presentation f...
 
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
“Practical Approaches to DNN Quantization,” a Presentation from Magic Leap
 
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ..."Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
"Optimizing Image Quality and Stereo Depth at the Edge," a Presentation from ...
 
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
“Using a Collaborative Network of Distributed Cameras for Object Tracking,” a...
 
“A Survey of Model Compression Methods,” a Presentation from Instrumental
“A Survey of Model Compression Methods,” a Presentation from Instrumental“A Survey of Model Compression Methods,” a Presentation from Instrumental
“A Survey of Model Compression Methods,” a Presentation from Instrumental
 
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
“Reinventing Smart Cities with Computer Vision,” a Presentation from Hayden AI
 

Dernier

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Dernier (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

"Making Cozmo See," a Presentation from Anki

  • 1. Copyright © 2017 Anki, Inc. 1 Andrew Stein May 2017 Making Cozmo See
  • 2. Copyright © 2017 Anki, Inc. 2 Introducing Anki & Cozmo • About Anki [http://www.anki.com] • Consumer Robotics & AI Company, currently focused on entertainment • Two Products: Overdrive and Cozmo • About Cozmo [http://www.anki.com/en-us/cozmo] • App-driven, autonomous robot character • Launched Fall 2016, $179 • Full-featured SDK (low- and high-level API)
  • 3. Copyright © 2017 Anki, Inc. 3 • Primary Goal: Bring Cozmo to Life • All about personality, via animation • Interactive, aware About Cozmo
  • 4. Copyright © 2017 Anki, Inc. 4 Cozmo’s Personality
  • 5. Copyright © 2017 Anki, Inc. 5 • Outputs: • 4 Mechanical DoF: Head, Lift, Treaded Wheels • Speaker • OLED face • Sensors: • Camera (VGA, 15fps) • Cliff sensor • Encoders and IMU • Battery Life: 1-2 hours • 3 Processors: comms, sensors/motor control, supervisor Cozmo’s Core Hardware Capabilities
  • 6. Copyright © 2017 Anki, Inc. 6 Simple Linear Flow Script Animation Library I wish!
  • 7. Copyright © 2017 Anki, Inc. 7 Cozmo Flow Reaction Triggers Game Logic World Model Path Planning Sensors Emotional State Controlled Movement Animation Library Behaviors
  • 8. Copyright © 2017 Anki, Inc. 8 • Cozmo moves • Motion blur • Must not drive off cliffs • Odometry not reliable (treads!) • Cozmo gets moved • “Robot Kidnapping” not just academic with six-year-olds! • How to maintain and communicate world state? • Cozmo interacts: Faces, cubes, charger, etc. Not a Static Surveillance Camera
  • 9. Copyright © 2017 Anki, Inc. 9 Vision Capabilities
  • 10. Copyright © 2017 Anki, Inc. 10 • Detect/track/recognize: cubes, charger, faces, motion, etc. • Robust enrollment for face recognition • Make eye contact • Estimate 3D pose of objects • Maintain map of the world (even if picked up) • Rolling shutter correction • Flexible scheduling according to needs and computational budget Vision Capabilities
  • 11. Copyright © 2017 Anki, Inc. 11 • Store up to 10 identities across play sessions, enrolled via app activity • Remember 3D poses for eye contact Face Detection, Tracking, & Recognition
  • 12. Copyright © 2017 Anki, Inc. 12 Cat & Dog Face Detection
  • 13. Copyright © 2017 Anki, Inc. 13 Markers: Cubes, Charger, & Custom Objects (SDK) Custom Markers for SDK
  • 14. Copyright © 2017 Anki, Inc. 15 Markers: Design vs. Vision Vs.
  • 15. Copyright © 2017 Anki, Inc. 16 Detecting “Interesting Stuff”
  • 16. Copyright © 2017 Anki, Inc. 17 System Architecture
  • 17. Copyright © 2017 Anki, Inc. 18 • Use Smart Device for heavy computation • Reduce robot cost • Better development environment System Architecture UI + Game Logic Robotics Engine (Vision / AI) Hardware Drivers Low-Level Controllers Comms Python SDK CLAD via Wifi CLAD via USB CLAD: Custom Cross-Platform Messaging (similar to protobufs) CLAD (Local)
  • 18. Copyright © 2017 Anki, Inc. 19 • Animations don’t play instantaneously (must buffer audio) • Face Enrollment in “Meet Cozmo” activity (limited visual feedback) • State Mirroring (from engine to Robot / Game / SDK) • Latency for vision and control Architecture Complications
  • 19. Copyright © 2017 Anki, Inc. 21 Under the Hood of the Full System
  • 20. Copyright © 2017 Anki, Inc. 22 Under the Hood (Manual Drive)
  • 21. Copyright © 2017 Anki, Inc. 23 Under the Hood (Autonomous)
  • 22. Copyright © 2017 Anki, Inc. 24 Computer Vision in a Product at Scale
  • 23. Copyright © 2017 Anki, Inc. 25 Computer Vision is Not the Product Realistic Expectations Product Requirements Computer Vision
  • 24. Copyright © 2017 Anki, Inc. 26 Computer Vision at Scale: Calibration Manipulating cubes requires high precision • Camera variability (lens placement) • Mechanical tolerances Must calibrate each camera at factory • Verify camera placement inside head • Estimate intrinsic parameters
  • 25. Copyright © 2017 Anki, Inc. 27 • Cannot control lighting in people’s homes • Cube lights complicate local binarization Computer Vision at Scale: Illumination
  • 26. Copyright © 2017 Anki, Inc. 28 • Detecting and responding to failures is as important as being successful Failure Detection
  • 27. Copyright © 2017 Anki, Inc. 30 • Cozmo: interactive, complex, full of personality • Computer vision is only a piece of the product, does not drive all decisions • Computer vision at scale, in a consumer product, comes with special challenges and considerations • Setting expectations for what is (not) possible with vision is important in a world of sci-fi TV & movies • Cozmo is a fun little robot to develop for Conclusions
  • 28. Copyright © 2017 Anki, Inc. 31 What Cozmo is All About
  • 29. Copyright © 2017 Anki, Inc. 32 Join us! We’re hiring!