SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
A New AI Platform Architecture
for the Smart Toys of the
Future
Gabriel Costache
Senior R&D Director
XPERI
40+
offices worldwide
headquarters in San
Jose, CA
$1.5B
+
market cap
public company,
trading under XPER
1,600
+
employees
worldwide
1,500
+
engineers
11,000
+
patent assets
100B+
devices worldwide
empowered by
technologies
delivered via Xperi
brands
• Safe
• Secure
• Private
• Enhances child development
• Uses natural interaction
• Monitors child cognitive load
• Develops with the child
• Long battery life
• Re-usable
Ideal Smart Toy
3
© 2022 XPERI
Smart Toy Examples
4
© 2022 XPERI
Privacy Issues
5
2022 XPERI
• Data privacy
• Safety
• Battery life
• Fast response
• AI technologies for children
• Data bias in AI
• Natural interaction with children
• Multimodal: audio, imaging, sensing
Smart Toy Challenges
6
© 2022 XPERI
DTIF (Disruptive Technology Innovation Fund)
D.A.V.I.D
DAVID will develop a “privacy by design” AI platform, capable of multi-modal, ultra-low power
consumption, “data center” level processing of audio and vision data on-device, without the need to
transmit any personal data to the cloud.
What DAVID will deliver to the smart toy market:
• A platform for a wide range of learning and interactive applications in the toy market
• A smart, trusted proof-of-concept toy using this platform that helps children learn and develop, using XPERI imaging
technology, Perceive® Ergo® chip and SoapBox Labs speech technology capabilities in collaboration with the National
University of Ireland, Galway.
• Cloud-free capabilities to ensure privacy and wonderfully immersive user experiences for children of all abilities.
DAVID – Data-center Audio/Video Intelligence
on Device
7
© 2022 XPERI
All-in-one Chip/Platform
Designed for Privacy
Multi-modal Platform Communication
Speech, Expressions, Emotions, Gesture, Context and
others..
• Perception
• Imaging/Vision
• Face Analytics
• Body Analytics
• Hand Analytics
• Video Compression
• Thermal Imaging
• Audio
• Wake Words / VAD
• Speech2Text / ASR
• Voice Analytics / Biometrics
• Sensing
AI Technologies to be Considered
8
© 2022 XPERI
• Interaction
• Visual
• Audio
• Text2Speech
• Sound Generation
• Others
• Language Models / Conversational Models
• Multi Modal Intent
• Cognitive and Behaviour Analysis
• Personalization
• Interactive Games
Perceive® Ergo® AI Processor
9
Source: A Reuther et al. MIT Lincoln Laboratory Supercomputing Center-arXiv:2009.00993
Ergo*
*Note: Ergo uses a proprietary representation. Ergo is not INT8.
© 2022 XPERI
DAVID Platform Design
10
© 2022 XPERI
• Interfaces:
- I2S (Tx, Rx), I2C (Tx, Rx) – (HUB and Ergo)
- MIPI and Parallel (Ergo)
- SPI & QSPI (HUB & Ergo)
- GPIO (HUB and Ergo)
- FTDI (JTAG, UART) (HUB)
- WiFi/BT (HUB)
- USB OTG (HUB)
• Computation Units:
- 3 x Ergo (55 TOPS/Watt + Arc CPUDSP)
- HUB STM32 MCU (Arm M7)
- ESP32 (2x Xtensa LX6)
• Memory:
- 16MB QSPI Flash (Ergo)
- 128MB QSPI Flash + 32MB SRAM (HUB)
- 448 KB ROM + 520 KB SRAM (ESP32)
- SDCard (HUB)
DAVID Platform Specifications
11
© 2022 XPERI
DAVID Toy PoC
12
© 2022 XPERI
microphones
camera
Thermal
LCDs
PIR
Speaker
Contacts
Wireless
charging
Boards, battery
& sensors
Current Ergo Vision Application
13
© 2022 XPERI
Face, Body & Hand
Detection
Facial Analytics FR CNN
Face Alignment
ERGO
x, y, w, h, confidence,
trackID
Facial Landmarks
Face Orientation
Face Expression
Face Embedding FR
x1,y1,
x2,y2
….
Tx, Ty, Rot, Scale
x, y, w, h
Body Analytics
Body Landmarks/Skeleton
Hand Analytics
Hand Gestures
Video Encoder
Encoded stream
1 2 3
4
5
6
Example Ergo Application
• Frame rate 30 fps
• Resolution 320x320
• Power ~100 mW
Fully neural video encoder (Ergo) and decoder (generic)
• Trained end-to-end
• Custom stream – data privacy
• Extra security can be added
• Y only currently but can be easily extended to color
• Enabler of other image enhancement technologies: colorization, super resolution
• Can enable smart monitoring
Video Encoding
14
© 2022 XPERI
ERGO
Video Encoder
Camera
MIPI/Parallel Stream Packing
Hub
Streaming App
Video Decoder
ONNX, TFLite, NNAPI
Mobile App
Decoded Frame
Hub
• Current Ergo board 3 application Text2Speech -> spectrogram generation +
vocoder
• Focus on comprehension, less on naturalness
• Next focus on: voice adaptation, voice cloning
• Extend to sound/music generation
Speech/Audio Neural Synthesis
15
© 2022 XPERI
powers magical and joyful
experiences for kids using speech technology
that is engaging, fun, and frictionless.
PLAY
DAVID Partners
NUIG C3I - Center for Computational,
Cognitive & Connected Imaging
© 2022 XPERI 16
• Smart Toy requirements:
• Privacy
• Battery life
• Multimodal interaction
• Platform requirements:
• Dedicated NN unit with very high OPs/W
• Communication unit
• Multiple sensor support
• Generic processing unit
• DAVID platform and toy PoC
• Available Q3/Q4 2022 for selected partners
Conclusions
17
© 2022 XPERI
Resources
• Xperi – www.Xperi.com
• Perceive, Ergo – www.perceive.io
• SoapBox Labs – www.soapboxlabs.com
• C3I, National University of Ireland, Galway - www.nuigalway.ie/c3i
• Disruptive Technologies Innovation Fund – DTIF
• STMicroelectronics STM32 MCU
• Espressif Systems ESP32
Resources
© 2022 XPERI 19

Contenu connexe

Similaire à “A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi

AiLIbrary White paper05
AiLIbrary White paper05AiLIbrary White paper05
AiLIbrary White paper05Gordon Kraft
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntelAPAC
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntelAPAC
 
The AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistThe AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistData Con LA
 
NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011Michael Heydt
 
AiLibrary Whitepaper 2
AiLibrary Whitepaper 2AiLibrary Whitepaper 2
AiLibrary Whitepaper 2Gordon Kraft
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideIsaac Rodriguez
 
Taller IoT en la Actualidad
Taller IoT en la ActualidadTaller IoT en la Actualidad
Taller IoT en la ActualidadLaurence HR
 
Unity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesUnity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesDevGAMM Conference
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel:  Creating Smart Spaces with All-in-OnesIT@Intel:  Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIntel IT Center
 
Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Frank Carey
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioBill Wong
 
Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung binusgamelab
 
The Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTThe Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTJim McKeeth
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingDesign World
 
Ai Development Company
Ai Development CompanyAi Development Company
Ai Development CompanyRuchir Kakkad
 

Similaire à “A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi (20)

AiLIbrary White paper05
AiLIbrary White paper05AiLIbrary White paper05
AiLIbrary White paper05
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
 
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration CenterIntel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
 
The AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves BergquistThe AI Takeover in Hollywood by Yves Bergquist
The AI Takeover in Hollywood by Yves Bergquist
 
google glass
google glassgoogle glass
google glass
 
NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011NUX Presentation from TechMixer Birmingham 2011
NUX Presentation from TechMixer Birmingham 2011
 
AiLibrary Whitepaper 2
AiLibrary Whitepaper 2AiLibrary Whitepaper 2
AiLibrary Whitepaper 2
 
HPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program GuideHPE Discover 2017 - Internet of Things Program Guide
HPE Discover 2017 - Internet of Things Program Guide
 
Hololens
HololensHololens
Hololens
 
Taller IoT en la Actualidad
Taller IoT en la ActualidadTaller IoT en la Actualidad
Taller IoT en la Actualidad
 
Unity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobilesUnity: What does it take to port a browser title to mobiles
Unity: What does it take to port a browser title to mobiles
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel:  Creating Smart Spaces with All-in-OnesIT@Intel:  Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
 
IT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-OnesIT@Intel: Creating Smart Spaces with All-in-Ones
IT@Intel: Creating Smart Spaces with All-in-Ones
 
Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013Google glass and the wearable revolution - NYCCamp 2013
Google glass and the wearable revolution - NYCCamp 2013
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western Ontario
 
Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung Realsense only STAGE 01 - Firstman Marpaung
Realsense only STAGE 01 - Firstman Marpaung
 
The Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoTThe Internet of Things and You - A Developers Guide to IoT
The Internet of Things and You - A Developers Guide to IoT
 
Telepresence Cisco
Telepresence CiscoTelepresence Cisco
Telepresence Cisco
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensing
 
Ai Development Company
Ai Development CompanyAi Development Company
Ai Development Company
 

Plus de Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

Plus de Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Dernier

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Dernier (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

“A New AI Platform Architecture for the Smart Toys of the Future,” a Presentation from Xperi

  • 1. A New AI Platform Architecture for the Smart Toys of the Future Gabriel Costache Senior R&D Director XPERI
  • 2. 40+ offices worldwide headquarters in San Jose, CA $1.5B + market cap public company, trading under XPER 1,600 + employees worldwide 1,500 + engineers 11,000 + patent assets 100B+ devices worldwide empowered by technologies delivered via Xperi brands
  • 3. • Safe • Secure • Private • Enhances child development • Uses natural interaction • Monitors child cognitive load • Develops with the child • Long battery life • Re-usable Ideal Smart Toy 3 © 2022 XPERI
  • 6. • Data privacy • Safety • Battery life • Fast response • AI technologies for children • Data bias in AI • Natural interaction with children • Multimodal: audio, imaging, sensing Smart Toy Challenges 6 © 2022 XPERI DTIF (Disruptive Technology Innovation Fund) D.A.V.I.D
  • 7. DAVID will develop a “privacy by design” AI platform, capable of multi-modal, ultra-low power consumption, “data center” level processing of audio and vision data on-device, without the need to transmit any personal data to the cloud. What DAVID will deliver to the smart toy market: • A platform for a wide range of learning and interactive applications in the toy market • A smart, trusted proof-of-concept toy using this platform that helps children learn and develop, using XPERI imaging technology, Perceive® Ergo® chip and SoapBox Labs speech technology capabilities in collaboration with the National University of Ireland, Galway. • Cloud-free capabilities to ensure privacy and wonderfully immersive user experiences for children of all abilities. DAVID – Data-center Audio/Video Intelligence on Device 7 © 2022 XPERI All-in-one Chip/Platform Designed for Privacy Multi-modal Platform Communication Speech, Expressions, Emotions, Gesture, Context and others..
  • 8. • Perception • Imaging/Vision • Face Analytics • Body Analytics • Hand Analytics • Video Compression • Thermal Imaging • Audio • Wake Words / VAD • Speech2Text / ASR • Voice Analytics / Biometrics • Sensing AI Technologies to be Considered 8 © 2022 XPERI • Interaction • Visual • Audio • Text2Speech • Sound Generation • Others • Language Models / Conversational Models • Multi Modal Intent • Cognitive and Behaviour Analysis • Personalization • Interactive Games
  • 9. Perceive® Ergo® AI Processor 9 Source: A Reuther et al. MIT Lincoln Laboratory Supercomputing Center-arXiv:2009.00993 Ergo* *Note: Ergo uses a proprietary representation. Ergo is not INT8. © 2022 XPERI
  • 11. • Interfaces: - I2S (Tx, Rx), I2C (Tx, Rx) – (HUB and Ergo) - MIPI and Parallel (Ergo) - SPI & QSPI (HUB & Ergo) - GPIO (HUB and Ergo) - FTDI (JTAG, UART) (HUB) - WiFi/BT (HUB) - USB OTG (HUB) • Computation Units: - 3 x Ergo (55 TOPS/Watt + Arc CPUDSP) - HUB STM32 MCU (Arm M7) - ESP32 (2x Xtensa LX6) • Memory: - 16MB QSPI Flash (Ergo) - 128MB QSPI Flash + 32MB SRAM (HUB) - 448 KB ROM + 520 KB SRAM (ESP32) - SDCard (HUB) DAVID Platform Specifications 11 © 2022 XPERI
  • 12. DAVID Toy PoC 12 © 2022 XPERI microphones camera Thermal LCDs PIR Speaker Contacts Wireless charging Boards, battery & sensors
  • 13. Current Ergo Vision Application 13 © 2022 XPERI Face, Body & Hand Detection Facial Analytics FR CNN Face Alignment ERGO x, y, w, h, confidence, trackID Facial Landmarks Face Orientation Face Expression Face Embedding FR x1,y1, x2,y2 …. Tx, Ty, Rot, Scale x, y, w, h Body Analytics Body Landmarks/Skeleton Hand Analytics Hand Gestures Video Encoder Encoded stream 1 2 3 4 5 6 Example Ergo Application • Frame rate 30 fps • Resolution 320x320 • Power ~100 mW
  • 14. Fully neural video encoder (Ergo) and decoder (generic) • Trained end-to-end • Custom stream – data privacy • Extra security can be added • Y only currently but can be easily extended to color • Enabler of other image enhancement technologies: colorization, super resolution • Can enable smart monitoring Video Encoding 14 © 2022 XPERI ERGO Video Encoder Camera MIPI/Parallel Stream Packing Hub Streaming App Video Decoder ONNX, TFLite, NNAPI Mobile App Decoded Frame Hub
  • 15. • Current Ergo board 3 application Text2Speech -> spectrogram generation + vocoder • Focus on comprehension, less on naturalness • Next focus on: voice adaptation, voice cloning • Extend to sound/music generation Speech/Audio Neural Synthesis 15 © 2022 XPERI
  • 16. powers magical and joyful experiences for kids using speech technology that is engaging, fun, and frictionless. PLAY DAVID Partners NUIG C3I - Center for Computational, Cognitive & Connected Imaging © 2022 XPERI 16
  • 17. • Smart Toy requirements: • Privacy • Battery life • Multimodal interaction • Platform requirements: • Dedicated NN unit with very high OPs/W • Communication unit • Multiple sensor support • Generic processing unit • DAVID platform and toy PoC • Available Q3/Q4 2022 for selected partners Conclusions 17 © 2022 XPERI
  • 19. • Xperi – www.Xperi.com • Perceive, Ergo – www.perceive.io • SoapBox Labs – www.soapboxlabs.com • C3I, National University of Ireland, Galway - www.nuigalway.ie/c3i • Disruptive Technologies Innovation Fund – DTIF • STMicroelectronics STM32 MCU • Espressif Systems ESP32 Resources © 2022 XPERI 19