SlideShare une entreprise Scribd logo
1  sur  49
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:Invent
Designing Far-Field Voice Solutions
with Intel N o v e m b e r 3 0 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Meet the team!
Chris Keyser
Principal Solutions Architect, Alexa
Voice Service
Sanjay Devireddy
Software Development Engineer,
Alexa Voice Service
Gagan Luthra
Sr. Product Marketing Manager, Alexa
Voice Service
Fletcher Nelson
Solutions Architect, Alexa
Voice Service
Navneet Malpani
Principal Solutions Architect, Alexa
Voice Service
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The evolution of voice interaction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Single microphone
• Always on
The evolution of voice interaction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Single microphone
• Always on
• Push to talk
• Built-in speaker
The evolution of voice interaction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Single microphone
• Always on
• Push to talk
• Built-in speaker
• Voice activated
• Noise cancellation
The evolution of voice interaction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Single microphone
• Always on
• Push to talk
• Built-in speaker
• Voice activated
• Noise cancellation
• 360-degree mic coverage
• Increased detection range
The evolution of voice interaction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Echo: The first Alexa-enabled device
“Amazon’s Echo might be the
most important product in years”
“Amazon Echo proves the viability
of voice-based home computing.”
“Amazon Alexa just conquered CES 2017.
The world is next.”
“Alexa is clearly the star of IFA 2017”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hi, Alexa… Hello!
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Smart home
control
Timers and alarms
Cloud-based
intelligence
Shopping
Music
News and
information
Custom skills
Always getting smarter
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon’s Alexa-enabled products
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Vision: Alexa anywhere
Home audio
Thermostats
Automobiles
Maker boards
Wearables
Small appliances
Smartphones
Large appliancesHome security
TV/video
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Alexa-enabled products
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Directives
Instructions sent from AVS to Client
Events
Information sent from Client to
AVS
Voice Hardware
Audio Front End
Device Software
AVS Device SDK
Simplifying voice interaction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
“Alexa, Play Music”
Wake Word
Engine
“Play” Directive
Intent
Recognition Result
Recognition Result
Text/SSML
Speech
Platform
Skills
Music
Service
ASR
NLU
TTS
“Recognize” Event
Streaming URL
User’s Utterance
Event
Alexa Voice Response
Directive
Intent
Alexa Voice Service
Speech
Capture via
Microphone
Play back
through
Speaker
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
How do you build Alexa into your product?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
White box design End device
Continuum of reference designs
Smart speaker puck Your product
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Form factor
reference design
White box design End device
FFRD
Continuum of reference designs
Smart speaker puck Your product
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AVS-enabled
module
Form factor
reference design
White box design End device
FFRD
Systems integrator
AVS module
Continuum of reference designs
Smart speaker puck Your product
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Development kit AVS-enabled
module
Form factor
reference design
White box design End device
FFRD
Systems integrator
AVS module
Continuum of reference designs
Smart speaker puck Your product
Intel Voice Enabling
Developer Kit
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Microphone array
ADC ADC ADC ADC
Hardware architecture – What’s in an AVS client?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hardware architecture – What’s in an AVS client?
DSP
Amp DAC
Audio
algorithms
Microphone array
ADC ADC ADC ADC
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DSP
Amp DAC
Audio
algorithms
SoC
Wi-Fi/Ethernet
AVS client
WWE
Microphone array
ADC ADC ADC ADC
Audio
drivers
Hardware architecture – What’s in an AVS client?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Touch activated
Interaction models – introducing terms
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hands-free
voice activated
Interaction models – introducing terms
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Interaction models – introducing terms
Far-field
voice activated
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Choosing the right audio front end
Number of
microphones
Microphone
arrangement
Audio algorithms
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
Wake word engine: Identifies “Alexa” wake word to start sending speech to cloud
• Model size:
Bigger is usually better
Requires more memory on client
• Cloud-based verification:
AVS double-checks your client’s WWE
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
Responseaccuracy
Far-field Hands-free
Increasing SNR
Noise reduction: Remove background noise for clear speech capture.
SNR = Signal to noise ratio
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
Problem: Audio feedback makes it difficult to pick out speech.
Voice A
Voice B
Voice A picked up by Voice B’s mic
Voice B
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
Music playing
on device
Speaker output picked up by device microphone
Speaker output
Microphone
on device
Voice of user
Problem: Audio feedback makes it difficult to pick out speech.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
Acoustic Echo Cancellation (AEC): Use audio feedback to cancel echo and improve SNR.
Music playing
on device
Speaker output
Voice of user
Microphone
on device
DSP
Audio
algorithms
AEC
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
First Last
Beamforming: Locate the source of speech and pick it out of background noise.
Audio algorithms
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
First Last
Beamforming: Locate the source of speech and pick it out of background noise.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Audio algorithms
Beamforming: Orientation matters!
90°
90°
15°
• Circular arrays work best at < 15° angle, such as on a table.
• Wall or ceiling mounting may negatively affect beamforming.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Microphone arrangement
Linear array for
wall-mounted
Circular array
for tabletop
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Number of microphones
One mic for
touch activated
Two or more mics for
hands-free and far-field
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Number of microphonesResponseaccuracy
Far-field – 7 mic
Increasing SNR
Far-field – 4 mic
Hands-free – 2 mic
• Two-mic dev kits perform
comparably.
• More mics are usually better, but
result in diminishing returns.
• In high SNR environments, fewer
mics work adequately.
• As SNR decreases, even seven-mic
dev kits eventually lose accuracy.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Integrate high-performance voice capture
• Dev kits Include mic arrays, DSPs, SoCs
• Supported by leading chipset providers
Development kits for AVS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INTEL
8-Mic Circular Array
XMOS
4-Mic Linear Array
AMAZON
7-Mic Circular Array
SYNAPTICS
4-Mic Square Array
CIRRUS LOGIC
2-Mic Linear Array
MICROSEMI
2-Mic Linear Array
CONEXANT
2-Mic Linear Array
AVS Development KitsHandsfreeFar-field
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Intel Speech Enabling Developer Kit
Nithyananda Jeganathan (Siva)
Product Manager , Smart Home Group
Intel Corporation
Intel helpers for the session:
Sathish Kuttan, Roberta Zouain, Ashwin Patti, Alex Z,
Yen-ning Chang, Sarah Bienvenue, Josh Cobb
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What’s included in the Intel Dev Kit?
Intel Dual DSP w/
Inference engine board
AVS Sample Client
8-Mic Array Board
Raspberry Pi™ connector
cable
Audio Pre-Processing
Algorithms, WWE, and
Software Drivers
Documentation, Collaterals,
and Software Tools
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Devkit comes w/ premium far-field audio
certification from Amazon
• Provides a solid foundation to build new
experiences
• Accelerate new product development by
leveraging Intel’s expertise
• Intel to assist w/ audio tuning and
optimization for the product
Why use Intel developer kit?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Intel Developer Kit— How It Works
Dual	DSP	w/	
inference	
engine
Mics
Speakers
Speech	
recognition	
algos
Intel	Audio	IP	
FW
Cloud	appsCloud	skills
Intel	Speech	Enabling	
Developer	Kit
SW
FW
HW
Legend
Alexa	voice	
services
SPI
Raspberry	Pi
	Audio	driver
AVS	SDK
AVS	Client
AVS	Cloud
1
2
1. User utters “Alexa” keyword, and mic arrays
pickup the utterance
2. Intel’s audio far-field IP performs beamforming,
noise cancellation and detects the “Alexa”
keyword and sends event signal to AVS SDK
3
3. Keyword engine in AVS SDK receives the event
and notifies AVS client to open the channel to
driver4
4. Audio data from driver is sent to the cloud by
the AVS client
5
5. Speech recognition AI algorithms convert
audio to text and pass information to relevant
apps for processing
6
6. Cloud apps return response that is sent back
to the AVS client for playback
7
7. AVS client sends response from cloud over the
speaker
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Intel Design Optimization – Far Field on a Chip
DSP
Amp DAC
Audio
Algorithms
SoC
Wifi / Ethernet
AVS Client
WWE
Audio
Drivers
• Wake Word Engine on DSP for low power product design
• Unique to Intel developer kit
• Significant platform power savings can be realized
• Low power WoV feature has been enabled on the Dual DSP
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• 8 Mic Circular array geometry allows for precise beamforming from all directions
• Ambient noise suppression outside of the signal beam
Intel Design Optimization – Beamforming
Ambient Noise “Alexa” wake word What time is it in Vegas?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Intel Design Optimization – Acoustic Echo Cancellation (AEC)
• Processes multichannel input signal and provides multichannel output to
algorithms
• AEC automatically adapts to room reverberation and playback signal changes
• Intel algorithms perform linear operations that does not introduce distortions
• Highly optimized for Intel Wake On Voice and Amazon Alexa performance
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Build the Sample App on your
Intel Speech Enabling Dev Kit
Open your hardware box and build your devkit!
Follow the instructions in your lab manual at:
https://avs-dvk-workshop.github.io/
W O R K S H O P A C T I V I T Y :
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
D e s i g n i n g F a r - F i e l d V o i c e
S o l u t i o n s w i t h I n t e l
Visit the Alexa developer portal
for more information:
Documentation, tutorials ,webinars, and more
https://developer.amazon.com/alexa
AVS: Alexa Voice Service
https://developer.amazon.com/avs
ASK: Create custom skills for Alexa
https://developer.amazon.com/ask
Smart home:
https://developer.amazon.com/alexa/smart-
home

Contenu connexe

Tendances

Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 
Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)BushraShaikh44
 
PCCN unit 4 physical layer notes
PCCN unit 4 physical layer notesPCCN unit 4 physical layer notes
PCCN unit 4 physical layer notessirishayerraboina
 
Informasiyanın dərsdə öyrənilməsi
Informasiyanın dərsdə öyrənilməsiInformasiyanın dərsdə öyrənilməsi
Informasiyanın dərsdə öyrənilməsiZuzu Mehdiyeva
 
Ri̇yazi̇yyat gündəli̇k həyatımızda
Ri̇yazi̇yyat gündəli̇k həyatımızdaRi̇yazi̇yyat gündəli̇k həyatımızda
Ri̇yazi̇yyat gündəli̇k həyatımızdaArzu Melikova
 
Computer graphics
Computer graphicsComputer graphics
Computer graphicsbhaveshbunk
 
Wireless Application protocol VS Internet Protocol (WAP VS IP)
Wireless Application protocol VS Internet Protocol (WAP VS IP)Wireless Application protocol VS Internet Protocol (WAP VS IP)
Wireless Application protocol VS Internet Protocol (WAP VS IP)Sai_Shunmuga
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlabArcanjo Salazaku
 
Cloud Computing Risk Management (IIA Webinar)
Cloud Computing Risk Management (IIA Webinar)Cloud Computing Risk Management (IIA Webinar)
Cloud Computing Risk Management (IIA Webinar)Brian K. Dickard
 
Wireless communication
Wireless communicationWireless communication
Wireless communicationMukesh Chinta
 
Artificial Intelligence- An Introduction
Artificial Intelligence- An IntroductionArtificial Intelligence- An Introduction
Artificial Intelligence- An Introductionacemindia
 
Antennas in Wireless Communication
Antennas in Wireless CommunicationAntennas in Wireless Communication
Antennas in Wireless CommunicationWani Zahoor
 
Informasiya anlayışı
Informasiya anlayışıInformasiya anlayışı
Informasiya anlayışıazerbaycanli
 
Cloud computing and Cloud security fundamentals
Cloud computing and Cloud security fundamentalsCloud computing and Cloud security fundamentals
Cloud computing and Cloud security fundamentalsViresh Suri
 

Tendances (20)

Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 
Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)Mel frequency cepstral coefficient (mfcc)
Mel frequency cepstral coefficient (mfcc)
 
PCCN unit 4 physical layer notes
PCCN unit 4 physical layer notesPCCN unit 4 physical layer notes
PCCN unit 4 physical layer notes
 
Açıq dərs
Açıq dərsAçıq dərs
Açıq dərs
 
Informasiyanın dərsdə öyrənilməsi
Informasiyanın dərsdə öyrənilməsiInformasiyanın dərsdə öyrənilməsi
Informasiyanın dərsdə öyrənilməsi
 
Ri̇yazi̇yyat gündəli̇k həyatımızda
Ri̇yazi̇yyat gündəli̇k həyatımızdaRi̇yazi̇yyat gündəli̇k həyatımızda
Ri̇yazi̇yyat gündəli̇k həyatımızda
 
Computer graphics
Computer graphicsComputer graphics
Computer graphics
 
Wireless Application protocol VS Internet Protocol (WAP VS IP)
Wireless Application protocol VS Internet Protocol (WAP VS IP)Wireless Application protocol VS Internet Protocol (WAP VS IP)
Wireless Application protocol VS Internet Protocol (WAP VS IP)
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh Tomar
 
What is Cognitive Radio?
What is Cognitive Radio? What is Cognitive Radio?
What is Cognitive Radio?
 
Bresenham algorithm
Bresenham algorithmBresenham algorithm
Bresenham algorithm
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
Cloud Computing Risk Management (IIA Webinar)
Cloud Computing Risk Management (IIA Webinar)Cloud Computing Risk Management (IIA Webinar)
Cloud Computing Risk Management (IIA Webinar)
 
Wireless communication
Wireless communicationWireless communication
Wireless communication
 
Artificial Intelligence- An Introduction
Artificial Intelligence- An IntroductionArtificial Intelligence- An Introduction
Artificial Intelligence- An Introduction
 
Riyaziyyat 4
Riyaziyyat 4Riyaziyyat 4
Riyaziyyat 4
 
Gsm channels concept
Gsm channels conceptGsm channels concept
Gsm channels concept
 
Antennas in Wireless Communication
Antennas in Wireless CommunicationAntennas in Wireless Communication
Antennas in Wireless Communication
 
Informasiya anlayışı
Informasiya anlayışıInformasiya anlayışı
Informasiya anlayışı
 
Cloud computing and Cloud security fundamentals
Cloud computing and Cloud security fundamentalsCloud computing and Cloud security fundamentals
Cloud computing and Cloud security fundamentals
 

Similaire à Designing Far-Field Speech Processing Systems with Intel and Amazon Alexa Voice Service - ALX305 - re:Invent 2017

ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...
ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...
ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...Amazon Web Services
 
ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...
ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...
ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...Amazon Web Services
 
Podcasting on AWS – A Discussion on Everything from Production to Distributio...
Podcasting on AWS – A Discussion on Everything from Production to Distributio...Podcasting on AWS – A Discussion on Everything from Production to Distributio...
Podcasting on AWS – A Discussion on Everything from Production to Distributio...Amazon Web Services
 
Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...
Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...
Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...Amazon Web Services
 
Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...
Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...
Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...Amazon Web Services
 
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...Amazon Web Services
 
Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017
Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017
Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017Amazon Web Services
 
MAE405_Build a 360° Immersive Media Video Solution on AWS
MAE405_Build a 360° Immersive Media Video Solution on AWSMAE405_Build a 360° Immersive Media Video Solution on AWS
MAE405_Build a 360° Immersive Media Video Solution on AWSAmazon Web Services
 
AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0Amazon Web Services
 
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...Amazon Web Services
 
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business ProductivityMCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business ProductivityAmazon Web Services
 
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon PollyMCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon PollyAmazon Web Services
 
利用 Alexa for Business 強您您的組織
利用 Alexa for Business 強您您的組織利用 Alexa for Business 強您您的組織
利用 Alexa for Business 強您您的組織Amazon Web Services
 
An Introduction to AI Services on AWS - Web Summit Lisbon
An Introduction to AI Services on AWS -  Web Summit LisbonAn Introduction to AI Services on AWS -  Web Summit Lisbon
An Introduction to AI Services on AWS - Web Summit LisbonBoaz Ziniman
 
AI Services on AWS - CTO Club JLM
AI Services on AWS - CTO Club JLMAI Services on AWS - CTO Club JLM
AI Services on AWS - CTO Club JLMBoaz Ziniman
 
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Amazon Web Services
 
NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...
NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...
NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...Amazon Web Services
 
ALX307_Integrate Alexa into Your Product
ALX307_Integrate Alexa into Your ProductALX307_Integrate Alexa into Your Product
ALX307_Integrate Alexa into Your ProductAmazon Web Services
 

Similaire à Designing Far-Field Speech Processing Systems with Intel and Amazon Alexa Voice Service - ALX305 - re:Invent 2017 (20)

ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...
ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...
ALX202_Integrate Alexa voice technology into your product with the Alexa Voic...
 
ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...
ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...
ALX318-Voice Plus Screen How to Design Multi-Modal Devices with the Alexa Voi...
 
Podcasting on AWS – A Discussion on Everything from Production to Distributio...
Podcasting on AWS – A Discussion on Everything from Production to Distributio...Podcasting on AWS – A Discussion on Everything from Production to Distributio...
Podcasting on AWS – A Discussion on Everything from Production to Distributio...
 
Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...
Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...
Bringing Characters to Life with Amazon Polly Text-to-Speech - AWS Online Tec...
 
Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...
Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...
Voice Assistants Beyond Smart Speakers - Integrate Alexa into Your Unique Pro...
 
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
 
Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017
Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017
Build a 360° Immersive Media Video Solution on AWS - MAE405 - re:Invent 2017
 
MAE405_Build a 360° Immersive Media Video Solution on AWS
MAE405_Build a 360° Immersive Media Video Solution on AWSMAE405_Build a 360° Immersive Media Video Solution on AWS
MAE405_Build a 360° Immersive Media Video Solution on AWS
 
AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0
 
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
Alexa State of the Science - ALX321 - 2h amazonwebservices Deep Dive into Ama...
 
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business ProductivityMCL308_Using a Digital Assistant in the Enterprise for Business Productivity
MCL308_Using a Digital Assistant in the Enterprise for Business Productivity
 
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon PollyMCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
 
利用 Alexa for Business 強您您的組織
利用 Alexa for Business 強您您的組織利用 Alexa for Business 強您您的組織
利用 Alexa for Business 強您您的組織
 
Intro to Amazon AI Services
Intro to Amazon AI ServicesIntro to Amazon AI Services
Intro to Amazon AI Services
 
An Introduction to AI Services on AWS - Web Summit Lisbon
An Introduction to AI Services on AWS -  Web Summit LisbonAn Introduction to AI Services on AWS -  Web Summit Lisbon
An Introduction to AI Services on AWS - Web Summit Lisbon
 
AI Services on AWS - CTO Club JLM
AI Services on AWS - CTO Club JLMAI Services on AWS - CTO Club JLM
AI Services on AWS - CTO Club JLM
 
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
 
AWSome Day Utrecht - Keynote
AWSome Day Utrecht - KeynoteAWSome Day Utrecht - Keynote
AWSome Day Utrecht - Keynote
 
NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...
NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...
NEW LAUNCH! Push Intelligence to the edge with Greengrass - IOT209 - re:Inven...
 
ALX307_Integrate Alexa into Your Product
ALX307_Integrate Alexa into Your ProductALX307_Integrate Alexa into Your Product
ALX307_Integrate Alexa into Your Product
 

Plus de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Designing Far-Field Speech Processing Systems with Intel and Amazon Alexa Voice Service - ALX305 - re:Invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:Invent Designing Far-Field Voice Solutions with Intel N o v e m b e r 3 0 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Meet the team! Chris Keyser Principal Solutions Architect, Alexa Voice Service Sanjay Devireddy Software Development Engineer, Alexa Voice Service Gagan Luthra Sr. Product Marketing Manager, Alexa Voice Service Fletcher Nelson Solutions Architect, Alexa Voice Service Navneet Malpani Principal Solutions Architect, Alexa Voice Service
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The evolution of voice interaction
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Single microphone • Always on The evolution of voice interaction
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Single microphone • Always on • Push to talk • Built-in speaker The evolution of voice interaction
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Single microphone • Always on • Push to talk • Built-in speaker • Voice activated • Noise cancellation The evolution of voice interaction
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Single microphone • Always on • Push to talk • Built-in speaker • Voice activated • Noise cancellation • 360-degree mic coverage • Increased detection range The evolution of voice interaction
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Echo: The first Alexa-enabled device “Amazon’s Echo might be the most important product in years” “Amazon Echo proves the viability of voice-based home computing.” “Amazon Alexa just conquered CES 2017. The world is next.” “Alexa is clearly the star of IFA 2017”
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hi, Alexa… Hello!
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Smart home control Timers and alarms Cloud-based intelligence Shopping Music News and information Custom skills Always getting smarter
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon’s Alexa-enabled products
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Vision: Alexa anywhere Home audio Thermostats Automobiles Maker boards Wearables Small appliances Smartphones Large appliancesHome security TV/video
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Alexa-enabled products
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Directives Instructions sent from AVS to Client Events Information sent from Client to AVS Voice Hardware Audio Front End Device Software AVS Device SDK Simplifying voice interaction
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. “Alexa, Play Music” Wake Word Engine “Play” Directive Intent Recognition Result Recognition Result Text/SSML Speech Platform Skills Music Service ASR NLU TTS “Recognize” Event Streaming URL User’s Utterance Event Alexa Voice Response Directive Intent Alexa Voice Service Speech Capture via Microphone Play back through Speaker
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. How do you build Alexa into your product?
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. White box design End device Continuum of reference designs Smart speaker puck Your product
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Form factor reference design White box design End device FFRD Continuum of reference designs Smart speaker puck Your product
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AVS-enabled module Form factor reference design White box design End device FFRD Systems integrator AVS module Continuum of reference designs Smart speaker puck Your product
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Development kit AVS-enabled module Form factor reference design White box design End device FFRD Systems integrator AVS module Continuum of reference designs Smart speaker puck Your product Intel Voice Enabling Developer Kit
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Microphone array ADC ADC ADC ADC Hardware architecture – What’s in an AVS client?
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hardware architecture – What’s in an AVS client? DSP Amp DAC Audio algorithms Microphone array ADC ADC ADC ADC
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DSP Amp DAC Audio algorithms SoC Wi-Fi/Ethernet AVS client WWE Microphone array ADC ADC ADC ADC Audio drivers Hardware architecture – What’s in an AVS client?
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Touch activated Interaction models – introducing terms
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hands-free voice activated Interaction models – introducing terms
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Interaction models – introducing terms Far-field voice activated
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Choosing the right audio front end Number of microphones Microphone arrangement Audio algorithms
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms Wake word engine: Identifies “Alexa” wake word to start sending speech to cloud • Model size: Bigger is usually better Requires more memory on client • Cloud-based verification: AVS double-checks your client’s WWE
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms Responseaccuracy Far-field Hands-free Increasing SNR Noise reduction: Remove background noise for clear speech capture. SNR = Signal to noise ratio
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms Problem: Audio feedback makes it difficult to pick out speech. Voice A Voice B Voice A picked up by Voice B’s mic Voice B
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms Music playing on device Speaker output picked up by device microphone Speaker output Microphone on device Voice of user Problem: Audio feedback makes it difficult to pick out speech.
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms Acoustic Echo Cancellation (AEC): Use audio feedback to cancel echo and improve SNR. Music playing on device Speaker output Voice of user Microphone on device DSP Audio algorithms AEC
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. First Last Beamforming: Locate the source of speech and pick it out of background noise. Audio algorithms
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms First Last Beamforming: Locate the source of speech and pick it out of background noise.
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Audio algorithms Beamforming: Orientation matters! 90° 90° 15° • Circular arrays work best at < 15° angle, such as on a table. • Wall or ceiling mounting may negatively affect beamforming.
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Microphone arrangement Linear array for wall-mounted Circular array for tabletop
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Number of microphones One mic for touch activated Two or more mics for hands-free and far-field
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Number of microphonesResponseaccuracy Far-field – 7 mic Increasing SNR Far-field – 4 mic Hands-free – 2 mic • Two-mic dev kits perform comparably. • More mics are usually better, but result in diminishing returns. • In high SNR environments, fewer mics work adequately. • As SNR decreases, even seven-mic dev kits eventually lose accuracy.
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Integrate high-performance voice capture • Dev kits Include mic arrays, DSPs, SoCs • Supported by leading chipset providers Development kits for AVS
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INTEL 8-Mic Circular Array XMOS 4-Mic Linear Array AMAZON 7-Mic Circular Array SYNAPTICS 4-Mic Square Array CIRRUS LOGIC 2-Mic Linear Array MICROSEMI 2-Mic Linear Array CONEXANT 2-Mic Linear Array AVS Development KitsHandsfreeFar-field
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Intel Speech Enabling Developer Kit Nithyananda Jeganathan (Siva) Product Manager , Smart Home Group Intel Corporation Intel helpers for the session: Sathish Kuttan, Roberta Zouain, Ashwin Patti, Alex Z, Yen-ning Chang, Sarah Bienvenue, Josh Cobb
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What’s included in the Intel Dev Kit? Intel Dual DSP w/ Inference engine board AVS Sample Client 8-Mic Array Board Raspberry Pi™ connector cable Audio Pre-Processing Algorithms, WWE, and Software Drivers Documentation, Collaterals, and Software Tools
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Devkit comes w/ premium far-field audio certification from Amazon • Provides a solid foundation to build new experiences • Accelerate new product development by leveraging Intel’s expertise • Intel to assist w/ audio tuning and optimization for the product Why use Intel developer kit?
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Intel Developer Kit— How It Works Dual DSP w/ inference engine Mics Speakers Speech recognition algos Intel Audio IP FW Cloud appsCloud skills Intel Speech Enabling Developer Kit SW FW HW Legend Alexa voice services SPI Raspberry Pi Audio driver AVS SDK AVS Client AVS Cloud 1 2 1. User utters “Alexa” keyword, and mic arrays pickup the utterance 2. Intel’s audio far-field IP performs beamforming, noise cancellation and detects the “Alexa” keyword and sends event signal to AVS SDK 3 3. Keyword engine in AVS SDK receives the event and notifies AVS client to open the channel to driver4 4. Audio data from driver is sent to the cloud by the AVS client 5 5. Speech recognition AI algorithms convert audio to text and pass information to relevant apps for processing 6 6. Cloud apps return response that is sent back to the AVS client for playback 7 7. AVS client sends response from cloud over the speaker
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Intel Design Optimization – Far Field on a Chip DSP Amp DAC Audio Algorithms SoC Wifi / Ethernet AVS Client WWE Audio Drivers • Wake Word Engine on DSP for low power product design • Unique to Intel developer kit • Significant platform power savings can be realized • Low power WoV feature has been enabled on the Dual DSP
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • 8 Mic Circular array geometry allows for precise beamforming from all directions • Ambient noise suppression outside of the signal beam Intel Design Optimization – Beamforming Ambient Noise “Alexa” wake word What time is it in Vegas?
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Intel Design Optimization – Acoustic Echo Cancellation (AEC) • Processes multichannel input signal and provides multichannel output to algorithms • AEC automatically adapts to room reverberation and playback signal changes • Intel algorithms perform linear operations that does not introduce distortions • Highly optimized for Intel Wake On Voice and Amazon Alexa performance
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Build the Sample App on your Intel Speech Enabling Dev Kit Open your hardware box and build your devkit! Follow the instructions in your lab manual at: https://avs-dvk-workshop.github.io/ W O R K S H O P A C T I V I T Y :
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! D e s i g n i n g F a r - F i e l d V o i c e S o l u t i o n s w i t h I n t e l Visit the Alexa developer portal for more information: Documentation, tutorials ,webinars, and more https://developer.amazon.com/alexa AVS: Alexa Voice Service https://developer.amazon.com/avs ASK: Create custom skills for Alexa https://developer.amazon.com/ask Smart home: https://developer.amazon.com/alexa/smart- home