1. RealSpeaker - audio-visual enhancement to - RealSpeaker audio-visual enhancement to speech recognition system
PROBLEM
Inability to Suppress Ambient
Noise (audio is not reliable
source of information)
High Cost of Voice Recognition
Applications (Nuance licenses
average costs from $100 to
$1000)
Issues with Accuracy (with
accuracy 60-70% their paintfull
to use)
Low Level of Security in
Speaker Verification
Users must speak in unnatural
fashion using fragmented
speech (the problem with
usability)
EXISTING ALTERNATIVES
Keyboard typing
DragonDictation for PC or Mac
(very expensive costs)
Google Speech recognition
(free using by default on
Android OS - only for short voice
commands)
Windows speech recognition
(free using by default on
Windows OS - the problem with
accuracy - only for short voice
commands)
Siri voice assistent on iOS (free
using by default on Iphones -
only for short voice commands)
SOLUTION
RealSpeaker uses additional
video information, which allows
to improve voice recognition
accuracy by at least 20-30 per
cent.
More safety because
RealSpeaker have function of
audio-video verification
speaker's speech from the
overall speech flow
RealSpeaker cheaper than
Nuance. Licences costs from
$25 to $90
RealSpeaker have functions of
voice editing and sending -
more usability than Nuance
UNIQUE VALUE
PROPOSITION
The average typing speed is 33
words a minute. By the means of
RealSpeaker can record over
100 words a minute
We have paid users, working
prototype and good traction
Multilanguage - (over 13
languages supported today)
Enter text of any length with the
voice and video without
keyboard at any text editor or
website (Notes, Facebook,
Skype, Evernote, E-mail,
Microsoft Office etc.)
HIGH-LEVEL CONCEPT
RealSpeaker - audio-visual
enhancement to speech
recognition systems
OpenCV - video processing
library originally developed by
Intel
Nuance - speech recognition
company
Google Voice Search - speech
recognition engine
CMU Sphinx - speech
recognition system with Open
Source code
GoogleGlass SDK - video
processing library
Kinect SDK - video processing
library
UNFAIR ADVANTAGE
US Patent 13/942,689: “System
of video enhancement for audio
speech recognition solutions to
improve the accuracy of audio
speech recognition due to the
analysis of speaker lip
movements”
Our team is supported by such
organizations and institutes as
Microsoft Seed Fund, Skolkovo,
Startobaza, Kazan IT-Park. We
have NDA with Samsung, LG,
Toyota, Itouchu.
We exists almost 2 years and
have a team of 10 people - our
working place is based in Kazan
(Russia) - cheap place with
good professional
Our technology can be integrate
at any electronic devices
We have own database - video
how its work -
http://youtu.be/TQaVWTqGCjs
CUSTOMER SEGMENTS
Adults group: disablity people
Professional segment: SEO,
journalists, writers, bloggers,
students, teachers, coachers,
mentors, research specialists,
focus groups
Active segment: businessmen,
teenagers, geeks
According to TechNavio
currently, only 15-20% of the
speech recognition market
potential is used - need only to
create customer product with
high accuracy speech
recognition
EARLY ADOPTERS
Bloggers, journalists, robotic
geeks, journalists - our first
testers
We have released beta version
of our product for Windows OS,
which is currently in use by
about 50k users, out of which 2k
users are paid users
KEY METRICS
Funding: - Current Burn Rate -
$1M - Seeds Round: $0,5M in
2012-2013 o $0,3M – in 2012 o
$0,2M – in 2013 - Seeking 1st
Round: $1,0M
about 50k users, out of which 2k
users are paid users (December
2013)
CHANNELS
Viral channel
www.realspeaker.net (prizes on
our site)
Social servises: YouTube,
Facebook or others
Free Torrents (spread trial
version of RealSpeaker)
Software Vendors (MailRu
Group , Digital River)
2. Lean Canvas is adapted from The Business Model Canvas (BusinessModelGeneration.com) and is licensed under the Creative Commons Attribution-Share Alike 3.0 Un-ported License.
COST STRUCTURE
Cost per license in time:
$25 for 3 months
$30 for 6 months
$37 for 1 year
$90 unlimited version
Integration at any service - royalty from sales
REVENUE STREAMS
Revenue: - 2012: $0,025M - 2013: $0,1M - 2014: $0,5M - 2015: $15M - 2016: $100M
B2C segment
Business Model: Try & Buy Free version – can recognize speech to text for 3 days; 4 %
conversion. 50 k free users, 2 k paid users in December of 2013
B2B
We have NDA with Samsung, LG, Toyota, Itouchu.