KrankyGeek 2015 - Mixing Data and Video - IBM Bluemix, Watson, and Twilio

WebRTC Invades the Doctor’s Oﬃce

Jeﬀ Sloyer
Developer Advocate
IBM Bluemix

@jsloyer

s://releaseblueprints.ibm.com/download/attachments/34868360/Green64.png?version=1&modiﬁcationDate=1392659234814
RUN APPS
YOUR WAY
What is Bluemix?
CATALOG OF
SERVICES / APIs
FLEXIBLE
TOOLING

s://releaseblueprints.ibm.com/download/attachments/34868360/Green64.png?version=1&modiﬁcationDate=1392659234814
RUN APPS
YOUR WAY
What is Bluemix?
CATALOG OF
SERVICES / APIs
FLEXIBLE
TOOLING
Instant Runtimes
Containers
VMs
Partner
IBM
Your Own
Open Source
Or Integrate Your Own
Use Ours

Web Page
Patient
1. Click to call
Doctor

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
5. Doctor and
patient video
chat

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
Node.Js app in Bluemix
.js6. Patient audio stream
sent over web sockets
to node backend
5. Doctor and
patient video
chat

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
to node backend
Watson Speech
to Text
7. Patient audio stream
to IBM Watson
5. Doctor and
patient video
chat

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
to node backend
Watson Speech
to Text
to IBM Watson
8. Audio transcribed in
real time
5. Doctor and
patient video
chat

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
to node backend
Watson Speech
to Text
to IBM Watson
real time
Cloudant NoSQL DB
9. Transcribed audio is
stored
5. Doctor and
patient video
chat

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
to node backend
Watson Speech
to Text
to IBM Watson
real time
Cloudant NoSQL DB
stored
5. Doctor and
patient video
chat
10. Determine personality of
the patent
Watson Personality
Insights

Web Page
Patient
1. Click to call
Doctor
Twilio Video Chat
2. Initiate
WebRTC call
Doctor
3. Doctor
comes online
for
appointment
4. Call
established
to node backend
Watson Speech
to Text
to IBM Watson
real time
Cloudant NoSQL DB
stored
5. Doctor and
patient video
chat
sent back to the doctor
in real-time
10. Determine personality of
the patent
Watson Personality
Insights

I
Image Source: Shutterstock
WebRTC - iOS and the browser

I
Twilio Javascript SDK
WebRTC - iOS and the browser
Twilio iOS SDK

Built a solution in
3 days…

Code time!

Record the local stream
function startRecording(myStream) {
blah = myStream;
audioContext = new AudioContext();
var gain = audioContext.createGain();
var audioInput = audioContext.createMediaStreamSource(myStream);
audioInput.connect(gain);
microphone = audioContext.createScriptProcessor(bufferSize, inputChannels,
outputChannels);
microphone.onaudioprocess = _onaudioprocess.bind(this);
gain.connect(microphone);
microphone.connect(audioContext.destination);

Open a socket to our Node.Js app
mySocket = io.connect();
mySocket.on('connect', function() {
console.log('socket.onconnect()');
connected = true;
onstart();
});
mySocket.on('message', function(msg){
console.log(msg);
//console.log('demo.onresult()');
showResult(msg, "local");
});
}

Convert the audio to PCM
function _onaudioprocess(data) {
// Check the data to see if we're just getting 0s
// (the user isn't saying anything)
var chan = data.inputBuffer.getChannelData(0);
onAudio(_exportDataBuffer(new Float32Array(chan)));
}

Emit a web socket message
function onAudio(data) {
if (mySocket.connected) {
mySocket.emit('message', {audio: data, rate:
microphone.sampleRate});
}
};

Initiate a websocket connection from Node to
IBM Watsonsocket.on('message', function(data) {
if (!session.open) {
session.open = true;
var payload = {
session_id: session.session_id,
cookie_session: session.cookie_session,
content_type: 'audio/l16; rate=' + (data.rate || 48000),
continuous: true,
interim_results: true
};
//open a connection to ibm watson
session.req = speechToText.recognizeLive(payload, observe_results(socket, true));
//listen for result
speechToText.observeResult(payload, observe_results(socket, false));
} else if (data.disconnect) {
session.req.end();
} else {
session.req.write(data.audio);
}
});

Send transcribed text from Watson to web client
var observe_results = function(socket, recognize_end) {
var session = sessions[socket.id];
return function(err, chunk) {
if (err) {
console.log(log(socket.id), 'error:', err);
socket.emit('onerror', {
error: err
});
session.req.end();
socket.disconnect();
} else {
var transcript = (chunk && chunk.results && chunk.results.length > 0);
if (transcript && !recognize_end) {
socket.emit('message', chunk);
}
if (recognize_end) {
socket.disconnect();
}
}
};
};

Receive the transcribed text from our Node app
function showResult(data, streamLocation) {
var textElement = $("#collapse-" + patientId + " .transcript ." + streamLocation + ".text");
textElement.show();
textElement.parent().show();
//if there are transcripts
if (data.results && data.results.length > 0) {
//if is a partial transcripts
if (data.results.length === 1 ) {
var paragraph = textElement.children().last(),
text = data.results[0].alternatives[0].transcript || '';
//Capitalize first word
text = text.charAt(0).toUpperCase() + text.substring(1);
// if final results, append a new paragraph
if (data.results[0].final){
text = text.trim() + '.';
$('<p></p>').appendTo(textElement);
}
paragraph.text(text);
}
}
}

Lessons Learned…

Order for starting a
call is buggy

Issues with remote
streams support in
Chrome and Firefox

Multiple websocket
audio streams are
hard

Getting the audio
format right…

Personality of the
patient could not be
determined till after
the call

Spying on the media
stream

Websocket
connections
dropped to Watson

Questions?
Image Source: My cat (Moose)

KrankyGeek 2015 - Mixing Data and Video - IBM Bluemix, Watson, and Twilio

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (18)

Similaire à KrankyGeek 2015 - Mixing Data and Video - IBM Bluemix, Watson, and Twilio

Similaire à KrankyGeek 2015 - Mixing Data and Video - IBM Bluemix, Watson, and Twilio (20)

Dernier

Dernier (20)

KrankyGeek 2015 - Mixing Data and Video - IBM Bluemix, Watson, and Twilio