SlideShare a Scribd company logo
1 of 17
Download to read offline
Human-AI communication for human-human communication:


Applying interpretable unsupervised anomaly detection to executive coaching
(equal contribution)
CHAI Workshop @ IJCAI '22


July 24, 2022
Riku Arakawa†


Carnegie Mellon University, USA
Hiromu Yakura†


University of Tsukuba, Japan
Background: Deep-learning-based human behavior analysis
Advancement in human behavior
analysis techniques:


・Facial expression recognition [1]


・Posture estimation [2]
[1] I. Çugu, et al., 2017. MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Frontal Face Images. arXiv.


[2] S.-E. Wei, et al., 2016. Convolutional Pose Machines. IEEE CVPR.
It is expected that we can analyze
and support human communication
by applying these techniques.
2
Background: A tool for helping public speaking with feedback
[3] M. I. Tanveer, et al., 2015. A Real-Time In-Situ Intelligent Interface to Help People With Public Speaking. ACM IUI.


[4] I. Damian, et al., 2015. Measuring the impact of multimodal behavioural feedback loops on social interactions.. ACM ICMI.
Speech-feature-based feedback [3]
Show feedback such as “louder”


and “faster” on a Google Glass


based on speech speed or volume.
Posture-based feedback [4]
Alert a speaker when they cross


their arm for a long time


based on posture estimation.
Our perspective: Limitation of heuristic approach
Human-to-human communication is very contextual:
[5] J. Navarro and M. Karlins, 2008. What Every BODY Is Saying: An Ex-FBI Agent’s Guide to Speed Reading People. HarperCollins, New York.


[6] R Friedman and A. J. Elliot, 2008. The effect of arm crossing on persistence and performance. Europ. J. Soc. Psych.
Heuristic approach
Unsupervised approach


w/o rules or training data
4
Defensive attitude [5] Deeply thinking [6]
Thus, we need a new framework of human-AI communication:
Supervised approach w/


training data of numerous classes
Research object: Executive coaching
• It consists of one-on-one conversation, in
which coaches are required to observe the
nonverbal behavior of coachees [7].


• The importance of observing nonverbal
behavior is emphasized in terms of reading
the nuance of what the coachee said [8].
But, notifying the detection of specific postures (e.g., crossing arms)


or emotions (e.g., confusing) without context was not appreciated.
[7] E. Cox, et al., 2009. The Complete Handbook of Coaching. SAGE Publications, Los Angeles.


[8] D. B. Drake, 2009. Narrative coaching. In The Complete Hand- book of Coaching. SAGE Publications, Los Angeles. 5
We hypothesized that AI can help novice coaches in the observation process.
Key idea: Separating observation and judgement
Coaches ignored the outputs
once the outputs contradicted


their observation or intuition.
They found it difficult to rely on
outputs based on simplified classes
that are indifferent to subtle context.
Human


Pros: Good at understanding context


Cons: Difficult to keep stable perspective


due to their skills or mental load




AIs


Pros: Stable performance


Cons: Not good at dealing with context
Separation of observation and
judgment would be an alternative


way of human-AI communication.
This guided us to reframe the way of


human-AI communication:
6
REsCUE: Real-time feedback using anomaly detection
1. Extract posture and gaze
information of the coachee.


2. Calculate outlierness score using
anomaly detection algorithm.


3. Notify the coach in real-time with
an interpretive visualization.
We developed a supporting system that observes


the nonverbal behavior of coachees using unsupervised anomaly detection.
It detects informative cues of the behavior and notifies the coach in real-time.
Detailed workflow
7
• The GMM gradually adapts to newly obtained nonverbal behavior data.








• When the trend of the input data suddenly changes,


it is detected by the spike of negative log-likelihood.
REsCUE: How anomaly detection algorithm works
[61] Kenji Yamanishi, et al. 2004. On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms. Data Mining and Knowledge Discovery.
We use an algorithm based on a time-adaptive gaussian mixture model [9].
Time series behavior data of


the coachee taken from webcam:
The parameters of


GMM (e.g., mean and cov)
are updated with


a forgetting rate r.
REsCUE: Visualization based on GMM
The GMM allows us to provide interpretative visualization.
In GMM, each component fits


the past representative states.
Most anomalous frames can be


specified by sorting with the likelihoods.
Just by arranging these frames, the coach can compare them
and understand the change easily even during the session.
9
REsCIE: Detection results
10
These behaviors were detected without
any rules or heuristics and regarded as
informative by professional coaches.
The algorithm sometimes detected


apparent behavioral changes.


(e.g., taking a personal organizer out of a bag)
The visualization allows the coach to


interpret why the scene is detected,


which avoids destroying their trust.
Now, REsCUE is practically deployed
as a supporting system.
Lens of Parasuraman’s framework of automation
11
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
10: the computer decides everything,


acts autonomously, ignoring the human
1: the computer offers no assistance;


human must take all decisions and actions
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
human performance


automation reliability


cost of consequences
Trade-off between
Lens of Parasuraman’s framework of automation
12
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
10: the computer decides everything,


acts autonomously, ignoring the human
1: the computer offers no assistance;


human must take all decisions and actions
human performance


automation reliability


cost of consequences
Trade-off between
Lens of Parasuraman’s framework of automation
13
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
Low human performance:


• Dependency on the skills


or mental load


High automation reliability:


• No dependency on


heuristics or training data


Low cost of consequence:


• Interpretable visualization to


discern uninformative cues
This characteristic plot


of our approach came from ...
observation
Lens of Parasuraman’s framework of automation
14
The design of our approach can be explained using Parasuraman's framework.
Information


acquisition
Information


analysis
Decision & action


selection
Action


implementation
Realm of automation
High human performance:


• Good at dealing with context


Low automation reliability:


• Automatic interpretation can


be insensitive to subtle context


High cost of consequence:


• Risk of asking irrelevant questions


that disturbs the session
This characteristic plot


of our approach came from ...
interpretation
Application: Supporting skill transfer
The informativeness of the detected cues depends on the coach's skill:
15
Skillful coach gains information
from trifling behaviors.
Novice coach often disregards


such behaviors.
The difference in how each coach interprets the cues


reveals the difference in their skills.
This can be utilized for skill transfer of coaches by helping novice coaches to


learn how skillful coaches gain information from various behaviors.
Application: Supporting skill transfer
16
Annotation phase:
They classify whether each
detected cues is informative or not.
Skillful coach
Novice coach
Discussion phase:
Through the discussion about the discrepancies,


the novice coach can learn the way of interpretation.
The transparency of the results and the design of
allowing open-ended interpretation enable this tool.
Conclusion & On-going work
• We introduced a new framework of human-AI communication that is based on


the unsupervised anomaly detection algorithm.


• Its design of separating observation and interpretation enables human-AI
collaboration in highly contextual situations, such as executive coaching.


• Its interpretable visualization enabled by GMM provides transparency in


its detection results, which helps maintain trust with humans.
We remark that REsCUE does not require any prior
knowledge or rules and can be used in various domains.
Now, we are working on applying this
to analyzing sales communication
17
Read our


paper!

More Related Content

Similar to Human-AI communication for human-human communication / CHAI Workshop @ IJCAI '22

IODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative ResearchIODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative Research
Chris Fletcher
 
Personality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy CasePersonality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy Case
Christy C Langdon
 

Similar to Human-AI communication for human-human communication / CHAI Workshop @ IJCAI '22 (20)

For graduate students
For graduate studentsFor graduate students
For graduate students
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex models
 
State representation learning for control: an overview
State representation learning for control: an overview State representation learning for control: an overview
State representation learning for control: an overview
 
Presentation Brain Perform
Presentation Brain PerformPresentation Brain Perform
Presentation Brain Perform
 
Virtual Characters Pres Ispi Masie
Virtual Characters Pres Ispi MasieVirtual Characters Pres Ispi Masie
Virtual Characters Pres Ispi Masie
 
ELearning Design and Rollout
ELearning Design and RolloutELearning Design and Rollout
ELearning Design and Rollout
 
Real time facial expression analysis using pca
Real time facial expression analysis using pcaReal time facial expression analysis using pca
Real time facial expression analysis using pca
 
Shanghai deep learning meetup 4
Shanghai deep learning meetup 4Shanghai deep learning meetup 4
Shanghai deep learning meetup 4
 
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
Who C.A.R.E.S.!? How Learning Got Its Bad Rap & What We Can Do About it.mp4
 
Visual Saliency Model Using Sift and Comparison of Learning Approaches
Visual Saliency Model Using Sift and Comparison of Learning ApproachesVisual Saliency Model Using Sift and Comparison of Learning Approaches
Visual Saliency Model Using Sift and Comparison of Learning Approaches
 
Review On In-Context Leaning.pptx
Review On In-Context Leaning.pptxReview On In-Context Leaning.pptx
Review On In-Context Leaning.pptx
 
AI_Unit-4_Learning.pptx
AI_Unit-4_Learning.pptxAI_Unit-4_Learning.pptx
AI_Unit-4_Learning.pptx
 
Maximum Likelihood Estimation
Maximum Likelihood EstimationMaximum Likelihood Estimation
Maximum Likelihood Estimation
 
Effectiveness of Organizational Training
Effectiveness of Organizational TrainingEffectiveness of Organizational Training
Effectiveness of Organizational Training
 
Chapter 6 - Learning data and analytics course
Chapter 6 - Learning data and analytics courseChapter 6 - Learning data and analytics course
Chapter 6 - Learning data and analytics course
 
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptxRahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
Rahul_Kirtoniya_11800121032_CSE_Machine_Learning.pptx
 
IODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative ResearchIODA - The Promise & Perils of Narrative Research
IODA - The Promise & Perils of Narrative Research
 
Km knowledge application.11
Km  knowledge application.11Km  knowledge application.11
Km knowledge application.11
 
ML-Chapter_one.pptx
ML-Chapter_one.pptxML-Chapter_one.pptx
ML-Chapter_one.pptx
 
Personality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy CasePersonality Traits and Visualization Survey by Christy Case
Personality Traits and Visualization Survey by Christy Case
 

More from Hiromu Yakura

CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…
Hiromu Yakura
 

More from Hiromu Yakura (20)

How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021
 
機械学習を"良く"使うには
機械学習を"良く"使うには機械学習を"良く"使うには
機械学習を"良く"使うには
 
自己紹介@名状しがたいお茶会
自己紹介@名状しがたいお茶会自己紹介@名状しがたいお茶会
自己紹介@名状しがたいお茶会
 
Python 3のWebシステムでDDDに入門してみた
Python 3のWebシステムでDDDに入門してみたPython 3のWebシステムでDDDに入門してみた
Python 3のWebシステムでDDDに入門してみた
 
Robust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical AttackRobust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical Attack
 
セキュリティ・キャンプのススメ
セキュリティ・キャンプのススメセキュリティ・キャンプのススメ
セキュリティ・キャンプのススメ
 
新しいライブ映像再生システムを研究してみた
新しいライブ映像再生システムを研究してみた新しいライブ映像再生システムを研究してみた
新しいライブ映像再生システムを研究してみた
 
GAME ON 特別イベント 「ゲームってなんでプログラミング?」
GAME ON 特別イベント 「ゲームってなんでプログラミング?」GAME ON 特別イベント 「ゲームってなんでプログラミング?」
GAME ON 特別イベント 「ゲームってなんでプログラミング?」
 
Arduinoでプログラミングに触れてみよう 続編
Arduinoでプログラミングに触れてみよう 続編Arduinoでプログラミングに触れてみよう 続編
Arduinoでプログラミングに触れてみよう 続編
 
Arduinoでプログラミングに触れてみよう
Arduinoでプログラミングに触れてみようArduinoでプログラミングに触れてみよう
Arduinoでプログラミングに触れてみよう
 
機械学習コン講評
機械学習コン講評機械学習コン講評
機械学習コン講評
 
NPCA夏合宿 2014 講義資料
NPCA夏合宿 2014 講義資料NPCA夏合宿 2014 講義資料
NPCA夏合宿 2014 講義資料
 
イタリアでパスタを食べた話
イタリアでパスタを食べた話イタリアでパスタを食べた話
イタリアでパスタを食べた話
 
Why don't you learn programming?
Why don't you learn programming?Why don't you learn programming?
Why don't you learn programming?
 
CTF, What's in it for me?
CTF, What's in it for me?CTF, What's in it for me?
CTF, What's in it for me?
 
Arrow Judge
Arrow JudgeArrow Judge
Arrow Judge
 
CTFとは
CTFとはCTFとは
CTFとは
 
CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…CTFはとんでもないものを 盗んでいきました。私の時間です…
CTFはとんでもないものを 盗んでいきました。私の時間です…
 
ぃぬ速@LKML
ぃぬ速@LKMLぃぬ速@LKML
ぃぬ速@LKML
 
プラトン
プラトンプラトン
プラトン
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Human-AI communication for human-human communication / CHAI Workshop @ IJCAI '22

  • 1. Human-AI communication for human-human communication: 
 Applying interpretable unsupervised anomaly detection to executive coaching (equal contribution) CHAI Workshop @ IJCAI '22 July 24, 2022 Riku Arakawa† Carnegie Mellon University, USA Hiromu Yakura† University of Tsukuba, Japan
  • 2. Background: Deep-learning-based human behavior analysis Advancement in human behavior analysis techniques: ・Facial expression recognition [1] ・Posture estimation [2] [1] I. Çugu, et al., 2017. MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Frontal Face Images. arXiv. [2] S.-E. Wei, et al., 2016. Convolutional Pose Machines. IEEE CVPR. It is expected that we can analyze and support human communication by applying these techniques. 2
  • 3. Background: A tool for helping public speaking with feedback [3] M. I. Tanveer, et al., 2015. A Real-Time In-Situ Intelligent Interface to Help People With Public Speaking. ACM IUI. [4] I. Damian, et al., 2015. Measuring the impact of multimodal behavioural feedback loops on social interactions.. ACM ICMI. Speech-feature-based feedback [3] Show feedback such as “louder” 
 and “faster” on a Google Glass 
 based on speech speed or volume. Posture-based feedback [4] Alert a speaker when they cross 
 their arm for a long time 
 based on posture estimation.
  • 4. Our perspective: Limitation of heuristic approach Human-to-human communication is very contextual: [5] J. Navarro and M. Karlins, 2008. What Every BODY Is Saying: An Ex-FBI Agent’s Guide to Speed Reading People. HarperCollins, New York. [6] R Friedman and A. J. Elliot, 2008. The effect of arm crossing on persistence and performance. Europ. J. Soc. Psych. Heuristic approach Unsupervised approach w/o rules or training data 4 Defensive attitude [5] Deeply thinking [6] Thus, we need a new framework of human-AI communication: Supervised approach w/ 
 training data of numerous classes
  • 5. Research object: Executive coaching • It consists of one-on-one conversation, in which coaches are required to observe the nonverbal behavior of coachees [7]. • The importance of observing nonverbal behavior is emphasized in terms of reading the nuance of what the coachee said [8]. But, notifying the detection of specific postures (e.g., crossing arms) 
 or emotions (e.g., confusing) without context was not appreciated. [7] E. Cox, et al., 2009. The Complete Handbook of Coaching. SAGE Publications, Los Angeles. [8] D. B. Drake, 2009. Narrative coaching. In The Complete Hand- book of Coaching. SAGE Publications, Los Angeles. 5 We hypothesized that AI can help novice coaches in the observation process.
  • 6. Key idea: Separating observation and judgement Coaches ignored the outputs once the outputs contradicted 
 their observation or intuition. They found it difficult to rely on outputs based on simplified classes that are indifferent to subtle context. Human 
 Pros: Good at understanding context 
 Cons: Difficult to keep stable perspective 
 due to their skills or mental load 
 
 AIs 
 Pros: Stable performance 
 Cons: Not good at dealing with context Separation of observation and judgment would be an alternative 
 way of human-AI communication. This guided us to reframe the way of 
 human-AI communication: 6
  • 7. REsCUE: Real-time feedback using anomaly detection 1. Extract posture and gaze information of the coachee. 2. Calculate outlierness score using anomaly detection algorithm. 3. Notify the coach in real-time with an interpretive visualization. We developed a supporting system that observes 
 the nonverbal behavior of coachees using unsupervised anomaly detection. It detects informative cues of the behavior and notifies the coach in real-time. Detailed workflow 7
  • 8. • The GMM gradually adapts to newly obtained nonverbal behavior data. 
 




 • When the trend of the input data suddenly changes, 
 it is detected by the spike of negative log-likelihood. REsCUE: How anomaly detection algorithm works [61] Kenji Yamanishi, et al. 2004. On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms. Data Mining and Knowledge Discovery. We use an algorithm based on a time-adaptive gaussian mixture model [9]. Time series behavior data of 
 the coachee taken from webcam: The parameters of 
 GMM (e.g., mean and cov) are updated with 
 a forgetting rate r.
  • 9. REsCUE: Visualization based on GMM The GMM allows us to provide interpretative visualization. In GMM, each component fits 
 the past representative states. Most anomalous frames can be 
 specified by sorting with the likelihoods. Just by arranging these frames, the coach can compare them and understand the change easily even during the session. 9
  • 10. REsCIE: Detection results 10 These behaviors were detected without any rules or heuristics and regarded as informative by professional coaches. The algorithm sometimes detected 
 apparent behavioral changes. 
 (e.g., taking a personal organizer out of a bag) The visualization allows the coach to 
 interpret why the scene is detected, 
 which avoids destroying their trust. Now, REsCUE is practically deployed as a supporting system.
  • 11. Lens of Parasuraman’s framework of automation 11 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition 10: the computer decides everything, 
 acts autonomously, ignoring the human 1: the computer offers no assistance; 
 human must take all decisions and actions Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation human performance 
 automation reliability 
 cost of consequences Trade-off between
  • 12. Lens of Parasuraman’s framework of automation 12 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation 10: the computer decides everything, 
 acts autonomously, ignoring the human 1: the computer offers no assistance; 
 human must take all decisions and actions human performance 
 automation reliability 
 cost of consequences Trade-off between
  • 13. Lens of Parasuraman’s framework of automation 13 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation Low human performance: • Dependency on the skills 
 or mental load High automation reliability: • No dependency on 
 heuristics or training data Low cost of consequence: • Interpretable visualization to 
 discern uninformative cues This characteristic plot 
 of our approach came from ... observation
  • 14. Lens of Parasuraman’s framework of automation 14 The design of our approach can be explained using Parasuraman's framework. Information 
 acquisition Information 
 analysis Decision & action 
 selection Action 
 implementation Realm of automation High human performance: • Good at dealing with context Low automation reliability: • Automatic interpretation can 
 be insensitive to subtle context High cost of consequence: • Risk of asking irrelevant questions 
 that disturbs the session This characteristic plot 
 of our approach came from ... interpretation
  • 15. Application: Supporting skill transfer The informativeness of the detected cues depends on the coach's skill: 15 Skillful coach gains information from trifling behaviors. Novice coach often disregards 
 such behaviors. The difference in how each coach interprets the cues 
 reveals the difference in their skills. This can be utilized for skill transfer of coaches by helping novice coaches to learn how skillful coaches gain information from various behaviors.
  • 16. Application: Supporting skill transfer 16 Annotation phase: They classify whether each detected cues is informative or not. Skillful coach Novice coach Discussion phase: Through the discussion about the discrepancies, 
 the novice coach can learn the way of interpretation. The transparency of the results and the design of allowing open-ended interpretation enable this tool.
  • 17. Conclusion & On-going work • We introduced a new framework of human-AI communication that is based on 
 the unsupervised anomaly detection algorithm. • Its design of separating observation and interpretation enables human-AI collaboration in highly contextual situations, such as executive coaching. • Its interpretable visualization enabled by GMM provides transparency in 
 its detection results, which helps maintain trust with humans. We remark that REsCUE does not require any prior knowledge or rules and can be used in various domains. Now, we are working on applying this to analyzing sales communication 17 Read our 
 paper!