Statistical Affect Detection in Collaborative Chat

Statistical Affect Detection
in Collaborative Chat
CSCW 2013: Mining Social Media Data, Feb. 23

Michael Brooks, Katie Kuksenok, Megan K. Torkildson,
Daniel Perry, John J. Robinson, Taylor Jackson Scott, Ona
Anicello, Ariana Zukowski, Paul Harris, Cecilia R. Aragon

Scientific
Collaboration
& Creativity
Lab

Scientific Collaboration & Creativity Lab 2/27/2013 2

June, 2007
6:07:57 Ray cool, it worked amusement, relief
6:08:04 Matt woot excitement, joy
6:08:07 Ray awesome, I don't think he needs that acceptance, no affect
long of a sleep after turning it off

6:08:47 We enhanced eready to detect the no affect
sticking

6:08:58 Matt good job supportive, acceptance
6:09:21 seems it did well there happiness, no affect
6:09:26 Ray yeah, pretty cool huh? interest, agreement,
happiness

6:09:43 Matt helps keep me from having to stopaic no affect
and restart

6:09:55 Ray indeed, that was the point agreement


Nearby Supernova Factory
• 30 astrophysicists
• US / France
• Daily remote operation of
telescope
• Rely on chat to communicate


SNfactory Chat Logs
• Four years of logs - 449,684 messages
• Manual coding for affective expressions
– 27,344 chat messages coded
– 1-5 coders per message
– 30 affect codes
– Multiple codes allowed

Scott et al. SIGDOC 2012. Adapting Grounded Theory to Construct a Taxonomy
of Affect in Collaborative Online Chat.


June, 2007
6:07:57 Ray cool, it worked amusement, relief
6:08:04 Matt woot excitement, joy
6:08:07 Ray awesome, I don't think he needs that acceptance, no affect
long of a sleep after turning it off

6:08:47 We enhanced eready to detect the no affect
sticking

6:08:58 Matt good job supportive, acceptance
6:09:21 seems it did well there happiness, no affect
6:09:26 Ray yeah, pretty cool huh? interest, agreement,
happiness

6:09:43 Matt helps keep me from having to stopaic no affect
and restart

6:09:55 Ray indeed, that was the point agreement


Top 13 Affect Codes
Times Used Reliability (Kappa)
int… 4351 interest 0.808
am… 3213 amusement 0.611
co… 1763 considering 0.49
agr… 1623 agreement 0.491
an… 1212 annoyance 0.77
co… 1125 confusion 0.615
acc… 975 acceptance 0.657
ap… 799 apprehension 0.529
fru… 541 frustration 0.55
sup… 518 supportive 0.583
sur… 464 surprise 0.543
ant… 426 anticipation 0.424
ser… 369 serenity 0.602


Linguistic Inquiry and Word Count
(LIWC)
• Detects words for Positive / Negative Emotions

I wish every day Positive: 15%
could be sunny Negative: 8%
and warm. Rain …
makes me angry.


June, 2005
11:44:08 Gabri ok that's better relief, serenity
11:44:17 Marcel GREAT ! excitement, happiness,
relief, joy
11:44:17 Gabri let's start aic and see anticipation, no affect
11:44:23 Marcel yes ... no affect
11:44:31 Derek Great what? confusion
11:44:32 Gabri can you do that? interest, no affect
11:44:50 derek.. it seems that now the focus is ok no affect
11:45:04 and we can finally start observing no affect
11:45:23 Derek Oh good! happiness, relief, joy
11:45:48 I have been waiting for this moment, because I amusement
want to leave the room and get my midnight
snack. ;)
11:46:54 Gabri go... amusement, no affect
11:47:02 and enjoy your snack amusement, no affect
11:47:13 Derek HEhe. amusement
11:47:18 I will bring it back here of course. amusement


The telescope is stuck! >:(
frustration

The telescope is stuuuuuuuuuck...
annoyance

The telescope is stuck??
confusion


• Word counts
• Emoticons
• Word sets
– Swear words
– Pronouns
– Negations
– Participant names
• Characters
– Capitalization
– Letter repetition
– Punctuation
• Metadata
– segment duration, length, rate


Emoticons
Naomi: I think we'd better stopaic... :( sadness
Matt: today was a gym + laundry day :) amusement, happiness
Marcel: and she can't teach over an ssh- amusement
channel ;-)


Word Sets
Swear Words
Ray: why the **** doesn't stop_script ******* rage
STOP THE ******* SCRIPT
Matt: ******* ******* ******* I think I broke it frustration, anger,
apprehension,
embarrassment

Negations
Paul: but I wouldn't hazzard a guess apprehension
Ray: cannot talk to camera frustration, no-affect


Character Features
Letter Repetition
Ray: noooooooooooooooo, it must be stopped annoyance, anger, fear
Marcel: AAaah too late, they will find meeee amusement

Punctuation
Rick: looks like something bad happened here... apprehension
Rene: 1 month before max??!? surprise, confusion,
considering

Capitalization
Marcel: ON TARGET ! relief, joy
Paul: we must set-up adopt an EXPLODING STAR amusement, no-affect


Feature Value
Alice: ok, so where was “ok” 1
the ******* SN on the “telescope” 0
image? “where” 1
“SN” 1
“image” 1
question marks 1
swears 1
emoticon :) 0
1st person pronouns 0
capitals 2
repetition 0
punctuation 1
length 45
…

Feature importance
Confusion Messages labeled Confusion
???? length Ben: ??? - the answer is likely found in
# question marks the otsim code
"understand" Marcel: well ... I'm not so sure ...
"confus_" Gary: Why do we care at all then?
"why" Ray: ummm I mean how does it get to
"what" the header
"nothing"
"wrong"
msg. length
"thought"


Feature importance
Apprehension Messages labeled Apprehension
"bad" Pascal: the problem is than the
"something" automated detection will not work ...
"problem" too much galaxy
"we" Ray: But now bad stuff in window
"seem" Ben: pascal, we had a problem with
"too" do_fchart
msg. length Gabriel: So something is completely
"not" wrong
# 3rd sg. Pronouns
# swearing


Feature importance
Amusement Messages labeled Amusement
emoticon ";)" Kevin: hehe
emoticon ":)" Ray: hahahaah
laughter Stef: lol ok derek :)
emoticon ";-)" Ray: He never sleeps -- you know that.
"fun" Pascal: but I think it could be interesting
laughter length for Extreeeeeeeeeeme photometry
"p" study ;-)
# people names
"sleep"
"of"


Specialized Features
• Count words based on the data
• Medium-specific features
– Emoticons, punctuation…
• Context-specific features
– People names, jargon…
• Affect-specific features
– Swearing vs. emoticons


5:17:48 Marcel ok, so let's cycle the stuff September, 2006
5:18:04 Rick ok…
5:18:40 Marcel damn mouse cutandpast
5:19:03 Ray off 1 right? then on 1?
5:19:32 Marcel have you telnet sdsugreen ??
5:19:58 Ray director on lbl2 looks dead
5:20:34 Marcel ok, one thind at a time. have you cycled the baytech on sdsugreen ?
5:20:36 Ray what is best way to revive it
5:20:39 baytech
5:20:40 yes
5:20:46 not sdsu
5:21:08 go ahead and do it I am not evneon this **** shift...grrr
5:21:22 Marcel ok, maybe we have to kill director and restart it mkanually
5:21:32 Ray yeah but that's tricky; all these damn arguments
5:23:53 Rick emile, I have no idea what's going on here
5:23:57 only that it is bad


5:17:48 Marcel ok, so let's cycle the stuff September, 2006
5:18:04 Rick ok…
5:18:40 Marcel damn mouse cutandpast
5:19:03 Ray off 1 right? then on 1?
5:19:32 Marcel have you telnet sdsugreen ??
5:20:39 baytech
5:20:40 yes
5:20:46 not sdsu


Classifier F-measure Precision Recall Accuracy
Naïve Bayes 0.650 0.637 0.691 0.637
Logistic Reg. 0.730 0.731 0.731 0.730
SVM (SMO) 0.759 0.766 0.751 0.761
C4.5 (J48) 0.700 0.724 0.680 0.710


Support Vector Machine
• Accurate
• Fast

# “ok”
• Transparent

# swear words
“frustration” applies
“frustration” does not apply


Support Vector Machine
• Accurate
• Fast

# “ok”
?
• Transparent

# swear words


Precision Recall
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

interest
amusement
considering
agreement
annoyance
confusion
acceptance
apprehension
frustration
supportive
surprise
anticipation
serenity


Interpretability
• How is the classifier
making decisions?

# “ok”
• What features are
important in the model?

# swear words


Feature importance
Amusement Messages labeled Amusement
emoticon ";)" Kevin: hehe
emoticon ":)" Ray: hahahaah
laughter Stef: lol ok derek :)
emoticon ";-)" Ray: He never sleeps -- you know that.
"fun" Pascal: but I think it could be interesting
laughter length for Extreeeeeeeeeeme photometry
"p" study ;-)
# people names
"sleep"
"of"


Interpretable Classifiers
• Explain classification errors
• Suggest improvement strategies
• Discover interesting anomalies


Future Work


Sequential Modeling
5:20:39 baytech
5:20:40 yes
5:20:46 not sdsu


Interactive Visual Analysis


Affect in Twitter
45000

40000

35000

30000
Number of Tweets

25000

game resumes
20000

blackout

game over
halftime

game resumes
kickoff

15000

10000

5000

0

Time (EST), 2/3/2013 positive negative neutral


Classify…
• Positive/negative/neutral
sentiment
• Highly granular emotions
• Anything else you can label
github.com/etcgroup/aloe
In…
Download it, use it, & tell us what • longer, formal documents (blog
you think! posts, reviews)
• individual sentences
Michael Brooks • instant messages
mjbrooks@uw.edu • tweets
http://depts.washington.edu/sccl
• Anything else you can put in CSV


Statistical Affect Detection in Collaborative Chat

Statistical Affect Detection in Collaborative Chat

Recommandé

Recommandé

Contenu connexe

En vedette

En vedette (13)

Dernier

Dernier (20)

Statistical Affect Detection in Collaborative Chat

Notes de l'éditeur