SlideShare une entreprise Scribd logo
1  sur  20
Roles and Words in a Massive NSSI–
Related Interaction Network
Dmitry Zinoviev
Mathematics and Computer Science Department
Suffolk University, Boston MA
Presented at SunBelt 2019, Montreal CA
June 2019 INSNA SunBelt 2
What Is NSSI?
● Non-suicidal self-injury (NSSI), such as self-cutting or
self-burning, if the deliberate destruction of one’s
body tissue in the absence of suicidal intent.
● Approximately one in five of adolescents and one in
four of young adults in the USA have engaged in NSSI
(“self-cutters,” “self-burners”)
June 2019 INSNA SunBelt 3
Where to Study Self-Harmers?
● Off-line: expensive, invasive
● On-line: cheap, noninvasive, in a naturally occurring
setting
– On LiveJournal:
● a blogging social networking site
● share skills and practices (especially concealment), ask
for help
June 2019 INSNA SunBelt 4
Research Question
● Do NSSI-related topic starters and followers on
LiveJournal use different vocabulary, and if so, how do
they differ?
June 2019 INSNA SunBelt 5
Research Strategy
● Build an interaction network of LiveJournal users
● Identify topics of discourse (ToDs)
● Find and explain the relationships between the
network attributes (such as centralities) and ToDs
June 2019 INSNA SunBelt 6
Dataset
● ~140 NSSI-related thematic communities
● 15,678 active users
● 63,000 original posts
● 169,000 follow-up comments
● Posted in 2001–2012
June 2019 INSNA SunBelt 7
Interaction Network Construction
● Interaction = response (comment) to the original post
or a comment
● A responds to B →edge from A to B
● Number of responses → weight of the edge
● Directed, weighted network
● 18 major network communities through Louvain
community detection
● Newman modularity 0.73
June 2019 INSNA SunBelt 8
Network at a Glance
Node attributes
represent users’ roles
June 2019 INSNA SunBelt 9
Interpretation of Attributes (I)
● In-Degree Centrality
– Author of requests for help or advice (topic starter), or
controversial statements
● Out-Degree Centrality
– Responder, advice-giver
● Closeness Centrality
– First responder (author of the first, or other lower-rank,
comment)
● Betweenness Centrality
– Mediator/broker
June 2019 INSNA SunBelt 10
Interpretation of Attributes (II)
● Eigenvector Centrality
– “Important” member (in the most general meaning of the
term)
● Clustering Coefficient
– Participant of active multi-party discussions
We are not sure
June 2019 INSNA SunBelt 11
Identify ToDs (I)
● Build a semantic network (a network of words). For
each post and comment:
– Remove frequent words (stop words)
– Lemmatize the remaining words
– Represent lemmas as network nodes
– Connect two words with an undirected edge if the lemmas
are at most five words apart in the text. The size of the
window is chosen to ensure that the resulting network is
neither too dense nor too sparse
– The number of co-occurrences is the edge weight
June 2019 INSNA SunBelt 12
Semantic Network at a Glance
June 2019 INSNA SunBelt 13
Identify ToDs (II)
● 11 major network communities through Louvain
community detection
● Newman modularity 0.37
● A community ↔ a collections of words that are
frequently used together ↔ a topic of discourse
● 11 major topics of discourse
● Each user has a vector of 11 topic memberships TDij
(∑i
TDij
=1)
June 2019 INSNA SunBelt 14
Name ToDs
● Extracted semantic network communities (ToDs) do
not have names
● Name after the most frequent lemmas (e.g., “sad” →
the “sad” topic)
● Name via Amazon Mechanical Turk (* denotes “magic”
numbers)
– Select 25* most frequent words
– Submit to 25* AMT workers and ask to come up with a
single- or double-word name
– Accept the majority vote, if any
June 2019 INSNA SunBelt 15
ToDs: Names and Top 7 Words
help Help, need, talk, stop, love, never, right
lifestyle Back, keep, away, around, put, stay, mind
friend Tell, friend, way, find, best, ask, mom
sad Sad, upset, depress, angry, depressed, pathetic
time Day, start, year, long, last, month, first
scar Scar, look, arm, leave, blood, enough, alone
hate Bad, life, hurt, hate, fuck, pain, feeling
rules Post, little, community, new, write, name, read
ana Yes, eat, disorder, depression, trust, etc., mental
s.i. Self, si, listen, sit, room, suicide, change
tools Use, razor, blade, word, cutter, usually, knife
June 2019 INSNA SunBelt 16
Logit Regression
● Independent variables X:
– Six network centralities and the clustering coefficient (they
define the role of the user in the network)
● Dependent variables Y:
– Membership in each of the 11 topics of discourse (they
define the language use by the user)
– Binarized (above/below the median)
June 2019 INSNA SunBelt 17
Significant Results
● Yellow nodes represent
independent variables
 Betweenness centrality is
not significantly related to
any Y
● Cyan nodes represent
dependent variables
● Arrows represent
statistically significant
(p≤0.05) relations between
independent and dependent
variables
● Thicker arrows represent
smaller p-values
● Blue arrows represent
positive coefficients
● Red arrows represent
negative coefficients
June 2019 INSNA SunBelt 18
Interpretation
● Topic starters are not associated with “rules” and
“help”
● Responders are not associated with “tools,” visual
manifestations of NSSI (“scars”), and “time”
● First responders are not associated with “help,”
“friend,” and “time” – not a good sign!
● Intensive multi-party discussions related to “rules”
● Influence of propensity for brokerage is not significant
● The negative effect of eigenvector centrality on ”scars”
needs further research
June 2019 INSNA SunBelt 19
Conclusion & Acknowledgment
● The structural roles and semantic preferences in an
NSSI interaction network are related.
● Topic starters and especially first responders
concentrate on negativity.
● Later responders concentrate on positivity.
● The author is grateful to the two anonymous reviewers
for their comments and inspiration
June 2019 INSNA SunBelt 20
More NSSI Research from SU
● D Zinoviev, “Non-suicidal self-injury–related interests
in blogging social networks,” poster presented at
SunBelt, 2018
● D Zinoviev, D Stefanescu, G Fireman, L Swenson,
“Semantic networks of interests in online non-suicidal
self-injury communities,” Digital Health, 1, 2016

Contenu connexe

Plus de Dmitry Zinoviev

Network analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweetsNetwork analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweets
Dmitry Zinoviev
 

Plus de Dmitry Zinoviev (20)

“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
 
Network analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweetsNetwork analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweets
 
Network Analysis of The Shining
Network Analysis of The ShiningNetwork Analysis of The Shining
Network Analysis of The Shining
 
The Lord of the Ring. A Network Analysis
The Lord of the Ring. A Network AnalysisThe Lord of the Ring. A Network Analysis
The Lord of the Ring. A Network Analysis
 
Pickling and CSV
Pickling and CSVPickling and CSV
Pickling and CSV
 
Python overview
Python overviewPython overview
Python overview
 
Welcome to CS310!
Welcome to CS310!Welcome to CS310!
Welcome to CS310!
 
Programming languages
Programming languagesProgramming languages
Programming languages
 
The P4 of Networkacy
The P4 of NetworkacyThe P4 of Networkacy
The P4 of Networkacy
 
DaVinci Code. Network Analysis
DaVinci Code. Network AnalysisDaVinci Code. Network Analysis
DaVinci Code. Network Analysis
 
Soviet Popular Music Landscape: Community Structure and Success Predictors
Soviet Popular Music Landscape: Community Structure and Success PredictorsSoviet Popular Music Landscape: Community Structure and Success Predictors
Soviet Popular Music Landscape: Community Structure and Success Predictors
 
C for Java programmers (part 2)
C for Java programmers (part 2)C for Java programmers (part 2)
C for Java programmers (part 2)
 
C for Java programmers (part 3)
C for Java programmers (part 3)C for Java programmers (part 3)
C for Java programmers (part 3)
 
C for Java programmers (part 1)
C for Java programmers (part 1)C for Java programmers (part 1)
C for Java programmers (part 1)
 
Introduction to Erlang Part 2
Introduction to Erlang Part 2Introduction to Erlang Part 2
Introduction to Erlang Part 2
 
Introduction to Erlang Part 1
Introduction to Erlang Part 1Introduction to Erlang Part 1
Introduction to Erlang Part 1
 
Networks of Music Groups as Success Predictors
Networks of Music Groups as Success PredictorsNetworks of Music Groups as Success Predictors
Networks of Music Groups as Success Predictors
 
Desdemona
DesdemonaDesdemona
Desdemona
 
Towards an Ideal Store
Towards an Ideal StoreTowards an Ideal Store
Towards an Ideal Store
 
Building Mini-Categories in Product Networks
Building Mini-Categories in Product NetworksBuilding Mini-Categories in Product Networks
Building Mini-Categories in Product Networks
 

Dernier

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 

Dernier (20)

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 

Roles and Words in a massive NSSI-Related Interaction Network

  • 1. Roles and Words in a Massive NSSI– Related Interaction Network Dmitry Zinoviev Mathematics and Computer Science Department Suffolk University, Boston MA Presented at SunBelt 2019, Montreal CA
  • 2. June 2019 INSNA SunBelt 2 What Is NSSI? ● Non-suicidal self-injury (NSSI), such as self-cutting or self-burning, if the deliberate destruction of one’s body tissue in the absence of suicidal intent. ● Approximately one in five of adolescents and one in four of young adults in the USA have engaged in NSSI (“self-cutters,” “self-burners”)
  • 3. June 2019 INSNA SunBelt 3 Where to Study Self-Harmers? ● Off-line: expensive, invasive ● On-line: cheap, noninvasive, in a naturally occurring setting – On LiveJournal: ● a blogging social networking site ● share skills and practices (especially concealment), ask for help
  • 4. June 2019 INSNA SunBelt 4 Research Question ● Do NSSI-related topic starters and followers on LiveJournal use different vocabulary, and if so, how do they differ?
  • 5. June 2019 INSNA SunBelt 5 Research Strategy ● Build an interaction network of LiveJournal users ● Identify topics of discourse (ToDs) ● Find and explain the relationships between the network attributes (such as centralities) and ToDs
  • 6. June 2019 INSNA SunBelt 6 Dataset ● ~140 NSSI-related thematic communities ● 15,678 active users ● 63,000 original posts ● 169,000 follow-up comments ● Posted in 2001–2012
  • 7. June 2019 INSNA SunBelt 7 Interaction Network Construction ● Interaction = response (comment) to the original post or a comment ● A responds to B →edge from A to B ● Number of responses → weight of the edge ● Directed, weighted network ● 18 major network communities through Louvain community detection ● Newman modularity 0.73
  • 8. June 2019 INSNA SunBelt 8 Network at a Glance Node attributes represent users’ roles
  • 9. June 2019 INSNA SunBelt 9 Interpretation of Attributes (I) ● In-Degree Centrality – Author of requests for help or advice (topic starter), or controversial statements ● Out-Degree Centrality – Responder, advice-giver ● Closeness Centrality – First responder (author of the first, or other lower-rank, comment) ● Betweenness Centrality – Mediator/broker
  • 10. June 2019 INSNA SunBelt 10 Interpretation of Attributes (II) ● Eigenvector Centrality – “Important” member (in the most general meaning of the term) ● Clustering Coefficient – Participant of active multi-party discussions We are not sure
  • 11. June 2019 INSNA SunBelt 11 Identify ToDs (I) ● Build a semantic network (a network of words). For each post and comment: – Remove frequent words (stop words) – Lemmatize the remaining words – Represent lemmas as network nodes – Connect two words with an undirected edge if the lemmas are at most five words apart in the text. The size of the window is chosen to ensure that the resulting network is neither too dense nor too sparse – The number of co-occurrences is the edge weight
  • 12. June 2019 INSNA SunBelt 12 Semantic Network at a Glance
  • 13. June 2019 INSNA SunBelt 13 Identify ToDs (II) ● 11 major network communities through Louvain community detection ● Newman modularity 0.37 ● A community ↔ a collections of words that are frequently used together ↔ a topic of discourse ● 11 major topics of discourse ● Each user has a vector of 11 topic memberships TDij (∑i TDij =1)
  • 14. June 2019 INSNA SunBelt 14 Name ToDs ● Extracted semantic network communities (ToDs) do not have names ● Name after the most frequent lemmas (e.g., “sad” → the “sad” topic) ● Name via Amazon Mechanical Turk (* denotes “magic” numbers) – Select 25* most frequent words – Submit to 25* AMT workers and ask to come up with a single- or double-word name – Accept the majority vote, if any
  • 15. June 2019 INSNA SunBelt 15 ToDs: Names and Top 7 Words help Help, need, talk, stop, love, never, right lifestyle Back, keep, away, around, put, stay, mind friend Tell, friend, way, find, best, ask, mom sad Sad, upset, depress, angry, depressed, pathetic time Day, start, year, long, last, month, first scar Scar, look, arm, leave, blood, enough, alone hate Bad, life, hurt, hate, fuck, pain, feeling rules Post, little, community, new, write, name, read ana Yes, eat, disorder, depression, trust, etc., mental s.i. Self, si, listen, sit, room, suicide, change tools Use, razor, blade, word, cutter, usually, knife
  • 16. June 2019 INSNA SunBelt 16 Logit Regression ● Independent variables X: – Six network centralities and the clustering coefficient (they define the role of the user in the network) ● Dependent variables Y: – Membership in each of the 11 topics of discourse (they define the language use by the user) – Binarized (above/below the median)
  • 17. June 2019 INSNA SunBelt 17 Significant Results ● Yellow nodes represent independent variables  Betweenness centrality is not significantly related to any Y ● Cyan nodes represent dependent variables ● Arrows represent statistically significant (p≤0.05) relations between independent and dependent variables ● Thicker arrows represent smaller p-values ● Blue arrows represent positive coefficients ● Red arrows represent negative coefficients
  • 18. June 2019 INSNA SunBelt 18 Interpretation ● Topic starters are not associated with “rules” and “help” ● Responders are not associated with “tools,” visual manifestations of NSSI (“scars”), and “time” ● First responders are not associated with “help,” “friend,” and “time” – not a good sign! ● Intensive multi-party discussions related to “rules” ● Influence of propensity for brokerage is not significant ● The negative effect of eigenvector centrality on ”scars” needs further research
  • 19. June 2019 INSNA SunBelt 19 Conclusion & Acknowledgment ● The structural roles and semantic preferences in an NSSI interaction network are related. ● Topic starters and especially first responders concentrate on negativity. ● Later responders concentrate on positivity. ● The author is grateful to the two anonymous reviewers for their comments and inspiration
  • 20. June 2019 INSNA SunBelt 20 More NSSI Research from SU ● D Zinoviev, “Non-suicidal self-injury–related interests in blogging social networks,” poster presented at SunBelt, 2018 ● D Zinoviev, D Stefanescu, G Fireman, L Swenson, “Semantic networks of interests in online non-suicidal self-injury communities,” Digital Health, 1, 2016