SlideShare une entreprise Scribd logo
Introduction to language
technology on social media
Ali Hürriyetoğlu
May 9, 2023
Utrecht University
What will we learn?
• What is social media?
• What is language technology?
• What do you do on social media?
• How does language technology help you on social media?
• How does language technology harm you on social media?
Outline
• Social media (SM)
• Language Technology (LT)
• Privacy
• Ethics
• Limitations
• Do it yourself!
Social media
• User-generated content
• Official writing rules do not apply
• Any character can be used
• Network of users, hashtags, etc.
• Real-time communication
• Two-way
• Global reach
• How many social media platforms do you use?
• Any niche social media platform you may suggest us?
• How many hours a day do you spend on social media?
Language technology
https://www.linkedin.com/pulse/information-extraction-natural-language-processing-shubham-shankar/
Language technology - TC
• Text classification
• Language identification
• Spam detection
• Sentiment analysis, etc.
https://doi.org/10.1016/j.neucom.2018.07.044
Language technology - IE
• Information extraction
• Named Entity recognition
• Event extraction
• Semantic role labeling, etc.
https://techblog.smc.it/en/2020-12-11/nlp-ner
Language technology - LG
• Language generation
• Machine translation
• Text summarization
• Text simplification
• Question answering
https://devopedia.org/text-summarization
Demo time
• Please go and try
• https://demo.allennlp.org/reading-comprehension/bidaf-elmo
• Pick and try a demo from https://allenai.org/demos
• https://huggingface.co/facebook/bart-large-mnli
• Pick and try a model from https://huggingface.co (use the search box on top-
left)
• What do you observe?
• Pros
• Cons
You must know this!
• Language technology shapes your
worldview! It determines
• The content you see
• The follow recommendations you get
• How people find you
• The job recommendations you see
• How recruiters find you
• …
• Be aware of it!
https://ourworldindata.org/artificial-intelligence
Language technology – User level tasks
• User-based analyses
• Ban content/accounts if they are offensive, fake, misinformation, etc.
• Detect bots
• Detect trust level: should I trust this user?
• Recommend relevant users
Language technology – Post level tasks
• Post/message level analyses
• Convert text to standard language
• Hide a comment if it does not contribute to the discussion
• Show relevant posts
• Show similar questions on Stackoverflow
• Detect hate speech, cyberbullying, and threats
• Detect fake or deceptive posts
• Detect sarcasm
• Detect fiction
• Summarize a set of posts
• Simplify or paraphrase a post for specific groups
• Translate between languages
• Detect trends
Language technology - Domains
• Humanitarian purposes
• What do people need?
• Identify relevant information in the scope of a disaster
• Health
• Supporting mental health, Suicide prevention
• Early detection of epidemics/pandemics
• Criminal
• Open-source intelligence, OSINT
• Improving public safety
• Politic behavior
• What do people want?
• Voting preference detection
• Consumer behavior
• What do people buy?
• What do people think about a product?
• What is the effect of a marketing campaign?
• Economics, finances
• Predicting stock market trends
• Linguistics
• Identify endangered languages
• Is the content appropriate for kids? Is a simplification or summarization needed and at what level?
• How else would you like to analyze text on social media?
Privacy
• Full anonymization is almost impossible
• Your content is utilized (or not) for creating
language technology, do you consent?
• Pros?
• Cons?
https://ai.googleblog.com/2020/12/privacy-considerations-in-
large.html
Ethics
• The development of social media
platforms and many language
technologies are mainly revenue-
oriented
• Companies do (may) not care how they
affect you.
• Check https://unqover.apps.allenai.org
• You may be in a filter bubble.
Limitations
• Although the text is everywhere, technologies utilized for social
media are increasingly multimodal: network, image, image+text, etc.
• The performance of language technology tools may significantly
decrease on a new text
• Meaning determined by the time, user, etc.
• LT may not work well on content from and on minority groups
Do it yourself
• Spacy: https://spacy.io
• Transformers: https://huggingface.co/docs/transformers/index
• Scikit-learn: https://scikit-
learn.org/stable/tutorial/text_analytics/working_with_text_data.html
• NLTK: https://www.nltk.org
• Corpus tools: https://corpus.tools
• Finnish tools: https://www.kielipankki.fi/tools/
• What is social media?
• What is language technology?
• What do you do on social media?
• How does language technology help you on social media?
• How does language technology harm you on social media?
Further …
• Zaghouani, W., & City, E. Language Technologies for Social Media.
INTEGRATING ICTIN SOCIETY, 11. URL:
http://infoz.ffzg.hr/infuture/2017/images/papers/1-
02_Zaghouani,_Language_Technologies_for_Social_Media.pdf
• https://ourworldindata.org/rise-of-social-media
• Tools and Demos
• Sense clustering over time: https://www.inf.uni-
hamburg.de/en/inst/ab/lt/resources/demos/scot.html
• Various demos: https://www.inf.uni-
hamburg.de/en/inst/ab/lt/resources/demos.html
• Thanks for your time!
• What questions do you have?
• Contact: ali.hurriyetoglu@gmail.com
@hurrial

Contenu connexe

Similaire à applications-of-lang-tech.pdf

Scale2014
Scale2014Scale2014
Scale2014
shaunagm
 
Beyond clicking
Beyond clickingBeyond clicking
Beyond clicking
Wu Heping
 
Social media for small NGOs
Social media for small NGOsSocial media for small NGOs
Social media for small NGOs
Amy Coulterman
 
How NGOs can use Social Media
How NGOs can use Social MediaHow NGOs can use Social Media
How NGOs can use Social Media
Farra Trompeter, Big Duck
 
Guide to Digital and Communication Accessibility in Higher Education
Guide to Digital and Communication Accessibility in Higher EducationGuide to Digital and Communication Accessibility in Higher Education
Guide to Digital and Communication Accessibility in Higher Education
3Play Media
 
Using Social Media to Engage Professional Alumni
Using Social Media to Engage Professional AlumniUsing Social Media to Engage Professional Alumni
Using Social Media to Engage Professional Alumni
Farra Trompeter, Big Duck
 
ChatGPT_Webinar_Slides.pptx
ChatGPT_Webinar_Slides.pptxChatGPT_Webinar_Slides.pptx
ChatGPT_Webinar_Slides.pptx
ssuser0e7b94
 
AALL Webinar: Technology Tools for Law Librarians
AALL Webinar:  Technology Tools for Law LibrariansAALL Webinar:  Technology Tools for Law Librarians
AALL Webinar: Technology Tools for Law Librarians
Lisa Smith-Butler
 
Ico guest lecture 2015 'Social media'
Ico guest lecture 2015 'Social media'Ico guest lecture 2015 'Social media'
Ico guest lecture 2015 'Social media'
University of Utrecht
 
Health information professionals and Artificial Intelligence
Health information professionals and Artificial IntelligenceHealth information professionals and Artificial Intelligence
Health information professionals and Artificial Intelligence
coxamcoxam
 
Integrating Ipads into the Classroom: Secondary Schools
Integrating Ipads into the Classroom: Secondary SchoolsIntegrating Ipads into the Classroom: Secondary Schools
Integrating Ipads into the Classroom: Secondary Schools
Spectronics
 
Soccnx10 Man versus Machine – A Story About Embracing Innovation
Soccnx10 Man versus Machine – A Story About Embracing Innovation Soccnx10 Man versus Machine – A Story About Embracing Innovation
Soccnx10 Man versus Machine – A Story About Embracing Innovation
Femke Goedhart
 
Social Media for Researchers
Social Media for ResearchersSocial Media for Researchers
Social Media for Researchers
Richard Hall
 
OSINT - Open Soure Intelligence - Webinar on CyberSecurity
OSINT - Open Soure Intelligence - Webinar on CyberSecurityOSINT - Open Soure Intelligence - Webinar on CyberSecurity
OSINT - Open Soure Intelligence - Webinar on CyberSecurity
Mohammed Adam
 
Artificial Intelligence Tools for Students with Learning Disabilities
Artificial Intelligence Tools for Students with Learning DisabilitiesArtificial Intelligence Tools for Students with Learning Disabilities
Artificial Intelligence Tools for Students with Learning Disabilities
John Rochford
 
Social engineering
Social engineeringSocial engineering
Social engineering
Robert Hood
 
01-Introduction to HCI.pptx
01-Introduction to HCI.pptx01-Introduction to HCI.pptx
01-Introduction to HCI.pptx
Le Hung
 
Content strategy in social media platforms
Content strategy in social media platformsContent strategy in social media platforms
Content strategy in social media platforms
Hossein sharafi
 
#csudocfest Bonsai Networking (for PhD students)
#csudocfest Bonsai Networking (for PhD students)#csudocfest Bonsai Networking (for PhD students)
#csudocfest Bonsai Networking (for PhD students)
Joyce Seitzinger
 
Thoughts on Open Accessibility
Thoughts on Open AccessibilityThoughts on Open Accessibility
Thoughts on Open Accessibility
colinbdclark
 

Similaire à applications-of-lang-tech.pdf (20)

Scale2014
Scale2014Scale2014
Scale2014
 
Beyond clicking
Beyond clickingBeyond clicking
Beyond clicking
 
Social media for small NGOs
Social media for small NGOsSocial media for small NGOs
Social media for small NGOs
 
How NGOs can use Social Media
How NGOs can use Social MediaHow NGOs can use Social Media
How NGOs can use Social Media
 
Guide to Digital and Communication Accessibility in Higher Education
Guide to Digital and Communication Accessibility in Higher EducationGuide to Digital and Communication Accessibility in Higher Education
Guide to Digital and Communication Accessibility in Higher Education
 
Using Social Media to Engage Professional Alumni
Using Social Media to Engage Professional AlumniUsing Social Media to Engage Professional Alumni
Using Social Media to Engage Professional Alumni
 
ChatGPT_Webinar_Slides.pptx
ChatGPT_Webinar_Slides.pptxChatGPT_Webinar_Slides.pptx
ChatGPT_Webinar_Slides.pptx
 
AALL Webinar: Technology Tools for Law Librarians
AALL Webinar:  Technology Tools for Law LibrariansAALL Webinar:  Technology Tools for Law Librarians
AALL Webinar: Technology Tools for Law Librarians
 
Ico guest lecture 2015 'Social media'
Ico guest lecture 2015 'Social media'Ico guest lecture 2015 'Social media'
Ico guest lecture 2015 'Social media'
 
Health information professionals and Artificial Intelligence
Health information professionals and Artificial IntelligenceHealth information professionals and Artificial Intelligence
Health information professionals and Artificial Intelligence
 
Integrating Ipads into the Classroom: Secondary Schools
Integrating Ipads into the Classroom: Secondary SchoolsIntegrating Ipads into the Classroom: Secondary Schools
Integrating Ipads into the Classroom: Secondary Schools
 
Soccnx10 Man versus Machine – A Story About Embracing Innovation
Soccnx10 Man versus Machine – A Story About Embracing Innovation Soccnx10 Man versus Machine – A Story About Embracing Innovation
Soccnx10 Man versus Machine – A Story About Embracing Innovation
 
Social Media for Researchers
Social Media for ResearchersSocial Media for Researchers
Social Media for Researchers
 
OSINT - Open Soure Intelligence - Webinar on CyberSecurity
OSINT - Open Soure Intelligence - Webinar on CyberSecurityOSINT - Open Soure Intelligence - Webinar on CyberSecurity
OSINT - Open Soure Intelligence - Webinar on CyberSecurity
 
Artificial Intelligence Tools for Students with Learning Disabilities
Artificial Intelligence Tools for Students with Learning DisabilitiesArtificial Intelligence Tools for Students with Learning Disabilities
Artificial Intelligence Tools for Students with Learning Disabilities
 
Social engineering
Social engineeringSocial engineering
Social engineering
 
01-Introduction to HCI.pptx
01-Introduction to HCI.pptx01-Introduction to HCI.pptx
01-Introduction to HCI.pptx
 
Content strategy in social media platforms
Content strategy in social media platformsContent strategy in social media platforms
Content strategy in social media platforms
 
#csudocfest Bonsai Networking (for PhD students)
#csudocfest Bonsai Networking (for PhD students)#csudocfest Bonsai Networking (for PhD students)
#csudocfest Bonsai Networking (for PhD students)
 
Thoughts on Open Accessibility
Thoughts on Open AccessibilityThoughts on Open Accessibility
Thoughts on Open Accessibility
 

Dernier

Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
frank0071
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
PirithiRaju
 
Modelo de slide quimica para powerpoint
Modelo  de slide quimica para powerpointModelo  de slide quimica para powerpoint
Modelo de slide quimica para powerpoint
Karen593256
 

Dernier (20)

Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
 
Modelo de slide quimica para powerpoint
Modelo  de slide quimica para powerpointModelo  de slide quimica para powerpoint
Modelo de slide quimica para powerpoint
 

applications-of-lang-tech.pdf

  • 1. Introduction to language technology on social media Ali Hürriyetoğlu May 9, 2023 Utrecht University
  • 2. What will we learn? • What is social media? • What is language technology? • What do you do on social media? • How does language technology help you on social media? • How does language technology harm you on social media?
  • 3. Outline • Social media (SM) • Language Technology (LT) • Privacy • Ethics • Limitations • Do it yourself!
  • 4. Social media • User-generated content • Official writing rules do not apply • Any character can be used • Network of users, hashtags, etc. • Real-time communication • Two-way • Global reach
  • 5. • How many social media platforms do you use? • Any niche social media platform you may suggest us? • How many hours a day do you spend on social media?
  • 6.
  • 7.
  • 9. Language technology - TC • Text classification • Language identification • Spam detection • Sentiment analysis, etc. https://doi.org/10.1016/j.neucom.2018.07.044
  • 10. Language technology - IE • Information extraction • Named Entity recognition • Event extraction • Semantic role labeling, etc. https://techblog.smc.it/en/2020-12-11/nlp-ner
  • 11. Language technology - LG • Language generation • Machine translation • Text summarization • Text simplification • Question answering https://devopedia.org/text-summarization
  • 12. Demo time • Please go and try • https://demo.allennlp.org/reading-comprehension/bidaf-elmo • Pick and try a demo from https://allenai.org/demos • https://huggingface.co/facebook/bart-large-mnli • Pick and try a model from https://huggingface.co (use the search box on top- left) • What do you observe? • Pros • Cons
  • 13. You must know this! • Language technology shapes your worldview! It determines • The content you see • The follow recommendations you get • How people find you • The job recommendations you see • How recruiters find you • … • Be aware of it! https://ourworldindata.org/artificial-intelligence
  • 14. Language technology – User level tasks • User-based analyses • Ban content/accounts if they are offensive, fake, misinformation, etc. • Detect bots • Detect trust level: should I trust this user? • Recommend relevant users
  • 15. Language technology – Post level tasks • Post/message level analyses • Convert text to standard language • Hide a comment if it does not contribute to the discussion • Show relevant posts • Show similar questions on Stackoverflow • Detect hate speech, cyberbullying, and threats • Detect fake or deceptive posts • Detect sarcasm • Detect fiction • Summarize a set of posts • Simplify or paraphrase a post for specific groups • Translate between languages • Detect trends
  • 16. Language technology - Domains • Humanitarian purposes • What do people need? • Identify relevant information in the scope of a disaster • Health • Supporting mental health, Suicide prevention • Early detection of epidemics/pandemics • Criminal • Open-source intelligence, OSINT • Improving public safety • Politic behavior • What do people want? • Voting preference detection • Consumer behavior • What do people buy? • What do people think about a product? • What is the effect of a marketing campaign? • Economics, finances • Predicting stock market trends • Linguistics • Identify endangered languages • Is the content appropriate for kids? Is a simplification or summarization needed and at what level?
  • 17. • How else would you like to analyze text on social media?
  • 18. Privacy • Full anonymization is almost impossible • Your content is utilized (or not) for creating language technology, do you consent? • Pros? • Cons? https://ai.googleblog.com/2020/12/privacy-considerations-in- large.html
  • 19. Ethics • The development of social media platforms and many language technologies are mainly revenue- oriented • Companies do (may) not care how they affect you. • Check https://unqover.apps.allenai.org • You may be in a filter bubble.
  • 20. Limitations • Although the text is everywhere, technologies utilized for social media are increasingly multimodal: network, image, image+text, etc. • The performance of language technology tools may significantly decrease on a new text • Meaning determined by the time, user, etc. • LT may not work well on content from and on minority groups
  • 21. Do it yourself • Spacy: https://spacy.io • Transformers: https://huggingface.co/docs/transformers/index • Scikit-learn: https://scikit- learn.org/stable/tutorial/text_analytics/working_with_text_data.html • NLTK: https://www.nltk.org • Corpus tools: https://corpus.tools • Finnish tools: https://www.kielipankki.fi/tools/
  • 22. • What is social media? • What is language technology? • What do you do on social media? • How does language technology help you on social media? • How does language technology harm you on social media?
  • 23. Further … • Zaghouani, W., & City, E. Language Technologies for Social Media. INTEGRATING ICTIN SOCIETY, 11. URL: http://infoz.ffzg.hr/infuture/2017/images/papers/1- 02_Zaghouani,_Language_Technologies_for_Social_Media.pdf • https://ourworldindata.org/rise-of-social-media • Tools and Demos • Sense clustering over time: https://www.inf.uni- hamburg.de/en/inst/ab/lt/resources/demos/scot.html • Various demos: https://www.inf.uni- hamburg.de/en/inst/ab/lt/resources/demos.html
  • 24. • Thanks for your time! • What questions do you have? • Contact: ali.hurriyetoglu@gmail.com @hurrial