Describing him, describing her: Linguistic biases in crowdsourced metadata for images of people

•Télécharger en tant que PPTX, PDF•

1 j'aime•710 vues

KNOWeSCAPE2014

Jahna Otterbacher -Describing him, describing her: Linguistic biases in crowdsourced metadata for images of people (Talk at 2nd Annual KNOWeSCAPE Scientific Meeting, http://knowescape.org/knowescape2014-2/)

Sciences

Describing him, describing her:
Linguistic biases in crowdsourced metadata
for images of people
Jahna Otterbacher
Social Information Systems

Purpose: Content-based image retrieval
Image: http://savvash.blogspot.com/2010_11_01_archive.html

Linguistic biases in metadata?
• The manner in which we use language plays a key role in the
transmission of social stereotypes [Maass et al., 1989; Rubin et al., 2013]
• Linguistic Bias:
Systematic asymmetry in the way that one uses language, as a
function of the social group of the person(s) being described
[Beukeboom, 2013]
• RQ: Do we observe linguistic biases in labels of images, with respect
to gender?
– Use of adjectives [Fiedler & Semin, 1988]
– Use of strongly subjective adjectives [Wilson et al., 2005]
– Use of labels that describe:
• Physical appearance
• Disposition or character
• Occupation

Linguistic biases
Abstract / positive:
He is intelligent, successful, helpful.
Concrete / neutral:
She is studying, listening, thinking.

Analysis
ESP Game Dataset
100k images
LIWC categories:
“humans, friends,
family”
Use hyponyms of
“man” / “male” and
“woman” / “female”
to label gender
STEP 1:
Find images
of men and
women
Part-of-speech
tagging
CLAWS C5
Manual error analysis:
Adjectives (1.45%)
Women are more
often described with
adjectives
STEP 2:
Find labels
that are
adjectives
Subjectivity Lexicon
(Wilson et al. 2005)
Women are more
often described with
subjective adjectives
STEP 3:
Find
subjective
adjectives
Identify images with
labels concerning 6
occupations
For each label/image:
Does it describe
appearance,
disposition,
occupation
STEP 4:
Manual
analysis
Women are associated
with more labels
concerning appearance;
fewer concerning
occupation

Strongly subjective adjectives
Men (N = 18,916) Women (N = 14,628)
Happy (385) Sexy (2,425)
Ugly (225) Happy (549)
Sad (201) Ugly (254)
Angry (132) Sad (241)
Drunk (124) Cute (117)
Scary (107) Beautiful (84)
Funny (103) Fun (67)
Cute (88) Drunk (58)
Mad (74) Scary (51)
Fun (63) Little (40)

Implications & future work
• Exposing biases brings up issues for
– Designers of systems
– Those who train algorithms
• Controlled experiment
– Stimulus
– Social cues
– Output (linguistic biases)

Recommandé

IRE_Majorproject Personality detectionAnuraag Ashok Kumar

Gender stereotyping re engineering gendersharon coen

Q3 L02 Attitude Formation and MeasurementDickson College

Gender bias in_the_curriculumKelly Reierson

J otterbacher stereotypesHan Woo PARK

Gender and language (linguistics, social network theory, Twitter!)Tyler Schnoebelen

Gender, language, and Twitter: Social theory and computational methodsIdibon1

Rhodena Townsell, PhD Proposal Defense, Dr. William Allan Kritsonis, Disserta...William Kritsonis

Recommandé

IRE_Majorproject Personality detectionAnuraag Ashok Kumar

Gender stereotyping re engineering gendersharon coen

Q3 L02 Attitude Formation and MeasurementDickson College

Gender bias in_the_curriculumKelly Reierson

J otterbacher stereotypesHan Woo PARK

Gender and language (linguistics, social network theory, Twitter!)Tyler Schnoebelen

Gender, language, and Twitter: Social theory and computational methodsIdibon1

Rhodena Townsell, PhD Proposal Defense, Dr. William Allan Kritsonis, Disserta...William Kritsonis

Dr. Rhodena Townsell, PhD Dissertation Defense, Dr. William Allan Kritsonis, ...William Kritsonis

REFERENCE MATERIAL.pdfPahmAmpaso

Rhodena Townsell, Dissertation, Dr. William Allan Kritsonis, Dissertation ChairWilliam Kritsonis

Peggy Wu - Incorporating Psychology Theories into Simulations & Serious GamesSeriousGamesAssoc

571 Presentation FinalSamuel Yelland

Stereotyping presentation1eviegrl42

Gender Representation Lesson 4hughes82

Dr. Darrell Cleveland, The Richmond Stockton College of New Jersey - Publishe...William Kritsonis

Groups in Action Workbook – Evolution of a Group (Segments 4.docxwhittemorelucilla

Lesson 1 introductionHeath Park, Wolverhampton

How women’s unconscious process are effected in gender situationsTaty Christofi

SOC 416 STUDY Education Planning--soc416study.comVTejeswini18

SOC 416 STUDY Redefined Education--soc416study.comkopiko197

E-journals Phongdanai Nampaktai

Ej presentMya Plaza

Psy 235 syllabus f 2012malloryhoffman

Fame cvprBilkent University

SOC 416 STUDY Inspiring Innovation--soc416study.comwilliamwordsworth63

SOC 416 STUDY Achievement Education / soc416study.comagathachristie163

Representation of Gender 2015Naamah Hill

Overview and Summarize knowledge areas: a dual approach in knowledge mapping ...KNOWeSCAPE2014

Kno we scape2014-thess-bouchoumarkhoffKNOWeSCAPE2014

Contenu connexe

Similaire à Describing him, describing her: Linguistic biases in crowdsourced metadata for images of people

Dr. Rhodena Townsell, PhD Dissertation Defense, Dr. William Allan Kritsonis, ...William Kritsonis

REFERENCE MATERIAL.pdfPahmAmpaso

Rhodena Townsell, Dissertation, Dr. William Allan Kritsonis, Dissertation ChairWilliam Kritsonis

Peggy Wu - Incorporating Psychology Theories into Simulations & Serious GamesSeriousGamesAssoc

571 Presentation FinalSamuel Yelland

Stereotyping presentation1eviegrl42

Gender Representation Lesson 4hughes82

Dr. Darrell Cleveland, The Richmond Stockton College of New Jersey - Publishe...William Kritsonis

Groups in Action Workbook – Evolution of a Group (Segments 4.docxwhittemorelucilla

Lesson 1 introductionHeath Park, Wolverhampton

How women’s unconscious process are effected in gender situationsTaty Christofi

SOC 416 STUDY Education Planning--soc416study.comVTejeswini18

SOC 416 STUDY Redefined Education--soc416study.comkopiko197

E-journals Phongdanai Nampaktai

Ej presentMya Plaza

Psy 235 syllabus f 2012malloryhoffman

Fame cvprBilkent University

SOC 416 STUDY Inspiring Innovation--soc416study.comwilliamwordsworth63

SOC 416 STUDY Achievement Education / soc416study.comagathachristie163

Representation of Gender 2015Naamah Hill

Similaire à Describing him, describing her: Linguistic biases in crowdsourced metadata for images of people (20)

Dr. Rhodena Townsell, PhD Dissertation Defense, Dr. William Allan Kritsonis, ...

REFERENCE MATERIAL.pdf

Rhodena Townsell, Dissertation, Dr. William Allan Kritsonis, Dissertation Chair

Peggy Wu - Incorporating Psychology Theories into Simulations & Serious Games

571 Presentation Final

Stereotyping presentation1

Gender Representation Lesson 4

Dr. Darrell Cleveland, The Richmond Stockton College of New Jersey - Publishe...

Groups in Action Workbook – Evolution of a Group (Segments 4.docx

Lesson 1 introduction

How women’s unconscious process are effected in gender situations

SOC 416 STUDY Education Planning--soc416study.com

SOC 416 STUDY Redefined Education--soc416study.com

E-journals

Ej present

Psy 235 syllabus f 2012

Fame cvpr

SOC 416 STUDY Inspiring Innovation--soc416study.com

SOC 416 STUDY Achievement Education / soc416study.com

Representation of Gender 2015

Plus de KNOWeSCAPE2014

Overview and Summarize knowledge areas: a dual approach in knowledge mapping ...KNOWeSCAPE2014

Kno we scape2014-thess-bouchoumarkhoffKNOWeSCAPE2014

Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014

Visualization of composition and characteristics of scientific elites at an I...KNOWeSCAPE2014

Interevent time distributions of human multi-level activity in a virtual worldKNOWeSCAPE2014

Graphical Analysis of Scientific Collaboration Variations Interactions KNOWeSCAPE2014

On the decay of attention in science KNOWeSCAPE2014

Quantitative Study of Innovation and Knowledge Building in Questions&Answers ...KNOWeSCAPE2014

UCLA Libraries and the Research Commons KNOWeSCAPE2014

A Statistical Study of the Wells Wilder Index KNOWeSCAPE2014

Effects of the Discussion Groups Sizes on the Dynamics of Public Opinion KNOWeSCAPE2014

Data modelling and visualization in German environmental policies KNOWeSCAPE2014

Identification of Influential Scientists versus Mass Producers by the Perfect...KNOWeSCAPE2014

Canalyzation in mathematical modelingKNOWeSCAPE2014

Reputation and Impact in Academic CareersKNOWeSCAPE2014

Plus de KNOWeSCAPE2014 (15)

Overview and Summarize knowledge areas: a dual approach in knowledge mapping ...

Kno we scape2014-thess-bouchoumarkhoff

Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour

Visualization of composition and characteristics of scientific elites at an I...

Interevent time distributions of human multi-level activity in a virtual world

Graphical Analysis of Scientific Collaboration Variations Interactions

On the decay of attention in science

Quantitative Study of Innovation and Knowledge Building in Questions&Answers ...

UCLA Libraries and the Research Commons

A Statistical Study of the Wells Wilder Index

Effects of the Discussion Groups Sizes on the Dynamics of Public Opinion

Data modelling and visualization in German environmental policies

Identification of Influential Scientists versus Mass Producers by the Perfect...

Canalyzation in mathematical modeling

Reputation and Impact in Academic Careers

Dernier

Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani

Natural Polymer Based NanomaterialsAArockiyaNisha

Engler and Prantl system of classification in plant taxonomyNistarini College, Purulia (W.B) India

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk

G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2

Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6

CELL -Structural and Functional unit of life.pdfNistarini College, Purulia (W.B) India

Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal

Isotopic evidence of long-lived volcanism on IoSérgio Sacani

GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji

Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1

Formation of low mass protostars and their circumstellar disksSérgio Sacani

Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani

Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P

Dernier (20)

Presentation Vikram Lander by Vedansh Gupta.pptx

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

Natural Polymer Based Nanomaterials

Engler and Prantl system of classification in plant taxonomy

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

G9 Science Q4- Week 1-2 Projectile Motion.ppt

Biopesticide (2).pptx .This slides helps to know the different types of biop...

CELL -Structural and Functional unit of life.pdf

Spermiogenesis or Spermateleosis or metamorphosis of spermatid

Isotopic evidence of long-lived volcanism on Io

GFP in rDNA Technology (Biotechnology).pptx

Work, Energy and Power for class 10 ICSE Physics

Formation of low mass protostars and their circumstellar disks

Analytical Profile of Coleus Forskohlii | Forskolin .pdf

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...

Bentham & Hooker's Classification. along with the merits and demerits of the ...

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

Describing him, describing her: Linguistic biases in crowdsourced metadata for images of people

1. Describing him, describing her: Linguistic biases in crowdsourced metadata for images of people Jahna Otterbacher Social Information Systems

2. ESP Game (2003)

3. Purpose: Content-based image retrieval Image: http://savvash.blogspot.com/2010_11_01_archive.html

4. Linguistic biases in metadata? • The manner in which we use language plays a key role in the transmission of social stereotypes [Maass et al., 1989; Rubin et al., 2013] • Linguistic Bias: Systematic asymmetry in the way that one uses language, as a function of the social group of the person(s) being described [Beukeboom, 2013] • RQ: Do we observe linguistic biases in labels of images, with respect to gender? – Use of adjectives [Fiedler & Semin, 1988] – Use of strongly subjective adjectives [Wilson et al., 2005] – Use of labels that describe: • Physical appearance • Disposition or character • Occupation

5. Linguistic biases Abstract / positive: He is intelligent, successful, helpful. Concrete / neutral: She is studying, listening, thinking.

6. Analysis ESP Game Dataset 100k images LIWC categories: “humans, friends, family” Use hyponyms of “man” / “male” and “woman” / “female” to label gender STEP 1: Find images of men and women Part-of-speech tagging CLAWS C5 Manual error analysis: Adjectives (1.45%) Women are more often described with adjectives STEP 2: Find labels that are adjectives Subjectivity Lexicon (Wilson et al. 2005) Women are more often described with subjective adjectives STEP 3: Find subjective adjectives Identify images with labels concerning 6 occupations For each label/image: Does it describe appearance, disposition, occupation STEP 4: Manual analysis Women are associated with more labels concerning appearance; fewer concerning occupation

7. Strongly subjective adjectives Men (N = 18,916) Women (N = 14,628) Happy (385) Sexy (2,425) Ugly (225) Happy (549) Sad (201) Ugly (254) Angry (132) Sad (241) Drunk (124) Cute (117) Scary (107) Beautiful (84) Funny (103) Fun (67) Cute (88) Drunk (58) Mad (74) Scary (51) Fun (63) Little (40)

8. Implications & future work • Exposing biases brings up issues for – Designers of systems – Those who train algorithms • Controlled experiment – Stimulus – Social cues – Output (linguistic biases)