Lab presentation (a framework for understanding unintended consequences of machine learning)

•

0 j'aime•207 vues

chguxu

Framework for bias in ML process

Données & analyses

A Framework for Understanding
Unintended consequences of
Machine Learning
Author: Harini Suresh (MIT), John V. Guttag(MIT)
Presented: Chenguang Xu “Shine”

The Problem with Biased data
• Various unwanted consequences of ML algorithm arise in
some way from biased data.
• Bias refers to an unintended or potentially harmful
property of the data.

• Data is a product of many factors, and is the product of a
process

An illustrative Scenario
Lack of data on women, introducing
more data solved the issue.
The use of a proxy label (human assessment of
quality) versus the true label (actual qualiﬁcation)
allowed the model to discriminate by gender.

Historical Bias
It is a fundamental, structural issue with the
very ﬁrst step of the data generation process.

Representation Bias
• It arises when deﬁning and sampling from a population.

• It can arise for several reasons:
• The sampling methods only reach a portion of the
population.

• The population of interest has changed or is distinct
from the population used during model training.

Representation Bias (cont.)
Shankar, Shreya, et al. "No classification without representation: Assessing geodiversity issues in open
data sets for the developing world." arXiv preprint arXiv:1711.08536 (2017).

Representation Bias (cont.)
Photos of bridegrooms from
diﬀerent countries aligned by the
log-likelihood that the classiﬁer
trained on Open Images assigns to
the bridegroom class.

Shankar, Shreya, et al. "No classification without representation: Assessing geodiversity issues in open
data sets for the developing world." arXiv preprint arXiv:1711.08536 (2017).

Measurement Bias
• It arises when subsequently choosing and measuring the
particular features of interest.

• It can arise in several ways:
• The granularity of data varies across groups.

• The quality of data varies across groups.

• The deﬁned classiﬁcation task is an oversimpliﬁcation.

• It arises when a one-size-ﬁt-all model is used for groups
with diﬀerent conditional distributions.

Aggregation Bias

Evaluation Bias
• It occurs when the evaluation and/or benchmark data for
an algorithm doesn’t represent the target population.
Buolamwini, Joy, and Timnit Gebru. "Gender shades: Intersectional accuracy disparities in
commercial gender classification." Conference on Fairness, Accountability and Transparency.
2018.

Formalizations and Mitigations
• A data generation and ML pipeline viewed as a series of
mapping functions.
Mitigating Aggregation Bias:
• adjusting g

• change r or t for transforming
the data
Mitigating Evaluation Bias:
• redeﬁne k

• adjusting X, Y
^ ^
Mitigating Representation Bias:
• improve s
Measurement and historical Bias:
• adjust s will likely be ineﬀective

Recommandé

Dr Thomas Christie eletseditorial

This Is Not What We Ordered: Exploring Why Biased Search Result Rankings Affe...TimDraws

Usability Test ProcessKshitiz Anand

SocialNetwork Analysis using facebookShweta Singh

Fitting and understanding Multilevel Models-Andrew Gelman Deepak Kumar

Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...QuantUniversity

Don't blindly trust your ML System, it may change your life (Azzurra Ragone, ...Data Driven Innovation

Algorithmic fairnessAnthonyMelson

Recommandé

Dr Thomas Christie eletseditorial

This Is Not What We Ordered: Exploring Why Biased Search Result Rankings Affe...TimDraws

Usability Test ProcessKshitiz Anand

SocialNetwork Analysis using facebookShweta Singh

Fitting and understanding Multilevel Models-Andrew Gelman Deepak Kumar

Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...QuantUniversity

Don't blindly trust your ML System, it may change your life (Azzurra Ragone, ...Data Driven Innovation

Algorithmic fairnessAnthonyMelson

AI ETHICS.pptxAthenaJoseph2

Quant vs Qual.pptxRachel Amescua

AI Bias Oxford 2017Dr Janet Bastiman

Fair AIVidhya Chandrasekaran

Ways of seeing learning - 2017v1.0 - NUI Galway University of Limerick postgr...Mary Loftus

GeneralizibilityFairness - DEFirst Reading GroupHossein A. (Saeed) Rahmani

Fairness in Machine LearningDelip Rao

Fairness and Privacy in AI/ML SystemsKrishnaram Kenthapadi

GA-CFS APPROACH TO INCREASE THE ACCURACY OF ESTIMATES IN ELECTIONS PARTICIPATIONijfcstjournal

Methods of sample selectionJeferson L. Feuser

Scientific Method to Hire Great Scrum MastersPavel Dabrytski

KnowMe and ShareMe: understanding automatically discovered personality traits...Leon Gou

Responsible AI in Industry (Tutorials at AAAI 2021, FAccT 2021, and WWW 2021)Krishnaram Kenthapadi

Fairness-aware Machine Learning: Practical Challenges and Lessons Learned (KD...Krishnaram Kenthapadi

Measures and mismeasures of algorithmic fairnessManojit Nandi

Fairness in Search & RecSys 네이버 검색 콜로키움 김진영Jin Young Kim

Fairness in Machine Learning @CodemotionAzzurra Ragone

Responsible AI in Industry (ICML 2021 Tutorial)Krishnaram Kenthapadi

Scientific Reproducibility from an Informatics PerspectiveMicah Altman

Reproducibility from an infomatics perspectiveMicah Altman

Carero dropshipping via API with DroFx.pptxolyaivanovalion

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

Contenu connexe

Similaire à Lab presentation (a framework for understanding unintended consequences of machine learning)

AI ETHICS.pptxAthenaJoseph2

Quant vs Qual.pptxRachel Amescua

AI Bias Oxford 2017Dr Janet Bastiman

Fair AIVidhya Chandrasekaran

Ways of seeing learning - 2017v1.0 - NUI Galway University of Limerick postgr...Mary Loftus

GeneralizibilityFairness - DEFirst Reading GroupHossein A. (Saeed) Rahmani

Fairness in Machine LearningDelip Rao

Fairness and Privacy in AI/ML SystemsKrishnaram Kenthapadi

GA-CFS APPROACH TO INCREASE THE ACCURACY OF ESTIMATES IN ELECTIONS PARTICIPATIONijfcstjournal

Methods of sample selectionJeferson L. Feuser

Scientific Method to Hire Great Scrum MastersPavel Dabrytski

KnowMe and ShareMe: understanding automatically discovered personality traits...Leon Gou

Responsible AI in Industry (Tutorials at AAAI 2021, FAccT 2021, and WWW 2021)Krishnaram Kenthapadi

Fairness-aware Machine Learning: Practical Challenges and Lessons Learned (KD...Krishnaram Kenthapadi

Measures and mismeasures of algorithmic fairnessManojit Nandi

Fairness in Search & RecSys 네이버 검색 콜로키움 김진영Jin Young Kim

Fairness in Machine Learning @CodemotionAzzurra Ragone

Responsible AI in Industry (ICML 2021 Tutorial)Krishnaram Kenthapadi

Scientific Reproducibility from an Informatics PerspectiveMicah Altman

Reproducibility from an infomatics perspectiveMicah Altman

Similaire à Lab presentation (a framework for understanding unintended consequences of machine learning) (20)

AI ETHICS.pptx

Quant vs Qual.pptx

AI Bias Oxford 2017

Fair AI

Ways of seeing learning - 2017v1.0 - NUI Galway University of Limerick postgr...

GeneralizibilityFairness - DEFirst Reading Group

Fairness in Machine Learning

Fairness and Privacy in AI/ML Systems

GA-CFS APPROACH TO INCREASE THE ACCURACY OF ESTIMATES IN ELECTIONS PARTICIPATION

Methods of sample selection

Scientific Method to Hire Great Scrum Masters

KnowMe and ShareMe: understanding automatically discovered personality traits...

Responsible AI in Industry (Tutorials at AAAI 2021, FAccT 2021, and WWW 2021)

Fairness-aware Machine Learning: Practical Challenges and Lessons Learned (KD...

Measures and mismeasures of algorithmic fairness

Fairness in Search & RecSys 네이버 검색 콜로키움 김진영

Fairness in Machine Learning @Codemotion

Responsible AI in Industry (ICML 2021 Tutorial)

Scientific Reproducibility from an Informatics Perspective

Reproducibility from an infomatics perspective

Dernier

Carero dropshipping via API with DroFx.pptxolyaivanovalion

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Ukraine War presentation: KNOW THE BASICSAishani27

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

April 2024 - Crypto Market Report's Analysismanisha194592

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一ffjhghh

Invezz.com - Grow your wealth with trading signalsInvezz1

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

Dernier (20)

Carero dropshipping via API with DroFx.pptx

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Mature dropshipping via API with DroFx.pptx

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

Ukraine War presentation: KNOW THE BASICS

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Schema on read is obsolete. Welcome metaprogramming..pdf

April 2024 - Crypto Market Report's Analysis

VidaXL dropshipping via API with DroFx.pptx

BigBuy dropshipping via API with DroFx.pptx

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一

Invezz.com - Grow your wealth with trading signals

BabyOno dropshipping via API with DroFx.pptx

FESE Capital Markets Fact Sheet 2024 Q1.pdf

04242024_CCC TUG_Joins and Relationships

Lab presentation (a framework for understanding unintended consequences of machine learning)

1. A Framework for Understanding Unintended consequences of Machine Learning Author: Harini Suresh (MIT), John V. Guttag(MIT) Presented: Chenguang Xu “Shine”

2. The Problem with Biased data • Various unwanted consequences of ML algorithm arise in some way from biased data. • Bias refers to an unintended or potentially harmful property of the data. • Data is a product of many factors, and is the product of a process

3. An illustrative Scenario Lack of data on women, introducing more data solved the issue. The use of a proxy label (human assessment of quality) versus the true label (actual qualiﬁcation) allowed the model to discriminate by gender.

4. Five Sources of Bias in ML

5. Historical Bias It is a fundamental, structural issue with the very ﬁrst step of the data generation process.

6. Representation Bias • It arises when deﬁning and sampling from a population. • It can arise for several reasons: • The sampling methods only reach a portion of the population. • The population of interest has changed or is distinct from the population used during model training.

7. Representation Bias (cont.) Shankar, Shreya, et al. "No classification without representation: Assessing geodiversity issues in open data sets for the developing world." arXiv preprint arXiv:1711.08536 (2017).

8. Representation Bias (cont.) Photos of bridegrooms from diﬀerent countries aligned by the log-likelihood that the classiﬁer trained on Open Images assigns to the bridegroom class. Shankar, Shreya, et al. "No classification without representation: Assessing geodiversity issues in open data sets for the developing world." arXiv preprint arXiv:1711.08536 (2017).

9. Measurement Bias • It arises when subsequently choosing and measuring the particular features of interest. • It can arise in several ways: • The granularity of data varies across groups. • The quality of data varies across groups. • The defined classification task is an oversimplification.

10. • It arises when a one-size-ﬁt-all model is used for groups with diﬀerent conditional distributions. Aggregation Bias

11. Evaluation Bias • It occurs when the evaluation and/or benchmark data for an algorithm doesn’t represent the target population. Buolamwini, Joy, and Timnit Gebru. "Gender shades: Intersectional accuracy disparities in commercial gender classification." Conference on Fairness, Accountability and Transparency. 2018.

12. Formalizations and Mitigations • A data generation and ML pipeline viewed as a series of mapping functions. Mitigating Aggregation Bias: • adjusting g • change r or t for transforming the data Mitigating Evaluation Bias: • redeﬁne k • adjusting X, Y ^ ^ Mitigating Representation Bias: • improve s Measurement and historical Bias: • adjust s will likely be ineﬀective

13. ？