Ethical Algorithms: Bias in Machine Learning for NextAI

Ethical Algorithms
Bias and Explainability in Machine Learning
VP Product & Strategy
@integrateai
Kathryn Hume
@humekathryn Venture Partner
quamproxime.com @ffvc

https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing

Algorithms are convex mirrors that refract human biases
Parmagianino, 1524 Ashbery, 1975

The Raw Ingredients
Deep understanding of a business problem
Data, data, data
Algorithmic capability

Data Product Lifecycle
Design
What problem
are we solving?
Data Exploration
What does the data
look like?
Data Engineering
Production
Can we harden and
scale the model?
Maintenance
How to update
as data changes?
Data Processing
Is our data ready
for use?
Model Prototyping
Will this work?
Should we pivot?

Supervised and Unsupervised Learning

Clustering households based on TV-viewing habits

70% of machine learning products use
supervised learning
https://www.sas.com/en_ca/insights/analytics/machine-learning.html

Supervised Learning
Find a function that deﬁnes a correlation between P and C
Use this function to make guesses about C
Find a proxy (P) for something hard to know (C)

Use square footage (P) to
predict housing prices (C)

Use “Nigerian Prince” (P) to
predict if emails are spam (C)

What (P) should we pick
to decide if it’s a cat or dog?

Deep Learning
• Use layers to transform complex input into mathematical expressions
• Remove need for human to select which features matter

Universal Approximation Theorem
Neural networks can approximate arbitrary functions

The art lies in designing your model,
not feature engineering

Classical Statistics & ML
Higher unconscious bias
in feature selection
Higher explainability
in the model
Deep Learning
Lower unconscious bias
in feature selection
Lower explainability
in the model

https://visual.ly/community/infographic/
human-rights/taxonomy-transitions
Redundant Encodings

Fairness does not align with accuracy
http://blog.mrtz.org/2016/09/06/approaching-fairness.html

the controversial world of sentiment analysis

• System use algorithms to identify negative sentiment
• Performs better with strident, unambiguous expressions of emotions
• Men more likely to use those expressions
• Men attract disproportionate attention from brands
https://blog.dominodatalab.com/video-how-machine-learning-ampliﬁes-societal-privilege/

Bluntness and bias
• Precision <> Recall
• Marketing wants high precision
• Implies low recall
• We’re better at identifying extremes
• They’re likely in a particular group

https://medium.com/@blaisea/physiognomys-new-clothes-f2d4b59fdd6a

Developments in Language Processing
Traditional NLP N-grams Word Embeddings

Bolukbasi, Chang, Zou, Saligrama, Kalai, 2016
Man : King :: Woman : Queen
Man : Computer Programmer :: Woman : Homemaker
Black Male : Assaulted :: White Male: Entitled To
Inherent Bias in Word Embeddings

http://www.andrew.cmu.edu/user/danupam/dtd-pets15.pdf

Regulate to require explainability

Machine Talk People Talk
h/t Hilary Mason of Fast Forward Labs

Affect important human rights
Education
Housing
Health
Work
Justice
Finance/credit

Lack of understanding stymies adoption

Deterministic framework on a probabilistic tool

Reﬁne our conceptual framework

https://medium.com/inventing-intelligent-machines/machine-learning-alien-knowledge-and-other-
ufos-1a44c66508d1
Observations, not explanations

Comfort with competence without comprehension

How come?
What for?
Can I intervene?

Get into the guts of the technology

Bolukbasi, Chang, Zou, Saligrama, Kalai, 2016
Remove bias without compromising utility

FairML: Measure dependence on inputs by changing them
http://blog.fastforwardlabs.com/2017/03/09/fairml-auditing-black-box-predictive-models.html

LIME: As if linear functions
https://homes.cs.washington.edu/~marcotcr/blog/lime/

https://homes.cs.washington.edu/~marcotcr/blog/lime/
LIME: As if linear functions

Fair representations: treat similar individuals similarly
http://proceedings.mlr.press/v28/zemel13.html

Fair representations: treat similar individuals similarly
http://proceedings.mlr.press/v28/zemel13.html
“We formulate fairness as an optimization
problem of ﬁnding an intermediate representation
of the data that best encodes the data (i.e., preserving
as much information about the individual’s attributes
as possible), while simultaneously obfuscates aspects of
it, removing any information about membership with
respect to the protected subgroup.”

LOVE PEOPLE
Find opportunities to maximize mutual lifetime value
Respect the principles of contextual integrity
Protect individual and corporate data using differential privacy
Consider the goals of the people affected by the systems we build

Questions?
@integrateai@humekathryn quamproxime.com @ffvc

Ethical Algorithms: Bias in Machine Learning for NextAI

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Ethical Algorithms: Bias in Machine Learning for NextAI

Similaire à Ethical Algorithms: Bias in Machine Learning for NextAI (20)

Dernier

Dernier (20)

Ethical Algorithms: Bias in Machine Learning for NextAI