SlideShare une entreprise Scribd logo
1  sur  10
Télécharger pour lire hors ligne
FairTest:  
Discovering  unwarranted  associa4ons  
in  data-­‐driven  applica4ons
Florian	
  Tramèr1,	
  	
  	
  Vaggelis	
  Atlidakis2,	
  	
  	
  Roxana	
  Geambasu2,	
  	
  	
  Daniel	
  Hsu2,	
  
Jean-­‐Pierre	
  Hubaux1,	
  	
  	
  Mathias	
  Humbert3,	
  	
  	
  Ari	
  Juels4,	
  	
  	
  Huang	
  Lin1	
  
	
  
1École	
  Polytechnique	
  Fédérale	
  de	
  Lausanne,	
  	
  	
  2Columbia	
  University,	
  	
  	
  	
  
3Saarland	
  University,	
  	
  	
  4Cornell	
  Tech	
  
	
  
1	
  
“Unfair”  associa4ons  +  consequences
 2	
  
“Unfair”  associa4ons  +  consequences
 3	
  
These	
  are	
  so#ware	
  bugs:	
  need	
  to	
  ac#vely	
  test	
  for	
  them	
  
and	
  fix	
  them	
  (i.e.,	
  debug)	
  in	
  data-­‐driven	
  applicaQons…	
  
just	
  as	
  with	
  func#onality,	
  performance,	
  and	
  reliability	
  bugs.	
  
Limits  of  preventa4ve  measures
What	
  doesn’t	
  work:	
  
•  Hide	
  protected	
  aSributes	
  from	
  data-­‐driven	
  applicaQon.	
  
•  Aim	
  for	
  staQsQcal	
  parity	
  w.r.t.	
  protected	
  classes	
  and	
  service	
  output.	
  
Foremost	
  challenge	
  is	
  to	
  even	
  detect	
  these	
  unwarranted	
  
associaQons.	
  
4	
  
FairTest:  a  tes4ng  suite  for  data-­‐driven  apps
race,	
  gender,	
  …	
  
zip	
  code,	
  job,	
  …	
  
qualificaQons,	
  …	
  
Protected	
  vars.	
  
Context	
  vars.	
   FairTest	
  
AssociaQon	
  bug	
  
report	
  for	
  developer	
  
Explanatory	
  vars.	
  
Data-­‐driven	
  
applicaQon	
  
User	
  inputs	
   ApplicaQon	
  outputs	
  
•  Finds	
  context-­‐specific	
  associaQons	
  between	
  protected	
  variables	
  and	
  
applicaQon	
  outputs	
  
•  Bug	
  report	
  ranks	
  findings	
  by	
  assoc.	
  strength	
  and	
  affected	
  pop.	
  size	
  
	
   locaQon,	
  click,	
  …	
   prices,	
  tags,	
  …	
  
5	
  
A  data-­‐driven  approach
Core	
  of	
  FairTest	
  is	
  based	
  on	
  staQsQcal	
  machine	
  learning	
  
FairTest	
  
Find	
  context-­‐specific	
  associaQons	
  
StaQsQcally	
  validate	
  associaQons	
  
Sta4s4cal	
  machine	
  learning	
  internals:	
  
•  top-­‐down	
  spaQal	
  parQQoning	
  algorithm	
  
•  confidence	
  intervals	
  for	
  assoc.	
  metrics	
  
•  …	
  
Training	
  data	
  
Test	
  data	
  
Ideally	
  sampled	
  from	
  
relevant	
  user	
  populaQon	
  
This is a confidential draft. Please do not redistribute. 7 CONCLUSION
Report of associations of O=Labels on Si=Race:
Global Population of size 1,324
* Labels associated with Race=Black:
Label Black White DIFF p-value
Cart 4% 0% [0.0137,0.0652] 3.31e-05
Drum 4% 0% [0.0095,0.0604] 3.83e-04
Helmet 8% 3% [0.0096,0.0888] 2.34e-03
Cattle 2% 0% [0.0037,0.0432] 4.73e-03
* Labels associated with Race=White:
Label Black White DIFF p-value
Face Powder 1% 10% [-0.1339,-0.0525] 5.60e-12
Maillot 4% 15% [-0.1590,-0.0575] 3.46e-10
Person 96% 99% [-0.0563,-0.0042] 6.06e-03
Lipstick 1% 4% [-0.0622,-0.0034] 1.03e-02
Fig. 9: Racial Label Associations in the Image Tagger. Shows par-
tial report of a Discovery (top k=35); the four most strongly associated
labels (for the binary difference metric DIFF) are shown for each race.
Fig.9 shows part of FairTest’s report. It lists the labels
most disparately applied to images of black people (first
table) and white people (second table); we show only
4 (of 35) labels per race. A developer could inspect
all top k labels and judge which ones deserve further
scrutiny. In Fig.9, the ‘cattle’ label might draw attention
due to its potentially negative connotation; upon inspec-
tion, we find that none of the tagged images depict farm
animals. Moreover, black people receive the ‘person’
tag less often, thus the model seems less accurate at de-
tecting them. Further work is needed to understand these
Report of associations of O=Income on Si=Race:
Global Population of size 24,421
p-value = 1.39e-53 ; NMI = [0.0063, 0.0139]
Income Asian Black ... White Total
<=50K 556(73%) 2061(88%) 15647(75%) 18640 (76%)
>50K 206(27%) 287(12%) 5238(25%) 5781 (24%)
Total 762 (3%) 2348(10%) ... 20885(86%) 24421(100%)
1. Subpopulation of size 341
Context = Age <= 42, Hours <= 55, Job: Fed-gov
p-value = 3.24e-03 ; NMI = [0.0085, 0.1310]
Income Asian Black ... White Total
<=50K 10(71%) 62(91%) 153(63%) 239 (70%)
>50K 4(29%) 6 (9%) 91(37%) 102 (30%)
Total 14 (4%) 68(20%) ... 244(72%) 341(100%)
2. Subpopulation of size 14,477
Context = Age <= 42, Hours <= 55
p-value = 7.50e-31 ; NMI = [0.0070, 0.0187]
Income Asian Black ... White Total
<=50K 362(79%) 1408(93%) 10113(83%) 12157 (84%)
>50K 97(21%) 101 (7%) 2098(17%) 2320 (16%)
Total 459 (3%) 1509(10%) ... 12211(84%) 14477(100%)
Report of associations of O=Income on Si=Gender:
Global Population of size 24,421
p-value = 1.44e-178 ; NMI = [0.0381, 0.0540]
Income Female Male Total
<=50K 7218(89%) 11422(70%) 18640 (76%)
>50K 876(11%) 4905(30%) 5781 (24%)
Total 8094(33%) 16327(67%) 24421(100%)
1. Subpopulation of size 1,371
Context = 9 <= Education <= 11, Age >= 47
p-value = 2.23e-35 ; NMI = [0.0529, 0.1442]
Income Female Male Total
<=50K 423(88%) 497(56%) 920 (67%)
Data	
  
6	
  
Example:  healthcare  applica4on
Predictor	
  of	
  whether	
  pa4ent	
  will	
  visit	
  hospital	
  again	
  in	
  next	
  year	
  
(from	
  winner	
  of	
  2012	
  Heritage	
  Health	
  Prize	
  CompeQQon)	
  
⇒	
  FairTest’s	
  finding:	
  strong	
  associaQons	
  between	
  age	
  and	
  predicQon	
  
error	
  rate	
  
AssociaQon	
  may	
  translate	
  to	
  quanQfiable	
  harms	
  
(e.g.,	
  if	
  app	
  is	
  used	
  to	
  adjust	
  insurance	
  premiums)!	
  
Hospital	
  
re-­‐admission	
  
predictor	
  
age,	
  gender,	
  
#	
  emergencies,	
  …	
  
Will	
  paQent	
  be	
  
re-­‐admiSed?	
  
This is a confidential draft. Please do no
Report of associations of O=Abs. Error
Global Population of size 28,930
p-value = 3.30e-179 ; CORR = [0.2057, 0
1. Subpopulation of size 6,252
Context = Urgent Care Treatments >= 1
p-value = 1.85e-141 ; CORR = [0.2724, 0
7	
  
Debugging  with  FairTest
Are	
  there	
  confounding	
  factors?	
  	
  
	
  Do	
  associaQons	
  disappear	
  aher	
  condiQoning?	
  
	
  ⇒	
  AdapQve	
  Data	
  Analysis!	
  
	
  
Example:	
  the	
  healthcare	
  applicaQon	
  (again)	
  
•  EsQmate	
  predicQon	
  target	
  variance	
  
•  Does	
  this	
  explain	
  the	
  predictor’s	
  behavior?	
  
•  Yes,	
  parQally	
  
	
  
	
  
	
  
FairTest	
  helps	
  developers	
  understand	
  &	
  evaluate	
  
potenQal	
  associaQon	
  bugs.	
  
of size 6,252
Care Treatments >= 1
141 ; CORR = [0.2724, 0.3492]
e for Health Predictions. Shows the global
population with highest effect size (correlation).
correlation between age and prediction error,
1 + number of visits). For each age-decade, we
ots (box from the 1st to 3rd quantile with a line
skers at 1.5 interquantile-ranges). The straight
* Low Confidence: Population of size 14,48
p-value = 2.27e-128 ; CORR = [0.1722, 0.
* High Confidence: Population of size 14,4
p-value = 2.44e-13 ; CORR = [0.0377, 0.0
Fig. 8: Error Profile for Health Predictions usin
confidence as an explanatory attribute. Shows correla
prediction error and user age, broken down by prediction
High	
  confidence	
  in	
  predic#on	
  
8	
  
Closing  remarks
•  Other	
  applica4ons	
  studied	
  using	
  FairTest	
  (hSp://arxiv.org/abs/1510.02377):	
  
•  Image	
  tagger	
  based	
  on	
  deep	
  learning	
  (on	
  ImageNet	
  data)	
  
•  Simple	
  movie	
  recommender	
  system	
  (on	
  MovieLens	
  data)	
  
•  SimulaQon	
  of	
  Staple’s	
  pricing	
  system	
  
•  Other	
  features	
  in	
  FairTest:	
  
•  Exploratory	
  studies	
  (e.g.,	
  find	
  image	
  tags	
  with	
  offensive	
  associaQons)	
  
•  AdapQve	
  data	
  analysis	
  (preliminary)	
  –	
  i.e.,	
  staQsQcal	
  validity	
  with	
  data	
  re-­‐use	
  
•  IntegraQon	
  with	
  SciPy	
  library	
  
Developers	
  need	
  beDer	
  sta4s4cal	
  training	
  and	
  tools	
  
to	
  make	
  beDer	
  sta4s4cal	
  decisions	
  and	
  applica4ons.	
  
9	
  
Example:  Berkeley  graduate  admissions
Admission	
  into	
  UC	
  Berkeley	
  graduate	
  programs	
  
(Bickel,	
  Hammel,	
  and	
  O’Connell,	
  1975)	
  
Bickel	
  et	
  al’s	
  (and	
  also	
  FairTest’s)	
  findings:	
  gender	
  bias	
  in	
  admissions	
  
at	
  university	
  level,	
  but	
  mostly	
  gone	
  aher	
  condiQoning	
  on	
  department	
  
FairTest	
  helps	
  developers	
  understand	
  &	
  evaluate	
  
potenQal	
  associaQon	
  bugs.	
  
Graduate	
  
admissions	
  
commiSees	
  
age,	
  gender,	
  GPA,	
  …	
   Admit	
  applicant?	
  
10	
  

Contenu connexe

En vedette

Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016MLconf
 
Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016
Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016
Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016MLconf
 
Tom Peters, Software Engineer, Ufora at MLconf ATL 2016
Tom Peters, Software Engineer, Ufora at MLconf ATL 2016Tom Peters, Software Engineer, Ufora at MLconf ATL 2016
Tom Peters, Software Engineer, Ufora at MLconf ATL 2016MLconf
 
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...MLconf
 
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016MLconf
 
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...MLconf
 
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16MLconf
 
Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016
Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016
Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016MLconf
 
Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016
Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016
Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016MLconf
 
Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...
Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...
Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...MLconf
 
Jason Baldridge, Associate Professor of Computational Linguistics, University...
Jason Baldridge, Associate Professor of Computational Linguistics, University...Jason Baldridge, Associate Professor of Computational Linguistics, University...
Jason Baldridge, Associate Professor of Computational Linguistics, University...MLconf
 
Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16
Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16
Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16MLconf
 
Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016
Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016
Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016MLconf
 
Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...
Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...
Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...MLconf
 
Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16
Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16
Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16MLconf
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016MLconf
 
Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016
Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016
Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016MLconf
 
Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16
Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16
Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16MLconf
 
Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16
Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16
Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16MLconf
 
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016MLconf
 

En vedette (20)

Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
 
Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016
Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016
Ryan Curtin, Principal Research Scientist, Symantec at MLconf ATL 2016
 
Tom Peters, Software Engineer, Ufora at MLconf ATL 2016
Tom Peters, Software Engineer, Ufora at MLconf ATL 2016Tom Peters, Software Engineer, Ufora at MLconf ATL 2016
Tom Peters, Software Engineer, Ufora at MLconf ATL 2016
 
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
Le Song, Assistant Professor, College of Computing, Georgia Institute of Tech...
 
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
 
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund Univ...
 
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
Jake Mannix, Lead Data Engineer, Lucidworks at MLconf SEA - 5/20/16
 
Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016
Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016
Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf ATL 2016
 
Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016
Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016
Teresa Larsen, Founder & Director, ScientificLiteracy.org at MLconf ATL 2016
 
Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...
Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...
Beverly Wright, Executive Director, Business Analytics Center, Georgia Instit...
 
Jason Baldridge, Associate Professor of Computational Linguistics, University...
Jason Baldridge, Associate Professor of Computational Linguistics, University...Jason Baldridge, Associate Professor of Computational Linguistics, University...
Jason Baldridge, Associate Professor of Computational Linguistics, University...
 
Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16
Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16
Ted Willke, Sr Principal Engineer, Intel at MLconf SEA - 5/20/16
 
Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016
Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016
Tanvi Motwani, Lead Data Scientist, Guided Search at A9.com at MLconf ATL 2016
 
Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...
Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...
Funda Gunes, Senior Research Statistician Developer & Patrick Koch, Principal...
 
Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16
Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16
Igor Markov, Software Engineer, Google at MLconf SEA - 5/20/16
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
 
Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016
Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016
Michael Galvin, Sr. Data Scientist, Metis at MLconf ATL 2016
 
Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16
Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16
Dr. Erin LeDell, Machine Learning Scientist, H2O.ai at MLconf SEA - 5/20/16
 
Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16
Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16
Ewa Dominowska, Engineering Manager, Facebook at MLconf SEA - 5/20/16
 
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
Brian Lucena, Senior Data Scientist, Metis at MLconf SF 2016
 

Similaire à Florian Tramèr, Researcher, EPFL at MLconf SEA - 5/20/16

COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...
COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...
COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...Amit Tyagi
 
Predictive model for falls poster v3
Predictive model for falls poster v3Predictive model for falls poster v3
Predictive model for falls poster v3Marmi Le
 
Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...
Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...
Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...resyst
 
statistic project on Hero motocorp
statistic project on Hero motocorpstatistic project on Hero motocorp
statistic project on Hero motocorpYug Bokadia
 
IRJET- Diagnosis of Breast Cancer using Decision Tree Models and SVM
IRJET-  	  Diagnosis of Breast Cancer using Decision Tree Models and SVMIRJET-  	  Diagnosis of Breast Cancer using Decision Tree Models and SVM
IRJET- Diagnosis of Breast Cancer using Decision Tree Models and SVMIRJET Journal
 
Data Analysis and Interpretation final.docx
Data Analysis and Interpretation final.docxData Analysis and Interpretation final.docx
Data Analysis and Interpretation final.docxAzraAhmed10
 
19 9742 the application paper id 0016(edit ty)
19 9742 the application paper id 0016(edit ty)19 9742 the application paper id 0016(edit ty)
19 9742 the application paper id 0016(edit ty)IAESIJEECS
 
Measuresof spread
Measuresof spreadMeasuresof spread
Measuresof spreaddeepneuron
 
Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Luis Fernandez Luque
 
Deep learning platform
Deep learning platformDeep learning platform
Deep learning platformBhagvanK1
 
Final Year IEEE Project 2013-2014 - Digital Image Processing Project Title a...
Final Year IEEE Project 2013-2014  - Digital Image Processing Project Title a...Final Year IEEE Project 2013-2014  - Digital Image Processing Project Title a...
Final Year IEEE Project 2013-2014 - Digital Image Processing Project Title a...elysiumtechnologies
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judesSean Ekins
 
Engineering Statistics
Engineering Statistics Engineering Statistics
Engineering Statistics Bahzad5
 

Similaire à Florian Tramèr, Researcher, EPFL at MLconf SEA - 5/20/16 (20)

APO The Philippines Health System Review (Health in Transition)
APO The Philippines Health System Review (Health in Transition)APO The Philippines Health System Review (Health in Transition)
APO The Philippines Health System Review (Health in Transition)
 
COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...
COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...
COMPARE THE LEVEL OF SECURITY RISK BETWEEN IT USER/EMPLOYEE & NON-IT USER/EMP...
 
DescriptiveStatistics.pdf
DescriptiveStatistics.pdfDescriptiveStatistics.pdf
DescriptiveStatistics.pdf
 
Predictive model for falls poster v3
Predictive model for falls poster v3Predictive model for falls poster v3
Predictive model for falls poster v3
 
AJMS_395_22.pdf
AJMS_395_22.pdfAJMS_395_22.pdf
AJMS_395_22.pdf
 
Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...
Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...
Pay-for-Performance and Distributional Effects in Tanzania: A Supply-side Ass...
 
statistic project on Hero motocorp
statistic project on Hero motocorpstatistic project on Hero motocorp
statistic project on Hero motocorp
 
IRJET- Diagnosis of Breast Cancer using Decision Tree Models and SVM
IRJET-  	  Diagnosis of Breast Cancer using Decision Tree Models and SVMIRJET-  	  Diagnosis of Breast Cancer using Decision Tree Models and SVM
IRJET- Diagnosis of Breast Cancer using Decision Tree Models and SVM
 
Data Analysis and Interpretation final.docx
Data Analysis and Interpretation final.docxData Analysis and Interpretation final.docx
Data Analysis and Interpretation final.docx
 
19 9742 the application paper id 0016(edit ty)
19 9742 the application paper id 0016(edit ty)19 9742 the application paper id 0016(edit ty)
19 9742 the application paper id 0016(edit ty)
 
Measuresof spread
Measuresof spreadMeasuresof spread
Measuresof spread
 
Gender Earnings Differentials Across Earnings Quantiles: Evidence from the li...
Gender Earnings Differentials Across Earnings Quantiles: Evidence from the li...Gender Earnings Differentials Across Earnings Quantiles: Evidence from the li...
Gender Earnings Differentials Across Earnings Quantiles: Evidence from the li...
 
Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums. Online Diabetes: Inferring Community Structure in Healthcare Forums.
Online Diabetes: Inferring Community Structure in Healthcare Forums.
 
Deep learning platform
Deep learning platformDeep learning platform
Deep learning platform
 
Implementation of ICF
Implementation of ICF Implementation of ICF
Implementation of ICF
 
Final Year IEEE Project 2013-2014 - Digital Image Processing Project Title a...
Final Year IEEE Project 2013-2014  - Digital Image Processing Project Title a...Final Year IEEE Project 2013-2014  - Digital Image Processing Project Title a...
Final Year IEEE Project 2013-2014 - Digital Image Processing Project Title a...
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
Engineering Statistics
Engineering Statistics Engineering Statistics
Engineering Statistics
 
50120140507010
5012014050701050120140507010
50120140507010
 
50120140507010
5012014050701050120140507010
50120140507010
 

Plus de MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

Plus de MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Dernier

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Dernier (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

Florian Tramèr, Researcher, EPFL at MLconf SEA - 5/20/16

  • 1. FairTest:   Discovering  unwarranted  associa4ons   in  data-­‐driven  applica4ons Florian  Tramèr1,      Vaggelis  Atlidakis2,      Roxana  Geambasu2,      Daniel  Hsu2,   Jean-­‐Pierre  Hubaux1,      Mathias  Humbert3,      Ari  Juels4,      Huang  Lin1     1École  Polytechnique  Fédérale  de  Lausanne,      2Columbia  University,         3Saarland  University,      4Cornell  Tech     1  
  • 2. “Unfair”  associa4ons  +  consequences 2  
  • 3. “Unfair”  associa4ons  +  consequences 3   These  are  so#ware  bugs:  need  to  ac#vely  test  for  them   and  fix  them  (i.e.,  debug)  in  data-­‐driven  applicaQons…   just  as  with  func#onality,  performance,  and  reliability  bugs.  
  • 4. Limits  of  preventa4ve  measures What  doesn’t  work:   •  Hide  protected  aSributes  from  data-­‐driven  applicaQon.   •  Aim  for  staQsQcal  parity  w.r.t.  protected  classes  and  service  output.   Foremost  challenge  is  to  even  detect  these  unwarranted   associaQons.   4  
  • 5. FairTest:  a  tes4ng  suite  for  data-­‐driven  apps race,  gender,  …   zip  code,  job,  …   qualificaQons,  …   Protected  vars.   Context  vars.   FairTest   AssociaQon  bug   report  for  developer   Explanatory  vars.   Data-­‐driven   applicaQon   User  inputs   ApplicaQon  outputs   •  Finds  context-­‐specific  associaQons  between  protected  variables  and   applicaQon  outputs   •  Bug  report  ranks  findings  by  assoc.  strength  and  affected  pop.  size     locaQon,  click,  …   prices,  tags,  …   5  
  • 6. A  data-­‐driven  approach Core  of  FairTest  is  based  on  staQsQcal  machine  learning   FairTest   Find  context-­‐specific  associaQons   StaQsQcally  validate  associaQons   Sta4s4cal  machine  learning  internals:   •  top-­‐down  spaQal  parQQoning  algorithm   •  confidence  intervals  for  assoc.  metrics   •  …   Training  data   Test  data   Ideally  sampled  from   relevant  user  populaQon   This is a confidential draft. Please do not redistribute. 7 CONCLUSION Report of associations of O=Labels on Si=Race: Global Population of size 1,324 * Labels associated with Race=Black: Label Black White DIFF p-value Cart 4% 0% [0.0137,0.0652] 3.31e-05 Drum 4% 0% [0.0095,0.0604] 3.83e-04 Helmet 8% 3% [0.0096,0.0888] 2.34e-03 Cattle 2% 0% [0.0037,0.0432] 4.73e-03 * Labels associated with Race=White: Label Black White DIFF p-value Face Powder 1% 10% [-0.1339,-0.0525] 5.60e-12 Maillot 4% 15% [-0.1590,-0.0575] 3.46e-10 Person 96% 99% [-0.0563,-0.0042] 6.06e-03 Lipstick 1% 4% [-0.0622,-0.0034] 1.03e-02 Fig. 9: Racial Label Associations in the Image Tagger. Shows par- tial report of a Discovery (top k=35); the four most strongly associated labels (for the binary difference metric DIFF) are shown for each race. Fig.9 shows part of FairTest’s report. It lists the labels most disparately applied to images of black people (first table) and white people (second table); we show only 4 (of 35) labels per race. A developer could inspect all top k labels and judge which ones deserve further scrutiny. In Fig.9, the ‘cattle’ label might draw attention due to its potentially negative connotation; upon inspec- tion, we find that none of the tagged images depict farm animals. Moreover, black people receive the ‘person’ tag less often, thus the model seems less accurate at de- tecting them. Further work is needed to understand these Report of associations of O=Income on Si=Race: Global Population of size 24,421 p-value = 1.39e-53 ; NMI = [0.0063, 0.0139] Income Asian Black ... White Total <=50K 556(73%) 2061(88%) 15647(75%) 18640 (76%) >50K 206(27%) 287(12%) 5238(25%) 5781 (24%) Total 762 (3%) 2348(10%) ... 20885(86%) 24421(100%) 1. Subpopulation of size 341 Context = Age <= 42, Hours <= 55, Job: Fed-gov p-value = 3.24e-03 ; NMI = [0.0085, 0.1310] Income Asian Black ... White Total <=50K 10(71%) 62(91%) 153(63%) 239 (70%) >50K 4(29%) 6 (9%) 91(37%) 102 (30%) Total 14 (4%) 68(20%) ... 244(72%) 341(100%) 2. Subpopulation of size 14,477 Context = Age <= 42, Hours <= 55 p-value = 7.50e-31 ; NMI = [0.0070, 0.0187] Income Asian Black ... White Total <=50K 362(79%) 1408(93%) 10113(83%) 12157 (84%) >50K 97(21%) 101 (7%) 2098(17%) 2320 (16%) Total 459 (3%) 1509(10%) ... 12211(84%) 14477(100%) Report of associations of O=Income on Si=Gender: Global Population of size 24,421 p-value = 1.44e-178 ; NMI = [0.0381, 0.0540] Income Female Male Total <=50K 7218(89%) 11422(70%) 18640 (76%) >50K 876(11%) 4905(30%) 5781 (24%) Total 8094(33%) 16327(67%) 24421(100%) 1. Subpopulation of size 1,371 Context = 9 <= Education <= 11, Age >= 47 p-value = 2.23e-35 ; NMI = [0.0529, 0.1442] Income Female Male Total <=50K 423(88%) 497(56%) 920 (67%) Data   6  
  • 7. Example:  healthcare  applica4on Predictor  of  whether  pa4ent  will  visit  hospital  again  in  next  year   (from  winner  of  2012  Heritage  Health  Prize  CompeQQon)   ⇒  FairTest’s  finding:  strong  associaQons  between  age  and  predicQon   error  rate   AssociaQon  may  translate  to  quanQfiable  harms   (e.g.,  if  app  is  used  to  adjust  insurance  premiums)!   Hospital   re-­‐admission   predictor   age,  gender,   #  emergencies,  …   Will  paQent  be   re-­‐admiSed?   This is a confidential draft. Please do no Report of associations of O=Abs. Error Global Population of size 28,930 p-value = 3.30e-179 ; CORR = [0.2057, 0 1. Subpopulation of size 6,252 Context = Urgent Care Treatments >= 1 p-value = 1.85e-141 ; CORR = [0.2724, 0 7  
  • 8. Debugging  with  FairTest Are  there  confounding  factors?      Do  associaQons  disappear  aher  condiQoning?    ⇒  AdapQve  Data  Analysis!     Example:  the  healthcare  applicaQon  (again)   •  EsQmate  predicQon  target  variance   •  Does  this  explain  the  predictor’s  behavior?   •  Yes,  parQally         FairTest  helps  developers  understand  &  evaluate   potenQal  associaQon  bugs.   of size 6,252 Care Treatments >= 1 141 ; CORR = [0.2724, 0.3492] e for Health Predictions. Shows the global population with highest effect size (correlation). correlation between age and prediction error, 1 + number of visits). For each age-decade, we ots (box from the 1st to 3rd quantile with a line skers at 1.5 interquantile-ranges). The straight * Low Confidence: Population of size 14,48 p-value = 2.27e-128 ; CORR = [0.1722, 0. * High Confidence: Population of size 14,4 p-value = 2.44e-13 ; CORR = [0.0377, 0.0 Fig. 8: Error Profile for Health Predictions usin confidence as an explanatory attribute. Shows correla prediction error and user age, broken down by prediction High  confidence  in  predic#on   8  
  • 9. Closing  remarks •  Other  applica4ons  studied  using  FairTest  (hSp://arxiv.org/abs/1510.02377):   •  Image  tagger  based  on  deep  learning  (on  ImageNet  data)   •  Simple  movie  recommender  system  (on  MovieLens  data)   •  SimulaQon  of  Staple’s  pricing  system   •  Other  features  in  FairTest:   •  Exploratory  studies  (e.g.,  find  image  tags  with  offensive  associaQons)   •  AdapQve  data  analysis  (preliminary)  –  i.e.,  staQsQcal  validity  with  data  re-­‐use   •  IntegraQon  with  SciPy  library   Developers  need  beDer  sta4s4cal  training  and  tools   to  make  beDer  sta4s4cal  decisions  and  applica4ons.   9  
  • 10. Example:  Berkeley  graduate  admissions Admission  into  UC  Berkeley  graduate  programs   (Bickel,  Hammel,  and  O’Connell,  1975)   Bickel  et  al’s  (and  also  FairTest’s)  findings:  gender  bias  in  admissions   at  university  level,  but  mostly  gone  aher  condiQoning  on  department   FairTest  helps  developers  understand  &  evaluate   potenQal  associaQon  bugs.   Graduate   admissions   commiSees   age,  gender,  GPA,  …   Admit  applicant?   10