SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Ancestral Causal Inference
Sara Magliacane1,2
, Tom Claassen1,3
, Joris M. Mooij1
1
University of Amsterdam; 2
VU Amsterdam 3
Radboud University Nijmegen
Current'best'choice'
CausAM
Causality@AmsterdaM
Main Contributions
• Ancestral Causal Discovery (ACI), a causal discovery method as accurate as the
state-of-the-art but much more scalable
• A method for scoring causal relations that approximates marginal probability
Causal discovery methods
• Score-based: evaluate models using a penalized likelihood score
• Constraint-based causal discovery: use statistical independences to express
constraints over possible causal models
Advantages of constraint-based w.r.t. score-based methods:
• can handle latent confounders naturally
• easy integration of background knowledge
Disadvantages of constraint-based methods:
• vulnerability to errors in statistical independence tests
• No estimation of confidence in the causal predictions
Causal inference as an optimization problem
To solve the vulnerability to errors in statistical tests Hyttinen et al. [2014] propose HEJ,
which formulates causal discovery as an optimization problem:
• Weighted list of statistical independence results: I = {(ij, wj)}:
– E.g. I = { (Y ⊥⊥ Z | X, 0.2), (Y ⊥⊥ X, 0.1)}
• For any possible causal structure C, define a loss function:
loss(C, I) :=
(ij,wj)∈I: ij is not satisfied in C
wj
• “ij is not satisfied in C” = defined by causal reasoning rules
• Causal inference = Find causal structure minimizing loss function
C∗
= arg min
C∈C
loss(C, I)
Problem: Scalability, e.g. HEJ is very slow already for 8 random variables.
Ancestral Causal Inference (ACI)
Instead of direct causal relations use a more coarse-grained representation, e.g., an
ancestral structure, i.e. the transitive closure of the observed variables of the DAG:
(reflexivity) : X X,
(transitivity) : X Y ∧ Y Z =⇒ X Z,
(antisymmetry) : X Y ∧ Y X =⇒ X = Y,
Ancestral Causal Inference (ACI)
We reformulate the causal discovery as an optimization problem in terms of ancestral
structures, which reduce drastically the search space (e.g. for 7 variables: 2.3 × 1015
→ 6 × 106
possible structures). This requires new ancestral reasoning rules:
For X, Y , W disjoint (sets of) variables:
1. (X ⊥⊥ Y | W ) ∧ (X W ) =⇒ X Y
2. X ⊥⊥ Y | W ∪ [Z] =⇒ (X ⊥⊥ Z | W ) ∧ (Z {X, Y } ∪ W )
3. X ⊥⊥ Y | W ∪ [Z] =⇒ (X ⊥⊥ Z | W ) ∧ (Z {X, Y } ∪ W )
4. (X ⊥⊥ Y | W ∪ [Z]) ∧ (X ⊥⊥ Z | W ∪ U) =⇒ (X ⊥⊥ Y | W ∪ U)
5. (Z ⊥⊥ X | W ) ∧ (Z ⊥⊥ Y | W ) ∧ (X ⊥⊥ Y | W ) =⇒ X ⊥⊥ Y | W ∪ Z
Possible weighting schemes for inputs
ACI supports two types of weighted input statements: statistical independence results
and ancestral relations. We propose two simple weighting schemes:
• a frequentist approach, in which for any appropriate frequentist statistical test with
independence as null hypothesis, we define the weight:
w = | log p − log α|, where p = p-value of the test, α = significance level (e.g., 5%);
• a Bayesian approach, in which the weight of each input i using data set D is:
w = log
p(i|D)
p(¬i|D)
= log
p(D|i)
p(D|¬i)
p(i)
p(¬i)
,
where the prior probability p(i) can be used as a tuning parameter.
For X Y we test the independence of Y and IX, an indicator variable (0 for
observational samples, 1 for samples from the distribution where X is intervened upon).
A method for scoring causal predictions
• Score the confidence in a predicted statement s (e.g. X Y ) as:
C(f) = min
C∈C
loss(C, I + (¬s, ∞))
− min
C∈C
loss(C, I + (s, ∞))
• ≈ MAP approximation of the log-odds ratio of s
• Asymptotically consistent, when consistent input weights
• Can be used with any method that solves an optimization problem
Simulated data
• Generate randomly 2000 linear acyclic models of n observed variables, with latent
variables and Gaussian noise
• Per model: sample 500 data points and perform independence tests up to order c
Evaluation on Simulated data
We compare ACI, HEJ [Hyttinen et al., 2014] equipped with our scoring method, and
bootstrapped versions of FCI and CFCI.
Recall
0 0.05 0.1 0.15 0.2
Precision
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Bootstrapped (100) CFCI
Bootstrapped (100) FCI
HEJ (c=1)
ACI (c=1)
Standard CFCI
Standard FCI
Recall
0 0.005 0.01 0.015 0.02
Precision
0.6
0.65
0.7
0.75
0.8
0.85
0.9
0.95
1
Precision recall curves for ancestral (left) and nonancestral (right) relations. The middle
column is a zoom of ancestral PR curve.
• ACI is as accurate as HEJ for c = 1, outperforming bootstrapped C/FCI
0.01
0.1
1
10
100
1000
6 6.5 7 7.5 8 8.5 9
Executiontime(s)
Number of variables
HEJ
ACI
• ACI is orders of magnitude faster than HEJ
• The difference grows exponentially as the
number of variables n increases (log-scale)
• HEJ is not feasible for 8 variables
• ACI can scale up to 12 variables
Application on real data
We apply ACI to reconstruct a signalling network from flow cytometry data.
Raf
Mek
PLCg
PIP2
PIP3
Erk
Akt
PKA
PKC
p38
JNK
BCFCI (indep. <= 1)
Raf
Mek
PLCg
PIP2
PIP3
Erk
Akt
PKA
PKC
p38
JNK
Bootstrapped CFCI (in-
dependences c = 1)
Raf
Mek
PLCg
PIP2
PIP3
Erk
Akt
PKA
PKC
p38
JNK
ACI (ancestral relations)
Raf
Mek
PLCg
PIP2
PIP3
Erk
Akt
PKA
PKC
p38
JNK
ACI (ancestral rela-
tions)
Raf
Mek
PLCg
PIP2
PIP3
Erk
Akt
PKA
PKC
p38
JNK
ACI (ancestral r. + indep. <= 1)
Raf
Mek
PLCg
PIP2
PIP3
Erk
Akt
PKA
PKC
p38
JNK
ACI (ancestral relations
and indep. c = 1)
• ACI can take advantage of weighted ancestral re-
lations from experimental data
• CFCI cannot, so it predicts much less
• ACI is consistent with other methods, e.g.
[MooijHeskes2013]
Raf
Mek
Erk
Akt
JNK
PIP3
PLCg
PIP2
PKC
PKA
p38
References
Antti Hyttinen, Frederick Eberhardt, and Matti J¨arvisalo. Constraint-based causal dis-
covery: Conflict resolution with Answer Set Programming. In UAI, 2014.
ACI source code: http://github.com/caus-am/aci

Contenu connexe

En vedette

ISWC 2012 "Efficient execution of top-k SPARQL queries"
ISWC 2012 "Efficient execution of top-k SPARQL queries"ISWC 2012 "Efficient execution of top-k SPARQL queries"
ISWC 2012 "Efficient execution of top-k SPARQL queries"Sara Magliacane
 
ISWC DC poster "Reconstructing Provenance"
ISWC DC poster "Reconstructing Provenance"ISWC DC poster "Reconstructing Provenance"
ISWC DC poster "Reconstructing Provenance"Sara Magliacane
 
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Valeria Fionda
 
Formal Maps and their Algebra
Formal Maps and their AlgebraFormal Maps and their Algebra
Formal Maps and their AlgebraValeria Fionda
 
NIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learningNIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learningzukun
 
Alexis Ohanian talks about the early days of reddit at Mass Challenge
Alexis Ohanian talks about the early days of reddit at Mass ChallengeAlexis Ohanian talks about the early days of reddit at Mass Challenge
Alexis Ohanian talks about the early days of reddit at Mass ChallengeAlexis Ohanian
 
Research project ppt
Research project pptResearch project ppt
Research project pptJohn McKeown
 
HR / Talent Analytics
HR / Talent AnalyticsHR / Talent Analytics
HR / Talent AnalyticsAkshay Raje
 

En vedette (8)

ISWC 2012 "Efficient execution of top-k SPARQL queries"
ISWC 2012 "Efficient execution of top-k SPARQL queries"ISWC 2012 "Efficient execution of top-k SPARQL queries"
ISWC 2012 "Efficient execution of top-k SPARQL queries"
 
ISWC DC poster "Reconstructing Provenance"
ISWC DC poster "Reconstructing Provenance"ISWC DC poster "Reconstructing Provenance"
ISWC DC poster "Reconstructing Provenance"
 
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
 
Formal Maps and their Algebra
Formal Maps and their AlgebraFormal Maps and their Algebra
Formal Maps and their Algebra
 
NIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learningNIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learning
 
Alexis Ohanian talks about the early days of reddit at Mass Challenge
Alexis Ohanian talks about the early days of reddit at Mass ChallengeAlexis Ohanian talks about the early days of reddit at Mass Challenge
Alexis Ohanian talks about the early days of reddit at Mass Challenge
 
Research project ppt
Research project pptResearch project ppt
Research project ppt
 
HR / Talent Analytics
HR / Talent AnalyticsHR / Talent Analytics
HR / Talent Analytics
 

Similaire à Ancestral Causal Inference - NIPS 2016 poster

Data classification sammer
Data classification sammer Data classification sammer
Data classification sammer Sammer Qader
 
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...SYRTO Project
 
Supervised Machine learning Algorithm.pptx
Supervised Machine learning Algorithm.pptxSupervised Machine learning Algorithm.pptx
Supervised Machine learning Algorithm.pptxKing Khalid University
 
supervised-learning.pptx
supervised-learning.pptxsupervised-learning.pptx
supervised-learning.pptxGandhiMathy6
 
Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...
Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...
Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...Arif Rahman
 
SLR Assumptions:Model Check Using SPSS
SLR Assumptions:Model Check Using SPSSSLR Assumptions:Model Check Using SPSS
SLR Assumptions:Model Check Using SPSSNermin Osman
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis TestingRyan Herzog
 
Chi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarChi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarAzmi Mohd Tamil
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selectionchenhm
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVAStephen Senn
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsUniversity of Glasgow
 

Similaire à Ancestral Causal Inference - NIPS 2016 poster (20)

Data classification sammer
Data classification sammer Data classification sammer
Data classification sammer
 
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
 
Supervised Machine learning Algorithm.pptx
Supervised Machine learning Algorithm.pptxSupervised Machine learning Algorithm.pptx
Supervised Machine learning Algorithm.pptx
 
supervised-learning.pptx
supervised-learning.pptxsupervised-learning.pptx
supervised-learning.pptx
 
Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...
Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...
Modul Ajar Statistika Inferensia ke-12: Uji Asumsi Klasik pada Regresi Linier...
 
9618821.ppt
9618821.ppt9618821.ppt
9618821.ppt
 
9618821.pdf
9618821.pdf9618821.pdf
9618821.pdf
 
Data Analysis Assignment Help
Data Analysis Assignment Help Data Analysis Assignment Help
Data Analysis Assignment Help
 
SLR Assumptions:Model Check Using SPSS
SLR Assumptions:Model Check Using SPSSSLR Assumptions:Model Check Using SPSS
SLR Assumptions:Model Check Using SPSS
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
MUMS: Transition & SPUQ Workshop - Some Strategies to Quantify Uncertainty fo...
 
Errors2
Errors2Errors2
Errors2
 
Factorial Experiments
Factorial ExperimentsFactorial Experiments
Factorial Experiments
 
Corrleation and regression
Corrleation and regressionCorrleation and regression
Corrleation and regression
 
Chi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemarChi-square, Yates, Fisher & McNemar
Chi-square, Yates, Fisher & McNemar
 
JISA_Paper
JISA_PaperJISA_Paper
JISA_Paper
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVA
 
Chapter6.pdf.pdf
Chapter6.pdf.pdfChapter6.pdf.pdf
Chapter6.pdf.pdf
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamics
 

Dernier

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 

Dernier (20)

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 

Ancestral Causal Inference - NIPS 2016 poster

  • 1. Ancestral Causal Inference Sara Magliacane1,2 , Tom Claassen1,3 , Joris M. Mooij1 1 University of Amsterdam; 2 VU Amsterdam 3 Radboud University Nijmegen Current'best'choice' CausAM Causality@AmsterdaM Main Contributions • Ancestral Causal Discovery (ACI), a causal discovery method as accurate as the state-of-the-art but much more scalable • A method for scoring causal relations that approximates marginal probability Causal discovery methods • Score-based: evaluate models using a penalized likelihood score • Constraint-based causal discovery: use statistical independences to express constraints over possible causal models Advantages of constraint-based w.r.t. score-based methods: • can handle latent confounders naturally • easy integration of background knowledge Disadvantages of constraint-based methods: • vulnerability to errors in statistical independence tests • No estimation of confidence in the causal predictions Causal inference as an optimization problem To solve the vulnerability to errors in statistical tests Hyttinen et al. [2014] propose HEJ, which formulates causal discovery as an optimization problem: • Weighted list of statistical independence results: I = {(ij, wj)}: – E.g. I = { (Y ⊥⊥ Z | X, 0.2), (Y ⊥⊥ X, 0.1)} • For any possible causal structure C, define a loss function: loss(C, I) := (ij,wj)∈I: ij is not satisfied in C wj • “ij is not satisfied in C” = defined by causal reasoning rules • Causal inference = Find causal structure minimizing loss function C∗ = arg min C∈C loss(C, I) Problem: Scalability, e.g. HEJ is very slow already for 8 random variables. Ancestral Causal Inference (ACI) Instead of direct causal relations use a more coarse-grained representation, e.g., an ancestral structure, i.e. the transitive closure of the observed variables of the DAG: (reflexivity) : X X, (transitivity) : X Y ∧ Y Z =⇒ X Z, (antisymmetry) : X Y ∧ Y X =⇒ X = Y, Ancestral Causal Inference (ACI) We reformulate the causal discovery as an optimization problem in terms of ancestral structures, which reduce drastically the search space (e.g. for 7 variables: 2.3 × 1015 → 6 × 106 possible structures). This requires new ancestral reasoning rules: For X, Y , W disjoint (sets of) variables: 1. (X ⊥⊥ Y | W ) ∧ (X W ) =⇒ X Y 2. X ⊥⊥ Y | W ∪ [Z] =⇒ (X ⊥⊥ Z | W ) ∧ (Z {X, Y } ∪ W ) 3. X ⊥⊥ Y | W ∪ [Z] =⇒ (X ⊥⊥ Z | W ) ∧ (Z {X, Y } ∪ W ) 4. (X ⊥⊥ Y | W ∪ [Z]) ∧ (X ⊥⊥ Z | W ∪ U) =⇒ (X ⊥⊥ Y | W ∪ U) 5. (Z ⊥⊥ X | W ) ∧ (Z ⊥⊥ Y | W ) ∧ (X ⊥⊥ Y | W ) =⇒ X ⊥⊥ Y | W ∪ Z Possible weighting schemes for inputs ACI supports two types of weighted input statements: statistical independence results and ancestral relations. We propose two simple weighting schemes: • a frequentist approach, in which for any appropriate frequentist statistical test with independence as null hypothesis, we define the weight: w = | log p − log α|, where p = p-value of the test, α = significance level (e.g., 5%); • a Bayesian approach, in which the weight of each input i using data set D is: w = log p(i|D) p(¬i|D) = log p(D|i) p(D|¬i) p(i) p(¬i) , where the prior probability p(i) can be used as a tuning parameter. For X Y we test the independence of Y and IX, an indicator variable (0 for observational samples, 1 for samples from the distribution where X is intervened upon). A method for scoring causal predictions • Score the confidence in a predicted statement s (e.g. X Y ) as: C(f) = min C∈C loss(C, I + (¬s, ∞)) − min C∈C loss(C, I + (s, ∞)) • ≈ MAP approximation of the log-odds ratio of s • Asymptotically consistent, when consistent input weights • Can be used with any method that solves an optimization problem Simulated data • Generate randomly 2000 linear acyclic models of n observed variables, with latent variables and Gaussian noise • Per model: sample 500 data points and perform independence tests up to order c Evaluation on Simulated data We compare ACI, HEJ [Hyttinen et al., 2014] equipped with our scoring method, and bootstrapped versions of FCI and CFCI. Recall 0 0.05 0.1 0.15 0.2 Precision 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Bootstrapped (100) CFCI Bootstrapped (100) FCI HEJ (c=1) ACI (c=1) Standard CFCI Standard FCI Recall 0 0.005 0.01 0.015 0.02 Precision 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 Precision recall curves for ancestral (left) and nonancestral (right) relations. The middle column is a zoom of ancestral PR curve. • ACI is as accurate as HEJ for c = 1, outperforming bootstrapped C/FCI 0.01 0.1 1 10 100 1000 6 6.5 7 7.5 8 8.5 9 Executiontime(s) Number of variables HEJ ACI • ACI is orders of magnitude faster than HEJ • The difference grows exponentially as the number of variables n increases (log-scale) • HEJ is not feasible for 8 variables • ACI can scale up to 12 variables Application on real data We apply ACI to reconstruct a signalling network from flow cytometry data. Raf Mek PLCg PIP2 PIP3 Erk Akt PKA PKC p38 JNK BCFCI (indep. <= 1) Raf Mek PLCg PIP2 PIP3 Erk Akt PKA PKC p38 JNK Bootstrapped CFCI (in- dependences c = 1) Raf Mek PLCg PIP2 PIP3 Erk Akt PKA PKC p38 JNK ACI (ancestral relations) Raf Mek PLCg PIP2 PIP3 Erk Akt PKA PKC p38 JNK ACI (ancestral rela- tions) Raf Mek PLCg PIP2 PIP3 Erk Akt PKA PKC p38 JNK ACI (ancestral r. + indep. <= 1) Raf Mek PLCg PIP2 PIP3 Erk Akt PKA PKC p38 JNK ACI (ancestral relations and indep. c = 1) • ACI can take advantage of weighted ancestral re- lations from experimental data • CFCI cannot, so it predicts much less • ACI is consistent with other methods, e.g. [MooijHeskes2013] Raf Mek Erk Akt JNK PIP3 PLCg PIP2 PKC PKA p38 References Antti Hyttinen, Frederick Eberhardt, and Matti J¨arvisalo. Constraint-based causal dis- covery: Conflict resolution with Answer Set Programming. In UAI, 2014. ACI source code: http://github.com/caus-am/aci