SlideShare a Scribd company logo
1 of 15
Download to read offline
Sampling based appr ximation of
confidence intervals for functions of
genetic covariance matrices
Karin Meyer 1
David Houle 2
1
Animal Genetics and Breeding Unit, University of New England, Armidale NSW 2351
2
Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295
AAABG 2013
Sampling standard errors | Introduction
REML sampling variances
REML estimates of covariance components
– multivariate normal distribution: ˆθθθ ∼ N (θθθ, I(θθθ)−1)
– inverse of information matrix −→ sampling errors
– large sample theory; asymptotic lower bounds
Linear functions of estimates
– sampling variances readily obtained
Non-linear functions
– obtain 1st order Taylor series expansion
– evaluate sampling variance of linear approximation
– needs partial derivatives w.r.t. all variables
−→ can be complicated / tedious
−→ options for evaluating in REML software limited
Confidence intervals: ±zα s.e.
– misleading at boundary of parameter space?
K. M. | 2 / 12
“Delta method”
Sampling standard errors | Introduction
Alternatives
Dealing with boundary conditions
– Derive confidence intervals from profile likelihood
– Bayesian estimation
General procedure
– Sample data, repeat analysis −→ distribution over reps
– slow & laborious!
K. M. | 3 / 12
Sampling standard errors | Introduction
Alternatives
Dealing with boundary conditions
– Derive confidence intervals from profile likelihood
– Bayesian estimation
General procedure
– Sample data, repeat analysis −→ distribution over reps
– slow & laborious!
Objectives
1 Propose new scheme
– sample from (theoretical) distribution of estimates
– simple & fast
2 Examine quality of approximation of sampling errors
K. M. | 3 / 12
Sampling standard errors | Method
Sampling scheme
Large sample theory
– (RE)ML estimates have MVN distribution
– Sampling covariance ∝ inverse of information matrix
Sample from this distribution
˜θθθ ∼ N ˆθθθ, H(ˆθθθ)−1
Information matrix
– Use same parameterisation as REML analysis
→ eliminate linear approx., account for constraints
– Evaluate function(s) of interest for ˜θθθ
– Examine distribution over replicates
K. M. | 4 / 12
Sampling standard errors | Method
Sampling scheme
Large sample theory
– (RE)ML estimates have MVN distribution
– Sampling covariance ∝ inverse of information matrix
Sample from this distribution
˜θθθ ∼ N ˆθθθ, H(ˆθθθ)−1
Information matrix
– Use same parameterisation as REML analysis
→ eliminate linear approx., account for constraints
– Evaluate function(s) of interest for ˜θθθ
– Examine distribution over replicates
Mandel, M. (2013) Simulation-based confidence intervals for
functions with complicated derivatives. American Statistician
67, 76–81.
K. M. | 4 / 12
Sampling standard errors | Simulation
Does it work?
Simulate two data sets
– 4000 animals, 6 traits
– h2
= 2 × (0.2, 0.3, 0.4)
– σ2
P
= 100
– rE = 0.3
– a) rG = 0.5, b) rG = |0.7||i−j|
REML analysis
– AI algorithm
– Cholesky factor
Estimates
– ˆθθθ
– H(ˆθθθ)
Compare estimates of sampling variances
REML Based on H(ˆθθθ), “Delta” method
Empirical Re-sample data using estimates as popul.
values, repeat analysis; 10000 replicates
Approx. Sample from MVN distribution, N(ˆθθθ, H(ˆθθθ)−1
)
200000 replicates
K. M. | 5 / 12
Sampling standard errors | Results
Sampling covariances for ˆΣΣΣG - a∗
Empirical vs. REML Approximate vs. REML Approximate vs. Empirical
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●●
●●●●
●●
●●
●●●
●
●
●
●
●
●
●●
●
●●
●●●
●
●●
●
●
●●
●
●●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●●●●
●●
●●●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●●
●●●
●
●●
●●
●
●●●
●
●
●
●
●
●●
●●●
●
●
●
●●
●
●●●
●
●
●
●
●
●
●
●
●
●●●
REML
●
●
●
●●●
●
●●
●
●
●
●
●
●
●
●●
●●●●
●●●●●●●
●
●●
●
●
●
●●
●
●●
●●●
●
●●
●
●
●●
●
●●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●●●
●●●●●●
●
●
●●
●
●
●
●●
●
●
●
●●
●
●●
●●●
●
●●
●●
●
●●●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●●●
●
●
●
●
●
●
●
●
●
●●●
REML
●
●
●
●●●
●
●●
●
●
●
●
●
●
●
●●
●●●●
●●●●●●●
●
●●
●
●
●
●●
●
●●
●●●
●
●●
●
●
●●
●
●●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●●●
●●●●●●
●
●
●●
●
●
●
●●
●
●
●
●●
●
●●
●●●
●
●●
●●
●
●●●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●●●
●
●
●
●
●
●
●
●
●
●●●
Empirical0
5
10
15
0 5 10 15 0 5 10 15 0 5 10 15
6 traits, 21 (co)variance components, 231 sampling (co)variances
variance, ◦ covariance
∗Case a: all genetic eigenvalues > 0
K. M. | 6 / 12
Sampling standard errors | Results
Sampling covariances for ˆΣΣΣG - b†
Rank 6 Rank 5
●
●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●●●
●●
●●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●●●●
●●
●
●●
●
●
●
●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●●
●
●●●
●
●
●
●
●
●
●
●
●
●●●
0
5
10
15
0 5 10 15 0 5 10 15
Empirical
Approximate
Approximation unreliable if model is over-parameterised
†Case b: one genetic eigenvalue ≈ 0
K. M. | 7 / 12
Sampling standard errors | Results
Delta method for ˆrij
Estimate elements of Cholesky L factor of ΣΣΣ = LL
– H(ˆθθθ)−1
gives Cov(ˆlij,ˆlmn)
– covariances between σij
Cov(ˆσij, ˆσkl) ≈
f(i,j)
t=1
f(k,m)
s=1
ˆljt
ˆlms Cov ˆlit,ˆlks +ˆljt
ˆlks Cov ˆlit,ˆlms
+ˆlit
ˆlms Cov ˆljt,ˆlks +ˆlit
ˆlks Cov ˆljt,ˆlms
For ˆrij = ˆσij/ ˆσ2
i
ˆσ2
j
Var(ˆrij) ≈ 4ˆσ4
i
ˆσ4
j
Var(ˆσij) + ˆσ2
ij
ˆσ4
j
Var(ˆσ2
i
) + ˆσ2
ij
ˆσ4
i
Var(ˆσ2
j
)
− 4ˆσij ˆσ2
i
ˆσ4
j
Cov(ˆσij, ˆσ2
i
) − 4ˆσij ˆσ4
i
ˆσ2
j
Cov(ˆσij, ˆσ2
j
)
+ 2ˆσ2
ij
ˆσ2
i
ˆσ2
j
Cov(ˆσ2
i
, ˆσ2
j
) / 4ˆσ6
i
ˆσ6
j
K. M. | 8 / 12
Sampling standard errors | Results
Approximation for ˆrij
Let ΣΣΣ = LL and θθθ = vech(L)
For many replicates
– Sample ˜θθθ ∼ N(ˆθθθ, H(ˆθθθ)−1
)
– Construct ˜L from ˜θθθ
– Calculate ˜ΣΣΣ = ˜L˜L
– Calculate correlation ˜rij = ˜σij/ ˜σ2
i
˜σ2
j
Evaluate Var(ˆrij) as emprical variance of ˜rij across
replicates
K. M. | 9 / 12
Sampling standard errors | Results
Distribution of ˆrG12 - b
Empirical
0.5 0.6 0.7 0.8 0.9 1.0
Correlation
Approximate
0.5 0.6 0.7 0.8 0.9 1.0
Correlation
REML Empirical Approxim.
ˆrG12 0.897 0.873 0.866
s.e. 0.059 0.066 0.063
K. M. | 10 / 12
Sampling standard errors | Results
Distribution of second eigenvalue
Empirical
20 30 40
Eigenvalue
Approximate
20 30 40
Eigenvalue
REML Empirical Approxim.
ˆλ2 32.93 33.25 33.84
s.e. – 3.27 3.30
K. M. | 11 / 12
Sampling standard errors | Results | Conclusions
Conclusions
Sampling from MVN distribution
– accommodates arbitrary functions
– yields good approximation of sampling variances
– easier than Delta method for complicated derivatives
– more appropriate confidence interval at boundary of
parameter space
– but:
−→ relies on large sample theory
−→ information matrix needs to be safely p.d.
−→ assumes ˆθθθ ≈ θθθ
Simple but useful addition to our toolkit
– implemented in WOMBAT
K. M. | 12 / 12
Sampling based approximation of confidence intervals for functions of genetic covariance matrices

More Related Content

What's hot

Regularization Methods to Solve
Regularization Methods to SolveRegularization Methods to Solve
Regularization Methods to Solve
Komal Goyal
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlation
nszakir
 

What's hot (20)

Regularization Methods to Solve
Regularization Methods to SolveRegularization Methods to Solve
Regularization Methods to Solve
 
SEM model examination
SEM model examinationSEM model examination
SEM model examination
 
Specific topics in optimisation
Specific topics in optimisationSpecific topics in optimisation
Specific topics in optimisation
 
Seattle.Slides.7
Seattle.Slides.7Seattle.Slides.7
Seattle.Slides.7
 
Final relation1 m_tech(cse)
Final relation1 m_tech(cse)Final relation1 m_tech(cse)
Final relation1 m_tech(cse)
 
Ridge regression
Ridge regressionRidge regression
Ridge regression
 
G03405049058
G03405049058G03405049058
G03405049058
 
SEM
SEMSEM
SEM
 
Varaiational formulation fem
Varaiational formulation fem Varaiational formulation fem
Varaiational formulation fem
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Eigenvalues for HIV-1 dynamic model with two delays
Eigenvalues for HIV-1 dynamic model with two delaysEigenvalues for HIV-1 dynamic model with two delays
Eigenvalues for HIV-1 dynamic model with two delays
 
Ch6 transform and conquer
Ch6 transform and conquerCh6 transform and conquer
Ch6 transform and conquer
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlation
 
An approach to Fuzzy clustering of the iris petals by using Ac-means
An approach to Fuzzy clustering of the iris petals by using Ac-meansAn approach to Fuzzy clustering of the iris petals by using Ac-means
An approach to Fuzzy clustering of the iris petals by using Ac-means
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
 
Forth Lecture
Forth LectureForth Lecture
Forth Lecture
 
A new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbersA new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbers
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
Rayleigh Ritz Method
Rayleigh Ritz MethodRayleigh Ritz Method
Rayleigh Ritz Method
 

Viewers also liked

Lyman_Reiner_Morrow_Crawford_annalsApril2015
Lyman_Reiner_Morrow_Crawford_annalsApril2015Lyman_Reiner_Morrow_Crawford_annalsApril2015
Lyman_Reiner_Morrow_Crawford_annalsApril2015
Maureen Reiner
 
çİkolatali tatlilar
çİkolatali tatlilarçİkolatali tatlilar
çİkolatali tatlilar
htcuysl13
 
công ty dịch vụ giúp việc quận 3 ở hcm
công ty dịch vụ giúp việc quận 3 ở hcmcông ty dịch vụ giúp việc quận 3 ở hcm
công ty dịch vụ giúp việc quận 3 ở hcm
tory860
 
Top 8 client service associate resume samples
Top 8 client service associate resume samplesTop 8 client service associate resume samples
Top 8 client service associate resume samples
jombinri
 
công ty dịch vụ giúp việc văn phòng sài gòn
công ty dịch vụ giúp việc văn phòng sài gòncông ty dịch vụ giúp việc văn phòng sài gòn
công ty dịch vụ giúp việc văn phòng sài gòn
cristen241
 
Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...
Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...
Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...
Sutra Sutra
 
nơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcm
nơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcmnơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcm
nơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcm
warren406
 
ở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minh
ở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minhở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minh
ở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minh
teodoro567
 
nhận làm dịch vụ giúp việc nhà tốt giá rẻ hcm
nhận làm dịch vụ giúp việc nhà tốt giá rẻ hcmnhận làm dịch vụ giúp việc nhà tốt giá rẻ hcm
nhận làm dịch vụ giúp việc nhà tốt giá rẻ hcm
filiberto448
 
Portfolio_FilmEditor_EfinDeLandmeter
Portfolio_FilmEditor_EfinDeLandmeterPortfolio_FilmEditor_EfinDeLandmeter
Portfolio_FilmEditor_EfinDeLandmeter
Efin de Landmeter
 

Viewers also liked (19)

Slides meyer116
Slides meyer116Slides meyer116
Slides meyer116
 
ppt on heart
ppt on heartppt on heart
ppt on heart
 
Lyman_Reiner_Morrow_Crawford_annalsApril2015
Lyman_Reiner_Morrow_Crawford_annalsApril2015Lyman_Reiner_Morrow_Crawford_annalsApril2015
Lyman_Reiner_Morrow_Crawford_annalsApril2015
 
çİkolatali tatlilar
çİkolatali tatlilarçİkolatali tatlilar
çİkolatali tatlilar
 
Networking interview
Networking interviewNetworking interview
Networking interview
 
Scout Trail
Scout TrailScout Trail
Scout Trail
 
công ty dịch vụ giúp việc quận 3 ở hcm
công ty dịch vụ giúp việc quận 3 ở hcmcông ty dịch vụ giúp việc quận 3 ở hcm
công ty dịch vụ giúp việc quận 3 ở hcm
 
Top 8 client service associate resume samples
Top 8 client service associate resume samplesTop 8 client service associate resume samples
Top 8 client service associate resume samples
 
công ty dịch vụ giúp việc văn phòng sài gòn
công ty dịch vụ giúp việc văn phòng sài gòncông ty dịch vụ giúp việc văn phòng sài gòn
công ty dịch vụ giúp việc văn phòng sài gòn
 
The pitch
The pitchThe pitch
The pitch
 
Jib Cranes
Jib CranesJib Cranes
Jib Cranes
 
2016 MCV
2016 MCV2016 MCV
2016 MCV
 
Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...
Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...
Undang undang-nomor-7-tahun-1992-tentang-perbankan-sebagaimana-diubah-dengan-...
 
nơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcm
nơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcmnơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcm
nơi nào dịch vụ giúp việc gia đình có kinh nghiệm ở hcm
 
Texual research.
Texual research.Texual research.
Texual research.
 
ở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minh
ở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minhở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minh
ở đâu dịch vụ giúp việc gia đình trọn gói ở hồ chí minh
 
Tidak perlu mucikari
Tidak perlu mucikariTidak perlu mucikari
Tidak perlu mucikari
 
nhận làm dịch vụ giúp việc nhà tốt giá rẻ hcm
nhận làm dịch vụ giúp việc nhà tốt giá rẻ hcmnhận làm dịch vụ giúp việc nhà tốt giá rẻ hcm
nhận làm dịch vụ giúp việc nhà tốt giá rẻ hcm
 
Portfolio_FilmEditor_EfinDeLandmeter
Portfolio_FilmEditor_EfinDeLandmeterPortfolio_FilmEditor_EfinDeLandmeter
Portfolio_FilmEditor_EfinDeLandmeter
 

Similar to Sampling based approximation of confidence intervals for functions of genetic covariance matrices

Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
chenhm
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

Maxim Kazantsev
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
Harry Potter
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
James Wong
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
Young Alista
 
Data miningmaximumlikelihood
Data miningmaximumlikelihoodData miningmaximumlikelihood
Data miningmaximumlikelihood
Fraboni Ec
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
Tony Nguyen
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
Luis Goldster
 
Intro to Feature Selection
Intro to Feature SelectionIntro to Feature Selection
Intro to Feature Selection
chenhm
 

Similar to Sampling based approximation of confidence intervals for functions of genetic covariance matrices (20)

Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
MEM and SEM in the GME framework: Modelling Perception and Satisfaction - Car...
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS

 
Summery of Robust and Effective Metric Learning Using Capped Trace Norm
Summery of  Robust and Effective Metric Learning Using Capped Trace NormSummery of  Robust and Effective Metric Learning Using Capped Trace Norm
Summery of Robust and Effective Metric Learning Using Capped Trace Norm
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
 
Data miningmaximumlikelihood
Data miningmaximumlikelihoodData miningmaximumlikelihood
Data miningmaximumlikelihood
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
 
Data mining maximumlikelihood
Data mining maximumlikelihoodData mining maximumlikelihood
Data mining maximumlikelihood
 
Intro to Feature Selection
Intro to Feature SelectionIntro to Feature Selection
Intro to Feature Selection
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
A walk in the black forest - during which I explain the fundamental problem o...
A walk in the black forest - during which I explain the fundamental problem o...A walk in the black forest - during which I explain the fundamental problem o...
A walk in the black forest - during which I explain the fundamental problem o...
 
Unbiased Bayes for Big Data
Unbiased Bayes for Big DataUnbiased Bayes for Big Data
Unbiased Bayes for Big Data
 
Statistics chm 235
Statistics chm 235Statistics chm 235
Statistics chm 235
 
0 introduction
0  introduction0  introduction
0 introduction
 
Dataanalysis2
Dataanalysis2Dataanalysis2
Dataanalysis2
 
Bias-Variance_relted_to_ML.pdf
Bias-Variance_relted_to_ML.pdfBias-Variance_relted_to_ML.pdf
Bias-Variance_relted_to_ML.pdf
 
ABC-Gibbs
ABC-GibbsABC-Gibbs
ABC-Gibbs
 

Recently uploaded

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 

Recently uploaded (20)

Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 

Sampling based approximation of confidence intervals for functions of genetic covariance matrices

  • 1. Sampling based appr ximation of confidence intervals for functions of genetic covariance matrices Karin Meyer 1 David Houle 2 1 Animal Genetics and Breeding Unit, University of New England, Armidale NSW 2351 2 Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295 AAABG 2013
  • 2. Sampling standard errors | Introduction REML sampling variances REML estimates of covariance components – multivariate normal distribution: ˆθθθ ∼ N (θθθ, I(θθθ)−1) – inverse of information matrix −→ sampling errors – large sample theory; asymptotic lower bounds Linear functions of estimates – sampling variances readily obtained Non-linear functions – obtain 1st order Taylor series expansion – evaluate sampling variance of linear approximation – needs partial derivatives w.r.t. all variables −→ can be complicated / tedious −→ options for evaluating in REML software limited Confidence intervals: ±zα s.e. – misleading at boundary of parameter space? K. M. | 2 / 12 “Delta method”
  • 3. Sampling standard errors | Introduction Alternatives Dealing with boundary conditions – Derive confidence intervals from profile likelihood – Bayesian estimation General procedure – Sample data, repeat analysis −→ distribution over reps – slow & laborious! K. M. | 3 / 12
  • 4. Sampling standard errors | Introduction Alternatives Dealing with boundary conditions – Derive confidence intervals from profile likelihood – Bayesian estimation General procedure – Sample data, repeat analysis −→ distribution over reps – slow & laborious! Objectives 1 Propose new scheme – sample from (theoretical) distribution of estimates – simple & fast 2 Examine quality of approximation of sampling errors K. M. | 3 / 12
  • 5. Sampling standard errors | Method Sampling scheme Large sample theory – (RE)ML estimates have MVN distribution – Sampling covariance ∝ inverse of information matrix Sample from this distribution ˜θθθ ∼ N ˆθθθ, H(ˆθθθ)−1 Information matrix – Use same parameterisation as REML analysis → eliminate linear approx., account for constraints – Evaluate function(s) of interest for ˜θθθ – Examine distribution over replicates K. M. | 4 / 12
  • 6. Sampling standard errors | Method Sampling scheme Large sample theory – (RE)ML estimates have MVN distribution – Sampling covariance ∝ inverse of information matrix Sample from this distribution ˜θθθ ∼ N ˆθθθ, H(ˆθθθ)−1 Information matrix – Use same parameterisation as REML analysis → eliminate linear approx., account for constraints – Evaluate function(s) of interest for ˜θθθ – Examine distribution over replicates Mandel, M. (2013) Simulation-based confidence intervals for functions with complicated derivatives. American Statistician 67, 76–81. K. M. | 4 / 12
  • 7. Sampling standard errors | Simulation Does it work? Simulate two data sets – 4000 animals, 6 traits – h2 = 2 × (0.2, 0.3, 0.4) – σ2 P = 100 – rE = 0.3 – a) rG = 0.5, b) rG = |0.7||i−j| REML analysis – AI algorithm – Cholesky factor Estimates – ˆθθθ – H(ˆθθθ) Compare estimates of sampling variances REML Based on H(ˆθθθ), “Delta” method Empirical Re-sample data using estimates as popul. values, repeat analysis; 10000 replicates Approx. Sample from MVN distribution, N(ˆθθθ, H(ˆθθθ)−1 ) 200000 replicates K. M. | 5 / 12
  • 8. Sampling standard errors | Results Sampling covariances for ˆΣΣΣG - a∗ Empirical vs. REML Approximate vs. REML Approximate vs. Empirical ● ● ● ●●● ● ● ● ● ● ● ● ● ●● ●● ●●●● ●● ●● ●●● ● ● ● ● ● ● ●● ● ●● ●●● ● ●● ● ● ●● ● ●● ● ●●● ● ●● ● ● ● ● ● ● ● ● ● ●●● ● ●● ● ● ● ● ● ● ● ● ●● ●● ● ●● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ●● ● ● ● ● ● ●● ●● ● ●● ●● ● ●●●●● ●● ●●● ● ● ● ● ● ● ● ●● ● ● ● ●● ● ●● ●●● ● ●● ●● ● ●●● ● ● ● ● ● ●● ●●● ● ● ● ●● ● ●●● ● ● ● ● ● ● ● ● ● ●●● REML ● ● ● ●●● ● ●● ● ● ● ● ● ● ● ●● ●●●● ●●●●●●● ● ●● ● ● ● ●● ● ●● ●●● ● ●● ● ● ●● ● ●● ● ●●● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●●● ●● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ●● ● ● ● ● ● ●● ●● ● ●● ●● ● ●●●● ●●●●●● ● ● ●● ● ● ● ●● ● ● ● ●● ● ●● ●●● ● ●● ●● ● ●●● ● ● ● ● ● ●● ●●● ● ●● ●● ● ●●● ● ● ● ● ● ● ● ● ● ●●● REML ● ● ● ●●● ● ●● ● ● ● ● ● ● ● ●● ●●●● ●●●●●●● ● ●● ● ● ● ●● ● ●● ●●● ● ●● ● ● ●● ● ●● ● ●●● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●●● ●● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ●● ● ● ● ● ● ●● ●● ● ●● ●● ● ●●●● ●●●●●● ● ● ●● ● ● ● ●● ● ● ● ●● ● ●● ●●● ● ●● ●● ● ●●● ● ● ● ● ● ●● ●●● ● ●● ●● ● ●●● ● ● ● ● ● ● ● ● ● ●●● Empirical0 5 10 15 0 5 10 15 0 5 10 15 0 5 10 15 6 traits, 21 (co)variance components, 231 sampling (co)variances variance, ◦ covariance ∗Case a: all genetic eigenvalues > 0 K. M. | 6 / 12
  • 9. Sampling standard errors | Results Sampling covariances for ˆΣΣΣG - b† Rank 6 Rank 5 ● ● ● ●●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ●● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●●●● ●● ● ● ● ● ● ● ● ● ●● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●●● ● ● ●● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ●●● ● ●● ● ● ● ● ● ●● ● ●● ● ● ● ●●● ●● ●● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ●●●●●● ●● ● ●● ● ● ● ● ● ●● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●●●● ● ●●● ● ● ● ● ● ● ● ● ● ●●● 0 5 10 15 0 5 10 15 0 5 10 15 Empirical Approximate Approximation unreliable if model is over-parameterised †Case b: one genetic eigenvalue ≈ 0 K. M. | 7 / 12
  • 10. Sampling standard errors | Results Delta method for ˆrij Estimate elements of Cholesky L factor of ΣΣΣ = LL – H(ˆθθθ)−1 gives Cov(ˆlij,ˆlmn) – covariances between σij Cov(ˆσij, ˆσkl) ≈ f(i,j) t=1 f(k,m) s=1 ˆljt ˆlms Cov ˆlit,ˆlks +ˆljt ˆlks Cov ˆlit,ˆlms +ˆlit ˆlms Cov ˆljt,ˆlks +ˆlit ˆlks Cov ˆljt,ˆlms For ˆrij = ˆσij/ ˆσ2 i ˆσ2 j Var(ˆrij) ≈ 4ˆσ4 i ˆσ4 j Var(ˆσij) + ˆσ2 ij ˆσ4 j Var(ˆσ2 i ) + ˆσ2 ij ˆσ4 i Var(ˆσ2 j ) − 4ˆσij ˆσ2 i ˆσ4 j Cov(ˆσij, ˆσ2 i ) − 4ˆσij ˆσ4 i ˆσ2 j Cov(ˆσij, ˆσ2 j ) + 2ˆσ2 ij ˆσ2 i ˆσ2 j Cov(ˆσ2 i , ˆσ2 j ) / 4ˆσ6 i ˆσ6 j K. M. | 8 / 12
  • 11. Sampling standard errors | Results Approximation for ˆrij Let ΣΣΣ = LL and θθθ = vech(L) For many replicates – Sample ˜θθθ ∼ N(ˆθθθ, H(ˆθθθ)−1 ) – Construct ˜L from ˜θθθ – Calculate ˜ΣΣΣ = ˜L˜L – Calculate correlation ˜rij = ˜σij/ ˜σ2 i ˜σ2 j Evaluate Var(ˆrij) as emprical variance of ˜rij across replicates K. M. | 9 / 12
  • 12. Sampling standard errors | Results Distribution of ˆrG12 - b Empirical 0.5 0.6 0.7 0.8 0.9 1.0 Correlation Approximate 0.5 0.6 0.7 0.8 0.9 1.0 Correlation REML Empirical Approxim. ˆrG12 0.897 0.873 0.866 s.e. 0.059 0.066 0.063 K. M. | 10 / 12
  • 13. Sampling standard errors | Results Distribution of second eigenvalue Empirical 20 30 40 Eigenvalue Approximate 20 30 40 Eigenvalue REML Empirical Approxim. ˆλ2 32.93 33.25 33.84 s.e. – 3.27 3.30 K. M. | 11 / 12
  • 14. Sampling standard errors | Results | Conclusions Conclusions Sampling from MVN distribution – accommodates arbitrary functions – yields good approximation of sampling variances – easier than Delta method for complicated derivatives – more appropriate confidence interval at boundary of parameter space – but: −→ relies on large sample theory −→ information matrix needs to be safely p.d. −→ assumes ˆθθθ ≈ θθθ Simple but useful addition to our toolkit – implemented in WOMBAT K. M. | 12 / 12