Johnny Aqm Presentation

•

0 j'aime•266 vues

guestbeb22e

Technologie Spirituel

Eﬀect of Number of Categories and Category
Boundaries on Recovery of Latent Linear
Correlations from Optimally Weighted
Categorical Data

Johnny Lin
Advisor: Peter Bentler

November 19, 2008

Outline

Introduction
LINEALS
Forming a Hypothesis

Method
Description
Simulation
Analysis

Results
Main Eﬀects
Interactions

Introducing LINEALS
A Method of Optimal Scaling

Algorithm
An iterative process that minimizes m m 2 2 2
l=1 (ηjl − rjl ) where ηjl
j=1
is a measure of nonlinearity.
Developed by Jan de Leeuw and implemented by Patrick Mair.

Assumption
That bi-linearization is possible. No assumption of normality.

Plot of LINEALS Transformation
Criterion: Linearize both X on Y and Y on X simultaneously.

Figure: Red: X on Y , Blue: Y on X

Questions to ask

First, deﬁne good recovery as small deviation from true score.
1. Does LINEALS recover true population correlations better
than Pearson for categorical data?
2. Is the performance of LINEALS robust?
3. What factors inﬂuence good recovery?

Conditions tested

Correlation Type, True Population Correlation, Number of
Categories, and Homogeneity

Condition Parameters
{0=LINEALS, 1=Pearson}
1. Correlation Type (r)
{0.3,0.5,0.7,0.9}
2. True Population Correlation (P)
{2,3,5,7,10}
3. Number of Categories (V)
{0=Non-Homogeneous, 1=Homogeneous}
4. Homogeneity (h)

Total of 80 combinations (2x4x5x2).

Creating functions in R

For each combination (total of 80):
1. Generate 1000 sets of bivariate normal data.
2. Make “cuts” (homogeneous vs. non-homogeneous).
3. Run through LINEALS / Pearson.
4. Calculate deviation of result and true population correlation.
5. Repeat Steps 1 - 4 twenty-ﬁve times.
Result: Total of 2000 deviations (80x25).

Hierarchical Regression
Description

DV: deviation of sample correlation from true population
correlation |ρ12 | − |ˆ12 |
ρ
IVs: main eﬀect and interactions of four conditions (total of
15)
Four main eﬀects (h,r,P,V)
Six 2-way interactions (hr, hP, hV, . . . )
Four 3-way interactions (hrP, hrV, . . . )
One 4-way interaction (hrPV)

Hierarchical Regression
Model Selection

Tested full model against nested models.
Conﬁrmed with Best Subset Regression.
Optimal Adj. R 2 and Mallow’s CP found with 7-8 parameters.

(a) Adj. R 2 (b) Mallow’s CP

Final Model
SPSS Output

Coefficients(a)

Unstandardized Standardized
Model Coefficients Coefficients t Sig.

B Std. Error Beta
1 (Constant) .189 .006 31.240 .000
h -.113 .012 -.620 -9.299 .000
r .007 .002 .041 3.054 .002
V -.024 .001 -.773 -40.558 .000
P .098 .008 .241 12.655 .000
hV .013 .002 .487 7.164 .000
hP .117 .018 .435 6.392 .000
hPV -.017 .003 -.422 -6.326 .000
a Dependent Variable: difference

Diﬀerence between LINEALS and Pearson deviations is .007
controlling for other factors.

Plot of Main Eﬀects I

Figure: Main Eﬀect of Number of
Figure: Main Eﬀect of Population
Categories V
Correlation P

Plot of Main Eﬀects II

Figure: Main Eﬀect of Homogeneity h Figure: Main Eﬀect of Correlation Type r

Plot of Signiﬁcant Interactions

Note: The signiﬁcant 3-way interaction hPV is not plotted.

Figure: Population Correlation by Levels Figure: Number of Categories by Levels
of Homogeneity hP of Homogeneity hV

Interaction of Correlation Type and Number of Categories
When rV added into regression model, the main eﬀect of
Correlation Type r goes away.
Suggests that number of categories may contribute to the LINEALS vs.
Pearson diﬀerence.

Figure: Number of Categories by Correlation Type (rV, marginally sig.)

Summary

1. LINEALS performs slightly better than Pearson under
bivariate normal categorizations.
2. The non-signiﬁcant interactions with Correlation Type suggest
that LINEALS is robust.
3. Recovery of true population correlations is highly inﬂuenced by
homogeneity (i.e., the underlying equality of interval widths).

Future Studies
How does it compare against polychoric correlations?
Is the resulting matrix positive deﬁnite?

Contenu connexe

Tendances

Regression analysissayantansarkar50

Applications of regression analysis - Measurement of validity of relationshipRithish Kumar

Regression analysisAmany El-seoud

Regression Ali Raza

04 regressionFiras Husseini

Linear regression without tearsAnkit Sharma

Simple linear regression ShubhamBhardwaj195

Regression AnalysisSalim Azad

Regression: A skin-deep diveabulyomon

Regression AnalysisMuhammad Fazeel

Regression analysis algorithm Sammer Qader

Statistics-Regression analysisRabin BK

Chap5 correlationSemurt Ensem

Regression analysis.sonia gupta

Regressionsimran sakshi

Chap12 multiple regressionJudianto Nugroho

Chap11 simple regressionJudianto Nugroho

Regression analysisUniversity of Jaffna

Tendances (18)

Regression analysis

Applications of regression analysis - Measurement of validity of relationship

Regression analysis

Regression

04 regression

Linear regression without tears

Simple linear regression

Regression Analysis

Regression: A skin-deep dive

Regression Analysis

Regression analysis algorithm

Statistics-Regression analysis

Chap5 correlation

Regression analysis.

Regression

Chap12 multiple regression

Chap11 simple regression

Regression analysis

En vedette

Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint SchaffSocialMediaUCLA

AQM Presentation by Johnny Lin on Jan 9, 2009guestbeb22e

BookadsSuman Girdhar

Rosa galindez presentacionRosaGalindez

Zespół pałacowo-parkowy w Dobrociniespmaldyty

Manufacturing-RoboticsKeith Bradford

http://es.slideshare.net/E.Prego/caligramaEnriquePrego

3. respuesta intimidacion de cronixAnibal Carrera

Comunicadores indigenes ley consulta previaCrónicas del despojo

Product Overview Brochure[Wais]wais31

Wings & more menu2phanelson

Aleksej Kovaliov - When the Whole World is Against YouAgile Lietuva

PresentacionManuel J. García Palomo

RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...MLD/Mel Lim Design

UCLA X469.21 - FALL '16 WEEK 5SocialMediaUCLA

Defects in timberVikul Puri

Réseau de capteurs sans fils wsnAchref Ben helel

En vedette (17)

Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint Schaff

AQM Presentation by Johnny Lin on Jan 9, 2009

Bookads

Rosa galindez presentacion

Zespół pałacowo-parkowy w Dobrocinie

Manufacturing-Robotics

http://es.slideshare.net/E.Prego/caligrama

3. respuesta intimidacion de cronix

Comunicadores indigenes ley consulta previa

Product Overview Brochure[Wais]

Wings & more menu2

Aleksej Kovaliov - When the Whole World is Against You

Presentacion

RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...

UCLA X469.21 - FALL '16 WEEK 5

Defects in timber

Réseau de capteurs sans fils wsn

Similaire à Johnny Aqm Presentation

Ders 2 ols .pptErgin Akalpler

Chapter 9 Regressionghalan

Multiple Regression.pptTanyaWadhwani4

What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?Smarten Augmented Analytics

manecohuhuhuhubasicEstimation-1.pptxasdfg hjkl

Ch8 Regression Revby RaoSumit Prajapati

Mba2216 week 11 data analysis part 02Stephen Ong

Regression Long Beach City College

Lesson07_newshengvn

Unit 03 - Consolidated.pptxChristopherDevakumar1

Intro to econometricsGaetan Lion

CFA Fit Statisticsnicolalritter

RegressionSAURABH KUMAR

unit 3 regression.pptxssuser5c580e1

RegressionICFAI Business School

Multiple regressionAntoine De Henau

Simple lin regress_inferenceKemal İnciroğlu

Linear regression.pptxssuserb8a904

A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTSavas Papadopoulos, Ph.D

Correlation and Regression pptSantosh Bhaskar

Similaire à Johnny Aqm Presentation (20)

Ders 2 ols .ppt

Chapter 9 Regression

Multiple Regression.ppt

What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?

manecohuhuhuhubasicEstimation-1.pptx

Ch8 Regression Revby Rao

Mba2216 week 11 data analysis part 02

Regression

Lesson07_new

Unit 03 - Consolidated.pptx

Intro to econometrics

CFA Fit Statistics

Regression

unit 3 regression.pptx

Regression

Multiple regression

Simple lin regress_inference

Linear regression.pptx

A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT

Correlation and Regression ppt

Dernier

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Developing An App To Navigate The Roads of BrazilV3cube

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Dernier (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Unblocking The Main Thread Solving ANRs and Frozen Frames

Injustice - Developers Among Us (SciFiDevCon 2024)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Automating Google Workspace (GWS) & more with Apps Script

Presentation on how to chat with PDF using ChatGPT code interpreter

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Developing An App To Navigate The Roads of Brazil

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Finology Group – Insurtech Innovation Award 2024

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

The Codex of Business Writing Software for Real-World Solutions 2.pptx

CNv6 Instructor Chapter 6 Quality of Service

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Breaking the Kubernetes Kill Chain: Host Path Mount

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Exploring the Future Potential of AI-Enabled Smartphone Processors

Johnny Aqm Presentation

1. Eﬀect of Number of Categories and Category Boundaries on Recovery of Latent Linear Correlations from Optimally Weighted Categorical Data Johnny Lin Advisor: Peter Bentler November 19, 2008

2. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

3. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

4. Introducing LINEALS A Method of Optimal Scaling Algorithm An iterative process that minimizes m m 2 2 2 l=1 (ηjl − rjl ) where ηjl j=1 is a measure of nonlinearity. Developed by Jan de Leeuw and implemented by Patrick Mair. Assumption That bi-linearization is possible. No assumption of normality.

5. Plot of LINEALS Transformation Criterion: Linearize both X on Y and Y on X simultaneously. Figure: Red: X on Y , Blue: Y on X

6. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

7. Questions to ask First, deﬁne good recovery as small deviation from true score. 1. Does LINEALS recover true population correlations better than Pearson for categorical data? 2. Is the performance of LINEALS robust? 3. What factors inﬂuence good recovery?

8. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

9. Conditions tested Correlation Type, True Population Correlation, Number of Categories, and Homogeneity Condition Parameters {0=LINEALS, 1=Pearson} 1. Correlation Type (r) {0.3,0.5,0.7,0.9} 2. True Population Correlation (P) {2,3,5,7,10} 3. Number of Categories (V) {0=Non-Homogeneous, 1=Homogeneous} 4. Homogeneity (h) Total of 80 combinations (2x4x5x2).

10. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

11. Creating functions in R For each combination (total of 80): 1. Generate 1000 sets of bivariate normal data. 2. Make “cuts” (homogeneous vs. non-homogeneous). 3. Run through LINEALS / Pearson. 4. Calculate deviation of result and true population correlation. 5. Repeat Steps 1 - 4 twenty-ﬁve times. Result: Total of 2000 deviations (80x25).

12. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

13. Hierarchical Regression Description DV: deviation of sample correlation from true population correlation |ρ12 | − |ˆ12 | ρ IVs: main eﬀect and interactions of four conditions (total of 15) Four main eﬀects (h,r,P,V) Six 2-way interactions (hr, hP, hV, . . . ) Four 3-way interactions (hrP, hrV, . . . ) One 4-way interaction (hrPV)

14. Hierarchical Regression Model Selection Tested full model against nested models. Conﬁrmed with Best Subset Regression. Optimal Adj. R 2 and Mallow’s CP found with 7-8 parameters. (a) Adj. R 2 (b) Mallow’s CP

15. Final Model SPSS Output Coefficients(a) Unstandardized Standardized Model Coefficients Coefficients t Sig. B Std. Error Beta 1 (Constant) .189 .006 31.240 .000 h -.113 .012 -.620 -9.299 .000 r .007 .002 .041 3.054 .002 V -.024 .001 -.773 -40.558 .000 P .098 .008 .241 12.655 .000 hV .013 .002 .487 7.164 .000 hP .117 .018 .435 6.392 .000 hPV -.017 .003 -.422 -6.326 .000 a Dependent Variable: difference Diﬀerence between LINEALS and Pearson deviations is .007 controlling for other factors.

16. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

17. Plot of Main Effects I Figure: Main Effect of Number of Figure: Main Effect of Population Categories V Correlation P

18. Plot of Main Effects II Figure: Main Effect of Homogeneity h Figure: Main Effect of Correlation Type r

19. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Eﬀects Interactions

20. Plot of Signiﬁcant Interactions Note: The signiﬁcant 3-way interaction hPV is not plotted. Figure: Population Correlation by Levels Figure: Number of Categories by Levels of Homogeneity hP of Homogeneity hV

21. Interaction of Correlation Type and Number of Categories When rV added into regression model, the main eﬀect of Correlation Type r goes away. Suggests that number of categories may contribute to the LINEALS vs. Pearson diﬀerence. Figure: Number of Categories by Correlation Type (rV, marginally sig.)

22. Summary 1. LINEALS performs slightly better than Pearson under bivariate normal categorizations. 2. The non-significant interactions with Correlation Type suggest that LINEALS is robust. 3. Recovery of true population correlations is highly influenced by homogeneity (i.e., the underlying equality of interval widths). Future Studies How does it compare against polychoric correlations? Is the resulting matrix positive definite?

Johnny Aqm Presentation

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (18)

En vedette

En vedette (17)

Similaire à Johnny Aqm Presentation

Similaire à Johnny Aqm Presentation (20)

Dernier

Dernier (20)

Johnny Aqm Presentation