SlideShare une entreprise Scribd logo
1  sur  19
Correspondence Analysis with XLSTAT Reporter : Nguyen Van Chuc - BI Lab
Summarization theory of  Correspondence  Analysis (CA) ,[object Object],[object Object],[object Object],[object Object]
Two stages and three steps of each stage in CA process
Basic concepts
Basic concepts
How to run Correspondence Analysis with XLSTAT Now, we use XLSTAT Tool to describe how to run CA and explain the result base on an example step by step. In order to illustrate the interpretation of output from correspondence analysis, the following example is worked through in detail.
The following contingency table showing the frequency of usage of four brands of toothpaste in three geographic regions among a random sample of 120 users Table 1. Brand by Region Contingency table
Analyzing Data – Correspondence Analysis CA
Table 2. Row and Column profiles
1.Significance of Dependencies The first step in the interpretation of correspondence analysis is to establish whether  there is a significance dependency between rows and columns
2.Dimensionality of the solution The second step in interpretation is to determine the appropriate number of dimension to use to describe the points. This is achieved by examining eigenvalue and percentage of inertia In this example, two dimensions explain 100% of inertia since two dimensions are sufficient to explain the total inertia
2.Dimensionality of the solution In this example two dimensions are sufficient to explain the total inertia
3. Interpreting the axes The axes are interpreted by way of the contribution that each element (in this case each Brand) makes towards the total inertia accounted for by the axis. In this example there are 4 brands, thus, any distribution greater than 100/4 = 25% would represent significance greater than what would be expected  in the case of a purely random distribution of Brands over axes. In this case, brand A meets (satisfies) this criterion and determines the first axis and Brand B determines the second axis
3. Interpreting the axes Likewise, for columns, Region 3 determines the first axis and the second axis is determined by Region 2 and Region 1 (Because of the contributions > 100/3=33.3%) Note that, Brand A determines the first axis(F1) and F1 is determined by Region 3, thus it is obvious to understand that Brand A strongly associated with region 3 (see symmetric plot).
4. Graphical Representation of a contingency table Brands C and D are positioned relatively closely indicates a similarity in their regional usage profiles (60%, 75% respectively) and Brand A is positioned relatively far away from Brands C and D indicates that Brand A has a very different regional usage profile from Brands C and D Categories with similar distributions will be represented as points that are close in space, and categories that have very dissimilar distributions will be positioned far apart If a profile is very  different from the average profile (centroid) , then the  point will lie far from the origin , whereas, profile that are close to the average will be represented  by points close to the centroid. If all the categories have equal profiles then all the points will fall in the centroid.
4. Graphical Representation of a contingency table The proximity of Brand A to the Region 3 indicates that  Brand A is strongly associated with Region 3  which is clearly because profile presented in table 2 with 75 % brand A users reside in region 3. Likewise, the proximity of brand B with region 2 and Brands C and D with region 1 indicate that the higher frequency of usage  of those brands in those regions. The positions of the points in the map tell you something about similarities between the rows, similarities between the columns and the association between rows and columns
4. Graphical Representation of a contingency table In the Asymmetric row plot map, rows are plotted base on principal coordinates and columns are plotted base on standard coordinates.
4. The quality of representation The higher of total of two (or first n dimensions) the higher quality of the representation. In this example, two axes explain 100% of the inertia  (The first dimension explains 61.8% of inertia and the second dimension explains 38.2% of the inertia).
4. The quality of representation The quality of representation is easily calculated from the  correlations  or  squared correlations  given in the output. The squared correlation presented for any column measures the degree of association between that column and a particular axis. So, for instance, the squared correlation between Brand A and the first and second axes is 0.986 and 0.014 respectively. This implies that Brand A are  strongly associated with the first axis (Region 3)   but only weakly associated with the second axis (Region 1 and Region 2).

Contenu connexe

Tendances

Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlationdomsr
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysisMurali Raj
 
simple discriminant
simple discriminantsimple discriminant
simple discriminantneha singh
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regressionA M
 
Principal Component Analysis and Clustering
Principal Component Analysis and ClusteringPrincipal Component Analysis and Clustering
Principal Component Analysis and ClusteringUsha Vijay
 
Linear Regression Using SPSS
Linear Regression Using SPSSLinear Regression Using SPSS
Linear Regression Using SPSSDr Athar Khan
 
Multinomial Logistic Regression
Multinomial Logistic RegressionMultinomial Logistic Regression
Multinomial Logistic RegressionDr Athar Khan
 
Chap15 analysis of variance
Chap15 analysis of varianceChap15 analysis of variance
Chap15 analysis of varianceJudianto Nugroho
 
Factor analysis
Factor analysis Factor analysis
Factor analysis Nima
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisASAD ALI
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAiden Yeh
 
Standard deviationnormal distributionshow
Standard deviationnormal distributionshowStandard deviationnormal distributionshow
Standard deviationnormal distributionshowBiologyIB
 
Mean Squared Error (MSE) of an Estimator
Mean Squared Error (MSE) of an EstimatorMean Squared Error (MSE) of an Estimator
Mean Squared Error (MSE) of an EstimatorSuruchi Somwanshi
 

Tendances (20)

Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
 
Model selection
Model selectionModel selection
Model selection
 
MANOVA SPSS
MANOVA SPSSMANOVA SPSS
MANOVA SPSS
 
Pca ppt
Pca pptPca ppt
Pca ppt
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Principal Component Analysis
Principal Component AnalysisPrincipal Component Analysis
Principal Component Analysis
 
Discriminant analysis
Discriminant analysisDiscriminant analysis
Discriminant analysis
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
simple discriminant
simple discriminantsimple discriminant
simple discriminant
 
4.5. logistic regression
4.5. logistic regression4.5. logistic regression
4.5. logistic regression
 
Ordinal Logistic Regression
Ordinal Logistic RegressionOrdinal Logistic Regression
Ordinal Logistic Regression
 
Principal Component Analysis and Clustering
Principal Component Analysis and ClusteringPrincipal Component Analysis and Clustering
Principal Component Analysis and Clustering
 
Linear Regression Using SPSS
Linear Regression Using SPSSLinear Regression Using SPSS
Linear Regression Using SPSS
 
Multinomial Logistic Regression
Multinomial Logistic RegressionMultinomial Logistic Regression
Multinomial Logistic Regression
 
Chap15 analysis of variance
Chap15 analysis of varianceChap15 analysis of variance
Chap15 analysis of variance
 
Factor analysis
Factor analysis Factor analysis
Factor analysis
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Standard deviationnormal distributionshow
Standard deviationnormal distributionshowStandard deviationnormal distributionshow
Standard deviationnormal distributionshow
 
Mean Squared Error (MSE) of an Estimator
Mean Squared Error (MSE) of an EstimatorMean Squared Error (MSE) of an Estimator
Mean Squared Error (MSE) of an Estimator
 

Similaire à Correspondence analysis(step by step)

Statistik Chapter 2
Statistik Chapter 2Statistik Chapter 2
Statistik Chapter 2WanBK Leo
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regressionLong Beach City College
 
For Problem 2, you are to evaluate the given analysis and inte.docx
For Problem 2, you are to evaluate the given analysis and inte.docxFor Problem 2, you are to evaluate the given analysis and inte.docx
For Problem 2, you are to evaluate the given analysis and inte.docxAKHIL969626
 
Churn Analysis in Telecom Industry
Churn Analysis in Telecom IndustryChurn Analysis in Telecom Industry
Churn Analysis in Telecom IndustrySatyam Barsaiyan
 
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of DataPG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of DataAashish Patel
 
how do you interpret a scatter gramSolution Contents Quality.pdf
how do you interpret a scatter gramSolution  Contents  Quality.pdfhow do you interpret a scatter gramSolution  Contents  Quality.pdf
how do you interpret a scatter gramSolution Contents Quality.pdfAMITJWELLER123
 
Fill in the blanks8.1.      The magnitude of the correlation .docx
Fill in the blanks8.1.      The magnitude of the correlation .docxFill in the blanks8.1.      The magnitude of the correlation .docx
Fill in the blanks8.1.      The magnitude of the correlation .docxmglenn3
 
Alg II 2-5 Linear Models
Alg II 2-5 Linear ModelsAlg II 2-5 Linear Models
Alg II 2-5 Linear Modelsjtentinger
 
Quantitative techniques in business
Quantitative techniques in businessQuantitative techniques in business
Quantitative techniques in businesssameer sheikh
 
Chapter Two PPT Lecture - Part One.ppt
Chapter Two PPT Lecture - Part One.pptChapter Two PPT Lecture - Part One.ppt
Chapter Two PPT Lecture - Part One.pptjosh658552
 
Qt graphical representation of data
Qt   graphical representation of dataQt   graphical representation of data
Qt graphical representation of dataJoel Pais
 
Qt graphical representation of data
Qt   graphical representation of dataQt   graphical representation of data
Qt graphical representation of dataJoel Pais
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inferenceKemal İnciroğlu
 
PRESENTATION OF DATA.pptx
PRESENTATION OF DATA.pptxPRESENTATION OF DATA.pptx
PRESENTATION OF DATA.pptxajesh ps
 

Similaire à Correspondence analysis(step by step) (20)

Statistik Chapter 2
Statistik Chapter 2Statistik Chapter 2
Statistik Chapter 2
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression
 
Statistics
StatisticsStatistics
Statistics
 
Charts in excel
Charts in excelCharts in excel
Charts in excel
 
Relational Algebra-23-04-2023.pdf
Relational Algebra-23-04-2023.pdfRelational Algebra-23-04-2023.pdf
Relational Algebra-23-04-2023.pdf
 
For Problem 2, you are to evaluate the given analysis and inte.docx
For Problem 2, you are to evaluate the given analysis and inte.docxFor Problem 2, you are to evaluate the given analysis and inte.docx
For Problem 2, you are to evaluate the given analysis and inte.docx
 
Churn Analysis in Telecom Industry
Churn Analysis in Telecom IndustryChurn Analysis in Telecom Industry
Churn Analysis in Telecom Industry
 
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of DataPG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
 
Data Mind Traps
Data Mind TrapsData Mind Traps
Data Mind Traps
 
Graphs in Biostatistics
Graphs in Biostatistics Graphs in Biostatistics
Graphs in Biostatistics
 
how do you interpret a scatter gramSolution Contents Quality.pdf
how do you interpret a scatter gramSolution  Contents  Quality.pdfhow do you interpret a scatter gramSolution  Contents  Quality.pdf
how do you interpret a scatter gramSolution Contents Quality.pdf
 
Fill in the blanks8.1.      The magnitude of the correlation .docx
Fill in the blanks8.1.      The magnitude of the correlation .docxFill in the blanks8.1.      The magnitude of the correlation .docx
Fill in the blanks8.1.      The magnitude of the correlation .docx
 
Alg II 2-5 Linear Models
Alg II 2-5 Linear ModelsAlg II 2-5 Linear Models
Alg II 2-5 Linear Models
 
Quantitative techniques in business
Quantitative techniques in businessQuantitative techniques in business
Quantitative techniques in business
 
Chapter05
Chapter05Chapter05
Chapter05
 
Chapter Two PPT Lecture - Part One.ppt
Chapter Two PPT Lecture - Part One.pptChapter Two PPT Lecture - Part One.ppt
Chapter Two PPT Lecture - Part One.ppt
 
Qt graphical representation of data
Qt   graphical representation of dataQt   graphical representation of data
Qt graphical representation of data
 
Qt graphical representation of data
Qt   graphical representation of dataQt   graphical representation of data
Qt graphical representation of data
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
PRESENTATION OF DATA.pptx
PRESENTATION OF DATA.pptxPRESENTATION OF DATA.pptx
PRESENTATION OF DATA.pptx
 

Dernier

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Dernier (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Correspondence analysis(step by step)

  • 1. Correspondence Analysis with XLSTAT Reporter : Nguyen Van Chuc - BI Lab
  • 2.
  • 3. Two stages and three steps of each stage in CA process
  • 6. How to run Correspondence Analysis with XLSTAT Now, we use XLSTAT Tool to describe how to run CA and explain the result base on an example step by step. In order to illustrate the interpretation of output from correspondence analysis, the following example is worked through in detail.
  • 7. The following contingency table showing the frequency of usage of four brands of toothpaste in three geographic regions among a random sample of 120 users Table 1. Brand by Region Contingency table
  • 8. Analyzing Data – Correspondence Analysis CA
  • 9. Table 2. Row and Column profiles
  • 10. 1.Significance of Dependencies The first step in the interpretation of correspondence analysis is to establish whether there is a significance dependency between rows and columns
  • 11. 2.Dimensionality of the solution The second step in interpretation is to determine the appropriate number of dimension to use to describe the points. This is achieved by examining eigenvalue and percentage of inertia In this example, two dimensions explain 100% of inertia since two dimensions are sufficient to explain the total inertia
  • 12. 2.Dimensionality of the solution In this example two dimensions are sufficient to explain the total inertia
  • 13. 3. Interpreting the axes The axes are interpreted by way of the contribution that each element (in this case each Brand) makes towards the total inertia accounted for by the axis. In this example there are 4 brands, thus, any distribution greater than 100/4 = 25% would represent significance greater than what would be expected in the case of a purely random distribution of Brands over axes. In this case, brand A meets (satisfies) this criterion and determines the first axis and Brand B determines the second axis
  • 14. 3. Interpreting the axes Likewise, for columns, Region 3 determines the first axis and the second axis is determined by Region 2 and Region 1 (Because of the contributions > 100/3=33.3%) Note that, Brand A determines the first axis(F1) and F1 is determined by Region 3, thus it is obvious to understand that Brand A strongly associated with region 3 (see symmetric plot).
  • 15. 4. Graphical Representation of a contingency table Brands C and D are positioned relatively closely indicates a similarity in their regional usage profiles (60%, 75% respectively) and Brand A is positioned relatively far away from Brands C and D indicates that Brand A has a very different regional usage profile from Brands C and D Categories with similar distributions will be represented as points that are close in space, and categories that have very dissimilar distributions will be positioned far apart If a profile is very different from the average profile (centroid) , then the point will lie far from the origin , whereas, profile that are close to the average will be represented by points close to the centroid. If all the categories have equal profiles then all the points will fall in the centroid.
  • 16. 4. Graphical Representation of a contingency table The proximity of Brand A to the Region 3 indicates that Brand A is strongly associated with Region 3 which is clearly because profile presented in table 2 with 75 % brand A users reside in region 3. Likewise, the proximity of brand B with region 2 and Brands C and D with region 1 indicate that the higher frequency of usage of those brands in those regions. The positions of the points in the map tell you something about similarities between the rows, similarities between the columns and the association between rows and columns
  • 17. 4. Graphical Representation of a contingency table In the Asymmetric row plot map, rows are plotted base on principal coordinates and columns are plotted base on standard coordinates.
  • 18. 4. The quality of representation The higher of total of two (or first n dimensions) the higher quality of the representation. In this example, two axes explain 100% of the inertia (The first dimension explains 61.8% of inertia and the second dimension explains 38.2% of the inertia).
  • 19. 4. The quality of representation The quality of representation is easily calculated from the correlations or squared correlations given in the output. The squared correlation presented for any column measures the degree of association between that column and a particular axis. So, for instance, the squared correlation between Brand A and the first and second axes is 0.986 and 0.014 respectively. This implies that Brand A are strongly associated with the first axis (Region 3) but only weakly associated with the second axis (Region 1 and Region 2).