RM MLM PPT March_22nd 2023.pptx

Multivariate Methods
Ordinary and Multinomial logistics regression
And Multilevel models

BASIC Terminologies drill- Examples
• Univariate, Bivariate, Multivariate - Examples
• Logistic regression and Logit - Difference
• Multivariate- Multiple regression – Multinomial – Multilevel -Difference
• Ordinary least squares vs Ordered logistic regression - Difference
• Multinomial regression and Polynomial regression - Difference
• Multilevel models
BASIC Terminologies drill- Examples

Linear Vs Logistics regression
Question Linear Regression Logistics regression
• What is it used for ?
Used to predict a dependent output variable based on
independent input variable
Used to classify a dependent output variable based on Independent
input variable
• How the accuracy is measured ? Accuracy is measured using Least squares estimation (OLS) Accuracy is measured using Maximum Likelihood estimation (MLE)
• How the best fit line look like ? The best fit line is a straight line The best fit Is given by a curve
• What is the outcome value look like ? The output is a predicted integer value The output Is a binary value between O and 1 value. Odds > Odds ratio
• Where it is used commonly ?
Used in business domain, forecasting stocks . Multiple linear,
Simple linear regression
Used for classification, Health services research eg Binary, multiple,
Ordinal

Linear and Logistics Basic difference : OLS vs MLE

Outcome categories example
• Settings outcomes
• Nursing home, Informal care , Homecare
• Primary care- Tertiary care
• Primary care or not primary care
• Disease
• Diabetes , Hypertension, Cardiac Heart failure
• Absent, mild, moderate, or severe
• Fee structure
• High , mid , low
• Below 500- above 500 and below 1000- Above 1000
• Age categories
• Below 20 – above 20 and below 65- Above 65

Multinomial Logistics regression : Introduction
• DV Multiple categories
• OLS can not be used
• DV not in natural order
• MLE is use not the OLS
• MLE also used in MPM
• Extension of the simple Logit model ( 2 outcomes
• Categories can be more than 2 (Binary)
• Binary example : Depression, disease status, mortality
• Yes/No
• Multiple outcome example :
• Diabetes , Hypertension, Cardiac Heart failure
• Nursing home, Informal care , Homecare

• Choose model if categories are truly discreet, nominal and unordered
• 5 types of LTC
• Nursing home
• Paid homecare
• Informal care from family
• Mixed care paid-homecare + informal
• No LTC
• All independent of each other
• Individual utility level of alternatives is not observed rather Instead its an
index
Multinomial Logistics regression : Choosing the Model

• Data needs to meet the diagnostic test first
• Hausman test to choose between random effect model and Fixed
effect model
• IIA – Independence of alternative assumptions
• Excluding one category doesn’t influence the other
• Run unconstrained model
• Drop one dependent – The coefficients remain (Statistically) identical
to the unconstrained model
• Partial model = Full Model IIA is correct
• Can use random effect model otherwise fixed effect ( MPM can be
used)
Multinomial Logistics regression : Diagnostic test (Hausman test)

Multinomial Logistics regression : Diagnostic test (Hausman test)

• There is no well-specified procedure
• Previous research
• Expert opinion
• Theory
• Perform the various tests to find the best
• Relevant findings Theory used to build the model
• OREM Selfcare deficit theory
• Bivariate analysis – Chi square and t test
• Created another variable (Income square) based on findings ( Parabola )
• Tested interaction effects ( effect of one depend on the level of other )
• High p value – Not used in model
Multinomial Logistics regression : Building and choosing best model

• How to run the model ?
• Computer will run the model
• Reference category is selected
• Makes no difference in estimated
coefficients- what is chosen as
reference category Once the
coefficients are determined the rest is
math
• Modern day software – machine
learning
Multinomial Logistics regression : Running the model

Output give coefficient and P value for each coefficient
IN SIMPLE LOGIT MODEL
• the coefficient represents the effect of a unit change in the IV on the natural logarithm of the odds of using one type of LTC
service.
IN MLM (Model)
• the coefficients and their exponential transformations that yield the odds ratios are always relative to the reference
category.
• E.g A vs B , A vs C , A vs D , A vs E
• a/b , Odds, OR.
Multinomial Logistics regression : Interpretation of coeffecients

Wald test
Wald test is used to compare models on best fit criteria in case of logistic
regression. This technique is used to determine 'significant' variables from the set of
predictors used in to a variety of models with binary variables or models with continuous
variables.
Likelihood ratio test
The Likelihood-Ratio Test (LRT) is a statistical test used to compare the
goodness of fit of two models based on the ratio of their likelihoods.
Multinomial Logistics regression : Predicted Probabilities and analysis of results

Multinomial Model results interpretation

• The calculation and interpretation of odd
ratio is easy
• The odd and probabilities don't change in
same direction
• Odds may be increasing when both
probabilities forming it may be decreasing
• Large odd ratio doesn’t mean change in
probabilities is large
• The change in probabilities may be large
proptionaly, but small in absolute terms.
• To examine the result of each
independent Variable on each category
Multinomial Logistics regression : Predicted Probabilities

Ordered logistic regression: Example

MULTILEVEL MODELING : What and Why
Aggregate Analysis
• Example : Time spent on physical activity – age, sex, education,
greenspace available, Area deprivation
• 100 observations in 10 neighborhoods
• Can run 10 models – Loss of power
Individual Analysis
• Artificially small standard errors and confidence intervals around
those regression coefficients
• If something is available in all clusters – Area deprivation , Green
spaces

MLA makes it possible to test different kinds of hypotheses
• Hypotheses about variation
• Hypotheses about the relationship between an outcome variable and
individual level independent variables
• Hypotheses about the relationship between an outcome variable and higher
level (contextual) independent variables.
• Hypotheses about cross-level interactions

• Context Hypotheses
• Aggregated Individual-Level Characteristics
• 1- Diabetic patients in GP- Competing for resoucres
• 2-the more diabetics there are in a practice, the greater the chances are that an individual
diabetic is better regulated.
• Higher Level Characteristics
• Cross-Level Interactions
• These are combinations of (or interactions between) variables at different levels. It is the
combination of a particular characteristic of the higher level with a particular individual level
variable that is hypothesized to have a specific effect on the dependent variable of interest
• The ability to analyze cross-level interactions is a major advantage of MLA that follows on
from the ability to incorporate both individual and contextual independent variables in an
analysis. In our thinking and theorizing about health and healthcare, the relationships
between context, individual characteristics and outcomes are of central importance. MLA
affords the opportunity to test our ideas about these relationships.

MULTILEVEL MODELING : Practical Approach
• The seven major steps involved in a multilevel analysis:
• Clarifying the research question
• Choosing the appropriate parameter estimator
• Assessing the need for MLM
• Building the level-1 model
• Building the level-2 model
• Multilevel effect size reporting
• Likelihood ratio model testing.

• Example of Multilevel data
• Patients nested in hospitals
• Hospitals nested in geographical regions
• Cross sectional MLM
• Patients nested in hospitals
• Longitudinal MLM
• Example of nested data where repeated measurements (i.e., the level-1 units)
are nested within individuals
MULTILEVEL MODELING : Macro Micro, pseudo

• Nested datasets do not automatically require multilevel modeling.
• If there is no variation in response variable scores across level-2 units
(e.g., hospitals)
• The data can be analyzed using OLS multiple regression
• Patient satisfaction score varies for one hospital
• If the mean score is across hospitals in widely varied – MLM is needed
• School example : Math score in one school- mean score variation across many schools
• “How much response variable variation is present at level-2?”
• Answer: This question involves the calculation of the intraclass
correlation (ICC) and the design effect statistics
MULTILEVEL MODELING : Why MLM

• Conceptually, the ICC is similar to the R2 effect size from regression
• ICC value of zero Indicates:
• No mean science achievement score variation across hospitals (Macro Level-
Hospital level),
• All score variation occurs across patients (Micro level- Patients)
• Traditional analysis techniques such as ANOVA and regression can be used to analyze the
student data.
• The ICC value increases
• The proportion score variation across hospitals increases
• Resulting in violations of the independence assumption
• MLM Partition the total score variation into “Variation across patients” and
Variation across hospitals”
MULTILEVEL MODELING : When to use MLM

MULTILEVEL MODELING : When to use MLM
• The ICC (.18) and the design effect 2.30 both indicate the need for
multilevel modeling.
• There are formulae to calculate the ICC and design effect
• Some researchers believe that design effect estimates greater than
2.0 indicate a need for MLM.
• What is design effect then ?
• The design effect quantifies the effect of independence violations on standard error estimates
and is an estimate of the multiplier that needs to be applied to standard errors to correct for
the negative bias that results from nested data.

Multinomial Logistics regression : Interpretation of results
Effect sizes in MLM analyses are not as straightforward, and currently no consensus
exists as to the effect sizes that are most appropriate.
Two categories: Global and local.
• In multiple regression, the global effect size R2 quantifies the response
variable variance explained by a model containing multiple predictors, while a
squared semi partial correlation coefficient quantifies the response variable
variance accounted for by asingle predictor variable, holding the influence of
additional predictor variables constant.

• In multiple regression, F test is used to test whether the explained
variance is statistically different from zero.
• likelihood ratio test do the same in MLM
• A likelihood ratio test is a statistical test of two nested models
• a “reduced” model is nested within a “full” model if the parameters
estimated in the reduced model are a subset of the parameters
estimated in the full model.
MULTILEVEL MODELING : Likelihood Ratio model testing

• Basic 2-Level Model
MULTILEVEL MODELING : Hierarchies

• Designs Including Time
MULTILEVEL MODELING : Hierarchies

• Pseudo-level
• Correlated Cross-Classified Model
MULTILEVEL MODELING : Non Hierarchies

• Nested data violate the independence assumption
• For example, Response variables more correlated in one hospital , one
department or one county
• The independence violations tend to create more type one errors and biased
parameters estimates
MULTILEVEL MODELING :Hypothesis testing in MLM

Methods
• What are the dependent variables ?
• Rating of care
• How the rating was converted into categorical variables
• 0-4, 5-8, 9,10
• What are the independent variables ?
• Hispanic Medicaid, Hispanic commercial, (non-Hispanic) White Medicaid, and
(non-Hispanic) White commercial.
• Confounders – What and Why ?
• age, education, self-rated health, survey mode, and survey language.

Methods Used
• Multinomial logistic regression was used to test for differences in
extreme response styles.
• Why Multinomial and not ordinal ?

Results interpretation ODDs Ratio

RM MLM PPT March_22nd 2023.pptx

Recommandé

Recommandé

Contenu connexe

Similaire à RM MLM PPT March_22nd 2023.pptx

Similaire à RM MLM PPT March_22nd 2023.pptx (20)

Dernier

Dernier (20)

RM MLM PPT March_22nd 2023.pptx

Notes de l'éditeur