SlideShare une entreprise Scribd logo
1  sur  15
Multinomial Logistic Regression Models




January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Logistic regression CAN handle dependant variables
   with more than two categories
  It is important to note whether the response variable
   is ordinal (consisting of ordered categories like young,
   middle-aged, old) or nominal (dependant is unordered
   like red, blue, black)
  Some multinomial logistic models are appropriate only
   for ordered response
  It is not mathematically necessary to consider the
   natural ordering when modeling ordinal response but,
  Considering the natural ordering
       Leads to a more parsimonious model
       Increase power to detect relationships with other variables

January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Applying logistic regression considering the natural
   order is done using a modeling technique called the
   “Proportional Odds Model”
  Say the dependant variable Y has 4 states measuring
   the impact of radiation on the human body; fine,
   sick, serious,dead
  Let p1=prob of fine, p2=prob of sick, p3=prob of
   serious, p4=prob of dead
  Let us define a baseline category: fine, since this is
   the normal stage (we shall see why we need this
   later)
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    What if we break up the modeling of the 4 level
      ordered dependant into 3 binary logistic
      situations: 1 – (fine,sick), 2 – (fine,serious),3 –
      (fine,dead)?
     Then we would have 3 logit equations:
       Log(p2/p1)=B11+B12X1+B13X2
       Log(p3/p1)=B21+B22X1+B23X2
       Log(p4/p1)=B31+B32X1+B33X2
      X is the degree of radiation dummy with 3 levels so
        broken into 2 binary dummies
     So, 9 parameters to be estmated

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Now consider an alternative model for the same
      situation
     Cumulative logit model:
       L1=log(p1/p2+p3+p4)
       L2=log(p1+p2/p3+p4)
       L3=log(p1+p2+p3/p4)
     The obvious way to introduce covariates is
       L1=B11+B12X1+B13X2
       L2=B21+B22X1+B23X2
       L3=B31+B32X1+B33X2

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us simplyfy the model by specifying that
      the slope parameters are identical over the
      logit equations. Then,
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     This is the proportional odds cumulative logit
      model

January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
 Suppose that the categorical outcome is actually a
   categorized version of an unobservable (latent)
   continuous variable which has a logistic distribution
  The continuous scale is divided into five regions by
   four cut-points c1, c2, c3, c4 which are determined by
   nature
  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y =
   2; and so on
  Suppose that the Z is related to the X’s through a linear
   regression
  Then, the coarsened categorical variable would be
   related Y will be related to the X’s by a proportional-
   odds cumulative logit model
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Let us go back to the model
       L1=A1+B1X1+B2X2
       L2=A2+B1X1+B2X2
       L3=A3+B1X1+B2X2
     Note that Lj is the log-odds of falling into or below category j
      versus falling above it
     Aj is the log-odds of falling into or below category j when X1 =
      X2 = 0
     B1 is the increase in log-odds of falling into or below any
      category associated with a one-unit increase in Xk, holding all
      the other X-variables constant.
     Therefore, a positive slope indicates a tendency for the
      response level to decrease as the variable decreases
January 1, 2013     ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Our example of 4 levels of impact of radiation
      corresponding to 3 levels of radiation

 proc logistic data=radiation_impact;
 freq count;
 class radiation / order=data param=ref ref=first;
 model sickness (order=data descending) = radiation /
   link=logit
 aggregate=(radiation) scale=none;
 run;
January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Freq=count
       This is important for specifying grouped data
       Count is the variable that contains the frequency of
        occurrance of each observation
       In its absence, each row would be considered a
        separate row of data
     Class=radiation
       Specifies that radiation is a classification variable to
        be used in the analysis
       SAS would automatically generate n-1 binary
        dummies for n categories of radiation with param=ref
        option
January 1, 2013    ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data
       Simply tells SAS to arrange the response categories in
          the order they occur in the input data 1,2,3,4
     Param=ref
       This implies that there is going to be dummy coding
          for the classification variable ‘radiation’listed in class
     Ref=first
       Designates the first ordered level, in this case ‘fine’ as
          the reference level

January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Order=data descending
         This tells SAS to reverse the order of the logits
         So, instead of the cumulative logit model being
         L1=log(p1/p2+p3+p4)
         L2=log(p1+p2/p3+p4)
         L3=log(p1+p2+p3/p4), it becomes
         L1=log(p4/p1+p2+p3)
         L2=log(p4+p3/p1+p2)
         L3=log(p4+p3+p2/p1)
     Now, a positive B1 indicates that a higher value of X1
      leads to greater chance of radiation sickness
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Link=logit
       fits the cumulative logit model when there are more
          than two response categories
     Aggregate=radiation
       Indicates that the goodness of fit statistics are to be
          calculated on the subpopulations of the variable:
          radiation
     Scale=none
       No correction is need for the dispersion parameter
       To understand this, read up. This happens when the
          goodness of fit statistic exceeds its degrees of freedom
          and need to be corrected for
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    When we fit this model, the first output we
      see:
 Score Test for the Proportional Odds Assumption
 Chi-Square DF         Pr > ChiSq
 17.2866        21     0.6936

  Null hypothesis is that the current proportional-odds
   cumulative logit model is true
  Seems like we fail to reject the null and so can proceed to the
   rest of the output under the current assumption


January 1, 2013   ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
    Ultimately we are interested in the predicted
      probabilities
          OUTPUT <OUT=SAS-data-set><options>
     Predicted=
       For a cumulative model, it is the predicted cumulative
          probability (that is, the probability that the response
          variable is less than or equal to the value of _LEVEL_);
     PREDPROBS=I or C
       Individual|I requests the predicted probability of each
        response level.
       CUMULATIVE | C requests the cumulative predicted
        probability of each response level
January 1, 2013      ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India

Contenu connexe

Tendances

ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...ijsc
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.inventionjournals
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...inventionjournals
 
A review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationA review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationssuserfa7e73
 
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMSA NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMSorajjournal
 
Linear programming
Linear programmingLinear programming
Linear programmingVARUN KUMAR
 
Fractional calculus and applications
Fractional calculus and applicationsFractional calculus and applications
Fractional calculus and applicationsPlusOrMinusZero
 
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...ijtsrd
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1irsa javed
 
Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodYogeshIJTSRD
 
Linear programming class 12 investigatory project
Linear programming class 12 investigatory projectLinear programming class 12 investigatory project
Linear programming class 12 investigatory projectDivyans890
 
Certified global minima
Certified global minimaCertified global minima
Certified global minimassuserfa7e73
 
Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2Zahra Amini
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...Navodaya Institute of Technology
 

Tendances (19)

ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
ANALYTICAL FORMULATIONS FOR THE LEVEL BASED WEIGHTED AVERAGE VALUE OF DISCRET...
 
Mm2521542158
Mm2521542158Mm2521542158
Mm2521542158
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
 
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
Application of Adomian Decomposition Method in Solving Second Order Nonlinear...
 
A review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementationA review of automatic differentiationand its efficient implementation
A review of automatic differentiationand its efficient implementation
 
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMSA NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
A NEW ALGORITHM FOR SOLVING FULLY FUZZY BI-LEVEL QUADRATIC PROGRAMMING PROBLEMS
 
Linear programming
Linear programmingLinear programming
Linear programming
 
Cg32519523
Cg32519523Cg32519523
Cg32519523
 
Chapter two
Chapter twoChapter two
Chapter two
 
Fractional calculus and applications
Fractional calculus and applicationsFractional calculus and applications
Fractional calculus and applications
 
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...An Approach to Mathematically Establish the Practical Use of Assignment Probl...
An Approach to Mathematically Establish the Practical Use of Assignment Probl...
 
Linear Programming 1
Linear Programming 1Linear Programming 1
Linear Programming 1
 
Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min Method
 
Linear programming class 12 investigatory project
Linear programming class 12 investigatory projectLinear programming class 12 investigatory project
Linear programming class 12 investigatory project
 
A new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbersA new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbers
 
linear programming
linear programming linear programming
linear programming
 
Certified global minima
Certified global minimaCertified global minima
Certified global minima
 
Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2Dr azimifar pattern recognition lect2
Dr azimifar pattern recognition lect2
 
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
A SYSTEMATIC APPROACH FOR SOLVING MIXED INTUITIONISTIC FUZZY TRANSPORTATION P...
 

En vedette

Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataBoosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataJay (Jianqiang) Wang
 
The power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demoThe power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demoWessel Luijben
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNJosh Patterson
 
Statisticsfor businessproject solution
Statisticsfor businessproject solutionStatisticsfor businessproject solution
Statisticsfor businessproject solutionhuynguyenbac
 
Financialmodeling
FinancialmodelingFinancialmodeling
FinancialmodelingTalal Tahir
 
Logistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentationLogistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentationMichael Hankin
 
Qam formulas
Qam formulasQam formulas
Qam formulasAshu Jain
 
Transparency7
Transparency7Transparency7
Transparency7A M
 
1.5.1 measures basic concepts
1.5.1 measures basic concepts1.5.1 measures basic concepts
1.5.1 measures basic conceptsA M
 
Regression
Regression Regression
Regression Ali Raza
 
(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regressionmothersafe
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep diveabulyomon
 
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsDeepThought, Inc.
 
Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)Anthony Kilili
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)mohamedchaouche
 
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...Aakash Bagchi
 

En vedette (20)

Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataBoosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
 
The power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demoThe power of RapidMiner, showing the direct marketing demo
The power of RapidMiner, showing the direct marketing demo
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
 
Statisticsfor businessproject solution
Statisticsfor businessproject solutionStatisticsfor businessproject solution
Statisticsfor businessproject solution
 
Financialmodeling
FinancialmodelingFinancialmodeling
Financialmodeling
 
Chapter 16
Chapter 16Chapter 16
Chapter 16
 
Midterm
MidtermMidterm
Midterm
 
Ordinal Logistic Regression
Ordinal Logistic RegressionOrdinal Logistic Regression
Ordinal Logistic Regression
 
Logistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentationLogistic Regression/Markov Chain presentation
Logistic Regression/Markov Chain presentation
 
Qam formulas
Qam formulasQam formulas
Qam formulas
 
Transparency7
Transparency7Transparency7
Transparency7
 
1.5.1 measures basic concepts
1.5.1 measures basic concepts1.5.1 measures basic concepts
1.5.1 measures basic concepts
 
Regression
Regression Regression
Regression
 
(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression(마더세이프 라운드) Logistic regression
(마더세이프 라운드) Logistic regression
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
 
ForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model StatisticsForecastIT 2. Linear Regression & Model Statistics
ForecastIT 2. Linear Regression & Model Statistics
 
Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)Generalized Logistic Regression - by example (Anthony Kilili)
Generalized Logistic Regression - by example (Anthony Kilili)
 
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
 
C2.1 intro
C2.1 introC2.1 intro
C2.1 intro
 
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
Mode Choice analysis for work trips using Multinomial Logit model for Windsor...
 

Similaire à Logistic regression for ordered dependant variable with more than 2 levels

How principal components analysis is different from factor
How principal components analysis is different from factorHow principal components analysis is different from factor
How principal components analysis is different from factorArup Guha
 
Supervised Learning.pdf
Supervised Learning.pdfSupervised Learning.pdf
Supervised Learning.pdfgadissaassefa
 
A Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC LearningA Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC Learningdhruvgairola
 
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHMGRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHMijscai
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET Journal
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET Journal
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Colleen Ganley
 
Design and analysis of ra sort
Design and analysis of ra sortDesign and analysis of ra sort
Design and analysis of ra sortijfcstjournal
 
2018 p 2019-ee-a2
2018 p 2019-ee-a22018 p 2019-ee-a2
2018 p 2019-ee-a2uetian12
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data AnalysisNBER
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix IJECEIAES
 
A new generalized lindley distribution
A new generalized lindley distributionA new generalized lindley distribution
A new generalized lindley distributionAlexander Decker
 

Similaire à Logistic regression for ordered dependant variable with more than 2 levels (20)

How principal components analysis is different from factor
How principal components analysis is different from factorHow principal components analysis is different from factor
How principal components analysis is different from factor
 
Supervised Learning.pdf
Supervised Learning.pdfSupervised Learning.pdf
Supervised Learning.pdf
 
ML MODULE 4.pdf
ML MODULE 4.pdfML MODULE 4.pdf
ML MODULE 4.pdf
 
A Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC LearningA Theory of the Learnable; PAC Learning
A Theory of the Learnable; PAC Learning
 
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHMGRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
GRADIENT OMISSIVE DESCENT IS A MINIMIZATION ALGORITHM
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
IRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms ComparisonIRJET- Supervised Learning Classification Algorithms Comparison
IRJET- Supervised Learning Classification Algorithms Comparison
 
autocorrelation.pptx
autocorrelation.pptxautocorrelation.pptx
autocorrelation.pptx
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Study the different
Study the differentStudy the different
Study the different
 
Yahya Almalki SOED 2016
Yahya Almalki SOED 2016Yahya Almalki SOED 2016
Yahya Almalki SOED 2016
 
Design and analysis of ra sort
Design and analysis of ra sortDesign and analysis of ra sort
Design and analysis of ra sort
 
working with python
working with pythonworking with python
working with python
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
 
2018 p 2019-ee-a2
2018 p 2019-ee-a22018 p 2019-ee-a2
2018 p 2019-ee-a2
 
Ica group 3[1]
Ica group 3[1]Ica group 3[1]
Ica group 3[1]
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
ppt0320defenseday
ppt0320defensedayppt0320defenseday
ppt0320defenseday
 
Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix Prediction model of algal blooms using logistic regression and confusion matrix
Prediction model of algal blooms using logistic regression and confusion matrix
 
A new generalized lindley distribution
A new generalized lindley distributionA new generalized lindley distribution
A new generalized lindley distribution
 

Plus de Arup Guha

Structural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time seriesStructural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time seriesArup Guha
 
Partnership with a Premier Business School
Partnership with a Premier Business SchoolPartnership with a Premier Business School
Partnership with a Premier Business SchoolArup Guha
 
Beer Market Analytics Solutions
Beer Market Analytics SolutionsBeer Market Analytics Solutions
Beer Market Analytics SolutionsArup Guha
 
Database marketing
Database marketingDatabase marketing
Database marketingArup Guha
 
Using survival analysis results
Using survival analysis results Using survival analysis results
Using survival analysis results Arup Guha
 
Measuring Actual Effect Of Tv Ads On Sales Lk
Measuring Actual Effect Of Tv Ads On Sales   LkMeasuring Actual Effect Of Tv Ads On Sales   Lk
Measuring Actual Effect Of Tv Ads On Sales LkArup Guha
 

Plus de Arup Guha (6)

Structural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time seriesStructural breaks, unit root tests and long time series
Structural breaks, unit root tests and long time series
 
Partnership with a Premier Business School
Partnership with a Premier Business SchoolPartnership with a Premier Business School
Partnership with a Premier Business School
 
Beer Market Analytics Solutions
Beer Market Analytics SolutionsBeer Market Analytics Solutions
Beer Market Analytics Solutions
 
Database marketing
Database marketingDatabase marketing
Database marketing
 
Using survival analysis results
Using survival analysis results Using survival analysis results
Using survival analysis results
 
Measuring Actual Effect Of Tv Ads On Sales Lk
Measuring Actual Effect Of Tv Ads On Sales   LkMeasuring Actual Effect Of Tv Ads On Sales   Lk
Measuring Actual Effect Of Tv Ads On Sales Lk
 

Dernier

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 

Dernier (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

Logistic regression for ordered dependant variable with more than 2 levels

  • 1. Multinomial Logistic Regression Models January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 2.  Logistic regression CAN handle dependant variables with more than two categories  It is important to note whether the response variable is ordinal (consisting of ordered categories like young, middle-aged, old) or nominal (dependant is unordered like red, blue, black)  Some multinomial logistic models are appropriate only for ordered response  It is not mathematically necessary to consider the natural ordering when modeling ordinal response but,  Considering the natural ordering  Leads to a more parsimonious model  Increase power to detect relationships with other variables January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 3.  Applying logistic regression considering the natural order is done using a modeling technique called the “Proportional Odds Model”  Say the dependant variable Y has 4 states measuring the impact of radiation on the human body; fine, sick, serious,dead  Let p1=prob of fine, p2=prob of sick, p3=prob of serious, p4=prob of dead  Let us define a baseline category: fine, since this is the normal stage (we shall see why we need this later) January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 4. What if we break up the modeling of the 4 level ordered dependant into 3 binary logistic situations: 1 – (fine,sick), 2 – (fine,serious),3 – (fine,dead)?  Then we would have 3 logit equations:  Log(p2/p1)=B11+B12X1+B13X2  Log(p3/p1)=B21+B22X1+B23X2  Log(p4/p1)=B31+B32X1+B33X2 X is the degree of radiation dummy with 3 levels so broken into 2 binary dummies  So, 9 parameters to be estmated January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 5. Now consider an alternative model for the same situation  Cumulative logit model:  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4)  The obvious way to introduce covariates is  L1=B11+B12X1+B13X2  L2=B21+B22X1+B23X2  L3=B31+B32X1+B33X2 January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 6. Let us simplyfy the model by specifying that the slope parameters are identical over the logit equations. Then,  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  This is the proportional odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 7.  Suppose that the categorical outcome is actually a categorized version of an unobservable (latent) continuous variable which has a logistic distribution  The continuous scale is divided into five regions by four cut-points c1, c2, c3, c4 which are determined by nature  If Z ≤ c1 we observe Y = 1; if c1 < Z ≤ c2 we observe Y = 2; and so on  Suppose that the Z is related to the X’s through a linear regression  Then, the coarsened categorical variable would be related Y will be related to the X’s by a proportional- odds cumulative logit model January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 8. Let us go back to the model  L1=A1+B1X1+B2X2  L2=A2+B1X1+B2X2  L3=A3+B1X1+B2X2  Note that Lj is the log-odds of falling into or below category j versus falling above it  Aj is the log-odds of falling into or below category j when X1 = X2 = 0  B1 is the increase in log-odds of falling into or below any category associated with a one-unit increase in Xk, holding all the other X-variables constant.  Therefore, a positive slope indicates a tendency for the response level to decrease as the variable decreases January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 9. Our example of 4 levels of impact of radiation corresponding to 3 levels of radiation proc logistic data=radiation_impact; freq count; class radiation / order=data param=ref ref=first; model sickness (order=data descending) = radiation / link=logit aggregate=(radiation) scale=none; run; January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 10. Freq=count  This is important for specifying grouped data  Count is the variable that contains the frequency of occurrance of each observation  In its absence, each row would be considered a separate row of data  Class=radiation  Specifies that radiation is a classification variable to be used in the analysis  SAS would automatically generate n-1 binary dummies for n categories of radiation with param=ref option January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 11. Order=data  Simply tells SAS to arrange the response categories in the order they occur in the input data 1,2,3,4  Param=ref  This implies that there is going to be dummy coding for the classification variable ‘radiation’listed in class  Ref=first  Designates the first ordered level, in this case ‘fine’ as the reference level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 12. Order=data descending  This tells SAS to reverse the order of the logits  So, instead of the cumulative logit model being  L1=log(p1/p2+p3+p4)  L2=log(p1+p2/p3+p4)  L3=log(p1+p2+p3/p4), it becomes  L1=log(p4/p1+p2+p3)  L2=log(p4+p3/p1+p2)  L3=log(p4+p3+p2/p1)  Now, a positive B1 indicates that a higher value of X1 leads to greater chance of radiation sickness January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 13. Link=logit  fits the cumulative logit model when there are more than two response categories  Aggregate=radiation  Indicates that the goodness of fit statistics are to be calculated on the subpopulations of the variable: radiation  Scale=none  No correction is need for the dispersion parameter  To understand this, read up. This happens when the goodness of fit statistic exceeds its degrees of freedom and need to be corrected for January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 14. When we fit this model, the first output we see: Score Test for the Proportional Odds Assumption Chi-Square DF Pr > ChiSq 17.2866 21 0.6936  Null hypothesis is that the current proportional-odds cumulative logit model is true  Seems like we fail to reject the null and so can proceed to the rest of the output under the current assumption January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India
  • 15. Ultimately we are interested in the predicted probabilities OUTPUT <OUT=SAS-data-set><options>  Predicted=  For a cumulative model, it is the predicted cumulative probability (that is, the probability that the response variable is less than or equal to the value of _LEVEL_);  PREDPROBS=I or C  Individual|I requests the predicted probability of each response level.  CUMULATIVE | C requests the cumulative predicted probability of each response level January 1, 2013 ©Arup Guha - Indian Institute of Foreign Trade - New Delhi, India