SlideShare une entreprise Scribd logo
1  sur  24
REGRESSION
MODELS
        By:
        Ayush Sharma 09
        Mickey Haldia 19
        Prerna Makhijani 29
        Sanoj George 39
        Sushant Jaggi 49
        Nitish Dorle 59
Example


Year      Population on Farm (in
          millions)
1935      32.1
1940      30.5
1945      24.4
1950      23.0
1955      19.1
1960      15.6
1965      12.5
Scatter Plot
               Population(in millions)
35
30
25
20
15                                         Poplation(in millions)

10
5
0
 1930   1940     1950      1960     1970
Correlation Coefficient (r)

   It is a measure of strength of the linear
    relationship between two variables and is
    calculated using the following formula:
Interpretation

   After calculating we find r = -0.993

   There is a strong negative correlation.
Coefficient of Determination
   Squaring the correlation coefficient (r) gives us
    the percent variation in the y-variable that is
    described by the variation in the x-variable
   To relate x and y, the Regression Equation is
    calculated using Least Squares technique.
   Regression Equation: Y’ = a +bX
   Slope of the regression line:
To continue with the example
   We found r = -0.993. By squaring we get the
    Coefficient of Determination (R^2) = 0.987
                      35       Regression
                                            y = -0.671 x + 1,330.350
    Population on Farm (in




                      30
                                                   R² = 0.987
          millions)




                      25

                      20

                      15

                      10
                        1930   1940   Year 1950        1960       1970
Interpretation

   We conclude that 98.7% of the decrease in
    farm population can be explained by timeline
    progression.
   Theoretically, population is a dependent
    variable (y-axis) and timeline is an independent
    variable (x-axis).
Assumptions of the Regression Model

 The following assumptions are made about the
  errors:
a) The errors are independent
b) The errors are normally distributed
c) The errors have a mean of zero
d) The errors have a constant variance(regardless
   of the value of X)
Patterns of Indicating Errors



Error




                 X
Estimating the Variance
 The error variance is measured by the MSE
 s2 = MSE= SSE
                n-k-1
where n = number of observations in the sample
        k = number of independent variables

Therefore the standard deviation will be
s = sqrt (MSE)
Multiple regression Analysis
  More than one independent variable
                    Y=β0+β1X1+β2X2+……+βkXk+ϵ

      Where,
              Y=dependent variable(response variable)
           Xi=ith independent variable(predictor variable or explanatory
      variable)
           β0= intercept(value of Y when all Xi = 0)
           βi= coefficient of the ith independent variable
           k= number of independent variables
            ϵ= random error

  To estimate the values of these coefficients, a sample is taken and the
  following equation is developed :
                    Ῡ= b0+b1X1+b2X2+…….+bkXk
                    where,
                              Ῡ= predicted value of Y
                              b0= sample intercept (and is an estimate of
                                        β0)
                              bi= sample coefficient of ith variable(and is an
                                   estimate of βi)
Testing the Model for Significance
•   MSE and co-efficient of determination (r2) does not
    provide a good measure of accuracy when the
    sample size is small
•   In this case, it is necessary to test the model for
    significance
•   Linear Model is given by,

           Y=β0 + β1X + ε

Null Hypothesis :If β1 = 0, then there is no linear relationship
between X and Y
Alternate Hypothesis : If β1 ≠ 0, then there is a linear relationship
Steps in Hypothesis Test for a Significant
 Regression Model

1. Specify null and alternative hypothesis.
2. Select the level of significance (α). Common
   values are between 0.01 and 0.05
3. Calculate the value of the test statistic using the
   formula:
        F = MSE/MSE
4. Make a decision using one of the following
methods:
a) Reject if Fcalculated > Ftable
b) Reject if p-value < α
Triple A Construction Example


   Step 1:
    H0 :β1 = 0, (no linear relationship between X and Y)
    H1 :β1 ≠ 0, (linear relationship between X and Y)


   Step 2
       Select α = 0.05
Triple A Construction Example

   Step 3: Calculate the value of the test statistic

    MSR = SSR/k
           = 15.6250/1
           = 15.6250

        F = MSR/MSE
           = 15.6250/1.7188
           = 9.09
Triple A Construction Example
   Step 4: Reject the null hypothesis if the test statistic
    is greater than the F value from the table.

To find table value, we need :
    Level of Significance (α) = 0.05
    df1 = k = 1
    df2 = n – k – 1 = 4

   where k = number of independent variables
           n = sample size
Using these values, we find
          Ftable = 7.71

    Hence, we reject H0 because 9.09 > 7.71
Selling Price ($)   Suare Footage         AGE       Condition
       95000             1926               30        GOOD              SUMMARY OUTPUT                   Jenny Wilson Reality
      119000             2069               40       Excellent
      124800             1720               30       Excellent
      135000             1396               15        GOOD
      142800             1706               32         Mint                                           Regression Statistics
      145000             1847               38         Mint
      159000             1950               27         Mint
                                                                        Multiple R             The coefficient of                          0.819680305
      165000             2323               30       Excellent
                                                                        R Square               determination r2                            0.671875802
      182000             2285               26         Mint
      183000             3752               35        GOOD              Adjusted R Square                                                  0.612216857
      200000             2300               18        GOOD
      211000             2525               17        GOOD              Standard Error                                                     24312.60729
      215000             3800               40       Excellent
      219000             1740               12         Mint             Observations                                                                  14

ANOVA


                           df                         SS              MS               F           Significance F
                                                                  The regression                                              The p-values are
Regression                             2               13313936968     6.7E+09 11.262                  0.002178765
                                                                  coefficients                                                used to test the
Residual                              11                 6502131603    5.9E+08                                                individual
Total                                 13               19816068571                                                            variables for
                                                                                                                              significance

                       Coefficients              Standard Error       t Stat         P-value         Lower 95%         Upper 95%       Lower 95.0%   Upper 95.0%


Intercept                 146630.89                     25482.08287    5.75427         0.0001          90545.20735        202717             90545         202717

SF                        43.819366                     10.28096507    4.26218         0.0013          21.19111495            66.448        21.191         66.448

AGE                        -2898.686                    796.5649421       -3.639       0.0039           -4651.91386           -1145        -4651.9       -1145.5
Binary or Dummy Variables
   Indicator Variable
   Assigned a value of 1 if a particular condition is
    met, 0 otherwise
   The number of dummy variables must equal one
    less than the number of categories of a
    qualitative variable
   The Jenny Wilson realty example :
     – X3= 1 for excellent condition
          = 0 otherwise
     – X4= 1 for mint condition
          = 0 otherwise
Selling Price
               Suare Footage                  AGE             X3(Exc.)     X4(Mint)             Condition
                                                                                                                         Jenny Wilson Reality
      ($)                                                                                                               SUMMARY OUTPUT
    95000          1926                       30                     0            0              GOOD
   119000          2069                       40                     1            0             Excellent
   124800          1720                       30                     1            0             Excellent
   135000          1396                       15                     0            0              GOOD                         Regression Statistics
   142800          1706                       32                     0            1               Mint
   145000          1847                       38                     0            1               Mint                  Multiple R                        0.94762
   159000          1950                       27                     0            1               Mint
   165000          2323                       30                     1            0             Excellent               R Square                          0.89798
   182000          2285                       26                     0            1               Mint
   183000          3752                       35                     0            0              GOOD                   Adjusted R Square                 0.85264
   200000          2300                       18                     0            0              GOOD
   211000          2525                       17                     0            0              GOOD                   Standard Error                    14987.6
   215000          3800                       40                     1            0             Excellent
   219000          1740                       12                     0            1               Mint                  Observations                             14
        The coefficients of age is negative, indicating
ANOVA   that the price decreases as a house gets older
                                   df                    SS              MS             F             Significance F

Regression                                     4       17794427451           4E+09       19.8044               0.000174421

Residual                                       9         2021641120          2E+08

Total                                         13       19816068571


                               Coefficients         Standard Error       t Stat       P-value               Lower 95%         Upper 95%     Lower 95.0%    Upper 95.0%


Intercept                             121658            17426.61432         6.9812       6.5E-05               82236.71393        161080        82236.71       161080

SF                                   56.4276            6.947516792           8.122          2E-05             40.71122594         72.144       40.71123        72.144

AGE                                 -3962.82            596.0278736       -6.6487        9.4E-05                -5311.12866       -2614.5      -5311.129       -2614.5

X3(Exc.)                             33162.6            12179.62073         2.7228          0.0235             5610.432651       60714.9        5610.433         60715

X4(Mint)                             47369.2            10649.26942         4.4481          0.0016             23278.92699       71459.6        23278.93         71460
Model Building

   The value of r2 can never decrease when more
    variables are added to the model
   Adjusted r2 often used to determine if an additional
    independent variable is beneficial



   The adjusted r2 is


   A variable should not be added to the model if it
    causes the adjusted r2 to decrease
Multiple Regression
   Sales/Decision to buy = B0+ B1* Price
        Sales/Decision to buy = B0+ B1* (Price)3+
        B2*(Design)2+B3*(Performance)


                       L = (Price)3
                       M = (Design)2
                       N = (Performance)




Sales/Decision to buy = B0+ B1* L+ B2* M+ B3* N
Pitfalls In Regression
A High Correlation does not mean one variable is causing a
change in another (Some regressions have shown a
significantly positive relation between individuals' college
GPA and future salary. )


Values of the dependent variable should not be used that
are above or below the ones from the sample



The number of independent variables that should be used
in the model is limited by the number of observations.
Presentation2 stats

Contenu connexe

Tendances (17)

Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
 
Coefficient of correlation
Coefficient of correlationCoefficient of correlation
Coefficient of correlation
 
4. regression analysis1
4. regression analysis14. regression analysis1
4. regression analysis1
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Sintering
SinteringSintering
Sintering
 
Correlation and Regrretion
Correlation and RegrretionCorrelation and Regrretion
Correlation and Regrretion
 
Scatter plot
Scatter plotScatter plot
Scatter plot
 
Chapter7
Chapter7Chapter7
Chapter7
 
2012-TFM1 Economic Gas-like Models
2012-TFM1 Economic Gas-like Models2012-TFM1 Economic Gas-like Models
2012-TFM1 Economic Gas-like Models
 
DOC a a dsfzxgsdfzgx
DOC  a a dsfzxgsdfzgxDOC  a a dsfzxgsdfzgx
DOC a a dsfzxgsdfzgx
 
Chi square using excel
Chi square using excelChi square using excel
Chi square using excel
 
Statr session14, Jan 11
Statr session14, Jan 11Statr session14, Jan 11
Statr session14, Jan 11
 
Bivariate
BivariateBivariate
Bivariate
 
Mung bean susan (1)
Mung bean susan (1)Mung bean susan (1)
Mung bean susan (1)
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Regression
Regression  Regression
Regression
 

Similaire à Presentation2 stats

Simple Regression Years with Midwest and Shelf Space Winter .docx
Simple Regression Years with Midwest and Shelf Space Winter .docxSimple Regression Years with Midwest and Shelf Space Winter .docx
Simple Regression Years with Midwest and Shelf Space Winter .docxbudabrooks46239
 
My favorite online teaching tool
My favorite online teaching toolMy favorite online teaching tool
My favorite online teaching toollittlebay
 
Regression analysis
Regression analysisRegression analysis
Regression analysisRavi shankar
 
15 regression basics
15 regression basics15 regression basics
15 regression basicsgaurav arora
 
Multiple Regression Case
Multiple Regression CaseMultiple Regression Case
Multiple Regression CaseKusuma Arifiani
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysisRabin BK
 
regressionanalysis-110723130213-phpapp02.pdf
regressionanalysis-110723130213-phpapp02.pdfregressionanalysis-110723130213-phpapp02.pdf
regressionanalysis-110723130213-phpapp02.pdfAdikesavaperumal
 
EE301 Lesson 15 Phasors Complex Numbers and Impedance (2).ppt
EE301 Lesson 15 Phasors Complex Numbers and Impedance (2).pptEE301 Lesson 15 Phasors Complex Numbers and Impedance (2).ppt
EE301 Lesson 15 Phasors Complex Numbers and Impedance (2).pptRyanAnderson41811
 
linear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annovalinear Regression, multiple Regression and Annova
linear Regression, multiple Regression and AnnovaMansi Rastogi
 
Duel simplex method_operations research .pptx
Duel simplex method_operations research .pptxDuel simplex method_operations research .pptx
Duel simplex method_operations research .pptxRaja Manyam
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regressionnszakir
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysisnadiazaheer
 
Housing Starts Forecast
Housing Starts ForecastHousing Starts Forecast
Housing Starts ForecastJohnMonty15
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)Harsh Upadhyay
 

Similaire à Presentation2 stats (20)

Simple Regression Years with Midwest and Shelf Space Winter .docx
Simple Regression Years with Midwest and Shelf Space Winter .docxSimple Regression Years with Midwest and Shelf Space Winter .docx
Simple Regression Years with Midwest and Shelf Space Winter .docx
 
My favorite online teaching tool
My favorite online teaching toolMy favorite online teaching tool
My favorite online teaching tool
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
15 regression basics
15 regression basics15 regression basics
15 regression basics
 
Multiple Regression Case
Multiple Regression CaseMultiple Regression Case
Multiple Regression Case
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
regressionanalysis-110723130213-phpapp02.pdf
regressionanalysis-110723130213-phpapp02.pdfregressionanalysis-110723130213-phpapp02.pdf
regressionanalysis-110723130213-phpapp02.pdf
 
EE301 Lesson 15 Phasors Complex Numbers and Impedance (2).ppt
EE301 Lesson 15 Phasors Complex Numbers and Impedance (2).pptEE301 Lesson 15 Phasors Complex Numbers and Impedance (2).ppt
EE301 Lesson 15 Phasors Complex Numbers and Impedance (2).ppt
 
linear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annovalinear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annova
 
Regression
RegressionRegression
Regression
 
ML Module 3.pdf
ML Module 3.pdfML Module 3.pdf
ML Module 3.pdf
 
Duel simplex method_operations research .pptx
Duel simplex method_operations research .pptxDuel simplex method_operations research .pptx
Duel simplex method_operations research .pptx
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regression
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Input analysis
Input analysisInput analysis
Input analysis
 
Housing Starts Forecast
Housing Starts ForecastHousing Starts Forecast
Housing Starts Forecast
 
01_SLR_final (1).pptx
01_SLR_final (1).pptx01_SLR_final (1).pptx
01_SLR_final (1).pptx
 
Presentech'10
Presentech'10Presentech'10
Presentech'10
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)
 

Plus de Prerna Makhijani (16)

Apple
AppleApple
Apple
 
Cola wars 2010
Cola wars 2010Cola wars 2010
Cola wars 2010
 
Ethics in HR, Medical
Ethics in HR, MedicalEthics in HR, Medical
Ethics in HR, Medical
 
Pepsi's entry into India
Pepsi's entry into IndiaPepsi's entry into India
Pepsi's entry into India
 
Carrefour
CarrefourCarrefour
Carrefour
 
Fm analysis itc
Fm analysis itcFm analysis itc
Fm analysis itc
 
E globuz z-vol1_issue_iii
E globuz z-vol1_issue_iiiE globuz z-vol1_issue_iii
E globuz z-vol1_issue_iii
 
E globuzz issue-iv
E globuzz issue-ivE globuzz issue-iv
E globuzz issue-iv
 
E globuzz vol ii issue iii
E globuzz vol ii issue iiiE globuzz vol ii issue iii
E globuzz vol ii issue iii
 
E globuz z vol ii issue ii
E globuz z vol ii issue iiE globuz z vol ii issue ii
E globuz z vol ii issue ii
 
E globuzz vol ii issue i
E globuzz vol ii issue iE globuzz vol ii issue i
E globuzz vol ii issue i
 
E globuz z vol 1, issue v
E globuz z vol 1, issue vE globuz z vol 1, issue v
E globuz z vol 1, issue v
 
E globuz z-vol1_issue2
E globuz z-vol1_issue2E globuz z-vol1_issue2
E globuz z-vol1_issue2
 
Essence of macroeconomics
Essence of macroeconomicsEssence of macroeconomics
Essence of macroeconomics
 
E commerce strategy
E commerce strategyE commerce strategy
E commerce strategy
 
E business strategy
E business strategyE business strategy
E business strategy
 

Dernier

8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCRashishs7044
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfJos Voskuil
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Servicecallgirls2057
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In.../:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...lizamodels9
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Pereraictsugar
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...lizamodels9
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCRashishs7044
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCRashishs7044
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...lizamodels9
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCRashishs7044
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfRbc Rbcua
 

Dernier (20)

Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdf
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In.../:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad ➥9990211544 Independent Best Escorts In...
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Perera
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdf
 

Presentation2 stats

  • 1. REGRESSION MODELS By: Ayush Sharma 09 Mickey Haldia 19 Prerna Makhijani 29 Sanoj George 39 Sushant Jaggi 49 Nitish Dorle 59
  • 2. Example Year Population on Farm (in millions) 1935 32.1 1940 30.5 1945 24.4 1950 23.0 1955 19.1 1960 15.6 1965 12.5
  • 3. Scatter Plot Population(in millions) 35 30 25 20 15 Poplation(in millions) 10 5 0 1930 1940 1950 1960 1970
  • 4. Correlation Coefficient (r)  It is a measure of strength of the linear relationship between two variables and is calculated using the following formula:
  • 5. Interpretation  After calculating we find r = -0.993  There is a strong negative correlation.
  • 6. Coefficient of Determination  Squaring the correlation coefficient (r) gives us the percent variation in the y-variable that is described by the variation in the x-variable  To relate x and y, the Regression Equation is calculated using Least Squares technique.  Regression Equation: Y’ = a +bX  Slope of the regression line:
  • 7. To continue with the example  We found r = -0.993. By squaring we get the Coefficient of Determination (R^2) = 0.987 35 Regression y = -0.671 x + 1,330.350 Population on Farm (in 30 R² = 0.987 millions) 25 20 15 10 1930 1940 Year 1950 1960 1970
  • 8. Interpretation  We conclude that 98.7% of the decrease in farm population can be explained by timeline progression.  Theoretically, population is a dependent variable (y-axis) and timeline is an independent variable (x-axis).
  • 9. Assumptions of the Regression Model  The following assumptions are made about the errors: a) The errors are independent b) The errors are normally distributed c) The errors have a mean of zero d) The errors have a constant variance(regardless of the value of X)
  • 10. Patterns of Indicating Errors Error X
  • 11. Estimating the Variance  The error variance is measured by the MSE  s2 = MSE= SSE n-k-1 where n = number of observations in the sample k = number of independent variables Therefore the standard deviation will be s = sqrt (MSE)
  • 12. Multiple regression Analysis More than one independent variable Y=β0+β1X1+β2X2+……+βkXk+ϵ Where, Y=dependent variable(response variable) Xi=ith independent variable(predictor variable or explanatory variable) β0= intercept(value of Y when all Xi = 0) βi= coefficient of the ith independent variable k= number of independent variables ϵ= random error To estimate the values of these coefficients, a sample is taken and the following equation is developed : Ῡ= b0+b1X1+b2X2+…….+bkXk where, Ῡ= predicted value of Y b0= sample intercept (and is an estimate of β0) bi= sample coefficient of ith variable(and is an estimate of βi)
  • 13. Testing the Model for Significance • MSE and co-efficient of determination (r2) does not provide a good measure of accuracy when the sample size is small • In this case, it is necessary to test the model for significance • Linear Model is given by, Y=β0 + β1X + ε Null Hypothesis :If β1 = 0, then there is no linear relationship between X and Y Alternate Hypothesis : If β1 ≠ 0, then there is a linear relationship
  • 14. Steps in Hypothesis Test for a Significant Regression Model 1. Specify null and alternative hypothesis. 2. Select the level of significance (α). Common values are between 0.01 and 0.05 3. Calculate the value of the test statistic using the formula: F = MSE/MSE 4. Make a decision using one of the following methods: a) Reject if Fcalculated > Ftable b) Reject if p-value < α
  • 15. Triple A Construction Example  Step 1: H0 :β1 = 0, (no linear relationship between X and Y) H1 :β1 ≠ 0, (linear relationship between X and Y)  Step 2 Select α = 0.05
  • 16. Triple A Construction Example  Step 3: Calculate the value of the test statistic MSR = SSR/k = 15.6250/1 = 15.6250 F = MSR/MSE = 15.6250/1.7188 = 9.09
  • 17. Triple A Construction Example  Step 4: Reject the null hypothesis if the test statistic is greater than the F value from the table. To find table value, we need : Level of Significance (α) = 0.05 df1 = k = 1 df2 = n – k – 1 = 4 where k = number of independent variables n = sample size Using these values, we find Ftable = 7.71 Hence, we reject H0 because 9.09 > 7.71
  • 18. Selling Price ($) Suare Footage AGE Condition 95000 1926 30 GOOD SUMMARY OUTPUT Jenny Wilson Reality 119000 2069 40 Excellent 124800 1720 30 Excellent 135000 1396 15 GOOD 142800 1706 32 Mint Regression Statistics 145000 1847 38 Mint 159000 1950 27 Mint Multiple R The coefficient of 0.819680305 165000 2323 30 Excellent R Square determination r2 0.671875802 182000 2285 26 Mint 183000 3752 35 GOOD Adjusted R Square 0.612216857 200000 2300 18 GOOD 211000 2525 17 GOOD Standard Error 24312.60729 215000 3800 40 Excellent 219000 1740 12 Mint Observations 14 ANOVA df SS MS F Significance F The regression The p-values are Regression 2 13313936968 6.7E+09 11.262 0.002178765 coefficients used to test the Residual 11 6502131603 5.9E+08 individual Total 13 19816068571 variables for significance Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 146630.89 25482.08287 5.75427 0.0001 90545.20735 202717 90545 202717 SF 43.819366 10.28096507 4.26218 0.0013 21.19111495 66.448 21.191 66.448 AGE -2898.686 796.5649421 -3.639 0.0039 -4651.91386 -1145 -4651.9 -1145.5
  • 19. Binary or Dummy Variables  Indicator Variable  Assigned a value of 1 if a particular condition is met, 0 otherwise  The number of dummy variables must equal one less than the number of categories of a qualitative variable  The Jenny Wilson realty example : – X3= 1 for excellent condition = 0 otherwise – X4= 1 for mint condition = 0 otherwise
  • 20. Selling Price Suare Footage AGE X3(Exc.) X4(Mint) Condition Jenny Wilson Reality ($) SUMMARY OUTPUT 95000 1926 30 0 0 GOOD 119000 2069 40 1 0 Excellent 124800 1720 30 1 0 Excellent 135000 1396 15 0 0 GOOD Regression Statistics 142800 1706 32 0 1 Mint 145000 1847 38 0 1 Mint Multiple R 0.94762 159000 1950 27 0 1 Mint 165000 2323 30 1 0 Excellent R Square 0.89798 182000 2285 26 0 1 Mint 183000 3752 35 0 0 GOOD Adjusted R Square 0.85264 200000 2300 18 0 0 GOOD 211000 2525 17 0 0 GOOD Standard Error 14987.6 215000 3800 40 1 0 Excellent 219000 1740 12 0 1 Mint Observations 14 The coefficients of age is negative, indicating ANOVA that the price decreases as a house gets older df SS MS F Significance F Regression 4 17794427451 4E+09 19.8044 0.000174421 Residual 9 2021641120 2E+08 Total 13 19816068571 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept 121658 17426.61432 6.9812 6.5E-05 82236.71393 161080 82236.71 161080 SF 56.4276 6.947516792 8.122 2E-05 40.71122594 72.144 40.71123 72.144 AGE -3962.82 596.0278736 -6.6487 9.4E-05 -5311.12866 -2614.5 -5311.129 -2614.5 X3(Exc.) 33162.6 12179.62073 2.7228 0.0235 5610.432651 60714.9 5610.433 60715 X4(Mint) 47369.2 10649.26942 4.4481 0.0016 23278.92699 71459.6 23278.93 71460
  • 21. Model Building  The value of r2 can never decrease when more variables are added to the model  Adjusted r2 often used to determine if an additional independent variable is beneficial  The adjusted r2 is  A variable should not be added to the model if it causes the adjusted r2 to decrease
  • 22. Multiple Regression Sales/Decision to buy = B0+ B1* Price Sales/Decision to buy = B0+ B1* (Price)3+ B2*(Design)2+B3*(Performance) L = (Price)3 M = (Design)2 N = (Performance) Sales/Decision to buy = B0+ B1* L+ B2* M+ B3* N
  • 23. Pitfalls In Regression A High Correlation does not mean one variable is causing a change in another (Some regressions have shown a significantly positive relation between individuals' college GPA and future salary. ) Values of the dependent variable should not be used that are above or below the ones from the sample The number of independent variables that should be used in the model is limited by the number of observations.

Notes de l'éditeur

  1. We take an example of farm population in USA, which has been declining over a period of 30 years. We take Year as the independent variable and Population as dependent variable. We explain the correlation coefficient and coefficient of determination through this example.