SlideShare une entreprise Scribd logo
1  sur  131
Télécharger pour lire hors ligne
Demographic forecasting
using functional data
analysis

Rob J Hyndman

Joint work with: Heather Booth, Han Lin Shang,
                 Shahid Ullah, Farah Yasmeen.
Demographic forecasting using functional data analysis   1
Mortality rates
                                                    France: male mortality (1816)
                 0
                 −2
Log death rate

                 −4
                 −6
                 −8




                       0                 20                40               60   80   100

                                                                      Age

             Demographic forecasting using functional data analysis                         2
Fertility rates
                                                    Australia: fertility rates (1921)
                 250
                 200
Fertility rate

                 150
                 100
                 50
                 0




                       15            20             25                30         35   40   45       50

                                                                           Age

             Demographic forecasting using functional data analysis                             3
Outline

 1    A functional linear model

 2    Bagplots, boxplots and outliers

 3    Functional forecasting

 4    Forecasting groups

 5    Population forecasting

 6    References


Demographic forecasting using functional data analysis   4
Outline

 1    A functional linear model

 2    Bagplots, boxplots and outliers

 3    Functional forecasting

 4    Forecasting groups

 5    Population forecasting

 6    References


Demographic forecasting using functional data analysis   A functional linear model   5
Some notation
Let yt,x be the observed (smoothed) data in period t
at age x, t = 1, . . . , n.
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


     Estimate ft (x) using penalized regression splines.
     Estimate µ(x) as me(di)an ft (x) across years.
     Estimate βt,k and φk (x) using (robust) functional
     principal components.
          iid                  iid
     εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)).
Demographic forecasting using functional data analysis          A functional linear model   6
Some notation
Let yt,x be the observed (smoothed) data in period t
at age x, t = 1, . . . , n.
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


     Estimate ft (x) using penalized regression splines.
     Estimate µ(x) as me(di)an ft (x) across years.
     Estimate βt,k and φk (x) using (robust) functional
     principal components.
          iid                  iid
     εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)).
Demographic forecasting using functional data analysis          A functional linear model   6
Some notation
Let yt,x be the observed (smoothed) data in period t
at age x, t = 1, . . . , n.
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


     Estimate ft (x) using penalized regression splines.
     Estimate µ(x) as me(di)an ft (x) across years.
     Estimate βt,k and φk (x) using (robust) functional
     principal components.
          iid                  iid
     εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)).
Demographic forecasting using functional data analysis          A functional linear model   6
Some notation
Let yt,x be the observed (smoothed) data in period t
at age x, t = 1, . . . , n.
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


     Estimate ft (x) using penalized regression splines.
     Estimate µ(x) as me(di)an ft (x) across years.
     Estimate βt,k and φk (x) using (robust) functional
     principal components.
          iid                  iid
     εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)).
Demographic forecasting using functional data analysis          A functional linear model   6
Some notation
Let yt,x be the observed (smoothed) data in period t
at age x, t = 1, . . . , n.
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


     Estimate ft (x) using penalized regression splines.
     Estimate µ(x) as me(di)an ft (x) across years.
     Estimate βt,k and φk (x) using (robust) functional
     principal components.
          iid                  iid
     εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)).
Demographic forecasting using functional data analysis          A functional linear model   6
French mortality components




                                                                                                                    0.2
        −1




                                                         0.20




                                                                                                                    0.1
        −2




                                                         0.15
        −3




                                                 φ1(x)




                                                                                                            φ2(x)

                                                                                                                    0.0
µ(x)




                                                         0.10
        −4




                                                                                                                    −0.1
        −5




                                                         0.05
        −6




                                                                                                                    −0.2
                                                         0.00
             0   20   40         60   80   100                     0    20    40         60   80      100                  0    20    40         60   80       100
                           Age                                                     Age                                                     Age




                                                                                                                    8
                                                         10




                                                                                                                    6
                                                         5
                                                         0




                                                                                                                    4
                                                 βt1




                                                                                                            βt2
                                                         −5




                                                                                                                    2
                                                         −15 −10




                                                                                                                    0
                                                                                                                    −2
                                                                       1850   1900        1950      2000                       1850   1900        1950     2000
                                                                                    t                                                       t


       Demographic forecasting using functional data analysis                                      A functional linear model                               7
French mortality components
                                                           Residuals
      100
      80
      60
Age

      40
      20
      0




                          1850                        1900                      1950               2000

                                                             Year

  Demographic forecasting using functional data analysis               A functional linear model     7
Outline

 1    A functional linear model

 2    Bagplots, boxplots and outliers

 3    Functional forecasting

 4    Forecasting groups

 5    Population forecasting

 6    References


Demographic forecasting using functional data analysis   Bagplots, boxplots and outliers   8
French male mortality rates
                                            France: male death rates (1900−2009)
                 0
                 −2
Log death rate

                 −4
                 −6
                 −8




                      0                20                40               60               80                100

                                                                    Age

           Demographic forecasting using functional data analysis          Bagplots, boxplots and outliers         9
French male mortality rates
                                            France: male death rates (1900−2009)
                 0




                                            War years
                 −2
Log death rate

                 −4
                 −6
                 −8




                      0                20                40               60               80                100

                                                                    Age

           Demographic forecasting using functional data analysis          Bagplots, boxplots and outliers         9
French male mortality rates
                                            France: male death rates (1900−2009)
                 0




                                            War years
                 −2
Log death rate

                 −4
                 −6
                 −8




                                                   Aims
                                                     1 “Boxplots” for functional data

                      0                20            240 Tools for detecting outliers in
                                                                 60      80      100

                                                         functional data
                                                             Age

           Demographic forecasting using functional data analysis   Bagplots, boxplots and outliers   9
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
Let {ft (x)}, t = 1, . . . , n, be a set of curves.
 1 Apply a robust principal component algorithm

                                      n−1
       ft (x) = µ(x) +                       βt,k φk (x)
                                      k =1

                 µ(x) is median curve
                 {φk (x)} are principal components
                 {βt,k } are PC scores
  2    Plot βi ,2 vs βi ,1
   ¯ Each point in scatterplot represents one curve.
   ¯ Outliers show up in bivariate score space.

Demographic forecasting using functional data analysis     Bagplots, boxplots and outliers   10
Robust principal components
                                           Scatterplot of first two PC scores

                                                                                                                       q    q
             6



                                                                                                                       q
                                                                                                                                q

                                                                                                               q
                                                                                                                   q
             4




                                                                                                       q
                                                                                                 q
PC score 2




                                                                                                       q
             2




                                    q
                                   q q
                              q q qq
                             qq       q                                                      q
                       q qq q qq q
                        q
                       q q             q                                                 q
                                       q q
                                       qq q
                                         q q
                                           q q
                                              q
                                             qq                                          q q qq
                                                qq
                                               qq                                        qqq
                                                 q                                   q q q q q qq
                                                                                     q
                                                                                                qq
                                                                                     q qq q q q qq q q
                                                                                                 qq
             0




                                                 q                                 q
                                                 qq
                                                 q
                                                 q
                                                 q                   q   q
                                                  q
                                                  q         q
                                                   q
                                                   qq q
                                                   q      q
                                                  q   q     q
                                                      q q
                                                      q q q
             −2




                     −10                −5                 0                   5                 10                    15

                                                                PC score 1

       Demographic forecasting using functional data analysis                Bagplots, boxplots and outliers                11
Robust principal components
                                           Scatterplot of first two PC scores

                                                                                                                 1915
                                                                                                               1914q q
             6



                                                                                                               1916q
                                                                                                                   1918q
                                                                                                      1944q q
                                                                                                         1917
             4




                                                                                                   1940q
                                                                                             1943q
PC score 2




                                                                                                   1919q
             2




                                    q
                                   q q
                              q q qq
                             qq       q                                             1945q
                       q qq q qq q
                        q
                       q q             q
                                       q q
                                       qq q
                                                                                   1942q
                                         q q
                                           q q
                                              q
                                             qq                                    1941qqqqqqq q
                                                                                       qq
                                                                                        q
                                                qq
                                               qqq                                     qq    q
                                                                                       q qq q q   q qqq
                                                                                                   qqq
                                                                                       q           qq q
             0




                                                 q                                 q
                                                 qq
                                                 q
                                                 q
                                                 q                   q   q
                                                  q
                                                  q         q
                                                   q
                                                   qq q
                                                   q      q
                                                  q   q     q
                                                      q q
                                                      q q q
             −2




                     −10                −5                 0                   5                   10             15

                                                                PC score 1

       Demographic forecasting using functional data analysis                Bagplots, boxplots and outliers           11
Functional bagplot
          Bivariate bagplot due to Rousseeuw et al. (1999).
          Rank points by halfspace location depth.
          Display median, 50% convex hull and outer
          convex hull (with 99% coverage if bivariate
          normal).
                                                                                                                                                                                                 1915
                                                                                                                                                                                               1914q q      q
                                       6

                                                                                                                                                                                                        q




                                                                                                                                                                                               1916qq



                                                                                                                                                                                                   1918q        q




                                                                                                                                                                            1944q q
                                                                                                                                                                               1917
                                                                                                                                                                                           q
                                                                                                                                                                                                q
                                       4




                                                                                                                                                                         1940q         q



                                                                                                                                                           1943q         q
                          PC score 2




                                                                                                                                                                         1919q
                                       2




                                                                                                                                                                                       q

                                                                            q
                                                                        q

                                             q
                                                 q
                                             qq q q   q
                                                          q qqq q
                                                            q       q           q q
                                                                                q
                                                                                  q
                                                                                    q
                                                                                                                                                1945q  q
                                                                                                                                                           q

                                                                                    q
                                                                                     q q q q
                                                                                        q  q
                                                                                             qq
                                                                                              q
                                                                                               q                                                                 q
                                                                                              q q                                                    q       q
                                                                                                                                                        q
                                                                                                     q                                               q q q
                                                                                                                                                     q q q
                                                                                                  q qq                                                     q         q
                                                                                                                                                                     q   q qq
                                                                                                                                                q q q q              q
                                                                                                                                                                             q q
                                                                                                      q                                         q qq                     q qq
                                       0




                                                                                                                                                q         q                        q
                                                                                                     q                                                                   q
                                                                                                                                            q
                                                                                                     q q
                                                                                                     qq
                                                                                                      q                         q
                                                                                                       q                    q       q
                                                                                                       q
                                                                                                       qqq
                                                                                                        q
                                                                                                              q         q
                                                                                                      q       q
                                                                                                              q             q
                                                                                                             q     q   q
                                                                                                                  q
                                       −2




                                            −10                                 −5                                0                     5                                10                         15

                                                                                                                       PC score 1
Demographic forecasting using functional data analysis                                                                 Bagplots, boxplots and outliers                                                              12
Functional bagplot
                                 Bivariate bagplot due to Rousseeuw et al. (1999).
                                 Rank points by halfspace location depth.
                                 Display median, 50% convex hull and outer
                                 convex hull (with 99% coverage if bivariate
                                 normal).
                                 Boundaries contain all curves inside bags.
                                 95% CI for median curve also shown.



                                                                                                                                                                                                          0
                                                                                                                                                                      1915
                                                                                                                                                                    1914q q      q
             6




                                                                                                                                                                             q




                                                                                                                                                                    1916qq



                                                                                                                                                                        1918q        q




                                                                                                                                                                                                          −2
                                                                                                                                                 1944q q
                                                                                                                                                    1917
                                                                                                                                                                q
                                                                                                                                                                     q
             4




                                                                                                                                              1940q         q



                                                                                                                                1943q         q
                                                                                                                                                                                         Log death rate
PC score 2




                                                                                                                                                                                                          −4
                                                                                                                                              1919q
             2




                                                                                                                                                            q

                                                  q
                                              q
                       q
                   qq q q
                   q
                                q qqq q
                                  q       q           q q
                                                      q
                                                        q                                                            1945q      q
                                                                                                                                                                                                          −6

                            q                             q
                                                                                                                            q
                                                          q
                                                           q q q q
                                                              q  q
                                                                   qq
                                                                    q
                                                                     q                                                                q
                                                                    q q                                                   q       q
                                                                                                                             q
                                                                           q                                              q q q
                                                                                                                          q q q
                                                                        q qq                                                    q         q
                                                                                                                                          q   q qq
                                                                                                                     q q q q              q
                                                                                                                                                  q q
                                                                            q                                        q qq                     q qq
             0




                                                                                                                     q         q                        q
                                                                           q                                                                  q
                                                                                                                 q
                                                                           q q
                                                                           qq
                                                                            q
                                                                             q
                                                                                                     q
                                                                                                         q
                                                                                                                                                                                                                                                      1914        1918
                                                                                                                                                                                                          −8




                                                                                                 q
                                                                             q


                                                                            q
                                                                             qqq
                                                                              q
                                                                                    q
                                                                                    q
                                                                                    q
                                                                                             q
                                                                                                 q
                                                                                                                                                                                                                                                      1915        1940
                                                                                   q     q   q
                                                                                        q
                                                                                                                                                                                                                                                      1916        1943
                                                                                                                                                                                                                                                      1917        1944
             −2




                  −10                                 −5                                0                    5                                10                         15                                    0           20        40         60   80           100

                                                                                             PC score 1                                                                                                                                   Age


             Demographic forecasting using functional data analysis                                                                                                                                                Bagplots, boxplots and outliers           12
Functional bagplot
                 0
                 −2
Log death rate

                 −4
                 −6




                                                                                                    1914    1918
                 −8




                                                                                                    1915    1940
                                                                                                    1916    1943
                                                                                                    1917    1944

                      0                  20                  40             60                   80         100

                                                                    Age

           Demographic forecasting using functional data analysis         Bagplots, boxplots and outliers    13
Functional HDR boxplot
                    Bivariate HDR boxplot due to Hyndman (1996).
                    Rank points by value of kernel density estimate.
                    Display mode, 50% and (usually) 99% highest
                    density regions (HDRs) and mode.
                          99% outer region                                                                                                                                             q
                    6




                                                                                                                                                                                   q



                                                                                                                                                                               q


                                                                                                                                                                                           q



                                                                                                                                                                       q
                                                                                                                                                                           q
                    4




                                                                                                                                                                   q



                                                                                                                                       1943q         q
       PC score 2




                                                                                                                                                     1919q
                    2




                                                                                                                                                                   q

                                                         q
                                                     q
                                                 q           q q
                                                             q
                                       q qqq q
                                         q                                                                                             q
                              q                                q
                          qq q q
                          q        q                             q
                                                                                                                                   q
                                                                 q
                                                                  q q q q
                                                                     q  q
                                                                          qq
                                                                           q
                                                                            q                                                                q
                                                                           q q                                                   q       q
                                                                                                                                    q
                                                                                                                                 q q q
                                                                                  q
                                                                               q qq
                                                                                   q
                                                                                                                                 q q q
                                                                                                                            q q q q
                                                                                                                            q qq       o
                                                                                                                                       q         q
                                                                                                                                                 q
                                                                                                                                                 q   q qq
                                                                                                                                                     q qqq q
                    0




                                                                                                                            q         q                        q
                                                                                  q                                                                  q
                                                                                                                        q
                                                                                  q q
                                                                                  qq
                                                                                   q                        q
                                                                                    q                   q       q
                                                                                    q
                                                                                    qqq
                                                                                     q
                                                                                           q        q
                                                                                   q       q
                                                                                           q            q
                                                                                          q     q   q
                                                                                               q
                    −2




                         −10                                 −5                                0                    5                                10                        15

                                                    PC score 1
Demographic forecasting using functional data analysis                                                                      Bagplots, boxplots and outliers                                    14
Functional HDR boxplot
                    Bivariate HDR boxplot due to Hyndman (1996).
                    Rank points by value of kernel density estimate.
                    Display mode, 50% and (usually) 99% highest
                    density regions (HDRs) and mode.
                          90% outer region                                                                                                                                   1915
                                                                                                                                                                           1914q q      q
                    6




                                                                                                                                                                                    q




                                                                                                                                                                           1916qq



                                                                                                                                                                               1918q        q




                                                                                                                                                        1944q q
                                                                                                                                                           1917
                                                                                                                                                                       q
                                                                                                                                                                            q
                    4




                                                                                                                                                     1940q         q



                                                                                                                                       1943q         q
       PC score 2




                                                                                                                                                     1919q
                    2




                                                                                                                                                                   q

                                                         q
                                                     q
                              q
                          qq q q
                          q
                                       q qqq q
                                         q       q           q q
                                                             q
                                                               q                                                         1945q         q
                                   q                             q
                                                                 q
                                                                  q q q q
                                                                     q
                                                                                                                        1942q      q

                                                                        q
                                                                          qq
                                                                           q
                                                                            q                                                                q
                                                                           q q                                                   q       q
                                                                                                                                    q
                                                                                                                                 q q q
                                                                                  q
                                                                               q qq
                                                                                   q
                                                                                                                                 q q q
                                                                                                                            q q q q
                                                                                                                            q qq       o
                                                                                                                                       q         q
                                                                                                                                                 q
                                                                                                                                                 q   q qq
                                                                                                                                                     q qqq q
                    0




                                                                                                                            q         q                        q
                                                                                  q                                                                  q
                                                                                                                        q
                                                                                  q q
                                                                                  qq
                                                                                   q                        q
                                                                                    q                   q       q
                                                                                    q
                                                                                    qqq
                                                                                     q
                                                                                           q        q
                                                                                   q       q
                                                                                           q            q
                                                                                          q     q   q
                                                                                               q
                    −2




                         −10                                 −5                                0                    5                                10                         15

                                                    PC score 1
Demographic forecasting using functional data analysis                                                                      Bagplots, boxplots and outliers                                     14
Functional HDR boxplot
                                 Bivariate HDR boxplot due to Hyndman (1996).
                                 Rank points by value of kernel density estimate.
                                 Display mode, 50% and (usually) 99% highest
                                 density regions (HDRs) and mode.
                                 Boundaries contain all curves inside HDRs.




                                                                                                                                                                                                             0
                   90% outer region                                                                                                                                      1915
                                                                                                                                                                       1914q q      q
             6




                                                                                                                                                                                q




                                                                                                                                                                       1916qq



                                                                                                                                                                           1918q        q




                                                                                                                                                                                                             −2
                                                                                                                                                           1944q q
                                                                                                                                                             1917
                                                                                                                                                                   q
                                                                                                                                                                        q
             4




                                                                                                                                                 1940q         q




                                                                                                                                                                                            Log death rate
                                                                                                                                  1943q          q
PC score 2




                                                                                                                                                                                                             −4
                                                                                                                                                 1919q
             2




                                                                                                                                                               q

                                                  q
                                              q
                       q
                   qq q q
                   q
                                q qqq q
                                  q       q           q q
                                                      q
                                                        q                                                         1945q           q
                                                                                                                                                                                                             −6
                            q                             q
                                                          q
                                                           q q q q
                                                              q
                                                                                                                 1942q        q

                                                                 q
                                                                   qq
                                                                    q
                                                                     q
                                                                    q q                                                      q
                                                                                                                                q
                                                                                                                             q q q
                                                                                                                                     q
                                                                                                                                         q
                                                                                                                                                                                                                                                         1914        1940
                                                                           q
                                                                        q qq
                                                                            q                                        q q q
                                                                                                                     q qq         o
                                                                                                                             q qq qq         q
                                                                                                                                             q
                                                                                                                                             q   q qq
                                                                                                                                                 q qqq q
                                                                                                                                                                                                                                                         1915        1943
             0




                                                                                                                     q            q                        q
                                                                           q                                                                     q
                                                                                                                 q
                                                                           q q
                                                                           qq
                                                                            q
                                                                             q
                                                                                                     q
                                                                                                         q
                                                                                                                                                                                                                                                         1916        1944
                                                                                                                                                                                                             −8




                                                                                                 q
                                                                             q


                                                                            q
                                                                             qqq
                                                                              q
                                                                                    q
                                                                                    q
                                                                                    q
                                                                                             q
                                                                                                 q
                                                                                                                                                                                                                                                         1917        1945
                                                                                   q     q   q
                                                                                        q
                                                                                                                                                                                                                                                         1918        1948
                                                                                                                                                                                                                                                         1919
             −2




                  −10                                 −5                                0                    5                                   10                         15                                    0           20        40         60   80           100

                                                                                             PC score 1                                                                                                                                      Age



             Demographic forecasting using functional data analysis                                                                                                                                                   Bagplots, boxplots and outliers           14
Functional HDR boxplot
                 0
                 −2
Log death rate

                 −4
                 −6




                                                                                                    1914    1940
                                                                                                    1915    1943
                                                                                                    1916    1944
                 −8




                                                                                                    1917    1945
                                                                                                    1918    1948
                                                                                                    1919

                      0                  20                  40             60                   80         100

                                                                    Age

           Demographic forecasting using functional data analysis         Bagplots, boxplots and outliers    15
Outline

 1    A functional linear model

 2    Bagplots, boxplots and outliers

 3    Functional forecasting

 4    Forecasting groups

 5    Population forecasting

 6    References


Demographic forecasting using functional data analysis   Functional forecasting   16
Functional time series model
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1

          The eigenfunctions φk (x) show the main
          regions of variation.
          The scores {βt,k } are uncorrelated by
          construction. So we can forecast each βt,k
          using a univariate time series model.
          Outliers are treated as missing values.
          Univariate ARIMA models are used for
          forecasting.
Demographic forecasting using functional data analysis           Functional forecasting   17
Functional time series model
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1

          The eigenfunctions φk (x) show the main
          regions of variation.
          The scores {βt,k } are uncorrelated by
          construction. So we can forecast each βt,k
          using a univariate time series model.
          Outliers are treated as missing values.
          Univariate ARIMA models are used for
          forecasting.
Demographic forecasting using functional data analysis           Functional forecasting   17
Functional time series model
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1

          The eigenfunctions φk (x) show the main
          regions of variation.
          The scores {βt,k } are uncorrelated by
          construction. So we can forecast each βt,k
          using a univariate time series model.
          Outliers are treated as missing values.
          Univariate ARIMA models are used for
          forecasting.
Demographic forecasting using functional data analysis           Functional forecasting   17
Functional time series model
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1

          The eigenfunctions φk (x) show the main
          regions of variation.
          The scores {βt,k } are uncorrelated by
          construction. So we can forecast each βt,k
          using a univariate time series model.
          Outliers are treated as missing values.
          Univariate ARIMA models are used for
          forecasting.
Demographic forecasting using functional data analysis           Functional forecasting   17
Functional time series model
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1

          The eigenfunctions φk (x) show the main
          regions of variation.
          The scores {βt,k } are uncorrelated by
          construction. So we can forecast each βt,k
          using a univariate time series model.
          Outliers are treated as missing values.
          Univariate ARIMA models are used for
          forecasting.
Demographic forecasting using functional data analysis           Functional forecasting   17
Forecasts
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1




Demographic forecasting using functional data analysis           Functional forecasting   18
Forecasts
                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1
                                                    K
     E[yn+h ,x | y] = µ(x) +
                      ˆ                                  ˆ       ˆ
                                                         βn+h ,k φk (x)
                                                  k =1
                                                     K
                   ˆ2
Var[yn+h ,x | y] = σµ (x) +                                        ˆ2
                                                           vn+h ,k φk (x) + σt2 (x) + v(x)
                                                    k =1

where vn+h ,k = Var(βn+h ,k | β1,k , . . . , βn,k )
and y = [y1,x , . . . , yn,x ].
Demographic forecasting using functional data analysis           Functional forecasting   18
Forecasting the PC scores
                      Main effects                                                        Interaction
        0




                                                                                                                                          0.2
                                                                0.20
        −2




                                             Basis function 1




                                                                                                                       Basis function 2
                                                                                                                                          0.1
                                                                0.15
Mean




                                                                                                                                          0.0
        −4




                                                                0.10




                                                                                                                                          −0.1
                                                                0.05
        −6




                                                                                                                                          −0.2
             0   20    40    60   80   100                               0          20        40     60   80   100                                0             20        40     60   80   100
                            Age                                                                    Age                                                                         Age
                                                                10 15
                                                                                    q




                                                                                                                                          2
                                                                                q
                                                                               qq
                                                                                q
                                                                                              q
                                                                                          q

                                                                                              q




                                                                                                                                          0
                                                                5
                                             Coefficient 1




                                                                                                                       Coefficient 2
                                                                0




                                                                                                                                          −2
                                                                                                                                                                          q


                                                                                                                                                                      q
                                                                −10




                                                                                                                                                                          q




                                                                                                                                          −4
                                                                                                                                                            q




                                                                                                                                                                q




                                                                                                                                          −6
                                                                                                                                                            q
                                                                −20




                                                                                                                                                        q
                                                                                                                                                        q




                                                                        1900             1940         1980     2020                              1900                1940         1980     2020
                                                                                                   Year                                                                        Year


       Demographic forecasting using functional data analysis                                                  Functional forecasting                                                      19
Forecasts of ft (x)
                                            France: male death rates (1900−2009)
                 0
                 −2
Log death rate

                 −4
                 −6
                 −8
                 −10




                       0                 20                  40           60                 80    100

                                                                    Age

           Demographic forecasting using functional data analysis         Functional forecasting   20
Forecasts of ft (x)
                                            France: male death rates (1900−2009)
                 0
                 −2
Log death rate

                 −4
                 −6
                 −8
                 −10




                       0                 20                  40           60                 80    100

                                                                    Age

           Demographic forecasting using functional data analysis         Functional forecasting   20
Forecasts of ft (x)
                                        France: male death forecasts (2010−2029)
                 0
                 −2
Log death rate

                 −4
                 −6
                 −8
                 −10




                       0                 20                  40           60                 80    100

                                                                    Age

           Demographic forecasting using functional data analysis         Functional forecasting   20
Forecasts of ft (x)
                                       France: male death forecasts (2010 & 2029)
                 0
                 −2
Log death rate

                 −4
                 −6
                 −8
                 −10




                                                          80% prediction intervals

                       0                 20                  40              60                  80    100

                                                                    Age

           Demographic forecasting using functional data analysis             Functional forecasting   20
Fertility application
                                             Australia fertility rates (1921−2009)
                 250
                 200
Fertility rate

                 150
                 100
                 50
                 0




                       15           20            25                30         35           40           45        50

                                                                         Age

           Demographic forecasting using functional data analysis               Functional forecasting        21
Fertility model




                                                          0.2
     15




                                                                                                                             0.25
                                                          0.1
     10




                                                          0.0
                                                  Φ1(x)




                                                                                                                     Φ2(x)

                                                                                                                             0.15
Μ




                                                          −0.1
     5




                                                                                                                             0.05
     0




                                                          −0.3
          15   20   25   30   35   40   45   50                  15     20     25   30       35   40     45    50                    15     20     25   30       35   40     45    50
                          Age                                                        Age                                                                 Age
                                                          10




                                                                                                                             8
                                                                                                                             6
                                                          5




                                                                                                                             4
                                                          0
                                                  Βt1




                                                                                                                     Βt2

                                                                                                                             2
                                                          −5




                                                                                                                             0
                                                                                                                             −4 −2
                                                          −10




                                                                 1970        1980   1990          2000        2010                   1970        1980   1990          2000        2010
                                                                                         t                                                                   t


    Demographic forecasting using functional data analysis                                                Functional forecasting                                           22
Fertility model
                                                           Residuals
      45
      40
      35
Age

      30
      25
      20
      15




        1970                       1980                       1990                     2000

                                                             Year

  Demographic forecasting using functional data analysis               Functional forecasting   23
Fertility model
                       Main effects                                                                Interaction




                                                                        0.2
        15




                                                                                                                                                      0.25
                                                                        0.1
                                                     Basis function 1




                                                                                                                                   Basis function 2
        10




                                                                        0.0
Mean




                                                                                                                                                      0.15
                                                                        −0.1
        5




                                                                                                                                                      0.05
        0




                                                                        −0.3
             15   20   25   30   35   40   45   50                                    15     20   25   30   35    40   45    50                               15     20   25   30   35    40   45    50
                             Age                                                                         Age                                                                     Age
                                                                        10 15 20 25




                                                                                                                                                      8
                                                                                                                                                      6
                                                     Coefficient 1




                                                                                                                                   Coefficient 2
                                                                                                                                                      4
                                                                                                                                                      2
                                                                        5




                                                                                                                                                      0
                                                                        0




                                                                                                                                                      −4 −2
                                                                        −10




                                                                                      1970        1990          2010        2030                              1970        1990          2010        2030
                                                                                                         Year                                                                    Year


       Demographic forecasting using functional data analysis                                                           Functional forecasting                                                 24
Forecasts of ft (x)
                                             Australia fertility rates (1921−2009)
                 250
                 200
Fertility rate

                 150
                 100
                 50
                 0




                       15           20            25                30         35           40           45        50

                                                                         Age

           Demographic forecasting using functional data analysis               Functional forecasting        25
Forecasts of ft (x)
                                             Australia fertility rates (1921−2009)
                 250
                 200
Fertility rate

                 150
                 100
                 50
                 0




                       15           20            25                30         35           40           45        50

                                                                         Age

           Demographic forecasting using functional data analysis               Functional forecasting        25
Forecasts of ft (x)
                                              Australia fertility rates: 2010−2029
                 250
                 200
Fertility rate

                 150
                 100
                 50
                 0




                       15           20            25                30         35           40           45        50

                                                                         Age

           Demographic forecasting using functional data analysis               Functional forecasting        25
Forecasts of ft (x)
                                            Australia fertility rates: 2010 and 2029
                                                                                        80% prediction intervals
                 250
                 200
Fertility rate

                 150
                 100
                 50
                 0




                       15           20            25                30         35           40           45             50

                                                                         Age

           Demographic forecasting using functional data analysis               Functional forecasting             25
Outline

 1    A functional linear model

 2    Bagplots, boxplots and outliers

 3    Functional forecasting

 4    Forecasting groups

 5    Population forecasting

 6    References


Demographic forecasting using functional data analysis   Forecasting groups   26
The problem

Let ft,j (x) be the smoothed mortality rate for age x in
group j in year t.
          Groups may be males and females.
          Groups may be states within a country.
          Expected that groups will behave similarly.
          Coherent forecasts do not diverge over time.
          Existing functional models do not impose
          coherence.



Demographic forecasting using functional data analysis   Forecasting groups   27
The problem

Let ft,j (x) be the smoothed mortality rate for age x in
group j in year t.
          Groups may be males and females.
          Groups may be states within a country.
          Expected that groups will behave similarly.
          Coherent forecasts do not diverge over time.
          Existing functional models do not impose
          coherence.



Demographic forecasting using functional data analysis   Forecasting groups   27
The problem

Let ft,j (x) be the smoothed mortality rate for age x in
group j in year t.
          Groups may be males and females.
          Groups may be states within a country.
          Expected that groups will behave similarly.
          Coherent forecasts do not diverge over time.
          Existing functional models do not impose
          coherence.



Demographic forecasting using functional data analysis   Forecasting groups   27
The problem

Let ft,j (x) be the smoothed mortality rate for age x in
group j in year t.
          Groups may be males and females.
          Groups may be states within a country.
          Expected that groups will behave similarly.
          Coherent forecasts do not diverge over time.
          Existing functional models do not impose
          coherence.



Demographic forecasting using functional data analysis   Forecasting groups   27
The problem

Let ft,j (x) be the smoothed mortality rate for age x in
group j in year t.
          Groups may be males and females.
          Groups may be states within a country.
          Expected that groups will behave similarly.
          Coherent forecasts do not diverge over time.
          Existing functional models do not impose
          coherence.



Demographic forecasting using functional data analysis   Forecasting groups   27
The problem

Let ft,j (x) be the smoothed mortality rate for age x in
group j in year t.
          Groups may be males and females.
          Groups may be states within a country.
          Expected that groups will behave similarly.
          Coherent forecasts do not diverge over time.
          Existing functional models do not impose
          coherence.



Demographic forecasting using functional data analysis   Forecasting groups   27
Forecasting the coefficients

                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


          We use ARIMA models for each coefficient
          {β1,j ,k , . . . , βn,j ,k }.
          The ARIMA models are non-stationary for the
          first few coefficients (k = 1, 2)
          Non-stationary ARIMA forecasts will diverge.
          Hence the mortality forecasts are not coherent.

Demographic forecasting using functional data analysis            Forecasting groups   28
Forecasting the coefficients

                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


          We use ARIMA models for each coefficient
          {β1,j ,k , . . . , βn,j ,k }.
          The ARIMA models are non-stationary for the
          first few coefficients (k = 1, 2)
          Non-stationary ARIMA forecasts will diverge.
          Hence the mortality forecasts are not coherent.

Demographic forecasting using functional data analysis            Forecasting groups   28
Forecasting the coefficients

                       yt,x = ft (x) + σt (x)εt,x
                                                     K
                     ft (x) = µ(x) +                      βt,k φk (x) + et (x)
                                                   k =1


          We use ARIMA models for each coefficient
          {β1,j ,k , . . . , βn,j ,k }.
          The ARIMA models are non-stationary for the
          first few coefficients (k = 1, 2)
          Non-stationary ARIMA forecasts will diverge.
          Hence the mortality forecasts are not coherent.

Demographic forecasting using functional data analysis            Forecasting groups   28
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting
Demographic forecasting

Contenu connexe

Tendances

Fcv hist zhu
Fcv hist zhuFcv hist zhu
Fcv hist zhu
zukun
 
7. toda yamamoto-granger causality
7. toda yamamoto-granger causality7. toda yamamoto-granger causality
7. toda yamamoto-granger causality
Quang Hoang
 
5. cem granger causality ecm
5. cem granger causality  ecm 5. cem granger causality  ecm
5. cem granger causality ecm
Quang Hoang
 
Nonlinear Filtering and Path Integral Method (Paper Review)
Nonlinear Filtering and Path Integral Method (Paper Review)Nonlinear Filtering and Path Integral Method (Paper Review)
Nonlinear Filtering and Path Integral Method (Paper Review)
Kohta Ishikawa
 
4. standard granger causality
4. standard granger causality4. standard granger causality
4. standard granger causality
Quang Hoang
 
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Chiheb Ben Hammouda
 
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Patrick Diehl
 
Visualizing, Modeling and Forecasting of Functional Time Series
Visualizing, Modeling and Forecasting of Functional Time SeriesVisualizing, Modeling and Forecasting of Functional Time Series
Visualizing, Modeling and Forecasting of Functional Time Series
hanshang
 

Tendances (20)

Fcv hist zhu
Fcv hist zhuFcv hist zhu
Fcv hist zhu
 
Levy processes in the energy markets
Levy processes in the energy marketsLevy processes in the energy markets
Levy processes in the energy markets
 
7. toda yamamoto-granger causality
7. toda yamamoto-granger causality7. toda yamamoto-granger causality
7. toda yamamoto-granger causality
 
5. cem granger causality ecm
5. cem granger causality  ecm 5. cem granger causality  ecm
5. cem granger causality ecm
 
Random Matrix Theory and Machine Learning - Part 1
Random Matrix Theory and Machine Learning - Part 1Random Matrix Theory and Machine Learning - Part 1
Random Matrix Theory and Machine Learning - Part 1
 
Andreas Eberle
Andreas EberleAndreas Eberle
Andreas Eberle
 
Random Matrix Theory and Machine Learning - Part 2
Random Matrix Theory and Machine Learning - Part 2Random Matrix Theory and Machine Learning - Part 2
Random Matrix Theory and Machine Learning - Part 2
 
Nonlinear Filtering and Path Integral Method (Paper Review)
Nonlinear Filtering and Path Integral Method (Paper Review)Nonlinear Filtering and Path Integral Method (Paper Review)
Nonlinear Filtering and Path Integral Method (Paper Review)
 
4. standard granger causality
4. standard granger causality4. standard granger causality
4. standard granger causality
 
Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4
 
Random Matrix Theory and Machine Learning - Part 3
Random Matrix Theory and Machine Learning - Part 3Random Matrix Theory and Machine Learning - Part 3
Random Matrix Theory and Machine Learning - Part 3
 
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
 
On the solvability of a system of forward-backward linear equations with unbo...
On the solvability of a system of forward-backward linear equations with unbo...On the solvability of a system of forward-backward linear equations with unbo...
On the solvability of a system of forward-backward linear equations with unbo...
 
Gwt sdm public
Gwt sdm publicGwt sdm public
Gwt sdm public
 
Sparse Binary Zero Sum Games (ACML2014)
Sparse Binary Zero Sum Games (ACML2014)Sparse Binary Zero Sum Games (ACML2014)
Sparse Binary Zero Sum Games (ACML2014)
 
Optimal Control and COVID-19
Optimal Control and COVID-19Optimal Control and COVID-19
Optimal Control and COVID-19
 
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
 
Visualizing, Modeling and Forecasting of Functional Time Series
Visualizing, Modeling and Forecasting of Functional Time SeriesVisualizing, Modeling and Forecasting of Functional Time Series
Visualizing, Modeling and Forecasting of Functional Time Series
 
An Introduction to HSIC for Independence Testing
An Introduction to HSIC for Independence TestingAn Introduction to HSIC for Independence Testing
An Introduction to HSIC for Independence Testing
 
05 Random Variables
05 Random Variables05 Random Variables
05 Random Variables
 

En vedette

Living Longer At What Price- Mortality Modelling
Living Longer At What Price- Mortality ModellingLiving Longer At What Price- Mortality Modelling
Living Longer At What Price- Mortality Modelling
Redington
 
Senior project
Senior projectSenior project
Senior project
Amy Ko
 
Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...
Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...
Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...
hanshang
 
Water demand and factor affecting water demand
Water demand and factor affecting water demandWater demand and factor affecting water demand
Water demand and factor affecting water demand
Ankit Gola
 

En vedette (16)

Living Longer At What Price- Mortality Modelling
Living Longer At What Price- Mortality ModellingLiving Longer At What Price- Mortality Modelling
Living Longer At What Price- Mortality Modelling
 
Senior project
Senior projectSenior project
Senior project
 
Public Safety, Public Spending: Forecasting America’s Prison Population, 2007...
Public Safety, Public Spending: Forecasting America’s Prison Population, 2007...Public Safety, Public Spending: Forecasting America’s Prison Population, 2007...
Public Safety, Public Spending: Forecasting America’s Prison Population, 2007...
 
Population forecasting
Population forecastingPopulation forecasting
Population forecasting
 
Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...
Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...
Modeling and forecasting age-specific mortality: Lee-Carter method vs. Functi...
 
Exploring the feature space of large collections of time series
Exploring the feature space of large collections of time seriesExploring the feature space of large collections of time series
Exploring the feature space of large collections of time series
 
Forecasting Hierarchical Time Series
Forecasting Hierarchical Time SeriesForecasting Hierarchical Time Series
Forecasting Hierarchical Time Series
 
Advances in automatic time series forecasting
Advances in automatic time series forecastingAdvances in automatic time series forecasting
Advances in automatic time series forecasting
 
Water demand and Population forecasting
Water demand and Population forecastingWater demand and Population forecasting
Water demand and Population forecasting
 
POPULATION PROJECTION
POPULATION PROJECTION POPULATION PROJECTION
POPULATION PROJECTION
 
L 7 population forecasting
L 7 population forecastingL 7 population forecasting
L 7 population forecasting
 
Poulation forecasting
Poulation forecastingPoulation forecasting
Poulation forecasting
 
L 3 and l-4 water demand
L 3  and l-4 water demandL 3  and l-4 water demand
L 3 and l-4 water demand
 
Water requirements
Water requirementsWater requirements
Water requirements
 
Water demand forecasting
Water demand forecastingWater demand forecasting
Water demand forecasting
 
Water demand and factor affecting water demand
Water demand and factor affecting water demandWater demand and factor affecting water demand
Water demand and factor affecting water demand
 

Similaire à Demographic forecasting

Thesis presentation hikmet
Thesis presentation hikmetThesis presentation hikmet
Thesis presentation hikmet
sshikmet
 
Introduction to matlab
Introduction to matlabIntroduction to matlab
Introduction to matlab
krishna_093
 
Graphs in physics
Graphs in physicsGraphs in physics
Graphs in physics
simonandisa
 
分かりやすいパターン認識第8章 学習アルゴリズムの一般化
分かりやすいパターン認識第8章 学習アルゴリズムの一般化分かりやすいパターン認識第8章 学習アルゴリズムの一般化
分かりやすいパターン認識第8章 学習アルゴリズムの一般化
Yohei Sato
 
Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92
Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92
Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92
ohenebabismark508
 

Similaire à Demographic forecasting (20)

Thesis presentation hikmet
Thesis presentation hikmetThesis presentation hikmet
Thesis presentation hikmet
 
Actuarial analysis in Social Security by Hikmet Tagiyev
Actuarial analysis in Social Security by Hikmet TagiyevActuarial analysis in Social Security by Hikmet Tagiyev
Actuarial analysis in Social Security by Hikmet Tagiyev
 
1010n3a
1010n3a1010n3a
1010n3a
 
Ph.D. Presentation
Ph.D. PresentationPh.D. Presentation
Ph.D. Presentation
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
22nd BSS meeting poster
22nd BSS meeting poster 22nd BSS meeting poster
22nd BSS meeting poster
 
Calibrating the Lee-Carter and the Poisson Lee-Carter models via Neural Netw...
Calibrating the Lee-Carter and the Poisson Lee-Carter models  via Neural Netw...Calibrating the Lee-Carter and the Poisson Lee-Carter models  via Neural Netw...
Calibrating the Lee-Carter and the Poisson Lee-Carter models via Neural Netw...
 
Introduction to matlab
Introduction to matlabIntroduction to matlab
Introduction to matlab
 
Sequential Selection of Correlated Ads by POMDPs
Sequential Selection of Correlated Ads by POMDPsSequential Selection of Correlated Ads by POMDPs
Sequential Selection of Correlated Ads by POMDPs
 
Graphs in physics
Graphs in physicsGraphs in physics
Graphs in physics
 
分かりやすいパターン認識第8章 学習アルゴリズムの一般化
分かりやすいパターン認識第8章 学習アルゴリズムの一般化分かりやすいパターン認識第8章 学習アルゴリズムの一般化
分かりやすいパターン認識第8章 学習アルゴリズムの一般化
 
YSC 2013
YSC 2013YSC 2013
YSC 2013
 
Yasser y thesis
Yasser y thesisYasser y thesis
Yasser y thesis
 
Application of matrix algebra to multivariate data using standardize scores
Application of matrix algebra to multivariate data using standardize scoresApplication of matrix algebra to multivariate data using standardize scores
Application of matrix algebra to multivariate data using standardize scores
 
11.application of matrix algebra to multivariate data using standardize scores
11.application of matrix algebra to multivariate data using standardize scores11.application of matrix algebra to multivariate data using standardize scores
11.application of matrix algebra to multivariate data using standardize scores
 
Applied Business Statistics ,ken black , ch 5
Applied Business Statistics ,ken black , ch 5Applied Business Statistics ,ken black , ch 5
Applied Business Statistics ,ken black , ch 5
 
Bayesian Core: Chapter 7
Bayesian Core: Chapter 7Bayesian Core: Chapter 7
Bayesian Core: Chapter 7
 
ISI MSQE Entrance Question Paper (2010)
ISI MSQE Entrance Question Paper (2010)ISI MSQE Entrance Question Paper (2010)
ISI MSQE Entrance Question Paper (2010)
 
Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92
Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92
Ch 6 Slides.doc/9929292929292919299292@:&:&:&9/92
 
Ses 3 quadratic equations
Ses 3 quadratic equationsSes 3 quadratic equations
Ses 3 quadratic equations
 

Plus de Rob Hyndman

Forecasting electricity demand distributions using a semiparametric additive ...
Forecasting electricity demand distributions using a semiparametric additive ...Forecasting electricity demand distributions using a semiparametric additive ...
Forecasting electricity demand distributions using a semiparametric additive ...
Rob Hyndman
 

Plus de Rob Hyndman (16)

Automatic algorithms for time series forecasting
Automatic algorithms for time series forecastingAutomatic algorithms for time series forecasting
Automatic algorithms for time series forecasting
 
Exploring the boundaries of predictability
Exploring the boundaries of predictabilityExploring the boundaries of predictability
Exploring the boundaries of predictability
 
MEFM: An R package for long-term probabilistic forecasting of electricity demand
MEFM: An R package for long-term probabilistic forecasting of electricity demandMEFM: An R package for long-term probabilistic forecasting of electricity demand
MEFM: An R package for long-term probabilistic forecasting of electricity demand
 
Visualization of big time series data
Visualization of big time series dataVisualization of big time series data
Visualization of big time series data
 
Probabilistic forecasting of long-term peak electricity demand
Probabilistic forecasting of long-term peak electricity demandProbabilistic forecasting of long-term peak electricity demand
Probabilistic forecasting of long-term peak electricity demand
 
Visualization and forecasting of big time series data
Visualization and forecasting of big time series dataVisualization and forecasting of big time series data
Visualization and forecasting of big time series data
 
Academia sinica jan-2015
Academia sinica jan-2015Academia sinica jan-2015
Academia sinica jan-2015
 
Automatic time series forecasting
Automatic time series forecastingAutomatic time series forecasting
Automatic time series forecasting
 
Automatic time series forecasting
Automatic time series forecastingAutomatic time series forecasting
Automatic time series forecasting
 
Forecasting using R
Forecasting using RForecasting using R
Forecasting using R
 
SimpleR: tips, tricks & tools
SimpleR: tips, tricks & toolsSimpleR: tips, tricks & tools
SimpleR: tips, tricks & tools
 
Forecasting without forecasters
Forecasting without forecastersForecasting without forecasters
Forecasting without forecasters
 
R tools for hierarchical time series
R tools for hierarchical time seriesR tools for hierarchical time series
R tools for hierarchical time series
 
Ysc2013
Ysc2013Ysc2013
Ysc2013
 
Forecasting electricity demand distributions using a semiparametric additive ...
Forecasting electricity demand distributions using a semiparametric additive ...Forecasting electricity demand distributions using a semiparametric additive ...
Forecasting electricity demand distributions using a semiparametric additive ...
 
FPP 1. Getting started
FPP 1. Getting startedFPP 1. Getting started
FPP 1. Getting started
 

Dernier

Dernier (20)

Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 

Demographic forecasting

  • 1. Demographic forecasting using functional data analysis Rob J Hyndman Joint work with: Heather Booth, Han Lin Shang, Shahid Ullah, Farah Yasmeen. Demographic forecasting using functional data analysis 1
  • 2. Mortality rates France: male mortality (1816) 0 −2 Log death rate −4 −6 −8 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis 2
  • 3. Fertility rates Australia: fertility rates (1921) 250 200 Fertility rate 150 100 50 0 15 20 25 30 35 40 45 50 Age Demographic forecasting using functional data analysis 3
  • 4. Outline 1 A functional linear model 2 Bagplots, boxplots and outliers 3 Functional forecasting 4 Forecasting groups 5 Population forecasting 6 References Demographic forecasting using functional data analysis 4
  • 5. Outline 1 A functional linear model 2 Bagplots, boxplots and outliers 3 Functional forecasting 4 Forecasting groups 5 Population forecasting 6 References Demographic forecasting using functional data analysis A functional linear model 5
  • 6. Some notation Let yt,x be the observed (smoothed) data in period t at age x, t = 1, . . . , n. yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 Estimate ft (x) using penalized regression splines. Estimate µ(x) as me(di)an ft (x) across years. Estimate βt,k and φk (x) using (robust) functional principal components. iid iid εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)). Demographic forecasting using functional data analysis A functional linear model 6
  • 7. Some notation Let yt,x be the observed (smoothed) data in period t at age x, t = 1, . . . , n. yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 Estimate ft (x) using penalized regression splines. Estimate µ(x) as me(di)an ft (x) across years. Estimate βt,k and φk (x) using (robust) functional principal components. iid iid εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)). Demographic forecasting using functional data analysis A functional linear model 6
  • 8. Some notation Let yt,x be the observed (smoothed) data in period t at age x, t = 1, . . . , n. yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 Estimate ft (x) using penalized regression splines. Estimate µ(x) as me(di)an ft (x) across years. Estimate βt,k and φk (x) using (robust) functional principal components. iid iid εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)). Demographic forecasting using functional data analysis A functional linear model 6
  • 9. Some notation Let yt,x be the observed (smoothed) data in period t at age x, t = 1, . . . , n. yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 Estimate ft (x) using penalized regression splines. Estimate µ(x) as me(di)an ft (x) across years. Estimate βt,k and φk (x) using (robust) functional principal components. iid iid εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)). Demographic forecasting using functional data analysis A functional linear model 6
  • 10. Some notation Let yt,x be the observed (smoothed) data in period t at age x, t = 1, . . . , n. yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 Estimate ft (x) using penalized regression splines. Estimate µ(x) as me(di)an ft (x) across years. Estimate βt,k and φk (x) using (robust) functional principal components. iid iid εt,x ∼ N(0, 1) and et (x) ∼ N(0, v(x)). Demographic forecasting using functional data analysis A functional linear model 6
  • 11. French mortality components 0.2 −1 0.20 0.1 −2 0.15 −3 φ1(x) φ2(x) 0.0 µ(x) 0.10 −4 −0.1 −5 0.05 −6 −0.2 0.00 0 20 40 60 80 100 0 20 40 60 80 100 0 20 40 60 80 100 Age Age Age 8 10 6 5 0 4 βt1 βt2 −5 2 −15 −10 0 −2 1850 1900 1950 2000 1850 1900 1950 2000 t t Demographic forecasting using functional data analysis A functional linear model 7
  • 12. French mortality components Residuals 100 80 60 Age 40 20 0 1850 1900 1950 2000 Year Demographic forecasting using functional data analysis A functional linear model 7
  • 13. Outline 1 A functional linear model 2 Bagplots, boxplots and outliers 3 Functional forecasting 4 Forecasting groups 5 Population forecasting 6 References Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 8
  • 14. French male mortality rates France: male death rates (1900−2009) 0 −2 Log death rate −4 −6 −8 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 9
  • 15. French male mortality rates France: male death rates (1900−2009) 0 War years −2 Log death rate −4 −6 −8 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 9
  • 16. French male mortality rates France: male death rates (1900−2009) 0 War years −2 Log death rate −4 −6 −8 Aims 1 “Boxplots” for functional data 0 20 240 Tools for detecting outliers in 60 80 100 functional data Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 9
  • 17. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 18. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 19. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 20. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 21. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 22. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 23. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 24. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 25. Robust principal components Let {ft (x)}, t = 1, . . . , n, be a set of curves. 1 Apply a robust principal component algorithm n−1 ft (x) = µ(x) + βt,k φk (x) k =1 µ(x) is median curve {φk (x)} are principal components {βt,k } are PC scores 2 Plot βi ,2 vs βi ,1 ¯ Each point in scatterplot represents one curve. ¯ Outliers show up in bivariate score space. Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 10
  • 26. Robust principal components Scatterplot of first two PC scores q q 6 q q q q 4 q q PC score 2 q 2 q q q q q qq qq q q q qq q qq q q q q q q q q qq q q q q q q qq q q qq qq qq qqq q q q q q q qq q qq q qq q q q qq q q qq 0 q q qq q q q q q q q q q qq q q q q q q q q q q q −2 −10 −5 0 5 10 15 PC score 1 Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 11
  • 27. Robust principal components Scatterplot of first two PC scores 1915 1914q q 6 1916q 1918q 1944q q 1917 4 1940q 1943q PC score 2 1919q 2 q q q q q qq qq q 1945q q qq q qq q q q q q q q qq q 1942q q q q q q qq 1941qqqqqqq q qq q qq qqq qq q q qq q q q qqq qqq q qq q 0 q q qq q q q q q q q q q qq q q q q q q q q q q q −2 −10 −5 0 5 10 15 PC score 1 Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 11
  • 28. Functional bagplot Bivariate bagplot due to Rousseeuw et al. (1999). Rank points by halfspace location depth. Display median, 50% convex hull and outer convex hull (with 99% coverage if bivariate normal). 1915 1914q q q 6 q 1916qq 1918q q 1944q q 1917 q q 4 1940q q 1943q q PC score 2 1919q 2 q q q q q qq q q q q qqq q q q q q q q q 1945q q q q q q q q q q qq q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q qq q qq 0 q q q q q q q q qq q q q q q q qqq q q q q q q q q q q q −2 −10 −5 0 5 10 15 PC score 1 Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 12
  • 29. Functional bagplot Bivariate bagplot due to Rousseeuw et al. (1999). Rank points by halfspace location depth. Display median, 50% convex hull and outer convex hull (with 99% coverage if bivariate normal). Boundaries contain all curves inside bags. 95% CI for median curve also shown. 0 1915 1914q q q 6 q 1916qq 1918q q −2 1944q q 1917 q q 4 1940q q 1943q q Log death rate PC score 2 −4 1919q 2 q q q q qq q q q q qqq q q q q q q q 1945q q −6 q q q q q q q q q q qq q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q qq q qq 0 q q q q q q q q qq q q q q 1914 1918 −8 q q q qqq q q q q q q 1915 1940 q q q q 1916 1943 1917 1944 −2 −10 −5 0 5 10 15 0 20 40 60 80 100 PC score 1 Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 12
  • 30. Functional bagplot 0 −2 Log death rate −4 −6 1914 1918 −8 1915 1940 1916 1943 1917 1944 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 13
  • 31. Functional HDR boxplot Bivariate HDR boxplot due to Hyndman (1996). Rank points by value of kernel density estimate. Display mode, 50% and (usually) 99% highest density regions (HDRs) and mode. 99% outer region q 6 q q q q q 4 q 1943q q PC score 2 1919q 2 q q q q q q q q qqq q q q q q qq q q q q q q q q q q q q q qq q q q q q q q q q q q q q qq q q q q q q q q q qq o q q q q q qq q qqq q 0 q q q q q q q q qq q q q q q q qqq q q q q q q q q q q q −2 −10 −5 0 5 10 15 PC score 1 Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 14
  • 32. Functional HDR boxplot Bivariate HDR boxplot due to Hyndman (1996). Rank points by value of kernel density estimate. Display mode, 50% and (usually) 99% highest density regions (HDRs) and mode. 90% outer region 1915 1914q q q 6 q 1916qq 1918q q 1944q q 1917 q q 4 1940q q 1943q q PC score 2 1919q 2 q q q q qq q q q q qqq q q q q q q q 1945q q q q q q q q q q 1942q q q qq q q q q q q q q q q q q q qq q q q q q q q q q qq o q q q q q qq q qqq q 0 q q q q q q q q qq q q q q q q qqq q q q q q q q q q q q −2 −10 −5 0 5 10 15 PC score 1 Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 14
  • 33. Functional HDR boxplot Bivariate HDR boxplot due to Hyndman (1996). Rank points by value of kernel density estimate. Display mode, 50% and (usually) 99% highest density regions (HDRs) and mode. Boundaries contain all curves inside HDRs. 0 90% outer region 1915 1914q q q 6 q 1916qq 1918q q −2 1944q q 1917 q q 4 1940q q Log death rate 1943q q PC score 2 −4 1919q 2 q q q q qq q q q q qqq q q q q q q q 1945q q −6 q q q q q q q q 1942q q q qq q q q q q q q q q q q 1914 1940 q q qq q q q q q qq o q qq qq q q q q qq q qqq q 1915 1943 0 q q q q q q q q qq q q q q 1916 1944 −8 q q q qqq q q q q q q 1917 1945 q q q q 1918 1948 1919 −2 −10 −5 0 5 10 15 0 20 40 60 80 100 PC score 1 Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 14
  • 34. Functional HDR boxplot 0 −2 Log death rate −4 −6 1914 1940 1915 1943 1916 1944 −8 1917 1945 1918 1948 1919 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Bagplots, boxplots and outliers 15
  • 35. Outline 1 A functional linear model 2 Bagplots, boxplots and outliers 3 Functional forecasting 4 Forecasting groups 5 Population forecasting 6 References Demographic forecasting using functional data analysis Functional forecasting 16
  • 36. Functional time series model yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 The eigenfunctions φk (x) show the main regions of variation. The scores {βt,k } are uncorrelated by construction. So we can forecast each βt,k using a univariate time series model. Outliers are treated as missing values. Univariate ARIMA models are used for forecasting. Demographic forecasting using functional data analysis Functional forecasting 17
  • 37. Functional time series model yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 The eigenfunctions φk (x) show the main regions of variation. The scores {βt,k } are uncorrelated by construction. So we can forecast each βt,k using a univariate time series model. Outliers are treated as missing values. Univariate ARIMA models are used for forecasting. Demographic forecasting using functional data analysis Functional forecasting 17
  • 38. Functional time series model yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 The eigenfunctions φk (x) show the main regions of variation. The scores {βt,k } are uncorrelated by construction. So we can forecast each βt,k using a univariate time series model. Outliers are treated as missing values. Univariate ARIMA models are used for forecasting. Demographic forecasting using functional data analysis Functional forecasting 17
  • 39. Functional time series model yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 The eigenfunctions φk (x) show the main regions of variation. The scores {βt,k } are uncorrelated by construction. So we can forecast each βt,k using a univariate time series model. Outliers are treated as missing values. Univariate ARIMA models are used for forecasting. Demographic forecasting using functional data analysis Functional forecasting 17
  • 40. Functional time series model yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 The eigenfunctions φk (x) show the main regions of variation. The scores {βt,k } are uncorrelated by construction. So we can forecast each βt,k using a univariate time series model. Outliers are treated as missing values. Univariate ARIMA models are used for forecasting. Demographic forecasting using functional data analysis Functional forecasting 17
  • 41. Forecasts yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 Demographic forecasting using functional data analysis Functional forecasting 18
  • 42. Forecasts yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 K E[yn+h ,x | y] = µ(x) + ˆ ˆ ˆ βn+h ,k φk (x) k =1 K ˆ2 Var[yn+h ,x | y] = σµ (x) + ˆ2 vn+h ,k φk (x) + σt2 (x) + v(x) k =1 where vn+h ,k = Var(βn+h ,k | β1,k , . . . , βn,k ) and y = [y1,x , . . . , yn,x ]. Demographic forecasting using functional data analysis Functional forecasting 18
  • 43. Forecasting the PC scores Main effects Interaction 0 0.2 0.20 −2 Basis function 1 Basis function 2 0.1 0.15 Mean 0.0 −4 0.10 −0.1 0.05 −6 −0.2 0 20 40 60 80 100 0 20 40 60 80 100 0 20 40 60 80 100 Age Age Age 10 15 q 2 q qq q q q q 0 5 Coefficient 1 Coefficient 2 0 −2 q q −10 q −4 q q −6 q −20 q q 1900 1940 1980 2020 1900 1940 1980 2020 Year Year Demographic forecasting using functional data analysis Functional forecasting 19
  • 44. Forecasts of ft (x) France: male death rates (1900−2009) 0 −2 Log death rate −4 −6 −8 −10 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Functional forecasting 20
  • 45. Forecasts of ft (x) France: male death rates (1900−2009) 0 −2 Log death rate −4 −6 −8 −10 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Functional forecasting 20
  • 46. Forecasts of ft (x) France: male death forecasts (2010−2029) 0 −2 Log death rate −4 −6 −8 −10 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Functional forecasting 20
  • 47. Forecasts of ft (x) France: male death forecasts (2010 & 2029) 0 −2 Log death rate −4 −6 −8 −10 80% prediction intervals 0 20 40 60 80 100 Age Demographic forecasting using functional data analysis Functional forecasting 20
  • 48. Fertility application Australia fertility rates (1921−2009) 250 200 Fertility rate 150 100 50 0 15 20 25 30 35 40 45 50 Age Demographic forecasting using functional data analysis Functional forecasting 21
  • 49. Fertility model 0.2 15 0.25 0.1 10 0.0 Φ1(x) Φ2(x) 0.15 Μ −0.1 5 0.05 0 −0.3 15 20 25 30 35 40 45 50 15 20 25 30 35 40 45 50 15 20 25 30 35 40 45 50 Age Age Age 10 8 6 5 4 0 Βt1 Βt2 2 −5 0 −4 −2 −10 1970 1980 1990 2000 2010 1970 1980 1990 2000 2010 t t Demographic forecasting using functional data analysis Functional forecasting 22
  • 50. Fertility model Residuals 45 40 35 Age 30 25 20 15 1970 1980 1990 2000 Year Demographic forecasting using functional data analysis Functional forecasting 23
  • 51. Fertility model Main effects Interaction 0.2 15 0.25 0.1 Basis function 1 Basis function 2 10 0.0 Mean 0.15 −0.1 5 0.05 0 −0.3 15 20 25 30 35 40 45 50 15 20 25 30 35 40 45 50 15 20 25 30 35 40 45 50 Age Age Age 10 15 20 25 8 6 Coefficient 1 Coefficient 2 4 2 5 0 0 −4 −2 −10 1970 1990 2010 2030 1970 1990 2010 2030 Year Year Demographic forecasting using functional data analysis Functional forecasting 24
  • 52. Forecasts of ft (x) Australia fertility rates (1921−2009) 250 200 Fertility rate 150 100 50 0 15 20 25 30 35 40 45 50 Age Demographic forecasting using functional data analysis Functional forecasting 25
  • 53. Forecasts of ft (x) Australia fertility rates (1921−2009) 250 200 Fertility rate 150 100 50 0 15 20 25 30 35 40 45 50 Age Demographic forecasting using functional data analysis Functional forecasting 25
  • 54. Forecasts of ft (x) Australia fertility rates: 2010−2029 250 200 Fertility rate 150 100 50 0 15 20 25 30 35 40 45 50 Age Demographic forecasting using functional data analysis Functional forecasting 25
  • 55. Forecasts of ft (x) Australia fertility rates: 2010 and 2029 80% prediction intervals 250 200 Fertility rate 150 100 50 0 15 20 25 30 35 40 45 50 Age Demographic forecasting using functional data analysis Functional forecasting 25
  • 56. Outline 1 A functional linear model 2 Bagplots, boxplots and outliers 3 Functional forecasting 4 Forecasting groups 5 Population forecasting 6 References Demographic forecasting using functional data analysis Forecasting groups 26
  • 57. The problem Let ft,j (x) be the smoothed mortality rate for age x in group j in year t. Groups may be males and females. Groups may be states within a country. Expected that groups will behave similarly. Coherent forecasts do not diverge over time. Existing functional models do not impose coherence. Demographic forecasting using functional data analysis Forecasting groups 27
  • 58. The problem Let ft,j (x) be the smoothed mortality rate for age x in group j in year t. Groups may be males and females. Groups may be states within a country. Expected that groups will behave similarly. Coherent forecasts do not diverge over time. Existing functional models do not impose coherence. Demographic forecasting using functional data analysis Forecasting groups 27
  • 59. The problem Let ft,j (x) be the smoothed mortality rate for age x in group j in year t. Groups may be males and females. Groups may be states within a country. Expected that groups will behave similarly. Coherent forecasts do not diverge over time. Existing functional models do not impose coherence. Demographic forecasting using functional data analysis Forecasting groups 27
  • 60. The problem Let ft,j (x) be the smoothed mortality rate for age x in group j in year t. Groups may be males and females. Groups may be states within a country. Expected that groups will behave similarly. Coherent forecasts do not diverge over time. Existing functional models do not impose coherence. Demographic forecasting using functional data analysis Forecasting groups 27
  • 61. The problem Let ft,j (x) be the smoothed mortality rate for age x in group j in year t. Groups may be males and females. Groups may be states within a country. Expected that groups will behave similarly. Coherent forecasts do not diverge over time. Existing functional models do not impose coherence. Demographic forecasting using functional data analysis Forecasting groups 27
  • 62. The problem Let ft,j (x) be the smoothed mortality rate for age x in group j in year t. Groups may be males and females. Groups may be states within a country. Expected that groups will behave similarly. Coherent forecasts do not diverge over time. Existing functional models do not impose coherence. Demographic forecasting using functional data analysis Forecasting groups 27
  • 63. Forecasting the coefficients yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 We use ARIMA models for each coefficient {β1,j ,k , . . . , βn,j ,k }. The ARIMA models are non-stationary for the first few coefficients (k = 1, 2) Non-stationary ARIMA forecasts will diverge. Hence the mortality forecasts are not coherent. Demographic forecasting using functional data analysis Forecasting groups 28
  • 64. Forecasting the coefficients yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 We use ARIMA models for each coefficient {β1,j ,k , . . . , βn,j ,k }. The ARIMA models are non-stationary for the first few coefficients (k = 1, 2) Non-stationary ARIMA forecasts will diverge. Hence the mortality forecasts are not coherent. Demographic forecasting using functional data analysis Forecasting groups 28
  • 65. Forecasting the coefficients yt,x = ft (x) + σt (x)εt,x K ft (x) = µ(x) + βt,k φk (x) + et (x) k =1 We use ARIMA models for each coefficient {β1,j ,k , . . . , βn,j ,k }. The ARIMA models are non-stationary for the first few coefficients (k = 1, 2) Non-stationary ARIMA forecasts will diverge. Hence the mortality forecasts are not coherent. Demographic forecasting using functional data analysis Forecasting groups 28