SlideShare une entreprise Scribd logo
1  sur  55
Télécharger pour lire hors ligne
A copula-based simulation method
      for clustered multi-state survival data
    F. Rotolo• , C. Legrand , I. Van Keilegom , M. Chiogna•


• Dipartmento   di Scienze Statistiche      Institut de Statistique, Biostatistique
                                                   et Sciences Actuarielles




 Universit` degli Studi di Padova
          a                                  Universit´ Catholique de Louvain
                                                      e




                               September 23, 2011
Clustered Multi-State Survival Data                                                    F. Rotolo


                                               Survival Data
      Time since an origin event until an event of interest.
      Example: from birth to death, since beginning of therapy until remission, etc.
                                                          Time
                         q                                                      q

                                                                               T=5


                         0             1             2              3      4    5




A copula-based simulation method for clustered multi-state survival data                 2/ 22
Clustered Multi-State Survival Data                                                    F. Rotolo


                                               Survival Data
      Time since an origin event until an event of interest.
      Example: from birth to death, since beginning of therapy until remission, etc.
                                                          Time
                         q                                                        q

                                                                                 T=5


                         0             1             2              3        4    5


      Censoring: some observations cannot be observed, the only
      available information being a lower bound.
      Example: migration, change of therapy, loss to follow-up, etc.
                                                          Time
                         q                                            x           q

                                                                    T>3.25


                         0             1             2              3        4    5


A copula-based simulation method for clustered multi-state survival data                 2/ 22
Clustered Multi-State Survival Data                                              F. Rotolo


                                        Modeling Survival Data
      Because of this peculiarity, instead of modeling the density f (t) of
      T , the hazard is considered
                                 P[t ≤ T < t + ∆t|T ≥ t]   f (t)    d
         h(t) = lim                                      =       = − log S(t),
                     ∆t      0             ∆t              S(t)     dt
                         ∞
      with S(t) =       t    f (u)du = P[T > t].
                                    t
      Note: S(t) = exp{−           0    h(u)du}.




A copula-based simulation method for clustered multi-state survival data           3/ 22
Clustered Multi-State Survival Data                                              F. Rotolo


                                        Modeling Survival Data
      Because of this peculiarity, instead of modeling the density f (t) of
      T , the hazard is considered
                                 P[t ≤ T < t + ∆t|T ≥ t]   f (t)    d
         h(t) = lim                                      =       = − log S(t),
                     ∆t      0             ∆t              S(t)     dt
                         ∞
      with S(t) =       t    f (u)du = P[T > t].
                                    t
      Note: S(t) = exp{−           0    h(u)du}.




      The basic regression model for the hazard is the Proportional
      Hazards (PH) Model (Cox, 1972)

                                          h(t|X ) = h0 (t) exp{β X }.



A copula-based simulation method for clustered multi-state survival data           3/ 22
Clustered Multi-State Survival Data                                        F. Rotolo


                                             Survival Models
      Complications of Cox models have been developed

             Frailty Models (FMs)
         account for overdispersion
           or clustering by means
              of random effects

           h(t|Xij ) = h0 (t)Zi e β Xij ,

            similar to GLMM
   log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij ,

                    with Zi = e Wi
   (Duchateau & Janssen, 2008; Wienke, 2010)




A copula-based simulation method for clustered multi-state survival data     4/ 22
Clustered Multi-State Survival Data                                                                              F. Rotolo


                                             Survival Models
      Complications of Cox models have been developed

             Frailty Models (FMs)                                    Multi-State Models (MSMs)
         account for overdispersion                                        consider several events
           or clustering by means                                           and their interactions
              of random effects                                                               LR


                                                                                    T1                 T4


           h(t|Xij ) = h0 (t)Zi e β Xij ,
                                                                                             T3
                                                                              NED                           De
            similar to GLMM
   log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij ,
                                                                                    T2                 T5


                                      Wi
                    with Zi = e                                                             DM




   (Duchateau & Janssen, 2008; Wienke, 2010)
                                                                 (Putter et al., 2007; de Wreede et al., 2010)




A copula-based simulation method for clustered multi-state survival data                                           4/ 22
Clustered Multi-State Survival Data                                                                              F. Rotolo


                                             Survival Models
      Complications of Cox models have been developed

             Frailty Models (FMs)                                    Multi-State Models (MSMs)
         account for overdispersion                                        consider several events
           or clustering by means                                           and their interactions
              of random effects                                                               LR


                                                                                    T1                 T4


           h(t|Xij ) = h0 (t)Zi e β Xij ,
                                                                                             T3
                                                                              NED                           De
            similar to GLMM
   log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij ,
                                                                                    T2                 T5


                                      Wi
                    with Zi = e                                                             DM




   (Duchateau & Janssen, 2008; Wienke, 2010)
                                                                 (Putter et al., 2007; de Wreede et al., 2010)




      Possible integration?
A copula-based simulation method for clustered multi-state survival data                                           4/ 22
Clustered Multi-State Survival Data                                                                              F. Rotolo


                                             Survival Models
      Complications of Cox models have been developed

             Frailty Models (FMs)                                    Multi-State Models (MSMs)
         account for overdispersion                                        consider several events
           or clustering by means                                           and their interactions
              of random effects                                                               LR


                                                                                    T1                 T4


           h(t|Xij ) = h0 (t)Zi e β Xij ,
                                                                                             T3
                                                                              NED                           De
            similar to GLMM
   log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij ,
                                                                                    T2                 T5


                                      Wi
                    with Zi = e                                                             DM




   (Duchateau & Janssen, 2008; Wienke, 2010)
                                                                 (Putter et al., 2007; de Wreede et al., 2010)




      Possible integration?                    Simulation studies
A copula-based simulation method for clustered multi-state survival data                                           4/ 22
Clustered Multi-State Survival Data                                                                     F. Rotolo


                                          Simulation of data
      A simulation method should be able to generate

                                                                           the dependence of times of
                             LR
                                                                           competing events




         NED                                    De




                             DM




A copula-based simulation method for clustered multi-state survival data                                  5/ 22
Clustered Multi-State Survival Data                                                                     F. Rotolo


                                          Simulation of data
      A simulation method should be able to generate

                                                                           the dependence of times of
                             LR
                                                                           competing events
                                                                           the dependence of times of
                                                                           subsequent events


         NED                                    De




                             DM




A copula-based simulation method for clustered multi-state survival data                                  5/ 22
Clustered Multi-State Survival Data                                                                               F. Rotolo


                                                               Simulation of data

      A simulation method should be able to generate

                                                                                     the dependence of times of
           LR                      LR              LR                      LR        competing events
     NED          De    NED             De   NED          De    NED             De

                       LR                                      LR
                                                                                     the dependence of times of
           DM                      DM              DM                      DM

            NED               De                    NED               De             subsequent events
           LR                      LR              LR                      LR
                       DM                                      DM

     NED          De    NED             De   NED          De    NED             De
                                                                                     the dependence between clustered
           DM                      DM              DM                      DM        observations

           LR                      LR              LR                      LR


     NED          De    NED             De   NED          De    NED             De

                       LR                                      LR
           DM                      DM              DM                      DM

            NED               De                    NED               De

           LR                      LR              LR                      LR
                       DM                                      DM

     NED          De    NED             De   NED          De    NED             De


           DM                      DM              DM                      DM




A copula-based simulation method for clustered multi-state survival data                                            5/ 22
Clustered Multi-State Survival Data                                                                     F. Rotolo


                                          Simulation of data

      A simulation method should be able to generate

                                                                           the dependence of times of
                            LR
                                                                           competing events
                                                                           the dependence of times of
                                                                           subsequent events
                                                                           the dependence between clustered
                                                                           observations
        NED                  x                  De
                                                                           the censoring due to competing
                                                                           events occurrence
                 x



                            DM




A copula-based simulation method for clustered multi-state survival data                                    5/ 22
Clustered Multi-State Survival Data                                                                      F. Rotolo


                                          Simulation of data

      A simulation method should be able to generate

                                                                           the dependence of times of
                             LR
                                                                           competing events
                                                                           the dependence of times of
                  x                                                        subsequent events
                                                                           the dependence between clustered
                                                                           observations
         NED                  x                 De
                                                                           the censoring due to competing
                                                                           events occurrence
                  x                                                        the censoring due to end of the
                                                                           study or loss to follow up
                             DM




A copula-based simulation method for clustered multi-state survival data                                     5/ 22
Clustered Multi-State Survival Data                                                                       F. Rotolo


                                          Simulation of data
      A simulation method should be able to generate

                                                                           the dependence of times of
                             LR
                                                                           competing events
                                                                           the dependence of times of
                T1                        T4
                                                                           subsequent events
                                                                           the dependence between clustered
                             T3
                                                                           observations
         NED                                    De
                                                                           the censoring due to competing
                                                                           events occurrence
                                                                           the censoring due to end of the
                T2                        T5
                                                                           study or loss to follow up
                             DM                                            the event-specific covariates effect




A copula-based simulation method for clustered multi-state survival data                                     5/ 22
Simulation Algorithm                                                       F. Rotolo


                                                      Outline

      Clustered Multi-State Survival Data


      Simulation Algorithm

            Clustering


      Choice of Parameters


      Example




A copula-based simulation method for clustered multi-state survival data     6/ 22
Simulation Algorithm                                                                           F. Rotolo


                                               Copula Model


                    LR

                                                   Marginal survival functions freely chosen
           T1
                                                   S1 (t), S2 (t) and S3 (t)


                       T3
     NED                         De




           T2



                   DM




A copula-based simulation method for clustered multi-state survival data                         7/ 22
Simulation Algorithm                                                                           F. Rotolo


                                               Copula Model


                    LR

                                                   Marginal survival functions freely chosen
           T1
                                                   S1 (t), S2 (t) and S3 (t)
                                                   Joint survival function by Clayton Copula
                                                                 3            −θ    −1/θ
                                                   S123 (t) =    i=1 Si (ti )    −2
                       T3
     NED                         De




           T2



                   DM




A copula-based simulation method for clustered multi-state survival data                         7/ 22
Simulation Algorithm                                                                                             F. Rotolo


                                               Copula Model


                    LR
                                                   Marginal survival functions freely chosen
           T1                                      S1 (t), S2 (t) and S3 (t)
                                                   Joint survival function by Clayton Copula
                                                                 3            −θ    −1/θ
                                                   S123 (t) =    i=1 Si (ti )    −2

     NED
                       T3
                                 De
                                                   Conditional survivals from the joint
                                                                                      θ                 −1/θ−1
                                                                           S1 (t1 )
                                                   S2|1 (t2 |t1 ) = 1 +    S2 (t2 )
                                                                                          − S1 (t1 )θ


           T2



                   DM




A copula-based simulation method for clustered multi-state survival data                                           7/ 22
Simulation Algorithm                                                                                             F. Rotolo


                                               Copula Model


                    LR
                                                   Marginal survival functions freely chosen
           T1                                      S1 (t), S2 (t) and S3 (t)
                                                   Joint survival function by Clayton Copula
                                                                 3            −θ    −1/θ
                                                   S123 (t) =    i=1 Si (ti )    −2

     NED
                       T3
                                 De
                                                   Conditional survivals from the joint
                                                                                      θ                 −1/θ−1
                                                                           S1 (t1 )
                                                   S2|1 (t2 |t1 ) = 1 +    S2 (t2 )
                                                                                          − S1 (t1 )θ
                                                                                                            −1/θ−2
                                                                                       S3 (t3 )−θ −1
                                                   S3|12 (t3 |t1 , t2 ) = 1 +   S1 (t1 )−θ +S2 (t2 )−θ −1
           T2



                   DM




A copula-based simulation method for clustered multi-state survival data                                             7/ 22
Simulation Algorithm                                                       F. Rotolo


                                                   Algorithm
      Data from the copula model (Kpanzou, 2007) are simulated as
      follows

                    −1
      1       T1 = S1 (U1 )




      with U1 , U2 , U3 , UC i.i.d. U(0, 1)
A copula-based simulation method for clustered multi-state survival data     8/ 22
Simulation Algorithm                                                              F. Rotolo


                                                   Algorithm
      Data from the copula model (Kpanzou, 2007) are simulated as
      follows

                    −1
      1       T1 = S1 (U1 )
                        −1
      2       T2 |t1 = S2|1 (U2 |t1 ) =
                                    θ                                      −1/θ
               −1                − 1+θ
              S2              U2         − 1 S1 (t1 )−θ + 1




      with U1 , U2 , U3 , UC i.i.d. U(0, 1)
A copula-based simulation method for clustered multi-state survival data            8/ 22
Simulation Algorithm                                                                          F. Rotolo


                                                   Algorithm
      Data from the copula model (Kpanzou, 2007) are simulated as
      follows

                    −1
      1       T1 = S1 (U1 )
                        −1
      2       T2 |t1 = S2|1 (U2 |t1 ) =
                                    θ                                      −1/θ
               −1                − 1+θ
              S2              U2          − 1 S1 (t1 )−θ + 1

                             −1
      3       T3 |t1 , t2 = S3|12 (U3 |t1 , t2 ) =
                                     θ                                                 −1/θ
               −1                − 1+2θ
              S3              U3           −1        S1 (t1 )−θ + S2 (t2 )−θ − 1 + 1




      with U1 , U2 , U3 , UC i.i.d. U(0, 1)
A copula-based simulation method for clustered multi-state survival data                        8/ 22
Simulation Algorithm                                                                          F. Rotolo


                                                   Algorithm
      Data from the copula model (Kpanzou, 2007) are simulated as
      follows

                    −1
      1       T1 = S1 (U1 )
                        −1
      2       T2 |t1 = S2|1 (U2 |t1 ) =
                                    θ                                      −1/θ
               −1                − 1+θ
              S2              U2          − 1 S1 (t1 )−θ + 1

                             −1
      3       T3 |t1 , t2 = S3|12 (U3 |t1 , t2 ) =
                                     θ                                                 −1/θ
               −1                − 1+2θ
              S3              U3           −1        S1 (t1 )−θ + S2 (t2 )−θ − 1 + 1

                    −1
      C       TC = FC (UC )


      with U1 , U2 , U3 , UC i.i.d. U(0, 1)
A copula-based simulation method for clustered multi-state survival data                        8/ 22
Simulation Algorithm                                                                          F. Rotolo


                                                   Algorithm
      Data from the copula model (Kpanzou, 2007) are simulated as
      follows

                    −1
      1       T1 = S1 (U1 )
                        −1
      2       T2 |t1 = S2|1 (U2 |t1 ) =
                                    θ                                      −1/θ
               −1                − 1+θ
              S2              U2          − 1 S1 (t1 )−θ + 1

                             −1
      3       T3 |t1 , t2 = S3|12 (U3 |t1 , t2 ) =
                                     θ                                                 −1/θ
               −1                − 1+2θ
              S3              U3           −1        S1 (t1 )−θ + S2 (t2 )−θ − 1 + 1

                    −1
      C       TC = FC (UC )
      T       min(TC , T1 , T2 , T3 )

      with U1 , U2 , U3 , UC i.i.d. U(0, 1)
A copula-based simulation method for clustered multi-state survival data                        8/ 22
Simulation Algorithm                                                                                             F. Rotolo


                                          Second transitions
      For patients with a transition into state LR or DM, an analogous
      copula model is used for second transition to state De



                    LR
                                           The following conditional survivals can be obtained
           T1                T4                                                                         −1/θ−1
                                                                                      θ
                                                                           S1 (t1 )
                                                   S4|1 (t4 |t1 ) = 1 +    S4 (t4 )
                                                                                          − S1 (t1 )θ

                                                                                      θ                 −1/θ−1
                                                                           S2 (t2 )
     NED                          De
                                                   S5|2 (t5 |t2 ) = 1 +    S5 (t5 )
                                                                                          − S2 (t2 )θ

                                           and the same algorithm is used to simulate second
                                           transition times, conditionally on first transition ones.



                   DM



A copula-based simulation method for clustered multi-state survival data                                           9/ 22
Simulation Algorithm                                                                                             F. Rotolo


                                          Second transitions
      For patients with a transition into state LR or DM, an analogous
      copula model is used for second transition to state De



                    LR
                                           The following conditional survivals can be obtained
                                                                                      θ                 −1/θ−1
                                                                           S1 (t1 )
                                                   S4|1 (t4 |t1 ) = 1 +    S4 (t4 )
                                                                                          − S1 (t1 )θ

                                                                                      θ                 −1/θ−1
                                                                           S2 (t2 )
     NED                          De
                                                   S5|2 (t5 |t2 ) = 1 +    S5 (t5 )
                                                                                          − S2 (t2 )θ

                                           and the same algorithm is used to simulate second
                                           transition times, conditionally on first transition ones.
           T2                T5



                   DM



A copula-based simulation method for clustered multi-state survival data                                           9/ 22
Simulation Algorithm                                                          F. Rotolo


                                                   Clustering
      The algorithm allows to freely specify the marginal survivals Si (t).
      How can we insert clustering?




A copula-based simulation method for clustered multi-state survival data       10/ 22
Simulation Algorithm                                                          F. Rotolo


                                                   Clustering
      The algorithm allows to freely specify the marginal survivals Si (t).
      How can we insert clustering?

      In a PH way
                                              hi (t|Z ) = Z h0i (t),
      with h0i (t) the baseline hazard for transition i.




A copula-based simulation method for clustered multi-state survival data       10/ 22
Simulation Algorithm                                                                          F. Rotolo


                                                   Clustering
      The algorithm allows to freely specify the marginal survivals Si (t).
      How can we insert clustering?

      In a PH way
                                              hi (t|Z ) = Z h0i (t),
      with h0i (t) the baseline hazard for transition i.
                                             t
      Since S0i (t) = exp{−                 0    h0i (u)du}, then
                                                               t
                       Si (t|Z ) = exp −Z                          h0i (u)du   = [S0i (t)]Z
                                                           0




A copula-based simulation method for clustered multi-state survival data                       10/ 22
Simulation Algorithm                                                                          F. Rotolo


                                                   Clustering
      The algorithm allows to freely specify the marginal survivals Si (t).
      How can we insert clustering?

      In a PH way
                                              hi (t|Z ) = Z h0i (t),
      with h0i (t) the baseline hazard for transition i.
                                             t
      Since S0i (t) = exp{−                 0    h0i (u)du}, then
                                                               t
                       Si (t|Z ) = exp −Z                          h0i (u)du   = [S0i (t)]Z
                                                           0


      The copula model can be used for conditional survivals
      {Si (t|Z )}i∈{1,2,3,4,5} and the same algorithm can be used,
      conditionally on Z .
A copula-based simulation method for clustered multi-state survival data                       10/ 22
Simulation Algorithm                                                              F. Rotolo


                                   Clustering and covariates



      The effect of covariates X can be inserted in an analogous way.
      The marginals are then
                                                                           βi X
                                         Si (t|X , Z ) = S0i (t)Ze

      and simulation via the copula model is done conditionally on
      (X , Z ).




A copula-based simulation method for clustered multi-state survival data           11/ 22
Simulation Algorithm                                                       F. Rotolo


                                The Clayton–Weibull model


      Despite the model is quite general, we consider in the following a
      particular case:
           Ti ∼ Wei(λi , ρi ), i ∈ {1, 2, 3, 4, 5}
              TC ∼ Wei(λC , 1) ∼ Exp(λC )
              72 months (6 years) of administrative censoring




A copula-based simulation method for clustered multi-state survival data    12/ 22
Simulation Algorithm                                                           F. Rotolo


                                The Clayton–Weibull model


      Despite the model is quite general, we consider in the following a
      particular case:
           Ti ∼ Wei(λi , ρi ), i ∈ {1, 2, 3, 4, 5}
              TC ∼ Wei(λC , 1) ∼ Exp(λC )
              72 months (6 years) of administrative censoring

      This model
       1. gives simple forms of conditional distributions
                                                                           T
         2. implies that Si|X ,Z (t|x, z) = exp{−λi ze βi x t ρi },
                                        T
            i.e. Ti |X , Z ∼ Wei(λi ze βi x , ρi ) is still a Weibull r.v.




A copula-based simulation method for clustered multi-state survival data        12/ 22
Choice of Parameters                                                       F. Rotolo


                                                      Outline

      Clustered Multi-State Survival Data


      Simulation Algorithm

            Clustering


      Choice of Parameters


      Example




A copula-based simulation method for clustered multi-state survival data    13/ 22
Choice of Parameters                                                                             F. Rotolo


                                       Choice of parameters
      When simulating a dataset, one should be able to choose parameters in
      order to obtain particular target values for


                           LR                             pi       probabilities of LR, DM, De and
                                                                   censoring from NED
               T1                        T4




                            T3
       NED                                     De




               T2                        T5


                           DM




A copula-based simulation method for clustered multi-state survival data                             14/ 22
Choice of Parameters                                                                             F. Rotolo


                                       Choice of parameters
      When simulating a dataset, one should be able to choose parameters in
      order to obtain particular target values for


                           LR                             pi       probabilities of LR, DM, De and
                                                                   censoring from NED
               T1                        T4

                                                         mi        median of uncensored LR, DM
                                                                   and De times from NED
                            T3
       NED                                     De




               T2                        T5


                           DM




A copula-based simulation method for clustered multi-state survival data                             14/ 22
Choice of Parameters                                                                               F. Rotolo


                                       Choice of parameters
      When simulating a dataset, one should be able to choose parameters in
      order to obtain particular target values for


                           LR                             pi       probabilities of LR, DM, De and
                                                                   censoring from NED
               T1                        T4

                                                         mi        median of uncensored LR, DM
                                                                   and De times from NED
                            T3
       NED                                     De
                                                          pi       probabilities of De and censoring
                                                                   from LR and from DM

               T2                        T5


                           DM




A copula-based simulation method for clustered multi-state survival data                               14/ 22
Choice of Parameters                                                                               F. Rotolo


                                       Choice of parameters
      When simulating a dataset, one should be able to choose parameters in
      order to obtain particular target values for


                           LR                             pi       probabilities of LR, DM, De and
                                                                   censoring from NED
               T1                        T4

                                                         mi        median of uncensored LR, DM
                                                                   and De times from NED
                            T3
       NED                                     De
                                                          pi       probabilities of De and censoring
                                                                   from LR and from DM

               T2                        T5
                                                         mi        median of uncensored De times
                                                                   from LR and from DM
                           DM




      It is not possible to analytically express these quantities as functions of
      the parameters.
A copula-based simulation method for clustered multi-state survival data                               14/ 22
Choice of Parameters                                                                                F. Rotolo


                                           Criterion function
      In order to find appropriate parameters for given target values
      {pi , mi }, we want to minimize the criterion function
                                                                           2                    2
                                                               pi                        mi
                  Υ(Π) =                               log                     + log
                                                             pi (Π)
                                                             ˆ                         mi (Π)
                                                                                       ˆ
                                i∈{1,2,3,4,5}

                            ≥0

       with

               Π = {λi }i∈{1,2,3,C ,4,C 4,5,C 5} ∪ {ρi }i∈{1,2,3,4,5} ∈ R13
                                                                         +




       and {ˆi (Π), mi (Π)} the observed values in a simulated dataset with
            p       ˆ
      parameters Π

A copula-based simulation method for clustered multi-state survival data                             15/ 22
Choice of Parameters                                                                                F. Rotolo


                                           Criterion function
      In order to find appropriate parameters for given target values
      {pi , mi }, we want to minimize the criterion function
                                                                           2                    2
                                                               pi                        mi
                  Υ(Π) =                               log                     + log
                                                             pi (Π)
                                                             ˆ                         mi (Π)
                                                                                       ˆ
                                i∈{1,2,3,4,5}

                            = Υ123 (Π123 ) + Υ4 (Π4 ) + Υ5 (Π5 ) ≥ 0

       with

               Π = {λi }i∈{1,2,3,C ,4,C 4,5,C 5} ∪ {ρi }i∈{1,2,3,4,5} ∈ R13
                                                                         +

                                                 Π = Π123 ∪ Π4 ∪ Π5 ∈ R7 × R3 × R3
                                                                       +    +    +




       and {ˆi (Π), mi (Π)} the observed values in a simulated dataset with
            p       ˆ
      parameters Π

A copula-based simulation method for clustered multi-state survival data                             15/ 22
Choice of Parameters                                                                                F. Rotolo


                                           Criterion function
      In order to find appropriate parameters for given target values
      {pi , mi }, we want to minimize the criterion function
                                                                           2                    2
                                                               pi                        mi
                  Υ(Π) =                               log                     + log
                                                             pi (Π)
                                                             ˆ                         mi (Π)
                                                                                       ˆ
                                i∈{1,2,3,4,5}

                            = Υ123 (Π123 ) + Υ4 (Π4 ) + Υ5 (Π5 ) ≥ 0

       with

               Π = {λi }i∈{1,2,3,C ,4,C 4,5,C 5} ∪ {ρi }i∈{1,2,3,4,5} ∈ R13
                                                                         +

                                                 Π = Π123 ∪ Π4 ∪ Π5 ∈ R7 × R3 × R3
                                                                       +    +    +

                  Further reduction of problem dimension...
                                                 Π = Π123 ∪ Π4 ∪ Π5 ∈ R4+3 × R2+1 × R2+1
                                                                       +      +      +


       and {ˆi (Π), mi (Π)} the observed values in a simulated dataset with
            p       ˆ
      parameters Π

A copula-based simulation method for clustered multi-state survival data                             15/ 22
Choice of Parameters                                                       F. Rotolo


                          Minimization of criterion function
      In order to further reduce the dimension of the problem, each of
      the parameter sets ΠK , K ∈ {{123}, {4}, {5}} is split into the
      scale {λi } and the shape parameters {ρi }. The optimization of the
      criterion function ΥK (ΠK ) is iterated on each subset

      Example: algorithm for K = {123}
              Set J = 1
                       (0)
              λ(0) = {λi }i∈{C ,1,2,3} = {1, 1, 1, 1}
                             (0)
              ρ(0) = {ρi }i∈{1,2,3} = {1, 1, 1}




A copula-based simulation method for clustered multi-state survival data    16/ 22
Choice of Parameters                                                        F. Rotolo


                          Minimization of criterion function
      In order to further reduce the dimension of the problem, each of
      the parameter sets ΠK , K ∈ {{123}, {4}, {5}} is split into the
      scale {λi } and the shape parameters {ρi }. The optimization of the
      criterion function ΥK (ΠK ) is iterated on each subset

      Example: algorithm for K = {123}
              Set J = 1
                       (0)
              λ(0) = {λi }i∈{C ,1,2,3} = {1, 1, 1, 1}
                             (0)
              ρ(0) = {ρi }i∈{1,2,3} = {1, 1, 1}
              Repeat until J = maxit or Υ123 (λ(J−1) , ρ(J−1) ) < th
                       Obtain λ(J) by minimizing Υ123 (λ, ρ(J−1) ) over λ
                       Obtain ρ(J) by minimizing Υ123 (λ(J) , ρ) over ρ
                       Set J = J + 1
              where maxit and th are arbitrary termination parameters.

A copula-based simulation method for clustered multi-state survival data     16/ 22
Example                                                                                          F. Rotolo


                                                  An example
      A dataset of size 44 is available from a multi-center study on head and neck cancer.

                                                                 Target values {pi } and {mi }


                                LR
                            8


                15                           7




                                3
          NED                                     De
       22                                        14




                4                            4




                             DM
     Tot: 44                0




A copula-based simulation method for clustered multi-state survival data                          17/ 22
Example                                                                                          F. Rotolo


                                                  An example
      A dataset of size 44 is available from a multi-center study on head and neck cancer.

                                                                 Target values {pi } and {mi }
                                                                 Frailty term
                                                                           40 Hospitals
                                LR
                            8
                                                                           random sizes
                15                           7                             Z ∼ Gam(1, 0.5)



                                3
          NED                                     De
       22                                        14




                4                            4




                             DM
     Tot: 44                0




A copula-based simulation method for clustered multi-state survival data                          17/ 22
Example                                                                                                          F. Rotolo


                                                  An example
      A dataset of size 44 is available from a multi-center study on head and neck cancer.

                                                                 Target values {pi } and {mi }
                                                                 Frailty term
                                                                           40 Hospitals
                                LR
                            8
                                                                           random sizes
                15                           7                             Z ∼ Gam(1, 0.5)

                                                                 Covariates
                                                                           Age ∼ N (60, 7)
                                3
          NED                                     De                       with    
       22                                        14                                log(0.8)/10
                                                                                                   i =1
                                                                           βi,Age = log(0.9)/10     i =2
                                                                                   
                                                                                     log(1.2)/10    i = 3, 4, 5
                                                                                   
                4                            4

                                                                           Treat ∼ Bin(0.5)
                                                                           with      
                             DM                                                      log(1/3)     i =1
                            0
                                                                                     
     Tot: 44
                                                                           βi,Treat = 0            i =2
                                                                                     
                                                                                      log(1.2)     i = 3, 4, 5
                                                                                     


A copula-based simulation method for clustered multi-state survival data                                          17/ 22
Example                                                                                    F. Rotolo


                                                      Results
      First transitions. The algorithm is run with datasets of size 104 ,
      maxit = 10 and th = 0.1. The time of execution was 11:57’ hours

                                            NED→ {LR,DM,De}
                  λ1           λ2            λ3    λC     ρ1                 ρ2      ρ3
                 0.276        0.019         0.013 0.031 0.851              1.076   0.569




A copula-based simulation method for clustered multi-state survival data                    18/ 22
Example                                                                                          F. Rotolo


                                                      Results
      First transitions. The algorithm is run with datasets of size 104 ,
      maxit = 10 and th = 0.1. The time of execution was 11:57’ hours

                                            NED→ {LR,DM,De}
                  λ1           λ2            λ3    λC     ρ1                   ρ2      ρ3
                 0.276        0.019         0.013 0.031 0.851                1.076   0.569


                                                       NED→ {LR,DM,De}
                                                     pi                            mi
                                   LR        DM            De          C    LR     DM      De
            Target                0.34       0.09         0.07      0.50   6.00   10.00   3.00
            Simulated             0.33       0.12         0.09      0.46   5.41    9.33   2.29


      Υ123 (Π123 ) = 0.24
A copula-based simulation method for clustered multi-state survival data                          18/ 22
Example                                                                                   F. Rotolo


                                                      Results
      Second transitions. Conditionally on first transitions data, the
      algorithm is run for second transitions from LR and DM with
      maxit = 6 and th = 0.05. The times of execution were 4:31’ and
      3:57’ hours, respectively.

                                  LR→De                                    DM→De
                        λ4          λC 4   ρ4                    λ5          λC 5   ρ5
                       0.029       0.099 1.078                  0.192       0.039 1.000




A copula-based simulation method for clustered multi-state survival data                   19/ 22
Example                                                                                         F. Rotolo


                                                      Results
      Second transitions. Conditionally on first transitions data, the
      algorithm is run for second transitions from LR and DM with
      maxit = 6 and th = 0.05. The times of execution were 4:31’ and
      3:57’ hours, respectively.

                                  LR→De                                    DM→De
                        λ4          λC 4   ρ4                    λ5          λC 5   ρ5
                       0.029       0.099 1.078                  0.192       0.039 1.000


                      LR→De                                                    DM→De
                      pi                  mi                                   pi     mi
                   De    C                De                                De    C   De
                  0.53 0.47              3.25         Target               0.95 0.05 0.50
                  0.50 0.50              3.32       Simulated              0.97 0.03 0.54

                 Υ4 (Π4 ) = 0.0043                                          Υ5 (Π5 ) = 0.0064
A copula-based simulation method for clustered multi-state survival data                         19/ 22
Conclusion                                                                               F. Rotolo


                                                  Conclusion
      The proposed simulation procedure for clustered MS allows to

  MSMs        generate dependence between times of the same subject
              (between both competing and subsequent event times)

    FMs       generate dependence between times of clustered subjects
              (with arbitrary number and size of groups and free frailty distribution)

      PH      insert covariates via proportional hazards
parMod        choose marginal distributions of time variables




A copula-based simulation method for clustered multi-state survival data                  20/ 22
Conclusion                                                                               F. Rotolo


                                                  Conclusion
      The proposed simulation procedure for clustered MS allows to

  MSMs        generate dependence between times of the same subject
              (between both competing and subsequent event times)

    FMs       generate dependence between times of clustered subjects
              (with arbitrary number and size of groups and free frailty distribution)

      PH      insert covariates via proportional hazards
parMod        choose marginal distributions of time variables

              automatically find appropriate parameters, given arbitrary
              target values for probabilities of censoring, of competing
              events and for medians of uncensored times

              generate censoring, both random and administrative

A copula-based simulation method for clustered multi-state survival data                  20/ 22
References                                                                    F. Rotolo


                                                  References

      Cox, D. R. (1972). Regression models and life-tables. Journal of the
        Royal Statistical Society. Series B (Methodological) 34, 187–220.
      de Wreede, L. C., Fiocco, M. & Putter, H. (2010). The mstate
        package for estimation and prediction in non- and semi-parametric
        multi-state and competing risks models. Comput Methods Programs
        Biomed 99, 261–74.
      Duchateau, L. & Janssen, P. (2008). The frailty model. Springer.
      Kpanzou, T. A. (2007). Copulas in statistics. African Institute for
        Mathematical Sciences (AIMS) .
      Putter, H., Fiocco, M. & Geskus, R. B. (2007). Tutorial in
        biostatistics: competing risks and multi-state models. Stat Med 26,
        2389–430.
      Wienke, A. (2010). Frailty Models in Survival Analysis. Chapman &
       Hall/CRC biostatistics series. Taylor and Francis.


A copula-based simulation method for clustered multi-state survival data       21/ 22
F. Rotolo [federico.rotolo@stat.unipd.it – federico.rotolo@uclouvain.be]
PhD Student at University of Padova and Visiting PhD Student at UCL


under the supervision of
 prof. C. Legrand, UCL
 prof. I. Van Keilegom, UCL
 prof. M. Chiogna, UniPd

Contenu connexe

Tendances

Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011BigMC
 
Doering Savov
Doering SavovDoering Savov
Doering Savovgh
 
Logics of the laplace transform
Logics of the laplace transformLogics of the laplace transform
Logics of the laplace transformTarun Gehlot
 
Optimalpolicyhandout
OptimalpolicyhandoutOptimalpolicyhandout
OptimalpolicyhandoutNBER
 
Tele3113 wk1tue
Tele3113 wk1tueTele3113 wk1tue
Tele3113 wk1tueVin Voro
 
Discussion of Faming Liang's talk
Discussion of Faming Liang's talkDiscussion of Faming Liang's talk
Discussion of Faming Liang's talkChristian Robert
 
Ada boost brown boost performance with noisy data
Ada boost brown boost performance with noisy dataAda boost brown boost performance with noisy data
Ada boost brown boost performance with noisy dataShadhin Rahman
 
Chapter 2 pertubation
Chapter 2 pertubationChapter 2 pertubation
Chapter 2 pertubationNBER
 
Chapter 5 heterogeneous
Chapter 5 heterogeneousChapter 5 heterogeneous
Chapter 5 heterogeneousNBER
 
Chapter 1 nonlinear
Chapter 1 nonlinearChapter 1 nonlinear
Chapter 1 nonlinearNBER
 
Likelihood survey-nber-0713101
Likelihood survey-nber-0713101Likelihood survey-nber-0713101
Likelihood survey-nber-0713101NBER
 
Optimal control of coupled PDE networks with automated code generation
Optimal control of coupled PDE networks with automated code generationOptimal control of coupled PDE networks with automated code generation
Optimal control of coupled PDE networks with automated code generationDelta Pi Systems
 
PAWL - GPU meeting @ Warwick
PAWL - GPU meeting @ WarwickPAWL - GPU meeting @ Warwick
PAWL - GPU meeting @ WarwickPierre Jacob
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Chapter 4 likelihood
Chapter 4 likelihoodChapter 4 likelihood
Chapter 4 likelihoodNBER
 
Tele3113 wk2wed
Tele3113 wk2wedTele3113 wk2wed
Tele3113 wk2wedVin Voro
 

Tendances (20)

PhD defense talk slides
PhD  defense talk slidesPhD  defense talk slides
PhD defense talk slides
 
Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011
 
Hw4sol
Hw4solHw4sol
Hw4sol
 
PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...
PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...
PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...
 
Doering Savov
Doering SavovDoering Savov
Doering Savov
 
Logics of the laplace transform
Logics of the laplace transformLogics of the laplace transform
Logics of the laplace transform
 
Optimalpolicyhandout
OptimalpolicyhandoutOptimalpolicyhandout
Optimalpolicyhandout
 
Tele3113 wk1tue
Tele3113 wk1tueTele3113 wk1tue
Tele3113 wk1tue
 
Discussion of Faming Liang's talk
Discussion of Faming Liang's talkDiscussion of Faming Liang's talk
Discussion of Faming Liang's talk
 
Ada boost brown boost performance with noisy data
Ada boost brown boost performance with noisy dataAda boost brown boost performance with noisy data
Ada boost brown boost performance with noisy data
 
Mixture Models for Image Analysis
Mixture Models for Image AnalysisMixture Models for Image Analysis
Mixture Models for Image Analysis
 
Chapter 2 pertubation
Chapter 2 pertubationChapter 2 pertubation
Chapter 2 pertubation
 
Chapter 5 heterogeneous
Chapter 5 heterogeneousChapter 5 heterogeneous
Chapter 5 heterogeneous
 
Chapter 1 nonlinear
Chapter 1 nonlinearChapter 1 nonlinear
Chapter 1 nonlinear
 
Likelihood survey-nber-0713101
Likelihood survey-nber-0713101Likelihood survey-nber-0713101
Likelihood survey-nber-0713101
 
Optimal control of coupled PDE networks with automated code generation
Optimal control of coupled PDE networks with automated code generationOptimal control of coupled PDE networks with automated code generation
Optimal control of coupled PDE networks with automated code generation
 
PAWL - GPU meeting @ Warwick
PAWL - GPU meeting @ WarwickPAWL - GPU meeting @ Warwick
PAWL - GPU meeting @ Warwick
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Chapter 4 likelihood
Chapter 4 likelihoodChapter 4 likelihood
Chapter 4 likelihood
 
Tele3113 wk2wed
Tele3113 wk2wedTele3113 wk2wed
Tele3113 wk2wed
 

En vedette

Applying the Scientific Method to Simulation Experiments
Applying the Scientific Method to Simulation ExperimentsApplying the Scientific Method to Simulation Experiments
Applying the Scientific Method to Simulation ExperimentsFrank Bergmann
 
Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...
Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...
Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...SeriousGamesAssoc
 
A368 afa1 b0f8-4b79-812f-980f7dc622bf
A368 afa1 b0f8-4b79-812f-980f7dc622bfA368 afa1 b0f8-4b79-812f-980f7dc622bf
A368 afa1 b0f8-4b79-812f-980f7dc622bfShatabdi Mahanta
 
Simulation As a Method To Support Complex Organizational Transformations in H...
Simulation As a Method To Support Complex Organizational Transformations in H...Simulation As a Method To Support Complex Organizational Transformations in H...
Simulation As a Method To Support Complex Organizational Transformations in H...Jos van Hillegersberg
 
Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...
Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...
Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...ArchiLab 7
 
Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...
Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...
Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...iosrjce
 
A High-speed Verilog HDL Simulation Method using a Lightweight Translator
A High-speed Verilog HDL Simulation Method using a Lightweight TranslatorA High-speed Verilog HDL Simulation Method using a Lightweight Translator
A High-speed Verilog HDL Simulation Method using a Lightweight TranslatorRyohei Kobayashi
 
Games and Serious Games in Urban Planning: Study Cases
Games and Serious Games in Urban Planning: Study CasesGames and Serious Games in Urban Planning: Study Cases
Games and Serious Games in Urban Planning: Study CasesBeniamino Murgante
 
Architecture and urban planning (3 d) representation
Architecture and urban planning (3 d) representationArchitecture and urban planning (3 d) representation
Architecture and urban planning (3 d) representationMaria Bostenaru
 
Midsquare method- simulation system
Midsquare method- simulation systemMidsquare method- simulation system
Midsquare method- simulation systemArman Hossain
 
Parallel Simulation of Urban Dynamics on the GPU Ivan Blečić, Arnaldo Cecchi...
Parallel Simulation of Urban Dynamics on the GPU  Ivan Blečić, Arnaldo Cecchi...Parallel Simulation of Urban Dynamics on the GPU  Ivan Blečić, Arnaldo Cecchi...
Parallel Simulation of Urban Dynamics on the GPU Ivan Blečić, Arnaldo Cecchi...Beniamino Murgante
 
Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...
Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...
Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...NeISSProject
 
Simulations and Educational Games
Simulations and Educational GamesSimulations and Educational Games
Simulations and Educational GamesCynthia Calongne
 
Simulation of urban mobility (sumo) prest
Simulation of urban mobility (sumo) prestSimulation of urban mobility (sumo) prest
Simulation of urban mobility (sumo) prestJaskaranpreet Singh
 
Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...
Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...
Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...ServDes
 

En vedette (20)

EGLACOMM / MEVIA APP TEASER
EGLACOMM / MEVIA APP TEASEREGLACOMM / MEVIA APP TEASER
EGLACOMM / MEVIA APP TEASER
 
Applying the Scientific Method to Simulation Experiments
Applying the Scientific Method to Simulation ExperimentsApplying the Scientific Method to Simulation Experiments
Applying the Scientific Method to Simulation Experiments
 
Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...
Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...
Eswaran Subrahmanian - Serious Games in Complex Design of Urban Systems and P...
 
Urban simulation by ANN-CA
Urban simulation by ANN-CAUrban simulation by ANN-CA
Urban simulation by ANN-CA
 
A368 afa1 b0f8-4b79-812f-980f7dc622bf
A368 afa1 b0f8-4b79-812f-980f7dc622bfA368 afa1 b0f8-4b79-812f-980f7dc622bf
A368 afa1 b0f8-4b79-812f-980f7dc622bf
 
Simulation As a Method To Support Complex Organizational Transformations in H...
Simulation As a Method To Support Complex Organizational Transformations in H...Simulation As a Method To Support Complex Organizational Transformations in H...
Simulation As a Method To Support Complex Organizational Transformations in H...
 
Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...
Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...
Brudaru, o. 2011: cellular genetic algorithm with communicating grids for a d...
 
Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...
Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...
Simulation of Urban Mobility (Sumo) For Evaluating Qos Parameters For Vehicul...
 
Cfms project (asrc)
Cfms project (asrc)Cfms project (asrc)
Cfms project (asrc)
 
Serious games
Serious gamesSerious games
Serious games
 
A High-speed Verilog HDL Simulation Method using a Lightweight Translator
A High-speed Verilog HDL Simulation Method using a Lightweight TranslatorA High-speed Verilog HDL Simulation Method using a Lightweight Translator
A High-speed Verilog HDL Simulation Method using a Lightweight Translator
 
Games and Serious Games in Urban Planning: Study Cases
Games and Serious Games in Urban Planning: Study CasesGames and Serious Games in Urban Planning: Study Cases
Games and Serious Games in Urban Planning: Study Cases
 
charge simulation method
charge simulation methodcharge simulation method
charge simulation method
 
Architecture and urban planning (3 d) representation
Architecture and urban planning (3 d) representationArchitecture and urban planning (3 d) representation
Architecture and urban planning (3 d) representation
 
Midsquare method- simulation system
Midsquare method- simulation systemMidsquare method- simulation system
Midsquare method- simulation system
 
Parallel Simulation of Urban Dynamics on the GPU Ivan Blečić, Arnaldo Cecchi...
Parallel Simulation of Urban Dynamics on the GPU  Ivan Blečić, Arnaldo Cecchi...Parallel Simulation of Urban Dynamics on the GPU  Ivan Blečić, Arnaldo Cecchi...
Parallel Simulation of Urban Dynamics on the GPU Ivan Blečić, Arnaldo Cecchi...
 
Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...
Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...
Spatial Microsimulation for City Modelling, Social Forecasting and Urban Poli...
 
Simulations and Educational Games
Simulations and Educational GamesSimulations and Educational Games
Simulations and Educational Games
 
Simulation of urban mobility (sumo) prest
Simulation of urban mobility (sumo) prestSimulation of urban mobility (sumo) prest
Simulation of urban mobility (sumo) prest
 
Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...
Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...
Empowering Stakeholders – Simulation Games As a Participatory Method - Jan Ec...
 

Similaire à A copula-based Simulation Method for Clustered Multi-State Survival Data

Thegeneralizedinverse weibulldistribution ....
Thegeneralizedinverse weibulldistribution ....Thegeneralizedinverse weibulldistribution ....
Thegeneralizedinverse weibulldistribution ....fitriya rizki
 
lecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 classlecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 classTroyTeo1
 
1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docx1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docxstilliegeorgiana
 
Sequential experimentation in clinical trials
Sequential experimentation in clinical trialsSequential experimentation in clinical trials
Sequential experimentation in clinical trialsSpringer
 
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Umberto Picchini
 
On estimating the integrated co volatility using
On estimating the integrated co volatility usingOn estimating the integrated co volatility using
On estimating the integrated co volatility usingkkislas
 
On Some Measures of Genetic Distance Based on Rates of Nucleotide Substitution
On Some Measures of Genetic Distance Based on Rates of Nucleotide SubstitutionOn Some Measures of Genetic Distance Based on Rates of Nucleotide Substitution
On Some Measures of Genetic Distance Based on Rates of Nucleotide SubstitutionJustine Leon Uro
 
Show that Greenwoods formula reduces to the binomial variance formul.pdf
Show that Greenwoods formula reduces to the binomial variance formul.pdfShow that Greenwoods formula reduces to the binomial variance formul.pdf
Show that Greenwoods formula reduces to the binomial variance formul.pdfakshitent
 
Poster for Information, probability and inference in systems biology (IPISB 2...
Poster for Information, probability and inference in systems biology (IPISB 2...Poster for Information, probability and inference in systems biology (IPISB 2...
Poster for Information, probability and inference in systems biology (IPISB 2...Colin Gillespie
 
07.12.2012 - Aprajit Mahajan
07.12.2012 - Aprajit Mahajan07.12.2012 - Aprajit Mahajan
07.12.2012 - Aprajit MahajanAMDSeminarSeries
 
SURVIVAL ANALYSIS 1.pptx
SURVIVAL ANALYSIS 1.pptxSURVIVAL ANALYSIS 1.pptx
SURVIVAL ANALYSIS 1.pptxDrVikasKaushik1
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Scienceresearchinventy
 
Survival analysis 1
Survival analysis 1Survival analysis 1
Survival analysis 1KyusonLim
 
STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...
STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...
STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...Carlos M Martínez M
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsJulyan Arbel
 
The tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsThe tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsColin Gillespie
 
Considerate Approaches to ABC Model Selection
Considerate Approaches to ABC Model SelectionConsiderate Approaches to ABC Model Selection
Considerate Approaches to ABC Model SelectionMichael Stumpf
 
Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...
Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...
Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...Jorge Quintanilla
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionJeremyHeng10
 

Similaire à A copula-based Simulation Method for Clustered Multi-State Survival Data (20)

Thegeneralizedinverse weibulldistribution ....
Thegeneralizedinverse weibulldistribution ....Thegeneralizedinverse weibulldistribution ....
Thegeneralizedinverse weibulldistribution ....
 
lecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 classlecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 class
 
1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docx1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docx
 
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI), International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI),
 
Sequential experimentation in clinical trials
Sequential experimentation in clinical trialsSequential experimentation in clinical trials
Sequential experimentation in clinical trials
 
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
 
On estimating the integrated co volatility using
On estimating the integrated co volatility usingOn estimating the integrated co volatility using
On estimating the integrated co volatility using
 
On Some Measures of Genetic Distance Based on Rates of Nucleotide Substitution
On Some Measures of Genetic Distance Based on Rates of Nucleotide SubstitutionOn Some Measures of Genetic Distance Based on Rates of Nucleotide Substitution
On Some Measures of Genetic Distance Based on Rates of Nucleotide Substitution
 
Show that Greenwoods formula reduces to the binomial variance formul.pdf
Show that Greenwoods formula reduces to the binomial variance formul.pdfShow that Greenwoods formula reduces to the binomial variance formul.pdf
Show that Greenwoods formula reduces to the binomial variance formul.pdf
 
Poster for Information, probability and inference in systems biology (IPISB 2...
Poster for Information, probability and inference in systems biology (IPISB 2...Poster for Information, probability and inference in systems biology (IPISB 2...
Poster for Information, probability and inference in systems biology (IPISB 2...
 
07.12.2012 - Aprajit Mahajan
07.12.2012 - Aprajit Mahajan07.12.2012 - Aprajit Mahajan
07.12.2012 - Aprajit Mahajan
 
SURVIVAL ANALYSIS 1.pptx
SURVIVAL ANALYSIS 1.pptxSURVIVAL ANALYSIS 1.pptx
SURVIVAL ANALYSIS 1.pptx
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Survival analysis 1
Survival analysis 1Survival analysis 1
Survival analysis 1
 
STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...
STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...
STATISTICAL TESTS TO COMPARE k SURVIVAL ANALYSIS FUNCTIONS INVOLVING RECURREN...
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian Nonparametrics
 
The tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsThe tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic models
 
Considerate Approaches to ABC Model Selection
Considerate Approaches to ABC Model SelectionConsiderate Approaches to ABC Model Selection
Considerate Approaches to ABC Model Selection
 
Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...
Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...
Double Occupancy as a Probe of the Mott Transition for Fermions in One-dimens...
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
 

A copula-based Simulation Method for Clustered Multi-State Survival Data

  • 1. A copula-based simulation method for clustered multi-state survival data F. Rotolo• , C. Legrand , I. Van Keilegom , M. Chiogna• • Dipartmento di Scienze Statistiche Institut de Statistique, Biostatistique et Sciences Actuarielles Universit` degli Studi di Padova a Universit´ Catholique de Louvain e September 23, 2011
  • 2. Clustered Multi-State Survival Data F. Rotolo Survival Data Time since an origin event until an event of interest. Example: from birth to death, since beginning of therapy until remission, etc. Time q q T=5 0 1 2 3 4 5 A copula-based simulation method for clustered multi-state survival data 2/ 22
  • 3. Clustered Multi-State Survival Data F. Rotolo Survival Data Time since an origin event until an event of interest. Example: from birth to death, since beginning of therapy until remission, etc. Time q q T=5 0 1 2 3 4 5 Censoring: some observations cannot be observed, the only available information being a lower bound. Example: migration, change of therapy, loss to follow-up, etc. Time q x q T>3.25 0 1 2 3 4 5 A copula-based simulation method for clustered multi-state survival data 2/ 22
  • 4. Clustered Multi-State Survival Data F. Rotolo Modeling Survival Data Because of this peculiarity, instead of modeling the density f (t) of T , the hazard is considered P[t ≤ T < t + ∆t|T ≥ t] f (t) d h(t) = lim = = − log S(t), ∆t 0 ∆t S(t) dt ∞ with S(t) = t f (u)du = P[T > t]. t Note: S(t) = exp{− 0 h(u)du}. A copula-based simulation method for clustered multi-state survival data 3/ 22
  • 5. Clustered Multi-State Survival Data F. Rotolo Modeling Survival Data Because of this peculiarity, instead of modeling the density f (t) of T , the hazard is considered P[t ≤ T < t + ∆t|T ≥ t] f (t) d h(t) = lim = = − log S(t), ∆t 0 ∆t S(t) dt ∞ with S(t) = t f (u)du = P[T > t]. t Note: S(t) = exp{− 0 h(u)du}. The basic regression model for the hazard is the Proportional Hazards (PH) Model (Cox, 1972) h(t|X ) = h0 (t) exp{β X }. A copula-based simulation method for clustered multi-state survival data 3/ 22
  • 6. Clustered Multi-State Survival Data F. Rotolo Survival Models Complications of Cox models have been developed Frailty Models (FMs) account for overdispersion or clustering by means of random effects h(t|Xij ) = h0 (t)Zi e β Xij , similar to GLMM log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij , with Zi = e Wi (Duchateau & Janssen, 2008; Wienke, 2010) A copula-based simulation method for clustered multi-state survival data 4/ 22
  • 7. Clustered Multi-State Survival Data F. Rotolo Survival Models Complications of Cox models have been developed Frailty Models (FMs) Multi-State Models (MSMs) account for overdispersion consider several events or clustering by means and their interactions of random effects LR T1 T4 h(t|Xij ) = h0 (t)Zi e β Xij , T3 NED De similar to GLMM log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij , T2 T5 Wi with Zi = e DM (Duchateau & Janssen, 2008; Wienke, 2010) (Putter et al., 2007; de Wreede et al., 2010) A copula-based simulation method for clustered multi-state survival data 4/ 22
  • 8. Clustered Multi-State Survival Data F. Rotolo Survival Models Complications of Cox models have been developed Frailty Models (FMs) Multi-State Models (MSMs) account for overdispersion consider several events or clustering by means and their interactions of random effects LR T1 T4 h(t|Xij ) = h0 (t)Zi e β Xij , T3 NED De similar to GLMM log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij , T2 T5 Wi with Zi = e DM (Duchateau & Janssen, 2008; Wienke, 2010) (Putter et al., 2007; de Wreede et al., 2010) Possible integration? A copula-based simulation method for clustered multi-state survival data 4/ 22
  • 9. Clustered Multi-State Survival Data F. Rotolo Survival Models Complications of Cox models have been developed Frailty Models (FMs) Multi-State Models (MSMs) account for overdispersion consider several events or clustering by means and their interactions of random effects LR T1 T4 h(t|Xij ) = h0 (t)Zi e β Xij , T3 NED De similar to GLMM log[h(t|Xij )] = log[h0 (t)]+Wi +β Xij , T2 T5 Wi with Zi = e DM (Duchateau & Janssen, 2008; Wienke, 2010) (Putter et al., 2007; de Wreede et al., 2010) Possible integration? Simulation studies A copula-based simulation method for clustered multi-state survival data 4/ 22
  • 10. Clustered Multi-State Survival Data F. Rotolo Simulation of data A simulation method should be able to generate the dependence of times of LR competing events NED De DM A copula-based simulation method for clustered multi-state survival data 5/ 22
  • 11. Clustered Multi-State Survival Data F. Rotolo Simulation of data A simulation method should be able to generate the dependence of times of LR competing events the dependence of times of subsequent events NED De DM A copula-based simulation method for clustered multi-state survival data 5/ 22
  • 12. Clustered Multi-State Survival Data F. Rotolo Simulation of data A simulation method should be able to generate the dependence of times of LR LR LR LR competing events NED De NED De NED De NED De LR LR the dependence of times of DM DM DM DM NED De NED De subsequent events LR LR LR LR DM DM NED De NED De NED De NED De the dependence between clustered DM DM DM DM observations LR LR LR LR NED De NED De NED De NED De LR LR DM DM DM DM NED De NED De LR LR LR LR DM DM NED De NED De NED De NED De DM DM DM DM A copula-based simulation method for clustered multi-state survival data 5/ 22
  • 13. Clustered Multi-State Survival Data F. Rotolo Simulation of data A simulation method should be able to generate the dependence of times of LR competing events the dependence of times of subsequent events the dependence between clustered observations NED x De the censoring due to competing events occurrence x DM A copula-based simulation method for clustered multi-state survival data 5/ 22
  • 14. Clustered Multi-State Survival Data F. Rotolo Simulation of data A simulation method should be able to generate the dependence of times of LR competing events the dependence of times of x subsequent events the dependence between clustered observations NED x De the censoring due to competing events occurrence x the censoring due to end of the study or loss to follow up DM A copula-based simulation method for clustered multi-state survival data 5/ 22
  • 15. Clustered Multi-State Survival Data F. Rotolo Simulation of data A simulation method should be able to generate the dependence of times of LR competing events the dependence of times of T1 T4 subsequent events the dependence between clustered T3 observations NED De the censoring due to competing events occurrence the censoring due to end of the T2 T5 study or loss to follow up DM the event-specific covariates effect A copula-based simulation method for clustered multi-state survival data 5/ 22
  • 16. Simulation Algorithm F. Rotolo Outline Clustered Multi-State Survival Data Simulation Algorithm Clustering Choice of Parameters Example A copula-based simulation method for clustered multi-state survival data 6/ 22
  • 17. Simulation Algorithm F. Rotolo Copula Model LR Marginal survival functions freely chosen T1 S1 (t), S2 (t) and S3 (t) T3 NED De T2 DM A copula-based simulation method for clustered multi-state survival data 7/ 22
  • 18. Simulation Algorithm F. Rotolo Copula Model LR Marginal survival functions freely chosen T1 S1 (t), S2 (t) and S3 (t) Joint survival function by Clayton Copula 3 −θ −1/θ S123 (t) = i=1 Si (ti ) −2 T3 NED De T2 DM A copula-based simulation method for clustered multi-state survival data 7/ 22
  • 19. Simulation Algorithm F. Rotolo Copula Model LR Marginal survival functions freely chosen T1 S1 (t), S2 (t) and S3 (t) Joint survival function by Clayton Copula 3 −θ −1/θ S123 (t) = i=1 Si (ti ) −2 NED T3 De Conditional survivals from the joint θ −1/θ−1 S1 (t1 ) S2|1 (t2 |t1 ) = 1 + S2 (t2 ) − S1 (t1 )θ T2 DM A copula-based simulation method for clustered multi-state survival data 7/ 22
  • 20. Simulation Algorithm F. Rotolo Copula Model LR Marginal survival functions freely chosen T1 S1 (t), S2 (t) and S3 (t) Joint survival function by Clayton Copula 3 −θ −1/θ S123 (t) = i=1 Si (ti ) −2 NED T3 De Conditional survivals from the joint θ −1/θ−1 S1 (t1 ) S2|1 (t2 |t1 ) = 1 + S2 (t2 ) − S1 (t1 )θ −1/θ−2 S3 (t3 )−θ −1 S3|12 (t3 |t1 , t2 ) = 1 + S1 (t1 )−θ +S2 (t2 )−θ −1 T2 DM A copula-based simulation method for clustered multi-state survival data 7/ 22
  • 21. Simulation Algorithm F. Rotolo Algorithm Data from the copula model (Kpanzou, 2007) are simulated as follows −1 1 T1 = S1 (U1 ) with U1 , U2 , U3 , UC i.i.d. U(0, 1) A copula-based simulation method for clustered multi-state survival data 8/ 22
  • 22. Simulation Algorithm F. Rotolo Algorithm Data from the copula model (Kpanzou, 2007) are simulated as follows −1 1 T1 = S1 (U1 ) −1 2 T2 |t1 = S2|1 (U2 |t1 ) = θ −1/θ −1 − 1+θ S2 U2 − 1 S1 (t1 )−θ + 1 with U1 , U2 , U3 , UC i.i.d. U(0, 1) A copula-based simulation method for clustered multi-state survival data 8/ 22
  • 23. Simulation Algorithm F. Rotolo Algorithm Data from the copula model (Kpanzou, 2007) are simulated as follows −1 1 T1 = S1 (U1 ) −1 2 T2 |t1 = S2|1 (U2 |t1 ) = θ −1/θ −1 − 1+θ S2 U2 − 1 S1 (t1 )−θ + 1 −1 3 T3 |t1 , t2 = S3|12 (U3 |t1 , t2 ) = θ −1/θ −1 − 1+2θ S3 U3 −1 S1 (t1 )−θ + S2 (t2 )−θ − 1 + 1 with U1 , U2 , U3 , UC i.i.d. U(0, 1) A copula-based simulation method for clustered multi-state survival data 8/ 22
  • 24. Simulation Algorithm F. Rotolo Algorithm Data from the copula model (Kpanzou, 2007) are simulated as follows −1 1 T1 = S1 (U1 ) −1 2 T2 |t1 = S2|1 (U2 |t1 ) = θ −1/θ −1 − 1+θ S2 U2 − 1 S1 (t1 )−θ + 1 −1 3 T3 |t1 , t2 = S3|12 (U3 |t1 , t2 ) = θ −1/θ −1 − 1+2θ S3 U3 −1 S1 (t1 )−θ + S2 (t2 )−θ − 1 + 1 −1 C TC = FC (UC ) with U1 , U2 , U3 , UC i.i.d. U(0, 1) A copula-based simulation method for clustered multi-state survival data 8/ 22
  • 25. Simulation Algorithm F. Rotolo Algorithm Data from the copula model (Kpanzou, 2007) are simulated as follows −1 1 T1 = S1 (U1 ) −1 2 T2 |t1 = S2|1 (U2 |t1 ) = θ −1/θ −1 − 1+θ S2 U2 − 1 S1 (t1 )−θ + 1 −1 3 T3 |t1 , t2 = S3|12 (U3 |t1 , t2 ) = θ −1/θ −1 − 1+2θ S3 U3 −1 S1 (t1 )−θ + S2 (t2 )−θ − 1 + 1 −1 C TC = FC (UC ) T min(TC , T1 , T2 , T3 ) with U1 , U2 , U3 , UC i.i.d. U(0, 1) A copula-based simulation method for clustered multi-state survival data 8/ 22
  • 26. Simulation Algorithm F. Rotolo Second transitions For patients with a transition into state LR or DM, an analogous copula model is used for second transition to state De LR The following conditional survivals can be obtained T1 T4 −1/θ−1 θ S1 (t1 ) S4|1 (t4 |t1 ) = 1 + S4 (t4 ) − S1 (t1 )θ θ −1/θ−1 S2 (t2 ) NED De S5|2 (t5 |t2 ) = 1 + S5 (t5 ) − S2 (t2 )θ and the same algorithm is used to simulate second transition times, conditionally on first transition ones. DM A copula-based simulation method for clustered multi-state survival data 9/ 22
  • 27. Simulation Algorithm F. Rotolo Second transitions For patients with a transition into state LR or DM, an analogous copula model is used for second transition to state De LR The following conditional survivals can be obtained θ −1/θ−1 S1 (t1 ) S4|1 (t4 |t1 ) = 1 + S4 (t4 ) − S1 (t1 )θ θ −1/θ−1 S2 (t2 ) NED De S5|2 (t5 |t2 ) = 1 + S5 (t5 ) − S2 (t2 )θ and the same algorithm is used to simulate second transition times, conditionally on first transition ones. T2 T5 DM A copula-based simulation method for clustered multi-state survival data 9/ 22
  • 28. Simulation Algorithm F. Rotolo Clustering The algorithm allows to freely specify the marginal survivals Si (t). How can we insert clustering? A copula-based simulation method for clustered multi-state survival data 10/ 22
  • 29. Simulation Algorithm F. Rotolo Clustering The algorithm allows to freely specify the marginal survivals Si (t). How can we insert clustering? In a PH way hi (t|Z ) = Z h0i (t), with h0i (t) the baseline hazard for transition i. A copula-based simulation method for clustered multi-state survival data 10/ 22
  • 30. Simulation Algorithm F. Rotolo Clustering The algorithm allows to freely specify the marginal survivals Si (t). How can we insert clustering? In a PH way hi (t|Z ) = Z h0i (t), with h0i (t) the baseline hazard for transition i. t Since S0i (t) = exp{− 0 h0i (u)du}, then t Si (t|Z ) = exp −Z h0i (u)du = [S0i (t)]Z 0 A copula-based simulation method for clustered multi-state survival data 10/ 22
  • 31. Simulation Algorithm F. Rotolo Clustering The algorithm allows to freely specify the marginal survivals Si (t). How can we insert clustering? In a PH way hi (t|Z ) = Z h0i (t), with h0i (t) the baseline hazard for transition i. t Since S0i (t) = exp{− 0 h0i (u)du}, then t Si (t|Z ) = exp −Z h0i (u)du = [S0i (t)]Z 0 The copula model can be used for conditional survivals {Si (t|Z )}i∈{1,2,3,4,5} and the same algorithm can be used, conditionally on Z . A copula-based simulation method for clustered multi-state survival data 10/ 22
  • 32. Simulation Algorithm F. Rotolo Clustering and covariates The effect of covariates X can be inserted in an analogous way. The marginals are then βi X Si (t|X , Z ) = S0i (t)Ze and simulation via the copula model is done conditionally on (X , Z ). A copula-based simulation method for clustered multi-state survival data 11/ 22
  • 33. Simulation Algorithm F. Rotolo The Clayton–Weibull model Despite the model is quite general, we consider in the following a particular case: Ti ∼ Wei(λi , ρi ), i ∈ {1, 2, 3, 4, 5} TC ∼ Wei(λC , 1) ∼ Exp(λC ) 72 months (6 years) of administrative censoring A copula-based simulation method for clustered multi-state survival data 12/ 22
  • 34. Simulation Algorithm F. Rotolo The Clayton–Weibull model Despite the model is quite general, we consider in the following a particular case: Ti ∼ Wei(λi , ρi ), i ∈ {1, 2, 3, 4, 5} TC ∼ Wei(λC , 1) ∼ Exp(λC ) 72 months (6 years) of administrative censoring This model 1. gives simple forms of conditional distributions T 2. implies that Si|X ,Z (t|x, z) = exp{−λi ze βi x t ρi }, T i.e. Ti |X , Z ∼ Wei(λi ze βi x , ρi ) is still a Weibull r.v. A copula-based simulation method for clustered multi-state survival data 12/ 22
  • 35. Choice of Parameters F. Rotolo Outline Clustered Multi-State Survival Data Simulation Algorithm Clustering Choice of Parameters Example A copula-based simulation method for clustered multi-state survival data 13/ 22
  • 36. Choice of Parameters F. Rotolo Choice of parameters When simulating a dataset, one should be able to choose parameters in order to obtain particular target values for LR pi probabilities of LR, DM, De and censoring from NED T1 T4 T3 NED De T2 T5 DM A copula-based simulation method for clustered multi-state survival data 14/ 22
  • 37. Choice of Parameters F. Rotolo Choice of parameters When simulating a dataset, one should be able to choose parameters in order to obtain particular target values for LR pi probabilities of LR, DM, De and censoring from NED T1 T4 mi median of uncensored LR, DM and De times from NED T3 NED De T2 T5 DM A copula-based simulation method for clustered multi-state survival data 14/ 22
  • 38. Choice of Parameters F. Rotolo Choice of parameters When simulating a dataset, one should be able to choose parameters in order to obtain particular target values for LR pi probabilities of LR, DM, De and censoring from NED T1 T4 mi median of uncensored LR, DM and De times from NED T3 NED De pi probabilities of De and censoring from LR and from DM T2 T5 DM A copula-based simulation method for clustered multi-state survival data 14/ 22
  • 39. Choice of Parameters F. Rotolo Choice of parameters When simulating a dataset, one should be able to choose parameters in order to obtain particular target values for LR pi probabilities of LR, DM, De and censoring from NED T1 T4 mi median of uncensored LR, DM and De times from NED T3 NED De pi probabilities of De and censoring from LR and from DM T2 T5 mi median of uncensored De times from LR and from DM DM It is not possible to analytically express these quantities as functions of the parameters. A copula-based simulation method for clustered multi-state survival data 14/ 22
  • 40. Choice of Parameters F. Rotolo Criterion function In order to find appropriate parameters for given target values {pi , mi }, we want to minimize the criterion function 2 2 pi mi Υ(Π) = log + log pi (Π) ˆ mi (Π) ˆ i∈{1,2,3,4,5} ≥0 with Π = {λi }i∈{1,2,3,C ,4,C 4,5,C 5} ∪ {ρi }i∈{1,2,3,4,5} ∈ R13 + and {ˆi (Π), mi (Π)} the observed values in a simulated dataset with p ˆ parameters Π A copula-based simulation method for clustered multi-state survival data 15/ 22
  • 41. Choice of Parameters F. Rotolo Criterion function In order to find appropriate parameters for given target values {pi , mi }, we want to minimize the criterion function 2 2 pi mi Υ(Π) = log + log pi (Π) ˆ mi (Π) ˆ i∈{1,2,3,4,5} = Υ123 (Π123 ) + Υ4 (Π4 ) + Υ5 (Π5 ) ≥ 0 with Π = {λi }i∈{1,2,3,C ,4,C 4,5,C 5} ∪ {ρi }i∈{1,2,3,4,5} ∈ R13 + Π = Π123 ∪ Π4 ∪ Π5 ∈ R7 × R3 × R3 + + + and {ˆi (Π), mi (Π)} the observed values in a simulated dataset with p ˆ parameters Π A copula-based simulation method for clustered multi-state survival data 15/ 22
  • 42. Choice of Parameters F. Rotolo Criterion function In order to find appropriate parameters for given target values {pi , mi }, we want to minimize the criterion function 2 2 pi mi Υ(Π) = log + log pi (Π) ˆ mi (Π) ˆ i∈{1,2,3,4,5} = Υ123 (Π123 ) + Υ4 (Π4 ) + Υ5 (Π5 ) ≥ 0 with Π = {λi }i∈{1,2,3,C ,4,C 4,5,C 5} ∪ {ρi }i∈{1,2,3,4,5} ∈ R13 + Π = Π123 ∪ Π4 ∪ Π5 ∈ R7 × R3 × R3 + + + Further reduction of problem dimension... Π = Π123 ∪ Π4 ∪ Π5 ∈ R4+3 × R2+1 × R2+1 + + + and {ˆi (Π), mi (Π)} the observed values in a simulated dataset with p ˆ parameters Π A copula-based simulation method for clustered multi-state survival data 15/ 22
  • 43. Choice of Parameters F. Rotolo Minimization of criterion function In order to further reduce the dimension of the problem, each of the parameter sets ΠK , K ∈ {{123}, {4}, {5}} is split into the scale {λi } and the shape parameters {ρi }. The optimization of the criterion function ΥK (ΠK ) is iterated on each subset Example: algorithm for K = {123} Set J = 1 (0) λ(0) = {λi }i∈{C ,1,2,3} = {1, 1, 1, 1} (0) ρ(0) = {ρi }i∈{1,2,3} = {1, 1, 1} A copula-based simulation method for clustered multi-state survival data 16/ 22
  • 44. Choice of Parameters F. Rotolo Minimization of criterion function In order to further reduce the dimension of the problem, each of the parameter sets ΠK , K ∈ {{123}, {4}, {5}} is split into the scale {λi } and the shape parameters {ρi }. The optimization of the criterion function ΥK (ΠK ) is iterated on each subset Example: algorithm for K = {123} Set J = 1 (0) λ(0) = {λi }i∈{C ,1,2,3} = {1, 1, 1, 1} (0) ρ(0) = {ρi }i∈{1,2,3} = {1, 1, 1} Repeat until J = maxit or Υ123 (λ(J−1) , ρ(J−1) ) < th Obtain λ(J) by minimizing Υ123 (λ, ρ(J−1) ) over λ Obtain ρ(J) by minimizing Υ123 (λ(J) , ρ) over ρ Set J = J + 1 where maxit and th are arbitrary termination parameters. A copula-based simulation method for clustered multi-state survival data 16/ 22
  • 45. Example F. Rotolo An example A dataset of size 44 is available from a multi-center study on head and neck cancer. Target values {pi } and {mi } LR 8 15 7 3 NED De 22 14 4 4 DM Tot: 44 0 A copula-based simulation method for clustered multi-state survival data 17/ 22
  • 46. Example F. Rotolo An example A dataset of size 44 is available from a multi-center study on head and neck cancer. Target values {pi } and {mi } Frailty term 40 Hospitals LR 8 random sizes 15 7 Z ∼ Gam(1, 0.5) 3 NED De 22 14 4 4 DM Tot: 44 0 A copula-based simulation method for clustered multi-state survival data 17/ 22
  • 47. Example F. Rotolo An example A dataset of size 44 is available from a multi-center study on head and neck cancer. Target values {pi } and {mi } Frailty term 40 Hospitals LR 8 random sizes 15 7 Z ∼ Gam(1, 0.5) Covariates Age ∼ N (60, 7) 3 NED De with  22 14 log(0.8)/10  i =1 βi,Age = log(0.9)/10 i =2  log(1.2)/10 i = 3, 4, 5  4 4 Treat ∼ Bin(0.5) with  DM log(1/3) i =1 0  Tot: 44 βi,Treat = 0 i =2  log(1.2) i = 3, 4, 5  A copula-based simulation method for clustered multi-state survival data 17/ 22
  • 48. Example F. Rotolo Results First transitions. The algorithm is run with datasets of size 104 , maxit = 10 and th = 0.1. The time of execution was 11:57’ hours NED→ {LR,DM,De} λ1 λ2 λ3 λC ρ1 ρ2 ρ3 0.276 0.019 0.013 0.031 0.851 1.076 0.569 A copula-based simulation method for clustered multi-state survival data 18/ 22
  • 49. Example F. Rotolo Results First transitions. The algorithm is run with datasets of size 104 , maxit = 10 and th = 0.1. The time of execution was 11:57’ hours NED→ {LR,DM,De} λ1 λ2 λ3 λC ρ1 ρ2 ρ3 0.276 0.019 0.013 0.031 0.851 1.076 0.569 NED→ {LR,DM,De} pi mi LR DM De C LR DM De Target 0.34 0.09 0.07 0.50 6.00 10.00 3.00 Simulated 0.33 0.12 0.09 0.46 5.41 9.33 2.29 Υ123 (Π123 ) = 0.24 A copula-based simulation method for clustered multi-state survival data 18/ 22
  • 50. Example F. Rotolo Results Second transitions. Conditionally on first transitions data, the algorithm is run for second transitions from LR and DM with maxit = 6 and th = 0.05. The times of execution were 4:31’ and 3:57’ hours, respectively. LR→De DM→De λ4 λC 4 ρ4 λ5 λC 5 ρ5 0.029 0.099 1.078 0.192 0.039 1.000 A copula-based simulation method for clustered multi-state survival data 19/ 22
  • 51. Example F. Rotolo Results Second transitions. Conditionally on first transitions data, the algorithm is run for second transitions from LR and DM with maxit = 6 and th = 0.05. The times of execution were 4:31’ and 3:57’ hours, respectively. LR→De DM→De λ4 λC 4 ρ4 λ5 λC 5 ρ5 0.029 0.099 1.078 0.192 0.039 1.000 LR→De DM→De pi mi pi mi De C De De C De 0.53 0.47 3.25 Target 0.95 0.05 0.50 0.50 0.50 3.32 Simulated 0.97 0.03 0.54 Υ4 (Π4 ) = 0.0043 Υ5 (Π5 ) = 0.0064 A copula-based simulation method for clustered multi-state survival data 19/ 22
  • 52. Conclusion F. Rotolo Conclusion The proposed simulation procedure for clustered MS allows to MSMs generate dependence between times of the same subject (between both competing and subsequent event times) FMs generate dependence between times of clustered subjects (with arbitrary number and size of groups and free frailty distribution) PH insert covariates via proportional hazards parMod choose marginal distributions of time variables A copula-based simulation method for clustered multi-state survival data 20/ 22
  • 53. Conclusion F. Rotolo Conclusion The proposed simulation procedure for clustered MS allows to MSMs generate dependence between times of the same subject (between both competing and subsequent event times) FMs generate dependence between times of clustered subjects (with arbitrary number and size of groups and free frailty distribution) PH insert covariates via proportional hazards parMod choose marginal distributions of time variables automatically find appropriate parameters, given arbitrary target values for probabilities of censoring, of competing events and for medians of uncensored times generate censoring, both random and administrative A copula-based simulation method for clustered multi-state survival data 20/ 22
  • 54. References F. Rotolo References Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society. Series B (Methodological) 34, 187–220. de Wreede, L. C., Fiocco, M. & Putter, H. (2010). The mstate package for estimation and prediction in non- and semi-parametric multi-state and competing risks models. Comput Methods Programs Biomed 99, 261–74. Duchateau, L. & Janssen, P. (2008). The frailty model. Springer. Kpanzou, T. A. (2007). Copulas in statistics. African Institute for Mathematical Sciences (AIMS) . Putter, H., Fiocco, M. & Geskus, R. B. (2007). Tutorial in biostatistics: competing risks and multi-state models. Stat Med 26, 2389–430. Wienke, A. (2010). Frailty Models in Survival Analysis. Chapman & Hall/CRC biostatistics series. Taylor and Francis. A copula-based simulation method for clustered multi-state survival data 21/ 22
  • 55. F. Rotolo [federico.rotolo@stat.unipd.it – federico.rotolo@uclouvain.be] PhD Student at University of Padova and Visiting PhD Student at UCL under the supervision of prof. C. Legrand, UCL prof. I. Van Keilegom, UCL prof. M. Chiogna, UniPd