SlideShare une entreprise Scribd logo
1  sur  69
Applications of Contemporary
  Statistical Approaches in
   Environmental Health




                        B. Rey de Castro, Sc.D.
                              before the
              CDC Emergency Response & Air Toxicants Branch
                              Atlanta, GA
                             April 28, 2011
Non-Independent Data
Normal & Non-Normal Data
Environmental Health Data
Missing Data
Filling In Missing Data
Statistical Characterization of an
   MCF-7 Cell Culture Assay
MCF-7 Cell Culture Assay
• Cell culture assay for estrogenic potency
  – E-SCREEN
• MCF-7 cell number increases with dose of
  17 -estradiol, xenoestrogens
MCF-7 Assay Data
• 17 -Estradiol
• 396 observed cell counts
                                               1998 17 -Estradiol E-SCREEN Data
  – 11 plates                     12 wells/plate            11 Plates      3 counts/well
                             1000000




                              800000
            Cells per Well




                              600000




                              400000




                              200000




                                   0
                                       CNTL   1e-13     1e-12      1e-11    1e-10   1e-9

                                                         Log10 Dose [M]
Estrogenic PCB Data

                 250000                                                      250000                                                                       250000

                                          PCB17                                                              PCB49                                                                 PCB66
                 200000                                                      200000                                                                       200000



                 150000                                                      150000                                                                       150000



                 100000                                                      100000                                                                       100000
Cells per Well




                 50000                                                       50000                                                                        50000



                     0                                                           0                                                                            0
                          CNTL   2.5e-6    5.0e-6   7.5e-61.0e-5    2.5e-5            CNTL 1.0e-6             2.5e-6             5.0e-6   7.5e-6 1.0e-5            CNTL   2.5e-6    5.0e-6   7.5e-6 1.0e-5   2.5e-5


                 250000                                                      250000

                                          PCB74                                                              PCB128
                 200000                                                      200000



                 150000                                                      150000



                 100000                                                      100000



                  50000                                                      50000



                      0                                                          0
                          CNTL   2.5e-6    5.0e-6   7.5e-6 1.0e-5   2.5e-5            CNTL          2.5e-6      5.0e-6   7.5e-6 1.0e-5           2.5e-5



                                                                                                    Log10 Dose [M]
MCF-7 Cell Culture Assay
• Dependent data – MCF-7 cell number
• Fixed effect – dose
  – 5 estradiol dose levels
  – 1 control dose
12-Well Plate
• Random effects, or variance components
  – Plate
  – Plate Dose interaction
  – Well within (Plate Dose)

                         CD FBS 5%    1E-11M E2


                          1E-9M E2    1E-12M E2


                          1E-10M E2   1E-13M E2
Generalized Linear Mixed Effects Model


g (Yijkm)          ijkm     d0       di      Pj      ( PD)ij W ( PD)k (ij)
where   Yijkm = cell number
        d0 = mean cell number of no-dose control (intercept; i = 0)
        di = fixed effect of ith dose (i = 1, 2, … , 5)
        Pj = random effect of jth plate (j = 1, 2, … , 21), P ~ N(0, P2)
        (PD)ij       = joint random effect of ith dose with jth plate, , PD ~ N(0, PD2)
        W(PD)k(ij) = random effect of kth well (k = 1, 2) nested within the ith dose
        and jth plate, , W ~ N(0, W2)
        In addition, the error term from Y = + is as follows:
         m(ijk) = random error of the mth count (m = 1, 2, 3)
Findings
• MCF-7 assay data
  – Gamma error distribution & reciprocal link
  – COV = 3.1 %
  – All variance components significant
Estrogenic PCB Data

                 250000                                                      250000                                                                       250000

                                          PCB17                                                              PCB49                                                                 PCB66
                 200000                                                      200000                                                                       200000



                 150000                                                      150000                                                                       150000



                 100000                                                      100000                                                                       100000
Cells per Well




                 50000                                                       50000                                                                        50000



                     0                                                           0                                                                            0
                          CNTL   2.5e-6    5.0e-6   7.5e-61.0e-5    2.5e-5            CNTL 1.0e-6             2.5e-6             5.0e-6   7.5e-6 1.0e-5            CNTL   2.5e-6    5.0e-6   7.5e-6 1.0e-5   2.5e-5


                 250000                                                      250000

                                          PCB74                                                              PCB128
                 200000                                                      200000



                 150000                                                      150000



                 100000                                                      100000



                  50000                                                      50000



                      0                                                          0
                          CNTL   2.5e-6    5.0e-6   7.5e-6 1.0e-5   2.5e-5            CNTL          2.5e-6      5.0e-6   7.5e-6 1.0e-5           2.5e-5



                                                                                                    Log10 Dose [M]
Increased sensitivity for
       detecting
  weakly estrogenic
environmental pollutants
Ambient Black Carbon From
   Traffic in an Urban
     Neighborhood
Baltimore Traffic Study
• Observe dynamics of ambient traffic-related
  pollutants at a location embedded within an
  urban residential neighborhood with high
  vehicular volume
Baltimore Traffic Study
• 2nd floor row house on commuter street
• Real-time sampling
• Near-simultaneous indoor/outdoor sampling
Baltimore Traffic Study
• Black carbon, PM, particle-bound
  PAH, CO, O3, NOx, VOCs
• Vehicle counts
• Meteorology
Missing Data

                                                              Observational
                                                                Interval        N       Missing
Black Carbon [ng/m3] ..................................             5         103,975     1,145
Vehicle Count [100s]....................................            5          93,482    11,638
Dew Point [5 C] ...........................................        30          15,801     1,719
Temperature [5 C] ........................................         30          15,801     1,719
BWI Atm. Pressure [10 hPa] ........................                60           8,758         2
BWI Precipitation [5 mm] ............................              60           8,756         4
BWI Wind Speed [m/s] ................................              60           8,760         0
AERMET Mixing Height [100 m] ................                      60           8,760         0
Impute Missing Data
• Regression prediction + N(0,      2)

    – Add pseudorandom variation
    – Minimize bias
•   2   estimated from regression
Meteorology Imputation
• External, simultaneous reference data
• BTS imputations estimated from BWI
  observations
• R2s = 0.98
Traffic Imputation
•   Internal, non-simultaneous reference data
•   Imputations estimated from own street’s data
•   Season, day-of-week, rush hour, time-of-day
•   Calvert R2 = 0.52, St. Paul R2 = 0.65
Optimal Time Series Model
Findings
• Neighborhood-level exposure to black carbon
  from mobile sources was 65.82 ng/m3 per 100
  vehicles
• Background exposure to black carbon without
  traffic was estimated to be 899.06 ng/m3
Findings
• Winds from the SW-S-SE quarter were
  associated with the greatest increases in
  black carbon
• Implicates atmospheric processes in
  transporting black carbon from
  – Baltimore’s central business district
  – Interstate highways
  – Regional and inter-regional sources
Longest
and highest resolution
  time series analysis
    I have ever seen
Without the
   autocorrelation term,
 the statistical relationship
       between traffic
and black carbon is reversed
Microenvironment Exposure
 Weights Can Be Obtained from a
Straightforward Statistical Model of
        Time-Location Data
Time-Location Data
• Basis for estimating total exposure
  – Amount of time in each microenvironment
  – Concentration in each microenvironment
• Structured diaries
Time-Weighted Exposure
• ith time interval
• jth microenvironment


                  N M
   Exposure twa = ∑ ∑ timeWeight ij × concentration ij
                  i   j
Generalized Logit Model
• Regression framework


                        P
       P[Y = j]
   log          = α j + ∑ β jp X p   j = 2, 3, ...,K
      P[Y = 1]          p=1
Time-Weighted Exposure
                                        time ij
 timeWeight ij =
                                    time total
                                   subjects ij
              =
                                  subjects total
              =                           p ij
                                    P
                            α j + ∑ β jpX p
                        ℯ          p= 1

                                           P
                                                    (j > 1)
              =         K         α k + ∑ β kpX p
                   1+   ∑     ℯ           p= 1


                        k=2
Outcome: Microenvironments
•   Indoor-home
•   Indoor-school
•   Indoor-other
•   Commuting
•   Outdoors
Subjects
• 95 children                       Sex
• 7 to 11-years-old    Age      Male Female
                      7 years    8      9
                         8       8      12
                         9       6      9
                        10       11     13
                        11       8      11
Time-Location
• 12 months
  – June 1995 – May 1996
• 4 days
  – Thursday - Monday
• 30-minute intervals
  – 0600 to 2030
Data
• N = 171,000
• 1,800 longitudinal observations/subject
• Missing observations
  – Imputation
Generalized Logit Model



   indoor home
                           time of day   sex
   indoor school
Pr indoor other = β 0j +   day of week + age
                           month         nonwhite
   commuting
                           lags 1 - 6    televisions
   outdoor
Time-of-Day: Weekday
                                    100%

                                                        Outdoor
• June                              90%
                                                               Commuting

• Thursday
             Percent Children [%]
                                    80%
                                             Indoor
                                              Other
                                    70%
                                               Indoor School

                                    60%                                      Indoor Home


                                    50%


                                    40%
                                       00

                                               00

                                               00

                                               00

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0

                                                 0
                                              :0

                                              :0

                                              :0

                                              :0

                                              :0

                                              :0

                                              :0

                                              :0

                                              :0

                                              :0

                                              :0
                                     6:

                                             7:

                                             8:

                                             9:
                                            10

                                            11

                                            12

                                            13

                                            14

                                            15

                                            16

                                            17

                                            18

                                            19

                                            20
                                                               Time-of-Day
Time-of-Day: Weekend
                                      100%
                                                                          Outdoor

• June                                90%                                            Commuting
                                                                                     Indoor Other
• Saturday
               Percent-Children [%]
                                      80%
                                                   Indoor
                                                   School

                                      70%

                                                                                                               Indoor Home
                                      60%


                                      50%


                                      40%
                                        00

                                             00

                                                  00

                                                       00

                                                                 0

                                                                       0

                                                                               0

                                                                                     0

                                                                                             0

                                                                                                   0

                                                                                                           0

                                                                                                                 0

                                                                                                                         0

                                                                                                                               0

                                                                                                                                       0
                                                             :0

                                                                     :0

                                                                           :0

                                                                                   :0

                                                                                         :0

                                                                                                 :0

                                                                                                       :0

                                                                                                               :0

                                                                                                                     :0

                                                                                                                             :0

                                                                                                                                   :0
                                       6:

                                             7:

                                                  8:

                                                       9:
                                                            10

                                                                  11

                                                                          12

                                                                                13

                                                                                        14

                                                                                              15

                                                                                                      16

                                                                                                            17

                                                                                                                    18

                                                                                                                          19

                                                                                                                                  20
                                                                                   Time-of-Day
Directly provides
      weights for
    estimating total
time-weighted exposure
Acrolein and Adult Asthma in a
  Nationally Representative
 Sample of the United States

       WORK IN PROGRESS
Acrolein
•   Air toxic
•   Aldehyde
•   Respiratory irritant
•   Hard to measure in ambient air
    – Improved methods becoming available
Acrolein
•   Industrial uses
•   Tobacco smoke
•   Mobile sources
•   Indoor air pollutant
2005 TRI: Acrolein
NATA 2005
• US EPA National-Scale Air Toxics Assessment
• Sources
  – Point
  – Non-point
  – Mobile
     • On-road
     • Off-road
• Secondary formation and decay
NATA 2005
•   National Emissions Inventory
•   Air monitoring data
•   Atmospheric dispersion modeling
•   Modeled exposure estimates
    – Every United States census tract
• No indoor sources assessed
NATA 2005
• Acrolein
  – Responsible for 75% respiratory non-cancer health
    effects nationwide
NHIS 2000 - 2009
• National Health Interview Survey
• Representative
  – United States
  – Non-institutionalized
  – Civilian
• Cross-sectional prevalence
NHIS 2000 - 2009
• Adults 18 years-old and over
• Self-reported asthma attack in previous 12
  months
NATA & NHIS
• NHIS subjects geographically linked to
  NATA acrolein exposure estimates
• Census tract
  – Survey subject residences
  – Area exposure estimates
• Individual-level analysis
Preliminary Findings
• At highest quintile of acrolein exposure
  – >0.055 g/m3
• pOR 1.11 [1.00:1.23]
• Controlling for
  smoking, sex, age, education, race, poverty,
  insurance, access to care, urban/rural
  residence, survey year
Feasible to conduct
national epidemiologic
   analysis for air toxic
with little measured data
First epidemiologic
evaluation of acrolein
Experimental Laboratory
Traffic Air Pollution
Multinomial Time-Location
Probabalistic National Survey
Biomarkers in Human Tissue
Toxicology
Gene Expression
Exposure Assessment
7 days x 24 hr/day @900 LPM
PM2.5                   SAEC 1i    Microarray 1i

                        SAEC 2i    Microarray 2i   Summer
 HVCI
                        SAEC 3i    Microarray 3i

  PUF
 Organic
 Extract
                        SAEC 1c    Microarray 1c
Clean PUF
                                                     Control
 Organic                SAEC 2c    Microarray 2c
 Extract                                           Field Blank
                        SAEC 3c    Microarray 3c
B. Rey de Castro, Sc.D.

rey.decastro@comcast.net

    +1 410 929 3583

  www.slideshare.net

Contenu connexe

Plus de REY DECASTRO

Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...
Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...
Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...REY DECASTRO
 
Association of Urinary Arsenic Species with Diet in a Representative Sample o...
Association of Urinary Arsenic Species with Diet in a Representative Sample o...Association of Urinary Arsenic Species with Diet in a Representative Sample o...
Association of Urinary Arsenic Species with Diet in a Representative Sample o...REY DECASTRO
 
Acrolein and COPD in a Nationally Representative Sample of United States Adul...
Acrolein and COPD in a Nationally Representative Sample of United States Adul...Acrolein and COPD in a Nationally Representative Sample of United States Adul...
Acrolein and COPD in a Nationally Representative Sample of United States Adul...REY DECASTRO
 
Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...
Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...
Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...REY DECASTRO
 
Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...
Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...
Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...REY DECASTRO
 
Acrolein and Adult Asthma in a Nationally Representative Sample of the United...
Acrolein and Adult Asthma in a Nationally Representative Sample of the United...Acrolein and Adult Asthma in a Nationally Representative Sample of the United...
Acrolein and Adult Asthma in a Nationally Representative Sample of the United...REY DECASTRO
 
Applications of Contemporary Statistical Approaches in Environmental Health M...
Applications of Contemporary Statistical Approaches in Environmental Health M...Applications of Contemporary Statistical Approaches in Environmental Health M...
Applications of Contemporary Statistical Approaches in Environmental Health M...REY DECASTRO
 
Secondary Data Analysis
Secondary Data AnalysisSecondary Data Analysis
Secondary Data AnalysisREY DECASTRO
 
Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...
Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...
Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...REY DECASTRO
 

Plus de REY DECASTRO (9)

Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...
Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...
Population-Weighted Exposure to 174 Air Toxics in a Representative Sample of...
 
Association of Urinary Arsenic Species with Diet in a Representative Sample o...
Association of Urinary Arsenic Species with Diet in a Representative Sample o...Association of Urinary Arsenic Species with Diet in a Representative Sample o...
Association of Urinary Arsenic Species with Diet in a Representative Sample o...
 
Acrolein and COPD in a Nationally Representative Sample of United States Adul...
Acrolein and COPD in a Nationally Representative Sample of United States Adul...Acrolein and COPD in a Nationally Representative Sample of United States Adul...
Acrolein and COPD in a Nationally Representative Sample of United States Adul...
 
Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...
Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...
Bootstrap estimation of variance from ROC curve analysis of NHANES complex su...
 
Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...
Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...
Perchlorate Exposure from Diet and Drinking Water in a Representative Sample ...
 
Acrolein and Adult Asthma in a Nationally Representative Sample of the United...
Acrolein and Adult Asthma in a Nationally Representative Sample of the United...Acrolein and Adult Asthma in a Nationally Representative Sample of the United...
Acrolein and Adult Asthma in a Nationally Representative Sample of the United...
 
Applications of Contemporary Statistical Approaches in Environmental Health M...
Applications of Contemporary Statistical Approaches in Environmental Health M...Applications of Contemporary Statistical Approaches in Environmental Health M...
Applications of Contemporary Statistical Approaches in Environmental Health M...
 
Secondary Data Analysis
Secondary Data AnalysisSecondary Data Analysis
Secondary Data Analysis
 
Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...
Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...
Microenvironment Exposure Weights Can Be Obtained from a Straightforward Stat...
 

Dernier

Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 

Dernier (20)

Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 

Applications of Contemporary Statistical Approaches in Environmental Health April 28 2011

  • 1. Applications of Contemporary Statistical Approaches in Environmental Health B. Rey de Castro, Sc.D. before the CDC Emergency Response & Air Toxicants Branch Atlanta, GA April 28, 2011
  • 7. Statistical Characterization of an MCF-7 Cell Culture Assay
  • 8. MCF-7 Cell Culture Assay • Cell culture assay for estrogenic potency – E-SCREEN • MCF-7 cell number increases with dose of 17 -estradiol, xenoestrogens
  • 9. MCF-7 Assay Data • 17 -Estradiol • 396 observed cell counts 1998 17 -Estradiol E-SCREEN Data – 11 plates 12 wells/plate 11 Plates 3 counts/well 1000000 800000 Cells per Well 600000 400000 200000 0 CNTL 1e-13 1e-12 1e-11 1e-10 1e-9 Log10 Dose [M]
  • 10. Estrogenic PCB Data 250000 250000 250000 PCB17 PCB49 PCB66 200000 200000 200000 150000 150000 150000 100000 100000 100000 Cells per Well 50000 50000 50000 0 0 0 CNTL 2.5e-6 5.0e-6 7.5e-61.0e-5 2.5e-5 CNTL 1.0e-6 2.5e-6 5.0e-6 7.5e-6 1.0e-5 CNTL 2.5e-6 5.0e-6 7.5e-6 1.0e-5 2.5e-5 250000 250000 PCB74 PCB128 200000 200000 150000 150000 100000 100000 50000 50000 0 0 CNTL 2.5e-6 5.0e-6 7.5e-6 1.0e-5 2.5e-5 CNTL 2.5e-6 5.0e-6 7.5e-6 1.0e-5 2.5e-5 Log10 Dose [M]
  • 11. MCF-7 Cell Culture Assay • Dependent data – MCF-7 cell number • Fixed effect – dose – 5 estradiol dose levels – 1 control dose
  • 12. 12-Well Plate • Random effects, or variance components – Plate – Plate Dose interaction – Well within (Plate Dose) CD FBS 5% 1E-11M E2 1E-9M E2 1E-12M E2 1E-10M E2 1E-13M E2
  • 13. Generalized Linear Mixed Effects Model g (Yijkm) ijkm d0 di Pj ( PD)ij W ( PD)k (ij) where Yijkm = cell number d0 = mean cell number of no-dose control (intercept; i = 0) di = fixed effect of ith dose (i = 1, 2, … , 5) Pj = random effect of jth plate (j = 1, 2, … , 21), P ~ N(0, P2) (PD)ij = joint random effect of ith dose with jth plate, , PD ~ N(0, PD2) W(PD)k(ij) = random effect of kth well (k = 1, 2) nested within the ith dose and jth plate, , W ~ N(0, W2) In addition, the error term from Y = + is as follows: m(ijk) = random error of the mth count (m = 1, 2, 3)
  • 14. Findings • MCF-7 assay data – Gamma error distribution & reciprocal link – COV = 3.1 % – All variance components significant
  • 15. Estrogenic PCB Data 250000 250000 250000 PCB17 PCB49 PCB66 200000 200000 200000 150000 150000 150000 100000 100000 100000 Cells per Well 50000 50000 50000 0 0 0 CNTL 2.5e-6 5.0e-6 7.5e-61.0e-5 2.5e-5 CNTL 1.0e-6 2.5e-6 5.0e-6 7.5e-6 1.0e-5 CNTL 2.5e-6 5.0e-6 7.5e-6 1.0e-5 2.5e-5 250000 250000 PCB74 PCB128 200000 200000 150000 150000 100000 100000 50000 50000 0 0 CNTL 2.5e-6 5.0e-6 7.5e-6 1.0e-5 2.5e-5 CNTL 2.5e-6 5.0e-6 7.5e-6 1.0e-5 2.5e-5 Log10 Dose [M]
  • 16. Increased sensitivity for detecting weakly estrogenic environmental pollutants
  • 17. Ambient Black Carbon From Traffic in an Urban Neighborhood
  • 18. Baltimore Traffic Study • Observe dynamics of ambient traffic-related pollutants at a location embedded within an urban residential neighborhood with high vehicular volume
  • 19. Baltimore Traffic Study • 2nd floor row house on commuter street • Real-time sampling • Near-simultaneous indoor/outdoor sampling
  • 20. Baltimore Traffic Study • Black carbon, PM, particle-bound PAH, CO, O3, NOx, VOCs • Vehicle counts • Meteorology
  • 21. Missing Data Observational Interval N Missing Black Carbon [ng/m3] .................................. 5 103,975 1,145 Vehicle Count [100s].................................... 5 93,482 11,638 Dew Point [5 C] ........................................... 30 15,801 1,719 Temperature [5 C] ........................................ 30 15,801 1,719 BWI Atm. Pressure [10 hPa] ........................ 60 8,758 2 BWI Precipitation [5 mm] ............................ 60 8,756 4 BWI Wind Speed [m/s] ................................ 60 8,760 0 AERMET Mixing Height [100 m] ................ 60 8,760 0
  • 22. Impute Missing Data • Regression prediction + N(0, 2) – Add pseudorandom variation – Minimize bias • 2 estimated from regression
  • 23. Meteorology Imputation • External, simultaneous reference data • BTS imputations estimated from BWI observations • R2s = 0.98
  • 24. Traffic Imputation • Internal, non-simultaneous reference data • Imputations estimated from own street’s data • Season, day-of-week, rush hour, time-of-day • Calvert R2 = 0.52, St. Paul R2 = 0.65
  • 25.
  • 26.
  • 28.
  • 29. Findings • Neighborhood-level exposure to black carbon from mobile sources was 65.82 ng/m3 per 100 vehicles • Background exposure to black carbon without traffic was estimated to be 899.06 ng/m3
  • 30. Findings • Winds from the SW-S-SE quarter were associated with the greatest increases in black carbon • Implicates atmospheric processes in transporting black carbon from – Baltimore’s central business district – Interstate highways – Regional and inter-regional sources
  • 31. Longest and highest resolution time series analysis I have ever seen
  • 32. Without the autocorrelation term, the statistical relationship between traffic and black carbon is reversed
  • 33. Microenvironment Exposure Weights Can Be Obtained from a Straightforward Statistical Model of Time-Location Data
  • 34. Time-Location Data • Basis for estimating total exposure – Amount of time in each microenvironment – Concentration in each microenvironment • Structured diaries
  • 35. Time-Weighted Exposure • ith time interval • jth microenvironment N M Exposure twa = ∑ ∑ timeWeight ij × concentration ij i j
  • 36. Generalized Logit Model • Regression framework P P[Y = j] log = α j + ∑ β jp X p j = 2, 3, ...,K P[Y = 1] p=1
  • 37. Time-Weighted Exposure time ij timeWeight ij = time total subjects ij = subjects total = p ij P α j + ∑ β jpX p ℯ p= 1 P (j > 1) = K α k + ∑ β kpX p 1+ ∑ ℯ p= 1 k=2
  • 38. Outcome: Microenvironments • Indoor-home • Indoor-school • Indoor-other • Commuting • Outdoors
  • 39. Subjects • 95 children Sex • 7 to 11-years-old Age Male Female 7 years 8 9 8 8 12 9 6 9 10 11 13 11 8 11
  • 40. Time-Location • 12 months – June 1995 – May 1996 • 4 days – Thursday - Monday • 30-minute intervals – 0600 to 2030
  • 41. Data • N = 171,000 • 1,800 longitudinal observations/subject • Missing observations – Imputation
  • 42. Generalized Logit Model indoor home time of day sex indoor school Pr indoor other = β 0j + day of week + age month nonwhite commuting lags 1 - 6 televisions outdoor
  • 43. Time-of-Day: Weekday 100% Outdoor • June 90% Commuting • Thursday Percent Children [%] 80% Indoor Other 70% Indoor School 60% Indoor Home 50% 40% 00 00 00 00 0 0 0 0 0 0 0 0 0 0 0 :0 :0 :0 :0 :0 :0 :0 :0 :0 :0 :0 6: 7: 8: 9: 10 11 12 13 14 15 16 17 18 19 20 Time-of-Day
  • 44. Time-of-Day: Weekend 100% Outdoor • June 90% Commuting Indoor Other • Saturday Percent-Children [%] 80% Indoor School 70% Indoor Home 60% 50% 40% 00 00 00 00 0 0 0 0 0 0 0 0 0 0 0 :0 :0 :0 :0 :0 :0 :0 :0 :0 :0 :0 6: 7: 8: 9: 10 11 12 13 14 15 16 17 18 19 20 Time-of-Day
  • 45. Directly provides weights for estimating total time-weighted exposure
  • 46. Acrolein and Adult Asthma in a Nationally Representative Sample of the United States WORK IN PROGRESS
  • 47. Acrolein • Air toxic • Aldehyde • Respiratory irritant • Hard to measure in ambient air – Improved methods becoming available
  • 48. Acrolein • Industrial uses • Tobacco smoke • Mobile sources • Indoor air pollutant
  • 50. NATA 2005 • US EPA National-Scale Air Toxics Assessment • Sources – Point – Non-point – Mobile • On-road • Off-road • Secondary formation and decay
  • 51. NATA 2005 • National Emissions Inventory • Air monitoring data • Atmospheric dispersion modeling • Modeled exposure estimates – Every United States census tract • No indoor sources assessed
  • 52. NATA 2005 • Acrolein – Responsible for 75% respiratory non-cancer health effects nationwide
  • 53.
  • 54. NHIS 2000 - 2009 • National Health Interview Survey • Representative – United States – Non-institutionalized – Civilian • Cross-sectional prevalence
  • 55. NHIS 2000 - 2009 • Adults 18 years-old and over • Self-reported asthma attack in previous 12 months
  • 56. NATA & NHIS • NHIS subjects geographically linked to NATA acrolein exposure estimates • Census tract – Survey subject residences – Area exposure estimates • Individual-level analysis
  • 57. Preliminary Findings • At highest quintile of acrolein exposure – >0.055 g/m3 • pOR 1.11 [1.00:1.23] • Controlling for smoking, sex, age, education, race, poverty, insurance, access to care, urban/rural residence, survey year
  • 58. Feasible to conduct national epidemiologic analysis for air toxic with little measured data
  • 68. 7 days x 24 hr/day @900 LPM PM2.5 SAEC 1i Microarray 1i SAEC 2i Microarray 2i Summer HVCI SAEC 3i Microarray 3i PUF Organic Extract SAEC 1c Microarray 1c Clean PUF Control Organic SAEC 2c Microarray 2c Extract Field Blank SAEC 3c Microarray 3c
  • 69. B. Rey de Castro, Sc.D. rey.decastro@comcast.net +1 410 929 3583 www.slideshare.net