SlideShare une entreprise Scribd logo
1  sur  69
Chicago Health Atlas:
  The Promise, Process, and Problems of
  using electronic health record data for
            population health
                     April 4, 2013

Abel Kho, MD               Roderick (Eric) Jones, MPH
Northwestern University    Chicago Dept. Public Health
Session Preview
• What is the Chicago Health Atlas?
• The Promise:
   Contextual factors that play a role in the
   collaboration
• The Process:
   Getting started, developing matching algorithms,
   minimizing reidentification risk
• The Problems:
   Deriving meaning and delivering it to people who
   can use it
Chicago Health Atlas is a . . .
       collaboration

• Informatics researchers from multiple
  healthcare institutions
• Chicago Regional Extension Center
  (CHITREC)
• Chicago Community Trust
• Chicago Department of Public Health
Chicago Health Atlas is a . . .
         website
Chicago Health Atlas is a . . .
         database

• De-identified electronic health record
  data for ~1 million Chicagoans
• In-patient and out-patient visits spanning
  2006-2011
• Individual patient records matched
  across institutions
Chicago Context:

Person, Place, Time
Chicago: Person, Place, Time

                     Percent change, Percent of total
      Group
                        2000-2010        in 2010

Chicago                     7           [2.7 million]

Non-Hispanic black         17                32

Non-Hispanic white          6                32
Hispanic                    3                29
Non-Hispanic Asian         14                5
Chicago: Person, Place, Time
                                           229 Square miles
                                           77 neighborhood “Community areas”
                           Lake Michigan
                                           with population median of 31,000
O’Hare
                                           (range, 3,000 – 99,000)
                                           Stem Leaf                     #    Boxplot
                                                 9 9                          1     0
                                                 9 4                          1     |
                                                 8                                  |
                                                 8 02                         2     |
                              Loop
                                                 7 99                         2     |
                                                 7 23                         2     |
                                                 6                                  |
                                                 6 44                         2     |
                                                 5 556667                     6     |
Suburban Cook County                             5 223                        3     |
                                                 4 559                        3 +-----+
                                                 4 0124                       4 |     |
                  Midway                         3 5666799                    7 | + |
                                                 3 01112233                   8 *-----*
                                                 2 55669                      5 |     |
                                                 2 01123334                   8 |     |
                                                 1 568888899                  9 +-----+
                                                 1 01233334                   8     |
                                                 0 6679                       4     |
                                                 0 33                         2     |
                                                   ----+----+----+----+
                                               Multiply Stem.Leaf by 10**+4
                                           All but two community areas have
                                           larger populations than the least-
                                           populated Illinois county
Chicago Context:
Healthy Chicago sets goals for. . .


 • Public policy and legislation (n=56)
 • Health education and awareness (n=45)
 • Interventions and programs (n=92)
HEALTHY CHICAGO
Chicago Department of Public Health




                                      Infrastructure
Highlights
       Infrastructure




• Establish an Office of Epidemiology and
  Public Health Informatics

• Expand epidemiology capacity through an
  increase in staff and the development of
  strategic partnerships with other entities who
  use or collect public health data
NYC Macroscope
     Scientific Advisory Group

• New York City has embarked on a study to
  validate population health estimates from its
  Primary Care Information Project

• CDPH involvement has lead to collaboration
  on developing vision and methodology for
  more widespread use of EHR data for public
  health
Highlights
         Infrastructure



• Increase the
  availability of
  public health data
  through the City
  of Chicago
  website
Chicago Context:




Health Information Exchange
Illinois Regional
Health Information Exchanges
Even if we don’t have a mature
HIE or a Regenstrief Institute,
is it possible to . . .

• Leverage existing EHR data
• Weave together data from multiple
  institutions with publicly available data
• Measure disease burden and care delivered?
Design Considerations
• Limit sharing of any protected health
  information
• Yet account for care of the same patient
  at multiple institutions
• Protect anonymity of
  patients/providers/institutions
• Enable linkage to new information and
  sources as it becomes available
  – Patient level
  – Geographic location
Process – getting started
• Coordinated IRB approval across multiple
  institutions.
   – Constrained to adults aged 18-89
   – Limited to structured data, no free text
   – Focus on 606xx zip codes, with known
     overlapping care institutions and high
     population density
• Instead of an EMPI, create a lightweight
  software application to pass identifiers through
  a standard set of preprocessing steps, and then
  “hash” the data
Process

Hashing and Matching Methods
How we “Hashed” our Data

-Hash algorithms accept variable size input messages and produce a small
fixed-size output called a hash value or message digest

-The hash is non-degenerate; only 1 input message per final hash value

-The hash is 1-way; Easy to go from message to hash value, very hard to go
from hash value to message.

-We initially used an early hash, Secure Hash Algorithm-1 (SHA-1).




http://csrc.nist.gov/publications/nistbul/b-May-2008.pdf
Preliminary SHA-1 Single
             Institution Validation
5-Variable Hash
                                           Concatenate WilliamGalanter22732M123456789
 William   Galanter   3/31/1962 M    SSN


WilliamGalanter22732M123456789      SHA1    20802322ED366A1EFD562A6219C4D7AF993BADAD




4-Variable Hash

 William   Galanter   3/31/1962 M Concatenate & SHA112345678901234567890123456789012345
Updated Hash Method

•   SHA-1 was found to have a potential security issue, moved to a
    second generation Hash, SHA-512* (512 bit)

•   Significant focus on data pre-processing / normalization

•   Trimming spaces and non A-Z characters, lower case
         _Jimmy__ O’Brien Jr. jimmy, obrien

•   Remove “-” from SSN and remove all invalid combinations
•   Only allow Birth year >1921
•   Use “F” and “M” for sex
•   Replace missing elements with missing data indicators




*http://csrc.nist.gov/groups/ST/toolkit/secure_hashing.html
Updated Hash Method (cont.)
•   Creates 5 hash IDs (with probability weights) depending on availability of
    last name, first name, date of birth (DOB), gender, SSN.

     –   All data available (1.0)
     –   All fields except; no DOB, or no First and last name, or no SSN (0.3)
     –   All fields, but only first three letters of names available (0.1)
     –   SOUNDEX codes (phonetic equivalents) of the first and last name plus
         date of birth and gender (0.1)

•    Wrapped up into a standalone Java program
•   Can readily consume other data sources (e.g. Social Security Death Index
    Tables)
Diabetes
 (250.xx)
                        Institution A                        Institution C/
                                               Hash ID-1     Honest Broker
    John                     john
                                               Hash ID-2
  O’Dwyer      Pre-         odwyer      Hash   Hash ID-3
 6/12/1970                 06121970
              Process
                          987654329     Fxn    Hash ID-4
987-65-4329                                    Hash ID-5
     M                         m
                                                           Replace
                                                           Matched    StudyID
                                                           HashIDs    250.xx
                                                            with      401.xx
                                                           Unique
  John                       john                          StudyID
 O dwyer                                       Hash ID-1
               Pre-         odwyer      Hash   Hash ID-2
 6/12/70                   06121970
  male
              Process                   Fxn    Hash ID-3
                               m               Hash ID-4
                                               Hash ID-5


   HTN                  Institution B
 (401.xx)
Data Dictionary
• Standardized data specifications for data
  extractions from participating sites
  – Demographics
  – Vital signs
  – Diagnoses
     • Study ID | Month/Year | Encounter type | Encounter
       number | Diagnosis code
  – Medications
  – Laboratory tests
     • Study ID | Month/Year | Lab test name | Result |
       Units | Normal Range | Specimen type
Process

Privacy and Re-Identification
      Considerations

    Courtesy of Brad Malin
     Vanderbilt University
De-Identified Health Information

    De-identified health information neither identifies nor provides
    a reasonable basis to identify an individual. There are two
    ways to de-identify information; either:
(1)      a formal determination by a qualified statistician;
(2)      the removal of specified identifiers of the individual and of
         the individual’s relatives, household members, and
         employers is required, and is adequate only if the covered
         entity has no actual knowledge that the remaining
         information could be used to identify the individual.




                                                                         29
HIPAA Expert Determination
        (abridged)
 Certify via “generally accepted
 statistical and scientific principles &
 methods, that the risk is very small
 that the information could be used,
 alone or in combination with other
 reasonably available information, by
 the anticipated recipient to identify the
 subject of the information.”
                                             30
Uniqueness Analysis

     Model         Uniques (%) Uniques (People)
     Safe Harbor   0.000064%   13
Uniqueness Analysis

     Model               Uniques (%) Uniques (People)
     Safe Harbor         0.000064%   13
     Chicago Health Atlas 0.3%       8,050
Uniqueness Analysis

     Model               Uniques (%) Uniques (People)
     Safe Harbor         0.000064%   13
     Chicago Health Atlas 0.3%       8,050
Completing the Re-identification
             Requires Resources

                     Safe Harbored          •   Could link to registries
                        Records                  – Birth     – Marriage
                                                 – Death     – Divorce




   Identified          Identified                              Identified
Clinical Records   Population Records                          Resource


                                            •   What’s in vogue?
                                                      Voter registration DBs
                     Chicago Health
                      Atlas Model



                         Benitez & Malin. JAMIA. 2010.
Risk will Vary Across Regions
                Voter Registration Databases
                IL                      MN                      TN       WA     WI
WHO             Registered Political    MN Voters               Anyone Anyone Anyone
                Committees
                (ANYONE – In Person)
Format          Disk                    Disk                    Disk     Disk   Disk
Cost            $500                    $46; “use ONLY for      $2500    $30    $12,500
                                        elections, political
                                        activities, or law
                                        enforcement”
Name                                                                             
Address                                                                          
Date of Birth                                                           
Sex                                                                      
Race                                                                 
                                Benitez & Malin. JAMIA. 2010.
Uniqueness Analysis

     Model               Uniques (%) Uniques (People)
     Safe Harbor         0.000064%   13
     Chicago Health Atlas 0.3%       8,050
Uniqueness Analysis
     Model                 Uniques (%) Uniques (People)
     Safe Harbor           0.000064%      13
     Chicago Health Atlas 0.3%            8,050

     Linked to Voter Registration
     Safe Harbor           Really small   0
     Chicago Health Atlas 0.004%          80
     Linked to Voter Reg
Uniqueness Analysis
     Model                 Uniques (%) Uniques (People)
     Safe Harbor           0.000064%      13
     Chicago Health Atlas 0.3%            8,050

     Linked to Voter Registration
     Safe Harbor           Really small   0
     Chicago Health Atlas 0.004%          80
     Linked to Voter Reg
Next Steps
• Consider re-identification risk options
  – Coarsen ZIP codes
  – Coarsen Ethnicities
  – Coarsen Age groups


• Search* for tradeoffs between information
  utility (e.g., epidemiologic findings) and
  privacy (i.e., re-identification risk)
              *Benitez & Malin. JAMIA. 2011.
Findings

A promising source of prevalence estimates
Data contribution summary,
               April 2013
          Data Type                        Institution
                   1                      2 3 4 5         6
    Demographics C                       C C C C         PC
    Diagnoses      C                     C C C C         PC
    Visit type     C                     C C C C         PC
    BMI, BP        C                     PP N N N        PC
    Glucose, HbA1c C                     C C N N         PC
    Medications    C                     C C N N         PC
C: complete; N: not yet incorporated;
PP: partial time period; PC: partial cohort
How many patients receive care
    at more than one institution?
  No. of institutions Number                                               %         Cumulative %

                  4 or 5                               393                0.0                  0.0

                       3                             8,409                0.9                  0.9

                       2                           74,372                 7.6                  8.5

                       1                          892,468 91.4                                100.0
Includes the 5 institutions with all patient visits 2006-2010 submitted (as of April 2013).
Sample size/cohort comparison,
    by residential ZIP code,
    BRFSS* vs. Chicago Health Atlas
Source      Min Median Mean Max
IL BRFSS, Chicago
2011 respondents               4          15           16         33

Chicago Health
Atlas, patient with        1,339 10,031 9,270 21,289
2010 visit


*CDC Behavioral Risk Factor Surveillance System survey, Chicago
sub-sample from Illinois dataset.
Diabetes prevalence estimate
by residential ZIP


Percent=

# of patients with > 1 diabetes mellitus diagnosis code

  # of patients with visit in 2006-2010
No, patient does not
                                                        have type 2 diabetes

Finding type 2 diabetes
in the health record
• Diagnosis codes
• Labs
• Medications
• Number of visits   Yes, patient has type 2 diabetes
Diabetes prevalence estimate
by residential ZIP


Percent=

# of patients with > 1 diabetes mellitus diagnosis code
                    or lab criteria met

  # of patients with visit in 2006-2010
Percent of Atlas patients with
  diabetes diagnosis in 2006-2010
 Percent




                             Minimum number of visits recorded

Illinois BRFSS estimates the prevalence of diabetes in Chicago at 9-11%.
Hypertension prevalence estimate
by residential ZIP


Percent=

# of patients with > 1 hypertension diagnosis code

  # of patients with visit in 2006-2010
Coronary heart disease prevalence
estimate
by residential ZIP

Percent=

# of patients with > 1 CHD diagnosis code

  # of patients with visit in 2006-2010
Gun shot wound prevalence
estimate
by residential ZIP

Percent=

# of patients with > 1 gun shot wound diagnosis code

  # of patients with visit in 2006-2010
Problem

  Applying estimates to Chicago
– rather than patient – populations
Age distribution comparison, 2010
Percent




            Age groups
Race-ethnicity comparison

                        Percent of total
      Group
                     Atlas        2010 Census

Non-Hispanic black    31              32

Non-Hispanic white    20              32

Hispanic              14              29
Non-Hispanic Asian    4               5
Not given/Unknown     31              0
Geographic coverage
by residential ZIP


 Percent=

 # of patients with visit in 2010

   2010 Census population




                                    Additional text
Problem

ZIP Codes aren’t meaningful
     geographic units
Imputation of ZIP code rates to
       community area
           Diabetes hospitalization, 2010
                                               Imputed using age, sex,
Rates by ZIP         Imputed using age & sex   & race-ethnicity




   Additional text
Imputation of ZIP code rates to
       community area
           Diabetes hospitalization, 2010
                                               Imputed using age, sex,
Rates by ZIP         Imputed using age & sex   & race-ethnicity




   Additional text
Maps courtesy of Chieko Maene, University of Chicago, as part of CDPH-UC Diabetes Translational Research Collaboration.
Dasymetric areal interpolation
1. Calculate for each ZIP code
    Male & female x 19 age groups = 28 rates
                       or
   Male & female x 19 age groups x 4 race-ethnicity
   groups = 84 rates
2. Apply rates to corresponding population group
   in each census block to get counts
3. Sum counts to Community area
4. Calculate rates based on community area
   population denominators
Dataset description elements


•   Description (who, what, where, when)
•   Definitions
•   Calculations and formulas
•   Limitations, disclaimers, sources of error
•   Benchmarks and references
Chicago Health Atlas Funders


• Otho S.A. Sprague Institute

• Northwestern Memorial Hospital
  Community Engagement
Health Atlas Team
• Northwestern University: John Cashy, Anna Roberts, Sara
  Lake
• Univ. of Illinois-Chicago: Bill Galanter, John Lazaro
• Cook County Hospital System: Bala Hota, Amanda Grasso
• Univ. of Chicago Medical Center: Chris Lyttle, Ben Vekhter,
  David Meltzer
• Alliance of Chicago: Erin Kaleba, Fred Rachman, Jermaine
  Dellahousaye
• Rush University Medical Center: Shannon Sims, Aaron Tabor
• Vanderbilt University: Brad Malin
• UIC Intern team: Ariadna Garcia, Pravin Babu Karuppaiah,
  Shazia Sathar, Ulas Keles (Sid Battacharya, Faculty mentor)
facebook.com/ChicagoPublicHealth             @ChiPublicHealth


HealthyChicago@CityofChicago.org             312.747.9884



                  CityofChicago.org/Health

Contenu connexe

Plus de Chicago Department of Public Health

Plus de Chicago Department of Public Health (20)

Big City Tobacco Control
Big City Tobacco ControlBig City Tobacco Control
Big City Tobacco Control
 
What gets Measured Gets Done: Data Needs, Uses and Innovations in Large Urban...
What gets Measured Gets Done: Data Needs, Uses and Innovations in Large Urban...What gets Measured Gets Done: Data Needs, Uses and Innovations in Large Urban...
What gets Measured Gets Done: Data Needs, Uses and Innovations in Large Urban...
 
2014 National Network of Public Health Institutes Open Forum: The Launch of H...
2014 National Network of Public Health Institutes Open Forum: The Launch of H...2014 National Network of Public Health Institutes Open Forum: The Launch of H...
2014 National Network of Public Health Institutes Open Forum: The Launch of H...
 
Local Strategies to Address Homelessness
Local Strategies to Address HomelessnessLocal Strategies to Address Homelessness
Local Strategies to Address Homelessness
 
Healthy Chicago and the ACA 2.0: Creating Opportunities for Innovation
Healthy Chicago and the ACA 2.0:  Creating Opportunities for InnovationHealthy Chicago and the ACA 2.0:  Creating Opportunities for Innovation
Healthy Chicago and the ACA 2.0: Creating Opportunities for Innovation
 
Wisconsin Public Health Association Annual Conference Keynote Presentation
Wisconsin Public Health Association Annual Conference Keynote PresentationWisconsin Public Health Association Annual Conference Keynote Presentation
Wisconsin Public Health Association Annual Conference Keynote Presentation
 
Current State of the HIV Epidemic Among MSM in Chicago
Current State of the HIV Epidemic Among MSM in ChicagoCurrent State of the HIV Epidemic Among MSM in Chicago
Current State of the HIV Epidemic Among MSM in Chicago
 
Partnership to Improve Children's Health
Partnership to Improve Children's HealthPartnership to Improve Children's Health
Partnership to Improve Children's Health
 
Healthy Chicago and Health Care Reform at University of Chicago MacLean Center
Healthy Chicago and Health Care Reform at University of Chicago MacLean CenterHealthy Chicago and Health Care Reform at University of Chicago MacLean Center
Healthy Chicago and Health Care Reform at University of Chicago MacLean Center
 
Annual Healthy Chicago 2013 Year In Review and Awards Presentation
Annual Healthy Chicago 2013 Year In Review and Awards PresentationAnnual Healthy Chicago 2013 Year In Review and Awards Presentation
Annual Healthy Chicago 2013 Year In Review and Awards Presentation
 
Gold Humanism Society Lori Ann Roscetti Annual Memorial Lecture on Ethical & ...
Gold Humanism Society Lori Ann Roscetti Annual Memorial Lecture on Ethical & ...Gold Humanism Society Lori Ann Roscetti Annual Memorial Lecture on Ethical & ...
Gold Humanism Society Lori Ann Roscetti Annual Memorial Lecture on Ethical & ...
 
Interdisciplinary Medicine III Lecture – Community Engagement and Service Unit
Interdisciplinary Medicine III Lecture – Community Engagement and Service UnitInterdisciplinary Medicine III Lecture – Community Engagement and Service Unit
Interdisciplinary Medicine III Lecture – Community Engagement and Service Unit
 
Chicago's Action Plan for Healthy Adolescents
Chicago's Action Plan for Healthy AdolescentsChicago's Action Plan for Healthy Adolescents
Chicago's Action Plan for Healthy Adolescents
 
Building a Healthier Chicago
Building a Healthier ChicagoBuilding a Healthier Chicago
Building a Healthier Chicago
 
Institute for Public Health and Medicine Seminar Series
Institute for Public Health and Medicine Seminar SeriesInstitute for Public Health and Medicine Seminar Series
Institute for Public Health and Medicine Seminar Series
 
Perspectives Charter High School Internship Day
Perspectives Charter High School Internship DayPerspectives Charter High School Internship Day
Perspectives Charter High School Internship Day
 
University of Illinois at Chicago Patient Centered Medicine Scholars Program
 University of Illinois at Chicago Patient Centered Medicine Scholars Program  University of Illinois at Chicago Patient Centered Medicine Scholars Program
University of Illinois at Chicago Patient Centered Medicine Scholars Program
 
The Chicago Cultural Plan and Healthy Chicago
The Chicago Cultural Plan and Healthy ChicagoThe Chicago Cultural Plan and Healthy Chicago
The Chicago Cultural Plan and Healthy Chicago
 
Neurodevelopmental Disorders Associated with Prenatal Exposure to Alcohol
Neurodevelopmental Disorders Associated with Prenatal Exposure to AlcoholNeurodevelopmental Disorders Associated with Prenatal Exposure to Alcohol
Neurodevelopmental Disorders Associated with Prenatal Exposure to Alcohol
 
Transforming the Health of the Community Snively Lecture
Transforming the Health of the Community Snively LectureTransforming the Health of the Community Snively Lecture
Transforming the Health of the Community Snively Lecture
 

Dernier

Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
Sheetaleventcompany
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
adilkhan87451
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
mahaiklolahd
 

Dernier (20)

Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
Coimbatore Call Girls in Thudiyalur : 7427069034 High Profile Model Escorts |...
 
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
 
Kollam call girls Mallu aunty service 7877702510
Kollam call girls Mallu aunty service 7877702510Kollam call girls Mallu aunty service 7877702510
Kollam call girls Mallu aunty service 7877702510
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
 
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
 
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 8250077686 Top Class Call Girl Service Available
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
 
Call Girls Mumbai Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Mumbai Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Mumbai Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Mumbai Just Call 8250077686 Top Class Call Girl Service Available
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
Jogeshwari ! Call Girls Service Mumbai - 450+ Call Girl Cash Payment 90042684...
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
💕SONAM KUMAR💕Premium Call Girls Jaipur ↘️9257276172 ↙️One Night Stand With Lo...
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
 
Top Rated Pune Call Girls (DIPAL) ⟟ 8250077686 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls (DIPAL) ⟟ 8250077686 ⟟ Call Me For Genuine Sex Serv...Top Rated Pune Call Girls (DIPAL) ⟟ 8250077686 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls (DIPAL) ⟟ 8250077686 ⟟ Call Me For Genuine Sex Serv...
 
Call Girls Coimbatore Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 8250077686 Top Class Call Girl Service Available
 

Chicago Health Atlas: The Promise, Process, and Problems of using electronic health record data for population health

  • 1. Chicago Health Atlas: The Promise, Process, and Problems of using electronic health record data for population health April 4, 2013 Abel Kho, MD Roderick (Eric) Jones, MPH Northwestern University Chicago Dept. Public Health
  • 2. Session Preview • What is the Chicago Health Atlas? • The Promise: Contextual factors that play a role in the collaboration • The Process: Getting started, developing matching algorithms, minimizing reidentification risk • The Problems: Deriving meaning and delivering it to people who can use it
  • 3. Chicago Health Atlas is a . . . collaboration • Informatics researchers from multiple healthcare institutions • Chicago Regional Extension Center (CHITREC) • Chicago Community Trust • Chicago Department of Public Health
  • 4. Chicago Health Atlas is a . . . website
  • 5. Chicago Health Atlas is a . . . database • De-identified electronic health record data for ~1 million Chicagoans • In-patient and out-patient visits spanning 2006-2011 • Individual patient records matched across institutions
  • 7. Chicago: Person, Place, Time Percent change, Percent of total Group 2000-2010 in 2010 Chicago 7 [2.7 million] Non-Hispanic black 17 32 Non-Hispanic white 6 32 Hispanic 3 29 Non-Hispanic Asian 14 5
  • 8. Chicago: Person, Place, Time 229 Square miles 77 neighborhood “Community areas” Lake Michigan with population median of 31,000 O’Hare (range, 3,000 – 99,000) Stem Leaf # Boxplot 9 9 1 0 9 4 1 | 8 | 8 02 2 | Loop 7 99 2 | 7 23 2 | 6 | 6 44 2 | 5 556667 6 | Suburban Cook County 5 223 3 | 4 559 3 +-----+ 4 0124 4 | | Midway 3 5666799 7 | + | 3 01112233 8 *-----* 2 55669 5 | | 2 01123334 8 | | 1 568888899 9 +-----+ 1 01233334 8 | 0 6679 4 | 0 33 2 | ----+----+----+----+ Multiply Stem.Leaf by 10**+4 All but two community areas have larger populations than the least- populated Illinois county
  • 10. Healthy Chicago sets goals for. . . • Public policy and legislation (n=56) • Health education and awareness (n=45) • Interventions and programs (n=92)
  • 11. HEALTHY CHICAGO Chicago Department of Public Health Infrastructure
  • 12. Highlights Infrastructure • Establish an Office of Epidemiology and Public Health Informatics • Expand epidemiology capacity through an increase in staff and the development of strategic partnerships with other entities who use or collect public health data
  • 13. NYC Macroscope Scientific Advisory Group • New York City has embarked on a study to validate population health estimates from its Primary Care Information Project • CDPH involvement has lead to collaboration on developing vision and methodology for more widespread use of EHR data for public health
  • 14. Highlights Infrastructure • Increase the availability of public health data through the City of Chicago website
  • 17. Even if we don’t have a mature HIE or a Regenstrief Institute, is it possible to . . . • Leverage existing EHR data • Weave together data from multiple institutions with publicly available data • Measure disease burden and care delivered?
  • 18.
  • 19. Design Considerations • Limit sharing of any protected health information • Yet account for care of the same patient at multiple institutions • Protect anonymity of patients/providers/institutions • Enable linkage to new information and sources as it becomes available – Patient level – Geographic location
  • 20. Process – getting started • Coordinated IRB approval across multiple institutions. – Constrained to adults aged 18-89 – Limited to structured data, no free text – Focus on 606xx zip codes, with known overlapping care institutions and high population density • Instead of an EMPI, create a lightweight software application to pass identifiers through a standard set of preprocessing steps, and then “hash” the data
  • 22. How we “Hashed” our Data -Hash algorithms accept variable size input messages and produce a small fixed-size output called a hash value or message digest -The hash is non-degenerate; only 1 input message per final hash value -The hash is 1-way; Easy to go from message to hash value, very hard to go from hash value to message. -We initially used an early hash, Secure Hash Algorithm-1 (SHA-1). http://csrc.nist.gov/publications/nistbul/b-May-2008.pdf
  • 23. Preliminary SHA-1 Single Institution Validation 5-Variable Hash Concatenate WilliamGalanter22732M123456789 William Galanter 3/31/1962 M SSN WilliamGalanter22732M123456789 SHA1 20802322ED366A1EFD562A6219C4D7AF993BADAD 4-Variable Hash William Galanter 3/31/1962 M Concatenate & SHA112345678901234567890123456789012345
  • 24. Updated Hash Method • SHA-1 was found to have a potential security issue, moved to a second generation Hash, SHA-512* (512 bit) • Significant focus on data pre-processing / normalization • Trimming spaces and non A-Z characters, lower case _Jimmy__ O’Brien Jr. jimmy, obrien • Remove “-” from SSN and remove all invalid combinations • Only allow Birth year >1921 • Use “F” and “M” for sex • Replace missing elements with missing data indicators *http://csrc.nist.gov/groups/ST/toolkit/secure_hashing.html
  • 25. Updated Hash Method (cont.) • Creates 5 hash IDs (with probability weights) depending on availability of last name, first name, date of birth (DOB), gender, SSN. – All data available (1.0) – All fields except; no DOB, or no First and last name, or no SSN (0.3) – All fields, but only first three letters of names available (0.1) – SOUNDEX codes (phonetic equivalents) of the first and last name plus date of birth and gender (0.1) • Wrapped up into a standalone Java program • Can readily consume other data sources (e.g. Social Security Death Index Tables)
  • 26. Diabetes (250.xx) Institution A Institution C/ Hash ID-1 Honest Broker John john Hash ID-2 O’Dwyer Pre- odwyer Hash Hash ID-3 6/12/1970 06121970 Process 987654329 Fxn Hash ID-4 987-65-4329 Hash ID-5 M m Replace Matched StudyID HashIDs 250.xx with 401.xx Unique John john StudyID O dwyer Hash ID-1 Pre- odwyer Hash Hash ID-2 6/12/70 06121970 male Process Fxn Hash ID-3 m Hash ID-4 Hash ID-5 HTN Institution B (401.xx)
  • 27. Data Dictionary • Standardized data specifications for data extractions from participating sites – Demographics – Vital signs – Diagnoses • Study ID | Month/Year | Encounter type | Encounter number | Diagnosis code – Medications – Laboratory tests • Study ID | Month/Year | Lab test name | Result | Units | Normal Range | Specimen type
  • 28. Process Privacy and Re-Identification Considerations Courtesy of Brad Malin Vanderbilt University
  • 29. De-Identified Health Information De-identified health information neither identifies nor provides a reasonable basis to identify an individual. There are two ways to de-identify information; either: (1) a formal determination by a qualified statistician; (2) the removal of specified identifiers of the individual and of the individual’s relatives, household members, and employers is required, and is adequate only if the covered entity has no actual knowledge that the remaining information could be used to identify the individual. 29
  • 30. HIPAA Expert Determination (abridged) Certify via “generally accepted statistical and scientific principles & methods, that the risk is very small that the information could be used, alone or in combination with other reasonably available information, by the anticipated recipient to identify the subject of the information.” 30
  • 31.
  • 32. Uniqueness Analysis Model Uniques (%) Uniques (People) Safe Harbor 0.000064% 13
  • 33. Uniqueness Analysis Model Uniques (%) Uniques (People) Safe Harbor 0.000064% 13 Chicago Health Atlas 0.3% 8,050
  • 34. Uniqueness Analysis Model Uniques (%) Uniques (People) Safe Harbor 0.000064% 13 Chicago Health Atlas 0.3% 8,050
  • 35. Completing the Re-identification Requires Resources Safe Harbored • Could link to registries Records – Birth – Marriage – Death – Divorce Identified Identified Identified Clinical Records Population Records Resource • What’s in vogue? Voter registration DBs Chicago Health Atlas Model Benitez & Malin. JAMIA. 2010.
  • 36. Risk will Vary Across Regions Voter Registration Databases IL MN TN WA WI WHO Registered Political MN Voters Anyone Anyone Anyone Committees (ANYONE – In Person) Format Disk Disk Disk Disk Disk Cost $500 $46; “use ONLY for $2500 $30 $12,500 elections, political activities, or law enforcement” Name      Address      Date of Birth     Sex    Race  Benitez & Malin. JAMIA. 2010.
  • 37. Uniqueness Analysis Model Uniques (%) Uniques (People) Safe Harbor 0.000064% 13 Chicago Health Atlas 0.3% 8,050
  • 38. Uniqueness Analysis Model Uniques (%) Uniques (People) Safe Harbor 0.000064% 13 Chicago Health Atlas 0.3% 8,050 Linked to Voter Registration Safe Harbor Really small 0 Chicago Health Atlas 0.004% 80 Linked to Voter Reg
  • 39. Uniqueness Analysis Model Uniques (%) Uniques (People) Safe Harbor 0.000064% 13 Chicago Health Atlas 0.3% 8,050 Linked to Voter Registration Safe Harbor Really small 0 Chicago Health Atlas 0.004% 80 Linked to Voter Reg
  • 40. Next Steps • Consider re-identification risk options – Coarsen ZIP codes – Coarsen Ethnicities – Coarsen Age groups • Search* for tradeoffs between information utility (e.g., epidemiologic findings) and privacy (i.e., re-identification risk) *Benitez & Malin. JAMIA. 2011.
  • 41.
  • 42. Findings A promising source of prevalence estimates
  • 43. Data contribution summary, April 2013 Data Type Institution 1 2 3 4 5 6 Demographics C C C C C PC Diagnoses C C C C C PC Visit type C C C C C PC BMI, BP C PP N N N PC Glucose, HbA1c C C C N N PC Medications C C C N N PC C: complete; N: not yet incorporated; PP: partial time period; PC: partial cohort
  • 44. How many patients receive care at more than one institution? No. of institutions Number % Cumulative % 4 or 5 393 0.0 0.0 3 8,409 0.9 0.9 2 74,372 7.6 8.5 1 892,468 91.4 100.0 Includes the 5 institutions with all patient visits 2006-2010 submitted (as of April 2013).
  • 45. Sample size/cohort comparison, by residential ZIP code, BRFSS* vs. Chicago Health Atlas Source Min Median Mean Max IL BRFSS, Chicago 2011 respondents 4 15 16 33 Chicago Health Atlas, patient with 1,339 10,031 9,270 21,289 2010 visit *CDC Behavioral Risk Factor Surveillance System survey, Chicago sub-sample from Illinois dataset.
  • 46. Diabetes prevalence estimate by residential ZIP Percent= # of patients with > 1 diabetes mellitus diagnosis code # of patients with visit in 2006-2010
  • 47. No, patient does not have type 2 diabetes Finding type 2 diabetes in the health record • Diagnosis codes • Labs • Medications • Number of visits Yes, patient has type 2 diabetes
  • 48. Diabetes prevalence estimate by residential ZIP Percent= # of patients with > 1 diabetes mellitus diagnosis code or lab criteria met # of patients with visit in 2006-2010
  • 49. Percent of Atlas patients with diabetes diagnosis in 2006-2010 Percent Minimum number of visits recorded Illinois BRFSS estimates the prevalence of diabetes in Chicago at 9-11%.
  • 50. Hypertension prevalence estimate by residential ZIP Percent= # of patients with > 1 hypertension diagnosis code # of patients with visit in 2006-2010
  • 51. Coronary heart disease prevalence estimate by residential ZIP Percent= # of patients with > 1 CHD diagnosis code # of patients with visit in 2006-2010
  • 52. Gun shot wound prevalence estimate by residential ZIP Percent= # of patients with > 1 gun shot wound diagnosis code # of patients with visit in 2006-2010
  • 53. Problem Applying estimates to Chicago – rather than patient – populations
  • 54. Age distribution comparison, 2010 Percent Age groups
  • 55. Race-ethnicity comparison Percent of total Group Atlas 2010 Census Non-Hispanic black 31 32 Non-Hispanic white 20 32 Hispanic 14 29 Non-Hispanic Asian 4 5 Not given/Unknown 31 0
  • 56. Geographic coverage by residential ZIP Percent= # of patients with visit in 2010 2010 Census population Additional text
  • 57. Problem ZIP Codes aren’t meaningful geographic units
  • 58.
  • 59. Imputation of ZIP code rates to community area Diabetes hospitalization, 2010 Imputed using age, sex, Rates by ZIP Imputed using age & sex & race-ethnicity Additional text
  • 60. Imputation of ZIP code rates to community area Diabetes hospitalization, 2010 Imputed using age, sex, Rates by ZIP Imputed using age & sex & race-ethnicity Additional text
  • 61. Maps courtesy of Chieko Maene, University of Chicago, as part of CDPH-UC Diabetes Translational Research Collaboration.
  • 62.
  • 63. Dasymetric areal interpolation 1. Calculate for each ZIP code Male & female x 19 age groups = 28 rates or Male & female x 19 age groups x 4 race-ethnicity groups = 84 rates 2. Apply rates to corresponding population group in each census block to get counts 3. Sum counts to Community area 4. Calculate rates based on community area population denominators
  • 64.
  • 65.
  • 66. Dataset description elements • Description (who, what, where, when) • Definitions • Calculations and formulas • Limitations, disclaimers, sources of error • Benchmarks and references
  • 67. Chicago Health Atlas Funders • Otho S.A. Sprague Institute • Northwestern Memorial Hospital Community Engagement
  • 68. Health Atlas Team • Northwestern University: John Cashy, Anna Roberts, Sara Lake • Univ. of Illinois-Chicago: Bill Galanter, John Lazaro • Cook County Hospital System: Bala Hota, Amanda Grasso • Univ. of Chicago Medical Center: Chris Lyttle, Ben Vekhter, David Meltzer • Alliance of Chicago: Erin Kaleba, Fred Rachman, Jermaine Dellahousaye • Rush University Medical Center: Shannon Sims, Aaron Tabor • Vanderbilt University: Brad Malin • UIC Intern team: Ariadna Garcia, Pravin Babu Karuppaiah, Shazia Sathar, Ulas Keles (Sid Battacharya, Faculty mentor)
  • 69. facebook.com/ChicagoPublicHealth @ChiPublicHealth HealthyChicago@CityofChicago.org 312.747.9884 CityofChicago.org/Health