SlideShare une entreprise Scribd logo
1  sur  18
Managing sensitive data at the
Australian Data Archive
“Making Data Social” webinar series
29 March 2017
Dr. Steven McEachern
Director, Australian Data Archive
ANU Centre for Social Research and Methods
Australian National University
Overview
• Sensitive data and the 5 Safes model
• Access to sensitive data in Australia
• Applying the 5 Safes model at ADA
• Sensitive data and the data lifecycle
Sensitive data
• “Sensitive data are data that can be
used to identify anindividual, species, object,
process,or locationthat introduces a risk
of discrimination, harm, or unwanted
attention.”
– ANDS Guide on Publishing and Sharing Sensitive
Data, p.7
– http://www.ands.org.au/__data/assets/pdf_file/0
010/489187/Sensitive-data.pdf
The 5 safes
1. Safe people: Can the researchers be trusted to do the
right thing?
2. Safe projects: Is the data to be used for an appropriate
purpose?
3. Safe settings: Is the environment in which the analysis
takes place safe?
4. Safe data: Is the data appropriately protected?
5. Safe output: Is there a low risk of disclosure in research
outputs?
Desai, T., F. Ritchie and R. Welpton (2016) Five Safes: designing data access for
research. Economics Working Paper Series
1601, University of the West of England.
http://www2.uwe.ac.uk/faculties/BBS/Documents/1601.pdf
What do researchers expect?
(or What is wanted? :-)
• “We emphasize that direct access to micro-data
is critical for success. Alternatives such as access
to synthetic data or submission of computer
programs to agency employees will not address
the key problem of restoring US leadership with
cutting-edge policy-relevant research.”
• Card, Chetty, Feldstein and Saez, 2010 (emphasis
in original)
– http://rajchetty.com/chettyfiles/NSFdataaccess.pdf
What is expected?
• “Here's what you need to do if you want an anonymised 1% sample
of the US Census
– Go to Google and type US Census 1% sample, click on link to the
Census.
– Download each of the state files from the FTP site and merge them
yourself. Or just check things out for one of the states. Whatever you
like.
– Start mucking about to test whether your pet theory is plausible.
• Here's what you need to do if you want an anoymised sample of
the NZ Census, or a Confidentialised Unit Record File (CURF) of any
of big Stats series:
– Go to Stats NZ's site, here.
– Follow the instructions below: …”
• (Followed by several pages of instructions, Application Process,
Assessment Criteria, Methods of Access, …)
Eric Crampton, the New Zealand Initiative, Wellington, formerly University of
Canterbury. http://offsettingbehaviour.blogspot.com.au/2015/10/curf-and-
turf.html
Can we bridge depositor and user
expectations?
I think so. Consider Card et al. again:
“We believe that five conditions must be satisfied to make a data
access program sustainable and efficient:
a) fair and open competition for data access based on scientific merit
b) sufficient bandwidth to accommodate a large number of projects
simultaneously
c) inclusion of younger scholars and graduate students in the
research teams that can access the data
d) direct access to de-identified micro data through local statistical
offices or, more preferably, secure remote connections
e) systematic electronic monitoring to allow immediate disclosure
of statistical results and prevent any disclosure of individual
records”
Current models in Australia
ABS:
• Confidentialised Unit
Record Files (CURFs)
• RADL
• ABSDL
• TableBuilder
ADA:
• Confidentialised Unit
Record Files (CURFs)
Shared (often remote
access) infrastructure:
• AURIN
• SURE (PHRN)
• Data linkage facilities
Ad hoc arrangements:
• “Secure rooms”
• Departmental
arrangements
Applying the 5 Safes
People Projects Settings Data Output
CURFs Yes? Yes? Yes? YES YES
TableBuilder No No YES YES YES
RADL Yes? Yes? YES YES YES
ABSDL Yes? Yes? YES YES YES
ABS Remote
Data Lab Yes Yes? YES YES YES
ADA Yes? No No YES No
AURIN No No YES YES Yes?
SURE (PHRN) Yes Yes YES No? ???
Data Linkage
facilities No? YES Yes? YES ???
Secure rooms Yes? Yes? YES No? ???
Australian experience
• Safe data
– Confidentialisation: ADA, ABS, DSS (HILDA, etc.)
– Indirect access to data: TableBuilder, ADA
• Safe settings
– Aggregated data: TableBuilder, AURIN
– Remote: RADL, ABS (Remote) Data Lab
– Secure environments: ABS (On site) Data Lab,
secure rooms
5 safes: lesser emphasis on…
• Safe outputs
– Difficult to scale (e.g. data lab output reviews)
– This is changing – e.g. TableBuilder is automated
– But need to consider replication and reproducibility
• Safe researchers and safe projects
– Considered in most models, but not closely monitored
– May be difficult to monitor? (Similar issues to the
reporting of research outputs in universities)
– Universities could (and do!!) provide imprimatur for
their staff and students
Frameworks for research practice
• There are existing Australian frameworks for
researcher accountability and responsibilities:
– the Australian Code for the Responsible Conduct of
Research (ACRCR), which sets out institutional and
researcher responsibilities for conduct of research
– (Note that this is currently under review)
– Human Research Ethics Committees (HREC)
• Increasingly, professional and journal
requirements for data sharing:
– E.g. PLOS One, AEA, DA-RT (political science)
– https://www.aeaweb.org/aer/data.php
– http://journals.plos.org/plosone/s/data-availability
Relevant content from ACRCR
• S.2: Management of research data and primary materials
– E.g. 2.7 Maintain confidentiality of research data and primary
materials
– Researchers given access to confidential information must
maintain that confidentiality. Primary materials and confidential
research data must be kept in secure storage. Confidential
information must only be used in ways agreed with those who
provided it. Particular care must be exercised when confidential
data are made available for discussion.
• S.4: Publication and dissemination of research findings
– E.g. 4.2.3 Institutions must ensure that the sponsors of research
understand the importance of publication in research and do
not delay publication beyond the time needed to protect
intellectual property and other relevant interests.
• S.9: Breaches of the Code and misconduct in research
ADA model
• Safe data: data is anonymised
(confidentialised) either prior to deposit or by
ADA archivists
• Safe people: virtually all data access is
mediated, and users must be identified and
provide contact and supervisor details
• Safe projects: users provide a project
description
• Safe settings and safe outputs: NOT applied
ABS Remote Data Lab (virtual enclave)
• Safe data: less of a focus – but the lab does not prohibit
use of safe data practices
– Risk: individual researchers can see individual records
– BUT this assumes “unsafe” people (researchers)
• Safe settings: Remote access environment hosted at
ABS
– Challenge: cost of establishing the system
• Safe outputs: outputs limited only to methods
approved through the access environment (i.e. no
printing)
– Risk: photographing the screen, taking notes
– Again, assumes “unsafe” people
– Challenge: managing output checking
• Safe people:
– Institutional support
• Note ACRCR – code of conduct, and HREC
– Training for researchers prior to access
• Intended breaches are uncommon (per background paper)
• Focus therefore on unintended breach
• Highlight also alternate access options to reduce breach due
to limitations of access method
– Assessing people: research background?
• Experience is difficult to evaluate here
• How would you build up a track record?
• Safe projects
– May be necessary for legislative reasons
– Should this matter?
• Basic research might generate just as useful insights
A suite of options
• Different existing models (each a mix of the 5
safes) all have their place
• Safe people can be incorporated into existing
models
– Many current models assume the “intruder”
– International evidence suggests this is not the case
• ADA has 2 “default” options
The principles should enable the right
mix of “safes” for a given data source
Source: http://www.shinyshiny.tv/2009/12/easymix_-_a_mix.html

Contenu connexe

Tendances

Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypseENUG
 
Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesAmanda Whitmire
 
Sharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and OpportunitiesSharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and OpportunitiesLancaster University Library
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Amanda Whitmire
 
Managing Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and ApproachesManaging Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and ApproachesMicah Altman
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19openEHR-Japan
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19openEHR-Japan
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementMichael Day
 
June2014 brownbag privacy
June2014 brownbag privacyJune2014 brownbag privacy
June2014 brownbag privacyMicah Altman
 
Reproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveReproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveMicah Altman
 
10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancyHannelore Vanhaverbeke
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
An Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsAn Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsBrittany Lasseigne, Ph.D.
 

Tendances (20)

Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Medical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructuresMedical image analysis, retrieval and evaluation infrastructures
Medical image analysis, retrieval and evaluation infrastructures
 
Challenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL modelChallenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL model
 
Developing data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universitiesDeveloping data services: a tale from two Oregon universities
Developing data services: a tale from two Oregon universities
 
Sharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and OpportunitiesSharing Qualitative Data - Challenges and Opportunities
Sharing Qualitative Data - Challenges and Opportunities
 
openEHR v COVID-19
openEHR v COVID-19openEHR v COVID-19
openEHR v COVID-19
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521
 
Cairo
CairoCairo
Cairo
 
From byte to mind
From byte to mindFrom byte to mind
From byte to mind
 
Managing Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and ApproachesManaging Confidential Information – Trends and Approaches
Managing Confidential Information – Trends and Approaches
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19Guideline based CDSS for COVID-19
Guideline based CDSS for COVID-19
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
June2014 brownbag privacy
June2014 brownbag privacyJune2014 brownbag privacy
June2014 brownbag privacy
 
Reproducibility from an infomatics perspective
Reproducibility from an infomatics perspectiveReproducibility from an infomatics perspective
Reproducibility from an infomatics perspective
 
10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy10 commandments in rdm funder compliancy
10 commandments in rdm funder compliancy
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
An Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and GenomicsAn Introduction to Machine Learning and Genomics
An Introduction to Machine Learning and Genomics
 

En vedette (19)

Internacional. tema 11, 12, 13.
Internacional. tema 11, 12, 13.Internacional. tema 11, 12, 13.
Internacional. tema 11, 12, 13.
 
01 aspectos basicos
01   aspectos basicos01   aspectos basicos
01 aspectos basicos
 
S4 tarea4 grgoe
S4 tarea4 grgoeS4 tarea4 grgoe
S4 tarea4 grgoe
 
Modelo Educativo Adventista
Modelo Educativo AdventistaModelo Educativo Adventista
Modelo Educativo Adventista
 
Presentación postitulo alfabetizacion digital unse
Presentación postitulo alfabetizacion digital unsePresentación postitulo alfabetizacion digital unse
Presentación postitulo alfabetizacion digital unse
 
Ejercicio 2
Ejercicio 2Ejercicio 2
Ejercicio 2
 
Articulo62 cst
Articulo62 cstArticulo62 cst
Articulo62 cst
 
Anorexia y bulimia
Anorexia y bulimiaAnorexia y bulimia
Anorexia y bulimia
 
Gestión Ágil de Proyectos: Scrum, Kanban y XP
Gestión Ágil de Proyectos: Scrum, Kanban y XPGestión Ágil de Proyectos: Scrum, Kanban y XP
Gestión Ágil de Proyectos: Scrum, Kanban y XP
 
Situa aprend didactica critica
Situa aprend didactica criticaSitua aprend didactica critica
Situa aprend didactica critica
 
S4 tarea4 grgoe
S4 tarea4 grgoeS4 tarea4 grgoe
S4 tarea4 grgoe
 
Tercer tema
Tercer temaTercer tema
Tercer tema
 
ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...
ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...
ORCID for funders webinar -NIH use of ORCID to track outcomes - Richard Iiked...
 
Decreto 094 de 1989
Decreto 094 de 1989Decreto 094 de 1989
Decreto 094 de 1989
 
Taller
TallerTaller
Taller
 
Programa de Área CN 2017
Programa de Área CN 2017Programa de Área CN 2017
Programa de Área CN 2017
 
PIKASEN Clasificadora con lector óptico
PIKASEN Clasificadora con lector ópticoPIKASEN Clasificadora con lector óptico
PIKASEN Clasificadora con lector óptico
 
Impact Hub_Abidjan - Columbia Ci3
Impact Hub_Abidjan - Columbia Ci3Impact Hub_Abidjan - Columbia Ci3
Impact Hub_Abidjan - Columbia Ci3
 
Pronunciamiento apo 25 03 2017
Pronunciamiento apo 25 03 2017Pronunciamiento apo 25 03 2017
Pronunciamiento apo 25 03 2017
 

Similaire à Managing sensitive data at the Australian Data Archive

Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6ARDC
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Datacunera
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesLouise Corti
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto UniversityStephanie Simms
 
Data Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesData Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesIUPUI
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research RequirementsICPSR
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Jamie Bisset
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data ManagementIzzyChad
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...Kristin Briney
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 

Similaire à Managing sensitive data at the Australian Data Archive (20)

Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Accessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five SafesAccessing data for research: data publishing pathways and the Five Safes
Accessing data for research: data publishing pathways and the Five Safes
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
Data Management Lab: Session 4 Slides
Data Management Lab: Session 4 SlidesData Management Lab: Session 4 Slides
Data Management Lab: Session 4 Slides
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research Requirements
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction)
 
Researh data management
Researh data managementResearh data management
Researh data management
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
Open Science and Open Data for Librarians
Open Science and Open Data for LibrariansOpen Science and Open Data for Librarians
Open Science and Open Data for Librarians
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 
The Ethics of Digital Preservation
The Ethics of Digital PreservationThe Ethics of Digital Preservation
The Ethics of Digital Preservation
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
 
Research-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhDResearch-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhD
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 

Plus de ARDC

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADAARDC
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and StandardsARDC
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation ARDC
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)ARDC
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveARDC
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domainARDC
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataARDC
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharingARDC
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studiesARDC
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scopeARDC
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things dataARDC
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128ARDC
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical dataARDC
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataARDC
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesARDC
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018ARDC
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintARDC
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataARDC
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018ARDC
 

Plus de ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 

Dernier

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 

Dernier (20)

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

Managing sensitive data at the Australian Data Archive

  • 1. Managing sensitive data at the Australian Data Archive “Making Data Social” webinar series 29 March 2017 Dr. Steven McEachern Director, Australian Data Archive ANU Centre for Social Research and Methods Australian National University
  • 2. Overview • Sensitive data and the 5 Safes model • Access to sensitive data in Australia • Applying the 5 Safes model at ADA • Sensitive data and the data lifecycle
  • 3. Sensitive data • “Sensitive data are data that can be used to identify anindividual, species, object, process,or locationthat introduces a risk of discrimination, harm, or unwanted attention.” – ANDS Guide on Publishing and Sharing Sensitive Data, p.7 – http://www.ands.org.au/__data/assets/pdf_file/0 010/489187/Sensitive-data.pdf
  • 4. The 5 safes 1. Safe people: Can the researchers be trusted to do the right thing? 2. Safe projects: Is the data to be used for an appropriate purpose? 3. Safe settings: Is the environment in which the analysis takes place safe? 4. Safe data: Is the data appropriately protected? 5. Safe output: Is there a low risk of disclosure in research outputs? Desai, T., F. Ritchie and R. Welpton (2016) Five Safes: designing data access for research. Economics Working Paper Series 1601, University of the West of England. http://www2.uwe.ac.uk/faculties/BBS/Documents/1601.pdf
  • 5. What do researchers expect? (or What is wanted? :-) • “We emphasize that direct access to micro-data is critical for success. Alternatives such as access to synthetic data or submission of computer programs to agency employees will not address the key problem of restoring US leadership with cutting-edge policy-relevant research.” • Card, Chetty, Feldstein and Saez, 2010 (emphasis in original) – http://rajchetty.com/chettyfiles/NSFdataaccess.pdf
  • 6. What is expected? • “Here's what you need to do if you want an anonymised 1% sample of the US Census – Go to Google and type US Census 1% sample, click on link to the Census. – Download each of the state files from the FTP site and merge them yourself. Or just check things out for one of the states. Whatever you like. – Start mucking about to test whether your pet theory is plausible. • Here's what you need to do if you want an anoymised sample of the NZ Census, or a Confidentialised Unit Record File (CURF) of any of big Stats series: – Go to Stats NZ's site, here. – Follow the instructions below: …” • (Followed by several pages of instructions, Application Process, Assessment Criteria, Methods of Access, …) Eric Crampton, the New Zealand Initiative, Wellington, formerly University of Canterbury. http://offsettingbehaviour.blogspot.com.au/2015/10/curf-and- turf.html
  • 7. Can we bridge depositor and user expectations? I think so. Consider Card et al. again: “We believe that five conditions must be satisfied to make a data access program sustainable and efficient: a) fair and open competition for data access based on scientific merit b) sufficient bandwidth to accommodate a large number of projects simultaneously c) inclusion of younger scholars and graduate students in the research teams that can access the data d) direct access to de-identified micro data through local statistical offices or, more preferably, secure remote connections e) systematic electronic monitoring to allow immediate disclosure of statistical results and prevent any disclosure of individual records”
  • 8. Current models in Australia ABS: • Confidentialised Unit Record Files (CURFs) • RADL • ABSDL • TableBuilder ADA: • Confidentialised Unit Record Files (CURFs) Shared (often remote access) infrastructure: • AURIN • SURE (PHRN) • Data linkage facilities Ad hoc arrangements: • “Secure rooms” • Departmental arrangements
  • 9. Applying the 5 Safes People Projects Settings Data Output CURFs Yes? Yes? Yes? YES YES TableBuilder No No YES YES YES RADL Yes? Yes? YES YES YES ABSDL Yes? Yes? YES YES YES ABS Remote Data Lab Yes Yes? YES YES YES ADA Yes? No No YES No AURIN No No YES YES Yes? SURE (PHRN) Yes Yes YES No? ??? Data Linkage facilities No? YES Yes? YES ??? Secure rooms Yes? Yes? YES No? ???
  • 10. Australian experience • Safe data – Confidentialisation: ADA, ABS, DSS (HILDA, etc.) – Indirect access to data: TableBuilder, ADA • Safe settings – Aggregated data: TableBuilder, AURIN – Remote: RADL, ABS (Remote) Data Lab – Secure environments: ABS (On site) Data Lab, secure rooms
  • 11. 5 safes: lesser emphasis on… • Safe outputs – Difficult to scale (e.g. data lab output reviews) – This is changing – e.g. TableBuilder is automated – But need to consider replication and reproducibility • Safe researchers and safe projects – Considered in most models, but not closely monitored – May be difficult to monitor? (Similar issues to the reporting of research outputs in universities) – Universities could (and do!!) provide imprimatur for their staff and students
  • 12. Frameworks for research practice • There are existing Australian frameworks for researcher accountability and responsibilities: – the Australian Code for the Responsible Conduct of Research (ACRCR), which sets out institutional and researcher responsibilities for conduct of research – (Note that this is currently under review) – Human Research Ethics Committees (HREC) • Increasingly, professional and journal requirements for data sharing: – E.g. PLOS One, AEA, DA-RT (political science) – https://www.aeaweb.org/aer/data.php – http://journals.plos.org/plosone/s/data-availability
  • 13. Relevant content from ACRCR • S.2: Management of research data and primary materials – E.g. 2.7 Maintain confidentiality of research data and primary materials – Researchers given access to confidential information must maintain that confidentiality. Primary materials and confidential research data must be kept in secure storage. Confidential information must only be used in ways agreed with those who provided it. Particular care must be exercised when confidential data are made available for discussion. • S.4: Publication and dissemination of research findings – E.g. 4.2.3 Institutions must ensure that the sponsors of research understand the importance of publication in research and do not delay publication beyond the time needed to protect intellectual property and other relevant interests. • S.9: Breaches of the Code and misconduct in research
  • 14. ADA model • Safe data: data is anonymised (confidentialised) either prior to deposit or by ADA archivists • Safe people: virtually all data access is mediated, and users must be identified and provide contact and supervisor details • Safe projects: users provide a project description • Safe settings and safe outputs: NOT applied
  • 15. ABS Remote Data Lab (virtual enclave) • Safe data: less of a focus – but the lab does not prohibit use of safe data practices – Risk: individual researchers can see individual records – BUT this assumes “unsafe” people (researchers) • Safe settings: Remote access environment hosted at ABS – Challenge: cost of establishing the system • Safe outputs: outputs limited only to methods approved through the access environment (i.e. no printing) – Risk: photographing the screen, taking notes – Again, assumes “unsafe” people – Challenge: managing output checking
  • 16. • Safe people: – Institutional support • Note ACRCR – code of conduct, and HREC – Training for researchers prior to access • Intended breaches are uncommon (per background paper) • Focus therefore on unintended breach • Highlight also alternate access options to reduce breach due to limitations of access method – Assessing people: research background? • Experience is difficult to evaluate here • How would you build up a track record? • Safe projects – May be necessary for legislative reasons – Should this matter? • Basic research might generate just as useful insights
  • 17. A suite of options • Different existing models (each a mix of the 5 safes) all have their place • Safe people can be incorporated into existing models – Many current models assume the “intruder” – International evidence suggests this is not the case • ADA has 2 “default” options
  • 18. The principles should enable the right mix of “safes” for a given data source Source: http://www.shinyshiny.tv/2009/12/easymix_-_a_mix.html