SlideShare a Scribd company logo
1 of 17
Demand-Driven Open Data
More info:
Contact
http://ddod.us
David.Portnoy@HHS.gov | tw:@DPortnoy
DDOD for FOIA Organizations
(Freedom of Information Act)
Originally presented in March 2015 to FOIA directors meeting for all agencies under HHS
Concepts explored here do not necessarily
represent the views of HHS.
9/2015
Abstract
Industry and researchers can get valuable data through DDOD (Demand-Driven
Open Data) and FOIA (Freedom of Information Act). Both paths have their
advantages and limitations. A subset of FOIA requests can also be worked as
DDOD "use cases". Such requests involve structured, machine-readable
datasets that continue to be generated and are not for restricted data. This paper
explains how combining the two methods by working this subset of requests in
parallel significantly improves the effectiveness both DDOD and FOIA.
This paper was originally presented at the FOIA directors meeting for all HHS
(U.S. Dept of Health and Human Services) agencies in March 2015.
Demand-Driven Open Data (DDOD) is a
framework of tools and methods that…
Provide external data users✽ with a
systematic, ongoing and transparent way to
tell HHS what data they need
...To be managed, measured and executed in
terms of use cases, enabling allocation of limited
resources based on value
What is DDOD?
✽ Including industry, researchers, nonprofits, media and other government organizations
More effective open data initiatives
More engaged user community
More economic value & discoveries delivered
Add
What does DDOD deliver to the user community?
Implementation of a use case could fall into one of 3 categories
Time to execute
Cost/Effort
Improve
Catalog
Facilitate deployment of
● New datasets
● New APIs
For existing datasets
● Add needed fields
● Improve data quality
● Add / improve metadata
● Add / improve API
If datasets already exist in legacy systems, make
them more available and discoverable
● Publicize availability
● Index to HealthData.gov and Data.gov
● Instructions for efficient FOIA handling
Current
State
Processes for administration of use cases, such as
• Encouraging responsiveness, transparency and documentation
• Ensuring use cases and resulting datasets are indexed in HealthData.gov
Specialized tools for administering use cases
• Workflow engine, communications method, knowledge base
• Data processing, storage, hosting, versioning
Proactive outreach to industry and academia for a thriving
community
What does DDOD provide to data owners?
1
2
3
“The Freedom of Information Act (FOIA) is a law that
gives you the right to access information from the
federal government. It is often described as the law that
keeps citizens in the know about their government.”
What is FOIA?
Source: FOIA.gov
U.S. Department of Health & Human Services
What does FOIA look like within HHS?
Source: FOIA.gov
Agency Received Processed Pending
ACF 1,864 1,255 763
CMS 26,361 25,027 4,717
OIG 843 815 34
CDC 1,141 1,028 691
FDA 10,224 10,191 2,896
HRSA 355 352 92
IHS 152 160 20
NIH 1,169 1,156 84
SAMHSA 193 238 51
OS 1,531 1,605 314
OASH 437 455 120
ACL 13 11 2
Total 44,283 42,293 9,784
Some FOIA requests look quite similar to DDOD use cases
FOIA requests that have all of the following attributes could be potential DDOD use cases
1. The data is still being generated or refreshed on a regular basis.
(Not a single pull of historical information.)
2. The data can be delivered in a machine readable, structured format, such as CSV,
JSON, XML or Excel files.
(Not freeform text-only, PDF, or scanned images.)
3. The data has widespread usefulness for multiple organizations.
(Excluding complaints and investigations, which are often specific to an individual
or organization, as is often the case in lawsuits.)
Although only a fraction of the 45,000 FOIA requests
received so far by HHS have attributes that make them
applicable to DDOD, there are still plenty that could become
high value use cases.
But both FOIA and DDOD have their own challenges
DDOD’s challenges:
● DDOD is a relatively new program that doesn’t enjoy the awareness, recognition and credibility
that FOIA has developed over the years
● Proactive outreach to the user community is a big effort and expense for DDOD
● The toolset for managing use cases is early in its lifecycle and is still being developed
● While the DDOD workflow is documented and actively in use, it’s still being fine-tuned
FOIA’s challenges:
● Growing backlog, due to high demand, complexity of some requests and limited resources
● Process for spotting similar requests and posting the response to an electronic “reading room” is
subjective and time consuming
● Difficult to maintain consistency in both format and actual data delivered. Requests may be
subject to interpretation. Each request may be fulfilled by a different employee or contractor
● There’s no mechanism for automatically refreshing results for data that continues to be generated
Joining forces on applicable requests is mutually beneficial,
because it can help address many of these challenges.
DDOD can benefit FOIA by...
1. Reducing number of future requests
More data indexed and made publically available via HealthData.gov (See diagram)
Data that’s automatically being refreshed doesn’t need to be requested multiple times
Rework stemming from inconsistent format and data values between consecutive extracts on
refreshed data
Better documentation of data provenance and usage via use case methodology
Data User
runs search on
HealthData.gov
Data User creates
/ updates use case
DDOD Admin
engages Data Owner
on use case
• keywords, subject
• data dictionary
HealthData.gov
Data
Dictionary
Dataset
Inventory
EDI*
Use
Case
DDOD Admin ensures changes to
EDI get propagated to HD.gov
* Enterprise Data Inventory (EDI), which is a catalog
of HHS “Strategically Relevant Data Assets”
Data
Dictionary
Dataset
Inventory
DDOD Admin enters
use case on HD.gov with
links to specifications
Data Owner adds entry
to EDI, including
metadata
DDOD Admin
curates entry &
ensures SLAs
DDOD Admin
creates repository
for use case
Process of adding a new DDOD use case
DDOD can benefit FOIA by… (cont)
2. Fulfilling the electronic “reading room” requirement for requests made multiple times by publishing the
data using HealthData.gov infrastructure. Results end up benefiting more users.
3. Instituting best practices for open data projects in terms of machine readability, standards,
documentation and access
4. Helping address the backlog, specifically for the subset of requests applicable to DDOD. These are
also the requests that are beneficial to a wider audience.
Much of the current backlog is unrelated to machine readable datasets that serve the broader user
community. FOIA is straining under the weight of one-time requests that benefit only the few. For
example, a single lawsuit-related request to CDC required processing of 90,000 documents.
FOIA can benefit DDOD by...
1. Since FOIA is legislated and well-known, it can bring in more use cases into DDOD.
By identifying specific requests that are applicable to DDOD
By promoting DDOD on FOIA websites and materials
2. DDOD can leverage work being done anyway to fulfill FOIA requests
Data owner already needs to be involved and resources -- whether employees or contractors
-- need to be allocated to do the extract
Once a data extract is performed, it might be trivial to configure periodic runs of the same
code and perform automated pushes to HealthData.gov
3. Association with FOIA lends additional awareness, recognition, credibility to DDOD
Requester
receives
documents
Requester
can sue
Data owner
produces
documents
Submit for
fulfillment to
data owner
Requester
can appeal
We start with a simplified overview of the FOIA workflow to determine how DDOD requests
could be initiated and executed in parallel
Agree to
fulfill
Requests that are partially or incorrectly fulfilled
can also go through the same appeals process
Agency decides
whether and what
to release
Agency obtains
clarification
(if needed)
Request routed
to ultimate
agency
Respond within 20
business days
(Can extend for unusual
circumstances: Research,
volume, consultation)
Route within 10
business days
Decline
request
Agree to
fulfill
Decline
request
Requester
submits request
to FOIA
In DDOD workflow, use case execution relies on three types of participants: Data User,
DDOD Admin, and Data Owner. The Data User initiates the request
All implementation decisions ultimately are made by the Data Owner
The DDOD Admin only facilitates the process when needed
Although it’s a legislation-driven process, FOIA organizations may voluntarily notify DDOD
on incoming requests. For such requests, the DDOD workflow works in parallel to the FOIA
workflow.Data
User
Data
Owner
DDOD
Admin
DocumentImplementEngage Discuss
DocumentEngageClarify
DocumentImplementDiscussClarify
FOIA
Org
Workflow for FOIA - DDOD partnership
Typical
FOIA request

More Related Content

What's hot

Hi3 Solutions: Accelerating HIE standards conformance
Hi3 Solutions: Accelerating HIE standards conformanceHi3 Solutions: Accelerating HIE standards conformance
Hi3 Solutions: Accelerating HIE standards conformanceAbdul-Malik Shakir
 
Blockchain Applications in Healthcare
Blockchain Applications in HealthcareBlockchain Applications in Healthcare
Blockchain Applications in HealthcareCitiusTech
 
Informatics Standards And Interoperability20090325
Informatics Standards And Interoperability20090325Informatics Standards And Interoperability20090325
Informatics Standards And Interoperability20090325Abdul-Malik Shakir
 
Hcd wp-2012-better dataleadstobetteranalytics
Hcd wp-2012-better dataleadstobetteranalyticsHcd wp-2012-better dataleadstobetteranalytics
Hcd wp-2012-better dataleadstobetteranalyticsHealth Care DataWorks
 
BioStorage Technologies Case Study: How to build an informatics platform usin...
BioStorage Technologies Case Study: How to build an informatics platform usin...BioStorage Technologies Case Study: How to build an informatics platform usin...
BioStorage Technologies Case Study: How to build an informatics platform usin...Denodo
 
Direct20: Modular Specifications - Provider Directories
Direct20: Modular Specifications - Provider DirectoriesDirect20: Modular Specifications - Provider Directories
Direct20: Modular Specifications - Provider DirectoriesBrian Ahier
 
Interoperability in health care information systems
Interoperability in health care information systemsInteroperability in health care information systems
Interoperability in health care information systemsAlexander Ask
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analyticsd-Wise Technologies
 
Recommender System in light of Big Data
Recommender System in light of Big DataRecommender System in light of Big Data
Recommender System in light of Big DataKhadija Atiya
 
Hybrid Architecture with Ike & Data Libraries
Hybrid Architecture with Ike  & Data LibrariesHybrid Architecture with Ike  & Data Libraries
Hybrid Architecture with Ike & Data LibrariesStephen Allan Weitzman
 
Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeStefan Kühn
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Philip Bourne
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHADenodo
 
Access Lab 2020: Context aware unified institutional knowledge services
Access Lab 2020: Context aware unified institutional knowledge servicesAccess Lab 2020: Context aware unified institutional knowledge services
Access Lab 2020: Context aware unified institutional knowledge servicesOpenAthens
 
Revenue opportunities in the management of healthcare data deluge
Revenue opportunities in the management of healthcare data delugeRevenue opportunities in the management of healthcare data deluge
Revenue opportunities in the management of healthcare data delugeShahid Shah
 
Unstructured Data Fact Sheet
Unstructured Data Fact SheetUnstructured Data Fact Sheet
Unstructured Data Fact SheetConnexica
 
8 Electronic Health Record (EHR) Downstream Challenges
8 Electronic Health Record (EHR) Downstream Challenges8 Electronic Health Record (EHR) Downstream Challenges
8 Electronic Health Record (EHR) Downstream ChallengesCitiusTech
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...David Peyruc
 

What's hot (20)

Hi3 Solutions: Accelerating HIE standards conformance
Hi3 Solutions: Accelerating HIE standards conformanceHi3 Solutions: Accelerating HIE standards conformance
Hi3 Solutions: Accelerating HIE standards conformance
 
Blockchain Applications in Healthcare
Blockchain Applications in HealthcareBlockchain Applications in Healthcare
Blockchain Applications in Healthcare
 
Trial io pcori doc v1
Trial io pcori doc v1Trial io pcori doc v1
Trial io pcori doc v1
 
Informatics Standards And Interoperability20090325
Informatics Standards And Interoperability20090325Informatics Standards And Interoperability20090325
Informatics Standards And Interoperability20090325
 
Hcd wp-2012-better dataleadstobetteranalytics
Hcd wp-2012-better dataleadstobetteranalyticsHcd wp-2012-better dataleadstobetteranalytics
Hcd wp-2012-better dataleadstobetteranalytics
 
BioStorage Technologies Case Study: How to build an informatics platform usin...
BioStorage Technologies Case Study: How to build an informatics platform usin...BioStorage Technologies Case Study: How to build an informatics platform usin...
BioStorage Technologies Case Study: How to build an informatics platform usin...
 
HIE technical infrastructure
HIE technical infrastructureHIE technical infrastructure
HIE technical infrastructure
 
Direct20: Modular Specifications - Provider Directories
Direct20: Modular Specifications - Provider DirectoriesDirect20: Modular Specifications - Provider Directories
Direct20: Modular Specifications - Provider Directories
 
Interoperability in health care information systems
Interoperability in health care information systemsInteroperability in health care information systems
Interoperability in health care information systems
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
 
Recommender System in light of Big Data
Recommender System in light of Big DataRecommender System in light of Big Data
Recommender System in light of Big Data
 
Hybrid Architecture with Ike & Data Libraries
Hybrid Architecture with Ike  & Data LibrariesHybrid Architecture with Ike  & Data Libraries
Hybrid Architecture with Ike & Data Libraries
 
Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data Challenge
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHA
 
Access Lab 2020: Context aware unified institutional knowledge services
Access Lab 2020: Context aware unified institutional knowledge servicesAccess Lab 2020: Context aware unified institutional knowledge services
Access Lab 2020: Context aware unified institutional knowledge services
 
Revenue opportunities in the management of healthcare data deluge
Revenue opportunities in the management of healthcare data delugeRevenue opportunities in the management of healthcare data deluge
Revenue opportunities in the management of healthcare data deluge
 
Unstructured Data Fact Sheet
Unstructured Data Fact SheetUnstructured Data Fact Sheet
Unstructured Data Fact Sheet
 
8 Electronic Health Record (EHR) Downstream Challenges
8 Electronic Health Record (EHR) Downstream Challenges8 Electronic Health Record (EHR) Downstream Challenges
8 Electronic Health Record (EHR) Downstream Challenges
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
 

Similar to DDOD for FOIA organizations

Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Carolyn Ten Holter
 
North American funders' DMP requirements
North American funders' DMP requirementsNorth American funders' DMP requirements
North American funders' DMP requirementsSarah Jones
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data ArchivingRainStor
 
Age Friendly Economy - Improving your business with external data
Age Friendly Economy - Improving your business with external dataAge Friendly Economy - Improving your business with external data
Age Friendly Economy - Improving your business with external dataAgeFriendlyEconomy
 
The value of big data analytics
The value of big data analyticsThe value of big data analytics
The value of big data analyticsMarc Vael
 
Michael Josephs
Michael JosephsMichael Josephs
Michael JosephsdaveGBE
 
Veritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault InfographicVeritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault InfographicIdeba
 
Veritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault InfographicVeritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault InfographicVeritas Technologies LLC
 
The Rise of Data Ethics and Security - AIDI Webinar
The Rise of Data Ethics and Security - AIDI WebinarThe Rise of Data Ethics and Security - AIDI Webinar
The Rise of Data Ethics and Security - AIDI WebinarEryk Budi Pratama
 
Foia data sheet (002)
Foia data sheet (002)Foia data sheet (002)
Foia data sheet (002)Brent Pyle
 
LexisNexis Government Transparency Solutions
LexisNexis Government Transparency SolutionsLexisNexis Government Transparency Solutions
LexisNexis Government Transparency SolutionsMichael Gandy
 
Healthcare data challenges
Healthcare data challengesHealthcare data challenges
Healthcare data challengesAngela Boyd
 
Healthcare Data Challenges
Healthcare Data ChallengesHealthcare Data Challenges
Healthcare Data ChallengesAngela Boyd
 
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...DATAVERSITY
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big dataMark Albala
 

Similar to DDOD for FOIA organizations (20)

Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...
 
North American funders' DMP requirements
North American funders' DMP requirementsNorth American funders' DMP requirements
North American funders' DMP requirements
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data Archiving
 
Data context new developments for research the social sciences
 Data context new developments for research the social sciences Data context new developments for research the social sciences
Data context new developments for research the social sciences
 
Age Friendly Economy - Improving your business with external data
Age Friendly Economy - Improving your business with external dataAge Friendly Economy - Improving your business with external data
Age Friendly Economy - Improving your business with external data
 
The value of big data analytics
The value of big data analyticsThe value of big data analytics
The value of big data analytics
 
Michael Josephs
Michael JosephsMichael Josephs
Michael Josephs
 
Veritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault InfographicVeritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault Infographic
 
Veritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault InfographicVeritas Managed Enterprise Vault Infographic
Veritas Managed Enterprise Vault Infographic
 
The Rise of Data Ethics and Security - AIDI Webinar
The Rise of Data Ethics and Security - AIDI WebinarThe Rise of Data Ethics and Security - AIDI Webinar
The Rise of Data Ethics and Security - AIDI Webinar
 
Foia data sheet (002)
Foia data sheet (002)Foia data sheet (002)
Foia data sheet (002)
 
LexisNexis Government Transparency Solutions
LexisNexis Government Transparency SolutionsLexisNexis Government Transparency Solutions
LexisNexis Government Transparency Solutions
 
big-data.pdf
big-data.pdfbig-data.pdf
big-data.pdf
 
Healthcare data challenges
Healthcare data challengesHealthcare data challenges
Healthcare data challenges
 
Healthcare Data Challenges
Healthcare Data ChallengesHealthcare Data Challenges
Healthcare Data Challenges
 
Life Science Analytics
Life Science AnalyticsLife Science Analytics
Life Science Analytics
 
Big data
Big dataBig data
Big data
 
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
Subscribing to Your Critical Data Supply Chain - Getting Value from True Data...
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big data
 
Wrox Big Data Analyst
Wrox Big Data AnalystWrox Big Data Analyst
Wrox Big Data Analyst
 

More from David Portnoy

DDOD framework infographic
DDOD framework infographicDDOD framework infographic
DDOD framework infographicDavid Portnoy
 
Open Data Discoverability
Open Data DiscoverabilityOpen Data Discoverability
Open Data DiscoverabilityDavid Portnoy
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeDavid Portnoy
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsDavid Portnoy
 
Agile Business Intelligence
Agile Business IntelligenceAgile Business Intelligence
Agile Business IntelligenceDavid Portnoy
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsDavid Portnoy
 

More from David Portnoy (6)

DDOD framework infographic
DDOD framework infographicDDOD framework infographic
DDOD framework infographic
 
Open Data Discoverability
Open Data DiscoverabilityOpen Data Discoverability
Open Data Discoverability
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human Genome
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
 
Agile Business Intelligence
Agile Business IntelligenceAgile Business Intelligence
Agile Business Intelligence
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse Platforms
 

Recently uploaded

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 

Recently uploaded (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 

DDOD for FOIA organizations

  • 1. Demand-Driven Open Data More info: Contact http://ddod.us David.Portnoy@HHS.gov | tw:@DPortnoy DDOD for FOIA Organizations (Freedom of Information Act) Originally presented in March 2015 to FOIA directors meeting for all agencies under HHS Concepts explored here do not necessarily represent the views of HHS. 9/2015
  • 2. Abstract Industry and researchers can get valuable data through DDOD (Demand-Driven Open Data) and FOIA (Freedom of Information Act). Both paths have their advantages and limitations. A subset of FOIA requests can also be worked as DDOD "use cases". Such requests involve structured, machine-readable datasets that continue to be generated and are not for restricted data. This paper explains how combining the two methods by working this subset of requests in parallel significantly improves the effectiveness both DDOD and FOIA. This paper was originally presented at the FOIA directors meeting for all HHS (U.S. Dept of Health and Human Services) agencies in March 2015.
  • 3. Demand-Driven Open Data (DDOD) is a framework of tools and methods that… Provide external data users✽ with a systematic, ongoing and transparent way to tell HHS what data they need ...To be managed, measured and executed in terms of use cases, enabling allocation of limited resources based on value What is DDOD? ✽ Including industry, researchers, nonprofits, media and other government organizations More effective open data initiatives More engaged user community More economic value & discoveries delivered
  • 4. Add What does DDOD deliver to the user community? Implementation of a use case could fall into one of 3 categories Time to execute Cost/Effort Improve Catalog Facilitate deployment of ● New datasets ● New APIs For existing datasets ● Add needed fields ● Improve data quality ● Add / improve metadata ● Add / improve API If datasets already exist in legacy systems, make them more available and discoverable ● Publicize availability ● Index to HealthData.gov and Data.gov ● Instructions for efficient FOIA handling Current State
  • 5. Processes for administration of use cases, such as • Encouraging responsiveness, transparency and documentation • Ensuring use cases and resulting datasets are indexed in HealthData.gov Specialized tools for administering use cases • Workflow engine, communications method, knowledge base • Data processing, storage, hosting, versioning Proactive outreach to industry and academia for a thriving community What does DDOD provide to data owners? 1 2 3
  • 6. “The Freedom of Information Act (FOIA) is a law that gives you the right to access information from the federal government. It is often described as the law that keeps citizens in the know about their government.” What is FOIA? Source: FOIA.gov
  • 7. U.S. Department of Health & Human Services What does FOIA look like within HHS? Source: FOIA.gov Agency Received Processed Pending ACF 1,864 1,255 763 CMS 26,361 25,027 4,717 OIG 843 815 34 CDC 1,141 1,028 691 FDA 10,224 10,191 2,896 HRSA 355 352 92 IHS 152 160 20 NIH 1,169 1,156 84 SAMHSA 193 238 51 OS 1,531 1,605 314 OASH 437 455 120 ACL 13 11 2 Total 44,283 42,293 9,784
  • 8. Some FOIA requests look quite similar to DDOD use cases FOIA requests that have all of the following attributes could be potential DDOD use cases 1. The data is still being generated or refreshed on a regular basis. (Not a single pull of historical information.) 2. The data can be delivered in a machine readable, structured format, such as CSV, JSON, XML or Excel files. (Not freeform text-only, PDF, or scanned images.) 3. The data has widespread usefulness for multiple organizations. (Excluding complaints and investigations, which are often specific to an individual or organization, as is often the case in lawsuits.)
  • 9. Although only a fraction of the 45,000 FOIA requests received so far by HHS have attributes that make them applicable to DDOD, there are still plenty that could become high value use cases.
  • 10. But both FOIA and DDOD have their own challenges DDOD’s challenges: ● DDOD is a relatively new program that doesn’t enjoy the awareness, recognition and credibility that FOIA has developed over the years ● Proactive outreach to the user community is a big effort and expense for DDOD ● The toolset for managing use cases is early in its lifecycle and is still being developed ● While the DDOD workflow is documented and actively in use, it’s still being fine-tuned FOIA’s challenges: ● Growing backlog, due to high demand, complexity of some requests and limited resources ● Process for spotting similar requests and posting the response to an electronic “reading room” is subjective and time consuming ● Difficult to maintain consistency in both format and actual data delivered. Requests may be subject to interpretation. Each request may be fulfilled by a different employee or contractor ● There’s no mechanism for automatically refreshing results for data that continues to be generated
  • 11. Joining forces on applicable requests is mutually beneficial, because it can help address many of these challenges.
  • 12. DDOD can benefit FOIA by... 1. Reducing number of future requests More data indexed and made publically available via HealthData.gov (See diagram) Data that’s automatically being refreshed doesn’t need to be requested multiple times Rework stemming from inconsistent format and data values between consecutive extracts on refreshed data Better documentation of data provenance and usage via use case methodology Data User runs search on HealthData.gov Data User creates / updates use case DDOD Admin engages Data Owner on use case • keywords, subject • data dictionary HealthData.gov Data Dictionary Dataset Inventory EDI* Use Case DDOD Admin ensures changes to EDI get propagated to HD.gov * Enterprise Data Inventory (EDI), which is a catalog of HHS “Strategically Relevant Data Assets” Data Dictionary Dataset Inventory DDOD Admin enters use case on HD.gov with links to specifications Data Owner adds entry to EDI, including metadata DDOD Admin curates entry & ensures SLAs DDOD Admin creates repository for use case Process of adding a new DDOD use case
  • 13. DDOD can benefit FOIA by… (cont) 2. Fulfilling the electronic “reading room” requirement for requests made multiple times by publishing the data using HealthData.gov infrastructure. Results end up benefiting more users. 3. Instituting best practices for open data projects in terms of machine readability, standards, documentation and access 4. Helping address the backlog, specifically for the subset of requests applicable to DDOD. These are also the requests that are beneficial to a wider audience. Much of the current backlog is unrelated to machine readable datasets that serve the broader user community. FOIA is straining under the weight of one-time requests that benefit only the few. For example, a single lawsuit-related request to CDC required processing of 90,000 documents.
  • 14. FOIA can benefit DDOD by... 1. Since FOIA is legislated and well-known, it can bring in more use cases into DDOD. By identifying specific requests that are applicable to DDOD By promoting DDOD on FOIA websites and materials 2. DDOD can leverage work being done anyway to fulfill FOIA requests Data owner already needs to be involved and resources -- whether employees or contractors -- need to be allocated to do the extract Once a data extract is performed, it might be trivial to configure periodic runs of the same code and perform automated pushes to HealthData.gov 3. Association with FOIA lends additional awareness, recognition, credibility to DDOD
  • 15. Requester receives documents Requester can sue Data owner produces documents Submit for fulfillment to data owner Requester can appeal We start with a simplified overview of the FOIA workflow to determine how DDOD requests could be initiated and executed in parallel Agree to fulfill Requests that are partially or incorrectly fulfilled can also go through the same appeals process Agency decides whether and what to release Agency obtains clarification (if needed) Request routed to ultimate agency Respond within 20 business days (Can extend for unusual circumstances: Research, volume, consultation) Route within 10 business days Decline request Agree to fulfill Decline request Requester submits request to FOIA
  • 16. In DDOD workflow, use case execution relies on three types of participants: Data User, DDOD Admin, and Data Owner. The Data User initiates the request All implementation decisions ultimately are made by the Data Owner The DDOD Admin only facilitates the process when needed
  • 17. Although it’s a legislation-driven process, FOIA organizations may voluntarily notify DDOD on incoming requests. For such requests, the DDOD workflow works in parallel to the FOIA workflow.Data User Data Owner DDOD Admin DocumentImplementEngage Discuss DocumentEngageClarify DocumentImplementDiscussClarify FOIA Org Workflow for FOIA - DDOD partnership Typical FOIA request