SlideShare une entreprise Scribd logo
1  sur  20
Télécharger pour lire hors ligne
Presented at
Executive Office of the President
Office of Science and Technology
Policy
US Gov Data Cabinet meeting
June 21, 2016
The Lean Startup
Approach to Open Data
How Demand-Driven Open Data (DDOD)
Improves Relevance, Discoverability and Usability
David Portnoy
Entrepreneur-in-Residence
U.S. Department of Health & Human Services
Twitter: @dportnoy
http://ddod.healthdata.gov
Piloted as “Lean Startup for Open Data”
at HHS IDEA Lab across Department of Health & Human Services agencies
Demonstrated capabilities in improving data quality
at White House Open Data Roundtable
Optimizing DDOD to be scalable and applicable across government
with OSTP/OMB, Data Cabinet and Center for Open Data Enterprise
Discuss data maturity model application beyond open data
at MIT Chief Data Officer conference
The Background
CIOs and agency heads must: Maintain an EDI
(Enterprise Data Inventory); Implement a
“process to evaluate and improve timeliness,
completeness, accuracy, usefulness, and
availability” of open data; Implement a method
for understanding data asset usage, responding
to quality issues, usability, recommendations for
improvements, and adherence complaints;
Ensure conformance with open data best
practices; Produce an Open Data Compliance
Report.
Agencies must: Analyze data asset usage,
including responding to quality issues, usability,
recommendations for improvements, and
complaints about adherence; Monitor public
satisfaction and performance improvement
needs; Engage the public in using open data and
encourage collaborative approaches to improving
data use; Provide information for the GAO report
on the value of information made available to the
public and additional data assets that should be
made available publicly.
Looking Ahead... OPEN Gov Data Act
Focus on measuring value of data
Engage the public in using open data and encourage
collaborative approaches to improving data use
Analyze data asset usage. “Monitor public
satisfaction and performance improvement needs”
Institute a process to continuously improve on “quality
issues, usability, recommendations, complaints...”
What happens when we don’t measure value?
Data owners focus on datasets that are:
easiest to generate and
least risky to release
Unusable and low-value datasets
Difficult to find useful data
The Reality
The Result
Take community engagement (on steroids, of course)
The Solution
And pair it with lean startup principles
The Shift
What’s a Use Case?
All metrics in DDOD are in terms of Use Cases,
...which is simply a well-defined application of a dataset
for a specific purpose in industry, research or media.
It always includes a statement of value -- both
to the requester and the general public
Each use case has core
sections…
Description
Value
Specifications
Solution
See them at:
http://ddod.healthdata.gov/
Anatomy of a Use Case
Processes for administration of use cases, such as
• Encouraging responsiveness, transparency and documentation
• Ensuring use cases and resulting datasets are indexed in HealthData.gov
Specialized tools for administering use cases
• Workflow engine, communications method, knowledge base
• Data processing, storage, hosting, versioning
Proactive outreach to industry and academia for a thriving
community
DDOD provides 3 core services to Data Owners
The Framework
Identify missing
technical capability
Manually improve
data catalog and
data assets
Contribute to
Use Case
knowledge base
External DDOD Activity
• Outreach & collaboration
• Use case administration
DDOD drives 3 types of deliverables:
cataloging of use cases,
improvements of data assets and
development of technical capabilities
Internal DDOD Activity
• Systems development
• Program evaluation
Ongoing
Systems development specification
Increase & measure value Improve capability
Knowledge Base Data Assets Technical Capability
The Process
DDOD’s workflow for a Use Case enabled by 3 types of participants:
Data User, DDOD Admin, Data Owner
Communications Platform
(Github Issues)
Data Catalog
(HealthData.gov)
Knowledge Base (MediaWiki)
The Tools
Middleware (Python)
Tied together with middleware
that monitors changes and
tracks progress
The Architecture
Data.json Hosted
charts
(Flask, Google
Charts, Bokeh)
Embed
Middleware
(Python, Flask, math libraries)
HealthData.gov
Drupal
(CMS,
workflow)
Semantic
MediaWiki
Drupal DKAN theme
SMW
API
GitHub
issues
GH API
DCAN Drupal
Extension
(DKAN data
catalog)
Requests
Library
...but it’s always changing
DDOD use cases
deliver value in 6
ways...
The Metrics
Found at: http://ddod.healthdata.gov/
The Progress
✤ As of May 2016
The Deliverables
Knowledge Base Data Catalog & Assets Technical Capability
◼ 44 use cases documenting
specific applications of open data
assets added to DDOD knowledge
base
◼ 8 agencies covered: CMS,
FDA, CDC, HRSA, ONC, ACF,
ACL, ASPE
◼ 47 users served by DDOD,
including companies, data
scientists, researchers, journalists
and nonprofits
◼ 20 use cases driving additional
datasets indexed
◼ 180 previously uncataloged
URLs identified
◼ 9 use cases driving new or
improved datasets released
◼ 2 standards for open data
resulting from 8 use cases
◼ Automated calculation and
visualization of value metrics
◼ Dataset count fluctuation
monitoring
◼ Daily catalog change reports
◼ Data asset federation report &
harvest flow visualization
◼ DDOD/HealthData.gov integration
roadmap
• Single source of truth monitoring
• Data quality notifications
• Auto sync between platforms
It started with frustration
about data quality
Can’t reconcile multiple sources
Missing unique identifiers
Refreshes change history
And ended with a release of
new data
(including an API!)
Example Use Case
Quality improvements
using machine readability
and consolidation
Medicaid enrollment
data reports have been
published only as PDFs
...with different files by
years and state!
Lots of overhead and
transcription errors
If only they could all be
that easy!
Example Use Case
Data quality improves by
eliminating manual entry
Federal poverty
guidelines are tables
published annually
Lots of organizations
enter these by hand
But community already
solved the problem
The best kind of problems
solve themselves!
Example Use Case
Insights for regulation Stimulate adoption
Sometimes, the biggest
gains come when you
observe trends.
Observe 7 use cases with
common challenge
Need standardized
provider dimension
Work with regulators and
industry
DDOD was able to
contribute to a new standard
7 use cases impacted Industry work groupExample Use Case
Your insights please!
Help fine-tune DDOD to be most
applicable to your agency’s needs
1. Channels used to reach public
2. Prioritization of releases / improvements
3. Measuring value of data assets
4. Incentives for program owner
The Request

Contenu connexe

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 

En vedette

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

DDOD - The Lean Startup Approach to Open Data

  • 1. Presented at Executive Office of the President Office of Science and Technology Policy US Gov Data Cabinet meeting June 21, 2016 The Lean Startup Approach to Open Data How Demand-Driven Open Data (DDOD) Improves Relevance, Discoverability and Usability David Portnoy Entrepreneur-in-Residence U.S. Department of Health & Human Services Twitter: @dportnoy http://ddod.healthdata.gov
  • 2. Piloted as “Lean Startup for Open Data” at HHS IDEA Lab across Department of Health & Human Services agencies Demonstrated capabilities in improving data quality at White House Open Data Roundtable Optimizing DDOD to be scalable and applicable across government with OSTP/OMB, Data Cabinet and Center for Open Data Enterprise Discuss data maturity model application beyond open data at MIT Chief Data Officer conference The Background
  • 3. CIOs and agency heads must: Maintain an EDI (Enterprise Data Inventory); Implement a “process to evaluate and improve timeliness, completeness, accuracy, usefulness, and availability” of open data; Implement a method for understanding data asset usage, responding to quality issues, usability, recommendations for improvements, and adherence complaints; Ensure conformance with open data best practices; Produce an Open Data Compliance Report. Agencies must: Analyze data asset usage, including responding to quality issues, usability, recommendations for improvements, and complaints about adherence; Monitor public satisfaction and performance improvement needs; Engage the public in using open data and encourage collaborative approaches to improving data use; Provide information for the GAO report on the value of information made available to the public and additional data assets that should be made available publicly. Looking Ahead... OPEN Gov Data Act Focus on measuring value of data Engage the public in using open data and encourage collaborative approaches to improving data use Analyze data asset usage. “Monitor public satisfaction and performance improvement needs” Institute a process to continuously improve on “quality issues, usability, recommendations, complaints...”
  • 4. What happens when we don’t measure value? Data owners focus on datasets that are: easiest to generate and least risky to release Unusable and low-value datasets Difficult to find useful data The Reality The Result
  • 5. Take community engagement (on steroids, of course) The Solution And pair it with lean startup principles The Shift
  • 6. What’s a Use Case? All metrics in DDOD are in terms of Use Cases, ...which is simply a well-defined application of a dataset for a specific purpose in industry, research or media. It always includes a statement of value -- both to the requester and the general public
  • 7. Each use case has core sections… Description Value Specifications Solution See them at: http://ddod.healthdata.gov/ Anatomy of a Use Case
  • 8. Processes for administration of use cases, such as • Encouraging responsiveness, transparency and documentation • Ensuring use cases and resulting datasets are indexed in HealthData.gov Specialized tools for administering use cases • Workflow engine, communications method, knowledge base • Data processing, storage, hosting, versioning Proactive outreach to industry and academia for a thriving community DDOD provides 3 core services to Data Owners
  • 9. The Framework Identify missing technical capability Manually improve data catalog and data assets Contribute to Use Case knowledge base External DDOD Activity • Outreach & collaboration • Use case administration DDOD drives 3 types of deliverables: cataloging of use cases, improvements of data assets and development of technical capabilities Internal DDOD Activity • Systems development • Program evaluation Ongoing Systems development specification Increase & measure value Improve capability Knowledge Base Data Assets Technical Capability
  • 10. The Process DDOD’s workflow for a Use Case enabled by 3 types of participants: Data User, DDOD Admin, Data Owner
  • 11. Communications Platform (Github Issues) Data Catalog (HealthData.gov) Knowledge Base (MediaWiki) The Tools Middleware (Python) Tied together with middleware that monitors changes and tracks progress
  • 12. The Architecture Data.json Hosted charts (Flask, Google Charts, Bokeh) Embed Middleware (Python, Flask, math libraries) HealthData.gov Drupal (CMS, workflow) Semantic MediaWiki Drupal DKAN theme SMW API GitHub issues GH API DCAN Drupal Extension (DKAN data catalog) Requests Library ...but it’s always changing
  • 13. DDOD use cases deliver value in 6 ways... The Metrics
  • 15. ✤ As of May 2016 The Deliverables Knowledge Base Data Catalog & Assets Technical Capability ◼ 44 use cases documenting specific applications of open data assets added to DDOD knowledge base ◼ 8 agencies covered: CMS, FDA, CDC, HRSA, ONC, ACF, ACL, ASPE ◼ 47 users served by DDOD, including companies, data scientists, researchers, journalists and nonprofits ◼ 20 use cases driving additional datasets indexed ◼ 180 previously uncataloged URLs identified ◼ 9 use cases driving new or improved datasets released ◼ 2 standards for open data resulting from 8 use cases ◼ Automated calculation and visualization of value metrics ◼ Dataset count fluctuation monitoring ◼ Daily catalog change reports ◼ Data asset federation report & harvest flow visualization ◼ DDOD/HealthData.gov integration roadmap • Single source of truth monitoring • Data quality notifications • Auto sync between platforms
  • 16. It started with frustration about data quality Can’t reconcile multiple sources Missing unique identifiers Refreshes change history And ended with a release of new data (including an API!) Example Use Case
  • 17. Quality improvements using machine readability and consolidation Medicaid enrollment data reports have been published only as PDFs ...with different files by years and state! Lots of overhead and transcription errors If only they could all be that easy! Example Use Case
  • 18. Data quality improves by eliminating manual entry Federal poverty guidelines are tables published annually Lots of organizations enter these by hand But community already solved the problem The best kind of problems solve themselves! Example Use Case
  • 19. Insights for regulation Stimulate adoption Sometimes, the biggest gains come when you observe trends. Observe 7 use cases with common challenge Need standardized provider dimension Work with regulators and industry DDOD was able to contribute to a new standard 7 use cases impacted Industry work groupExample Use Case
  • 20. Your insights please! Help fine-tune DDOD to be most applicable to your agency’s needs 1. Channels used to reach public 2. Prioritization of releases / improvements 3. Measuring value of data assets 4. Incentives for program owner The Request