SlideShare une entreprise Scribd logo
1  sur  43
Primer: Data-Driven Startups
Digital Incubation Centre,
Ministry of Transportation and Communications
Doha, Qatar
Heather Leson March 9, 2016
Data Examples
Cultural: Data about cultural works and artefacts — for example titles and authors —
and generally collected and held by galleries, libraries, archives and museums.
Science: Data that is produced as part of scientific research from astronomy to
zoology.
Finance: Data such as government accounts (expenditure and revenue) and
information on financial markets (stocks, shares, bonds etc).
Statistics: Data produced by statistical offices such as the census and key
socioeconomic indicators.
Weather: The many types of information used to understand and predict the weather
and climate.
Environment: Information related to the natural environment such presence and
level of pollutants, the quality and rivers and seas.
Transport: Data such as timetables, routes, on-time statistics.
Types of Open Data
(Source: okfn.org)
Kasra and QCRI: Connecting Startups & Research
Metis:
Collaborating with
CMU to get data
working within the
privacy/security
guidelines
Academic Planning Made Easier.
Mumm:
Connecting with the local
Cairo data science
community.
Data for food.
Exantium:
Strategy firm
connecting open
data to government
and business. Part
of a global network.
Data-Driven Recipes
1. How to:
Technical
Training/Business
for Data Literacy
2. How to:
Host a Data
Expedition
Storyteller
Role: Generate Ideas, interesting questions, help defining the questions and assist in the information
products/story outputs.
Scout
Role: Scouts hunt down data from across the web. They can be non-technical or technical, depending on
how difficult it is to obtain data (whether it is easily downloadable or needs to be scraped etc).
Analyst
Role: Analysts are the ones who crunch the data found by the scouts and test the hypotheses generated
by the storytellers.
“Engineers” (Optional)
Role: create information outputs (varying degrees of technical from coding to using ‘off the shelf’ tools
Designers
Role: Beautify the outputs and make sure the story really comes through the data.
3. How to:
Data Clinics to
connect
entrepreneurs,
business and
government
Data Discovery
DIY Data:
BQ Magazine’s
Faces of Qatar
DIY Data:
QCRI Social
Computing
Groundtruth Data
Collection
Phones, photos and food consumption
for Health Monitoring
You are a Smart City: Create a local map dataset
Data
Pipeline
Qatar Data Expedition
What are the questions you seek to answer?
What is the license? Can you reuse/publish the data?
Is the source credible?
Is the data credible?
Where did they get their data?
How much time do I have to search?
How am I organizing my research?
Keen to learn more about verification? http://verificationhandbook.com/ (it
is in Arabic too!)
Consider
Who is publishing about Qatar...on biodiversity?
United States 7,440 occurrences, 97.77% geo-
referenced.
United Kingdom 832 occurrences, 8.29% geo-referenced.
Sweden 620 occurrences, 0.32% geo-referenced.
Netherlands 298 occurrences, 5.03% geo-referenced.
Source: Global Biodiversity Information Facility
What about data on tourism?
Source: Knoema Data Atlas, which
aggregates the World Development
Indicators, 2015
$6, 616,000,000 USD
International Tourism
expenditures for travel items
(Time for more boutique
travel startups)
World Bank UN Data
UNESCO Institute of Statistics
HDX WEF
Forbes: Top 35 big data sources
Visually: 30 places to find Open Data
Location Data
OpenStreetMap: Free, open
Dataset
Get data: http://planet.osm.org/
GADM: Administrative Boundaries
Bing Imagery
Ministry of Development Planning and Statistics
In economic statistics:
Quarterly and annual Gross Domestic Product -GDP (constant and current) by economic
activity
Monthly, quarterly and annual Consumer Price Index, Production Price Index-PPI,
Foreign Trade Statistics (import and export), Building permits
In social statistics:
Labor force statistics (through a labor force sample survey)
Marriage, health, birth, fertility, education, disability, mortality statistics (in coordination with
other ministries)
In environmental statistics:
Monthly rainfall, Monthly and annual average concentrations of air pollutants, Capacities
of urban wastewater treatment plants
In population statistics: Population growth rate, Population sex ratio
QALM portal (Qatar Information Exchange)
QALM is an ambitious national project, developed by a number of government partners
including: The General Secretariat for Development Planning, The Statistics Authority, The
Supreme Council of Health, The Supreme Education Council, Supreme Council of Family Affairs,
ictQATAR, Ministerial Cabinet and the Permanent Population Committee.
http://www.qalm.gov.qa/
Data is available in multiple formats!
To get data from the Ministry of Development. Check their website. If you are looking for other
data, they are an email away. ICU@mdps.gov.qa
Using Data
Learn how: http://datadrivenjournalism.net/
"Expenditure Components Of GDP at Current Prices (Mn Qatari Riyal)
Source - Ministry of Development Planning and Statistics
"
"",""," ",,,,,,,,,,,,,,,,,,
"","","2004","2005","2006","2007","2008","2009","2010","2011","2012","2013",,,,,"2014",,,,
"","","Total","Total","Total","Total","Total","Total","Total","Total","Total","Q1","Q2","Q3","Q4","Total","Q1","Q2","Q3","Q4","Total"
"Gross Domestic
product","B.1G",115512.376669,162091.018049205,221610.304141365,290151.574403828,419582.826273579,355986.474251774,455445,618089.239045503,692654.670488044,186654.189573065,177830.42
0532429,185433.336051801,189857.929208376,739776,193880.888003083,189653.51105388,193080.129441538,194397.657502752,771013.233251822
"Household Final Consumption
Expenditure","P.3a",20166,25889.8602243444,36186.326795032,49728.6119489121,64675.8351579253,68622.9919301139,73645.7899114015,79905.6820538706,87682.19979384,24130.4586981125,24802.4
947262859,23572.4447936237,26368.9939206421,98874.3921386642,26807.1948166319,27414.3657651239,26424.7106136522,28729.6901996358,109375.961395044
"Government Final Consumption
Expenditure","P.3b",15094,23171.9888517611,32616.2047008325,35989.9119915317,42695.8750950427,55652.33697478,63689.0870608494,77007.4825664626,89527.4435418714,24336.9460716118,24384.
7648280038,24240.4862291342,25297.5589689309,98259.7560976807,26593.3225341388,26861.3831859924,27030.5661941075,27714.3396569197,108199.611571158
"Gross capital
formation","P.5",36399.044558,55609.5389690997,92830.0390858622,133518.050463385,172523.116020611,152947.14534688,142449.123027749,177621.474425169,194347.357152333,49488.7848033409,4
9657.1609781394,58089.4050290433,60871.3763188034,218106.851763655,53389.3706523124,58868.7621027634,67296.8526337788,77579.6276461965,257731.66028562
"Exports (Goods & Services)-
F.O.B","P.6",74122.332111,105496.630004,139210.733559638,174896,257467,182033,283832,442959.8,520182,141152,131890,134332,131751,539125,146457,134748,131592,116481,528682
"Imports (Goods & Services)-F.O.B","P.7",-30269,-48077,-79233,-103981,-117779,-103269,-108171,-159405.2,-199084.33,-52454,-52904,-54801,-54431,-214590,-59366,-58239,-59264,-56107,-232976
"*Figures for 2013 & 2014 are Preliminary estimates
Powered by © QALM"
Census data extracted...not usable yet..
Qatar Census
(Source: Doha News 2016)
South African Census Data
Open Refine http://openrefine.org/
Sublime Text
https://www.sublimetext.com/
There are many tools for software
developers and data scientists too.
Note: you still need the Human API to analyze and
make decisions for your business. Of course, if you
can afford it, then you can get your business
intelligence from KPMG, Gartner, Bloomberg,
McKinley or PWC. Until then….
Some tools to Clean Datasets
Learn more with Lillian and her
online courses.
Tools for Charts, Graphs and Infographics
http://tableau.com/
http://infogr.am/
http://piktochart.com/
https://www.canva.com/
More LMGTFY: http://www.creativebloq.com/design-tools/data-visualization-
712402
(source: TuktukDesign, Noun Project ccby)
Map tools
Mapbox: http://mapbox.com/
CartoDB: http://academy.cartodb.com/
Leaflet: http://leafletjs.com/
Google: https://www.google.com/mapmaker
ARCgis: https://www.arcgis.com/features/
Time mapper: http://timemapper.okfnlabs.org/
Also: if you are collecting your own location data, try Field Papers or
crowdsource map photos with Mapillary. (They just got 8M funding!)
(source: Mister Pixel, Noun Project, ccby)
QCRI Combining Data Sources: Real-Time Traffic
Monitoring
● Collection and classification of traffic
related tweets (script, research tool)
● Continuous Real-time querying of
Google Traffic API
● Qatar Traffic Profiling & Modeling
○ Geo: City, zone, district
○ Time: Hourly, daily, weekly,
and monthly
● Usage:
○ Detection of abnormal
behaviors
○ Predictions
○ Monthly Public reports
■ Commute status
■ Deadpoints
The best way to learn
is to find data and
make data information
products.
Try to recreate the
diagrams and track
back the data.
Track how other
startups use data.
Copy. Remix.
Social Entrepreneurship & Social Good
Impact of Data-Driven Business
You know your business. Data can give you a
leading edge. Be a Data-Driven Startup.
Some reading:
ODI Report: The Economic Impact of Open Data
ODI - Open Data Means Business
How to build a business from Open Data (1)
How to build a business from Open Data (2)
OpenMENA - 19 studies on Open Data
ABC: Always be Charging
How can you have a Data-Driven Career?
What is your Data Plan for your startup?
Can you use Data-Driven Journalism techniques to improve your
business?
What kind of data do you need to grow your business?
What type of training do you want/need?
Thank you
@heatherleson
@qatarcomputing

Contenu connexe

Similaire à Primer: Data-Driven Startups

cse6339-spring15-02.pptx
cse6339-spring15-02.pptxcse6339-spring15-02.pptx
cse6339-spring15-02.pptx
Paul832
 
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Fatemeh Ahmadi
 
EDF2012 Rufus Pollock - Open Data. Where we are where we are going
EDF2012  Rufus Pollock - Open Data. Where we are where we are goingEDF2012  Rufus Pollock - Open Data. Where we are where we are going
EDF2012 Rufus Pollock - Open Data. Where we are where we are going
European Data Forum
 
Will We Command Our Data? From the Petascale to the Personal
Will We Command Our Data?  From the Petascale to the PersonalWill We Command Our Data?  From the Petascale to the Personal
Will We Command Our Data? From the Petascale to the Personal
Richard Akerman
 

Similaire à Primer: Data-Driven Startups (20)

HLG Big Data project and Sandbox
HLG Big Data project and SandboxHLG Big Data project and Sandbox
HLG Big Data project and Sandbox
 
cse6339-spring15-02.pptx
cse6339-spring15-02.pptxcse6339-spring15-02.pptx
cse6339-spring15-02.pptx
 
Foresight Analytics
Foresight AnalyticsForesight Analytics
Foresight Analytics
 
A Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsA Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's Datasets
 
Bigdatacooltools
BigdatacooltoolsBigdatacooltools
Bigdatacooltools
 
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
 
Lecture week 5 -
Lecture week 5 -Lecture week 5 -
Lecture week 5 -
 
Foresight conversation
Foresight conversationForesight conversation
Foresight conversation
 
Bigdata ai
Bigdata aiBigdata ai
Bigdata ai
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
Participatory Web
Participatory WebParticipatory Web
Participatory Web
 
Big data and the dark arts - Jisc Digital Media 2015
Big data and the dark arts - Jisc Digital Media 2015Big data and the dark arts - Jisc Digital Media 2015
Big data and the dark arts - Jisc Digital Media 2015
 
Datapreneurs
DatapreneursDatapreneurs
Datapreneurs
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
EDF2012 Rufus Pollock - Open Data. Where we are where we are going
EDF2012  Rufus Pollock - Open Data. Where we are where we are goingEDF2012  Rufus Pollock - Open Data. Where we are where we are going
EDF2012 Rufus Pollock - Open Data. Where we are where we are going
 
Hawke's Bay Open Data Conference - 2 May 2019
Hawke's Bay Open Data Conference - 2 May 2019Hawke's Bay Open Data Conference - 2 May 2019
Hawke's Bay Open Data Conference - 2 May 2019
 
CeRDI Research RUN Vietnam Agriculture Group
CeRDI Research RUN Vietnam Agriculture GroupCeRDI Research RUN Vietnam Agriculture Group
CeRDI Research RUN Vietnam Agriculture Group
 
Will We Command Our Data? From the Petascale to the Personal
Will We Command Our Data?  From the Petascale to the PersonalWill We Command Our Data?  From the Petascale to the Personal
Will We Command Our Data? From the Petascale to the Personal
 
Systemof insight
Systemof insightSystemof insight
Systemof insight
 
Big Data & Smart City Applications
Big Data & Smart City ApplicationsBig Data & Smart City Applications
Big Data & Smart City Applications
 

Plus de International Federation of Red Cross and Red Crescent Societies

Plus de International Federation of Red Cross and Red Crescent Societies (20)

Fragile communities in a data driven world
Fragile communities in a data driven world Fragile communities in a data driven world
Fragile communities in a data driven world
 
Co creating Data Literacy
Co creating Data Literacy Co creating Data Literacy
Co creating Data Literacy
 
When we say open...(updated)
When we say open...(updated)When we say open...(updated)
When we say open...(updated)
 
Introducing the Data Playbook (Beta)
Introducing the Data Playbook (Beta)Introducing the Data Playbook (Beta)
Introducing the Data Playbook (Beta)
 
When we say open...
When we say open...When we say open...
When we say open...
 
Data Literacy at IFRC 2017
Data Literacy at IFRC 2017Data Literacy at IFRC 2017
Data Literacy at IFRC 2017
 
Where Do We Go from Here?
Where Do We Go from Here?Where Do We Go from Here?
Where Do We Go from Here?
 
The Next Million
The Next MillionThe Next Million
The Next Million
 
Crowdsourcing with Data-Driven Innovation
Crowdsourcing with Data-Driven InnovationCrowdsourcing with Data-Driven Innovation
Crowdsourcing with Data-Driven Innovation
 
Building a Citizen Engaged Research Project
Building a Citizen Engaged Research ProjectBuilding a Citizen Engaged Research Project
Building a Citizen Engaged Research Project
 
Our Common Startup
Our Common StartupOur Common Startup
Our Common Startup
 
Reduce Risk with Digital Preparedness
Reduce Risk with Digital Preparedness  Reduce Risk with Digital Preparedness
Reduce Risk with Digital Preparedness
 
Empower Digital Skills for Good
Empower Digital Skills for Good Empower Digital Skills for Good
Empower Digital Skills for Good
 
Data-Driven Innovation in Qatar
Data-Driven Innovation in Qatar Data-Driven Innovation in Qatar
Data-Driven Innovation in Qatar
 
Aingel Accelerator
Aingel Accelerator Aingel Accelerator
Aingel Accelerator
 
Using Maps to Connect
Using Maps to ConnectUsing Maps to Connect
Using Maps to Connect
 
Micro Maps
Micro MapsMicro Maps
Micro Maps
 
Getting to know maps for social good
Getting to know maps for social goodGetting to know maps for social good
Getting to know maps for social good
 
Digital Humanitarians in the Sky
Digital Humanitarians in the SkyDigital Humanitarians in the Sky
Digital Humanitarians in the Sky
 
Introduction to Digital Humanitarians
Introduction to Digital Humanitarians   Introduction to Digital Humanitarians
Introduction to Digital Humanitarians
 

Dernier

+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893
+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893
+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893
Health
 
Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...
Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...
Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...
ZurliaSoop
 
Enabling Business Users to Interpret Data Through Self-Service Analytics (2).pdf
Enabling Business Users to Interpret Data Through Self-Service Analytics (2).pdfEnabling Business Users to Interpret Data Through Self-Service Analytics (2).pdf
Enabling Business Users to Interpret Data Through Self-Service Analytics (2).pdf
Smartinfologiks
 
Indian Call girl in Dubai 0508644382 Dubai Call girls
Indian Call girl in Dubai 0508644382 Dubai Call girlsIndian Call girl in Dubai 0508644382 Dubai Call girls
Indian Call girl in Dubai 0508644382 Dubai Call girls
Monica Sydney
 

Dernier (12)

+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893
+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893
+971565801893>>Safe and original mtp kit for sale in Dubai>>+971565801893
 
Famedesired Project portfolio1 . Fullsail
Famedesired Project portfolio1 . FullsailFamedesired Project portfolio1 . Fullsail
Famedesired Project portfolio1 . Fullsail
 
NEON LIGHT CITY pitch deck for the new PC game
NEON LIGHT CITY pitch deck for the new PC gameNEON LIGHT CITY pitch deck for the new PC game
NEON LIGHT CITY pitch deck for the new PC game
 
How Multicultural Toys Helps in Child Development.pptx
How Multicultural Toys Helps in Child Development.pptxHow Multicultural Toys Helps in Child Development.pptx
How Multicultural Toys Helps in Child Development.pptx
 
Shareholders Agreement Template for Compulsorily Convertible Debt Funding- St...
Shareholders Agreement Template for Compulsorily Convertible Debt Funding- St...Shareholders Agreement Template for Compulsorily Convertible Debt Funding- St...
Shareholders Agreement Template for Compulsorily Convertible Debt Funding- St...
 
EV Electric Vehicle Startup Pitch Deck- StartupSprouts.in
EV Electric Vehicle Startup Pitch Deck- StartupSprouts.inEV Electric Vehicle Startup Pitch Deck- StartupSprouts.in
EV Electric Vehicle Startup Pitch Deck- StartupSprouts.in
 
Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...
Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...
Jual Obat Aborsi Bojonegoro ( Asli No.1 ) 085657271886 Obat Penggugur Kandung...
 
Dàni Velvet Personal Brand Exploration (1).pptx
Dàni Velvet Personal Brand Exploration (1).pptxDàni Velvet Personal Brand Exploration (1).pptx
Dàni Velvet Personal Brand Exploration (1).pptx
 
JAIPUR CALL GIRLS SERVICE REAL HOT SEXY 👯 CALL GIRLS IN JAIPUR BOOK YOUR DREA...
JAIPUR CALL GIRLS SERVICE REAL HOT SEXY 👯 CALL GIRLS IN JAIPUR BOOK YOUR DREA...JAIPUR CALL GIRLS SERVICE REAL HOT SEXY 👯 CALL GIRLS IN JAIPUR BOOK YOUR DREA...
JAIPUR CALL GIRLS SERVICE REAL HOT SEXY 👯 CALL GIRLS IN JAIPUR BOOK YOUR DREA...
 
Enabling Business Users to Interpret Data Through Self-Service Analytics (2).pdf
Enabling Business Users to Interpret Data Through Self-Service Analytics (2).pdfEnabling Business Users to Interpret Data Through Self-Service Analytics (2).pdf
Enabling Business Users to Interpret Data Through Self-Service Analytics (2).pdf
 
How to structure your pitch - B4i template
How to structure your pitch - B4i templateHow to structure your pitch - B4i template
How to structure your pitch - B4i template
 
Indian Call girl in Dubai 0508644382 Dubai Call girls
Indian Call girl in Dubai 0508644382 Dubai Call girlsIndian Call girl in Dubai 0508644382 Dubai Call girls
Indian Call girl in Dubai 0508644382 Dubai Call girls
 

Primer: Data-Driven Startups

  • 1. Primer: Data-Driven Startups Digital Incubation Centre, Ministry of Transportation and Communications Doha, Qatar Heather Leson March 9, 2016
  • 2.
  • 4.
  • 5.
  • 6. Cultural: Data about cultural works and artefacts — for example titles and authors — and generally collected and held by galleries, libraries, archives and museums. Science: Data that is produced as part of scientific research from astronomy to zoology. Finance: Data such as government accounts (expenditure and revenue) and information on financial markets (stocks, shares, bonds etc). Statistics: Data produced by statistical offices such as the census and key socioeconomic indicators. Weather: The many types of information used to understand and predict the weather and climate. Environment: Information related to the natural environment such presence and level of pollutants, the quality and rivers and seas. Transport: Data such as timetables, routes, on-time statistics. Types of Open Data (Source: okfn.org)
  • 7. Kasra and QCRI: Connecting Startups & Research
  • 8. Metis: Collaborating with CMU to get data working within the privacy/security guidelines Academic Planning Made Easier.
  • 9. Mumm: Connecting with the local Cairo data science community. Data for food.
  • 10. Exantium: Strategy firm connecting open data to government and business. Part of a global network.
  • 13. 2. How to: Host a Data Expedition
  • 14. Storyteller Role: Generate Ideas, interesting questions, help defining the questions and assist in the information products/story outputs. Scout Role: Scouts hunt down data from across the web. They can be non-technical or technical, depending on how difficult it is to obtain data (whether it is easily downloadable or needs to be scraped etc). Analyst Role: Analysts are the ones who crunch the data found by the scouts and test the hypotheses generated by the storytellers. “Engineers” (Optional) Role: create information outputs (varying degrees of technical from coding to using ‘off the shelf’ tools Designers Role: Beautify the outputs and make sure the story really comes through the data.
  • 15. 3. How to: Data Clinics to connect entrepreneurs, business and government
  • 18. DIY Data: QCRI Social Computing Groundtruth Data Collection Phones, photos and food consumption for Health Monitoring
  • 19. You are a Smart City: Create a local map dataset
  • 22. What are the questions you seek to answer? What is the license? Can you reuse/publish the data? Is the source credible? Is the data credible? Where did they get their data? How much time do I have to search? How am I organizing my research? Keen to learn more about verification? http://verificationhandbook.com/ (it is in Arabic too!) Consider
  • 23. Who is publishing about Qatar...on biodiversity? United States 7,440 occurrences, 97.77% geo- referenced. United Kingdom 832 occurrences, 8.29% geo-referenced. Sweden 620 occurrences, 0.32% geo-referenced. Netherlands 298 occurrences, 5.03% geo-referenced. Source: Global Biodiversity Information Facility
  • 24. What about data on tourism? Source: Knoema Data Atlas, which aggregates the World Development Indicators, 2015 $6, 616,000,000 USD International Tourism expenditures for travel items (Time for more boutique travel startups)
  • 25. World Bank UN Data UNESCO Institute of Statistics HDX WEF Forbes: Top 35 big data sources Visually: 30 places to find Open Data
  • 26. Location Data OpenStreetMap: Free, open Dataset Get data: http://planet.osm.org/ GADM: Administrative Boundaries Bing Imagery
  • 27. Ministry of Development Planning and Statistics In economic statistics: Quarterly and annual Gross Domestic Product -GDP (constant and current) by economic activity Monthly, quarterly and annual Consumer Price Index, Production Price Index-PPI, Foreign Trade Statistics (import and export), Building permits In social statistics: Labor force statistics (through a labor force sample survey) Marriage, health, birth, fertility, education, disability, mortality statistics (in coordination with other ministries) In environmental statistics: Monthly rainfall, Monthly and annual average concentrations of air pollutants, Capacities of urban wastewater treatment plants In population statistics: Population growth rate, Population sex ratio
  • 28. QALM portal (Qatar Information Exchange) QALM is an ambitious national project, developed by a number of government partners including: The General Secretariat for Development Planning, The Statistics Authority, The Supreme Council of Health, The Supreme Education Council, Supreme Council of Family Affairs, ictQATAR, Ministerial Cabinet and the Permanent Population Committee. http://www.qalm.gov.qa/ Data is available in multiple formats! To get data from the Ministry of Development. Check their website. If you are looking for other data, they are an email away. ICU@mdps.gov.qa
  • 31.
  • 32. "Expenditure Components Of GDP at Current Prices (Mn Qatari Riyal) Source - Ministry of Development Planning and Statistics " "",""," ",,,,,,,,,,,,,,,,,, "","","2004","2005","2006","2007","2008","2009","2010","2011","2012","2013",,,,,"2014",,,, "","","Total","Total","Total","Total","Total","Total","Total","Total","Total","Q1","Q2","Q3","Q4","Total","Q1","Q2","Q3","Q4","Total" "Gross Domestic product","B.1G",115512.376669,162091.018049205,221610.304141365,290151.574403828,419582.826273579,355986.474251774,455445,618089.239045503,692654.670488044,186654.189573065,177830.42 0532429,185433.336051801,189857.929208376,739776,193880.888003083,189653.51105388,193080.129441538,194397.657502752,771013.233251822 "Household Final Consumption Expenditure","P.3a",20166,25889.8602243444,36186.326795032,49728.6119489121,64675.8351579253,68622.9919301139,73645.7899114015,79905.6820538706,87682.19979384,24130.4586981125,24802.4 947262859,23572.4447936237,26368.9939206421,98874.3921386642,26807.1948166319,27414.3657651239,26424.7106136522,28729.6901996358,109375.961395044 "Government Final Consumption Expenditure","P.3b",15094,23171.9888517611,32616.2047008325,35989.9119915317,42695.8750950427,55652.33697478,63689.0870608494,77007.4825664626,89527.4435418714,24336.9460716118,24384. 7648280038,24240.4862291342,25297.5589689309,98259.7560976807,26593.3225341388,26861.3831859924,27030.5661941075,27714.3396569197,108199.611571158 "Gross capital formation","P.5",36399.044558,55609.5389690997,92830.0390858622,133518.050463385,172523.116020611,152947.14534688,142449.123027749,177621.474425169,194347.357152333,49488.7848033409,4 9657.1609781394,58089.4050290433,60871.3763188034,218106.851763655,53389.3706523124,58868.7621027634,67296.8526337788,77579.6276461965,257731.66028562 "Exports (Goods & Services)- F.O.B","P.6",74122.332111,105496.630004,139210.733559638,174896,257467,182033,283832,442959.8,520182,141152,131890,134332,131751,539125,146457,134748,131592,116481,528682 "Imports (Goods & Services)-F.O.B","P.7",-30269,-48077,-79233,-103981,-117779,-103269,-108171,-159405.2,-199084.33,-52454,-52904,-54801,-54431,-214590,-59366,-58239,-59264,-56107,-232976 "*Figures for 2013 & 2014 are Preliminary estimates Powered by © QALM" Census data extracted...not usable yet..
  • 35. Open Refine http://openrefine.org/ Sublime Text https://www.sublimetext.com/ There are many tools for software developers and data scientists too. Note: you still need the Human API to analyze and make decisions for your business. Of course, if you can afford it, then you can get your business intelligence from KPMG, Gartner, Bloomberg, McKinley or PWC. Until then…. Some tools to Clean Datasets Learn more with Lillian and her online courses.
  • 36. Tools for Charts, Graphs and Infographics http://tableau.com/ http://infogr.am/ http://piktochart.com/ https://www.canva.com/ More LMGTFY: http://www.creativebloq.com/design-tools/data-visualization- 712402 (source: TuktukDesign, Noun Project ccby)
  • 37. Map tools Mapbox: http://mapbox.com/ CartoDB: http://academy.cartodb.com/ Leaflet: http://leafletjs.com/ Google: https://www.google.com/mapmaker ARCgis: https://www.arcgis.com/features/ Time mapper: http://timemapper.okfnlabs.org/ Also: if you are collecting your own location data, try Field Papers or crowdsource map photos with Mapillary. (They just got 8M funding!) (source: Mister Pixel, Noun Project, ccby)
  • 38. QCRI Combining Data Sources: Real-Time Traffic Monitoring ● Collection and classification of traffic related tweets (script, research tool) ● Continuous Real-time querying of Google Traffic API ● Qatar Traffic Profiling & Modeling ○ Geo: City, zone, district ○ Time: Hourly, daily, weekly, and monthly ● Usage: ○ Detection of abnormal behaviors ○ Predictions ○ Monthly Public reports ■ Commute status ■ Deadpoints
  • 39. The best way to learn is to find data and make data information products. Try to recreate the diagrams and track back the data. Track how other startups use data. Copy. Remix.
  • 41. Impact of Data-Driven Business You know your business. Data can give you a leading edge. Be a Data-Driven Startup. Some reading: ODI Report: The Economic Impact of Open Data ODI - Open Data Means Business How to build a business from Open Data (1) How to build a business from Open Data (2) OpenMENA - 19 studies on Open Data
  • 42. ABC: Always be Charging How can you have a Data-Driven Career? What is your Data Plan for your startup? Can you use Data-Driven Journalism techniques to improve your business? What kind of data do you need to grow your business? What type of training do you want/need?

Notes de l'éditeur

  1. Data-Driven Startups to be held at the DIC, Qatar Ministry of Transportation and Communications http://www.ticketfun.me/index/event?eid=999 http://textontechs.com/2016/03/primer-on-data-driven-innovation-for-startups/
  2. Your startup is all about data. From your market segmentation analysis to your business intelligence to your customer management system and beyond. Understanding the tools and formats on how to use data and data skills makes you a business leader and a “Data Driven Startup”
  3. To show how data-driven startups can be successful, I’ll share some data basics followed by some local and regional examples of data startups.
  4. There are many types of data. I like to think of it in layers (mainly due to my love of maps). This diagram is to give you an picture into all the types of data and how they might interact to tell stories, do good and sell your startup outputs. Every startup will use a different combination of this.
  5. Open Data is available in some countries and regions. Qatar currently has an open data policy and it is listed in the National Strategy . http://opendatahandbook.org/. See some of the impact via this report - http://odimpact.org/ More from https://okfn.org/opendata/
  6. Kasra.co is an arabic online news site that targets Arabic language speakers worldwide, especially in MENA. Kasra leverages social media to assist driving traffic, mainly from Facebook. Kasra’s Facebook page has 1M followers. The News Analytics team at QCRI is working to help with social data analytics. (Team is lead by Jim Jansen, Principal Scientist) http://qcri.org.qa/our-people/bio?pid=235&par=acc&name=JimJansen 1. We are using online traffic data to assist in topic selection for their online articles 2. Goal is to understand what types of articles go viral 3. Research aim is to prediction the popularity of articles From Kasra.co http://goo.gl/H3mLyc More about Kasra - http://textontechs.com/2015/08/in-their-own-words-via-kasra/
  7. Metis is a local startup that focuses on connecting students to planning. This objective of this project is to develop student-centric academic planning software for universities and students, using elective based system, which are very flexible but imposes greater challenges for students completing on time http://www.menafn.com/1094627802/Qatar--New-tool-helps-university-students-plan-their-courses http://www.gulf-times.com/story/483430/New-tool-helps-university-students-plan-their-cour https://www.facebook.com/metiscmu/ From Sabih “Regarding data, we have relied primarily on statistics. We did our pilot for 2 weeks, collected data on student interaction and their behavior towards short-term and long-term degree planning. Even though the data itself was not statistically significant, we got good insights on what further data to collect in production mode and how this data can be input back to our recommendation system.”
  8. Waleed Abd El Rahman is creating a data-driven business. Making healthy nutrition available for everyone through spreading entrepreneurship. He is also connecting with local communities to help grow their business. Which brings up the important point. Let’s move beyond hackathons to ongoing sustainable growth for entrepreneurs. With the local community behind his business, he is growing his supporters and his ability to use talent to inspire. https://eg.linkedin.com/in/waleed-abd-el-rahman-1b9a6312 http://getmumm.com/
  9. Exantium is a leading UAE-based advisory firm focusing on the public sector transformation in the GCC and the Arab world, driven by cutting-edge innovations, strategic digital transformation initiatives and world-class informational policies. http://exantium.com/ Exantium did a recent Smart Government course. http://exantium.com/?p=609, and is focused on Smart Cities. They are also an Open Data Institute Node. ODI works to connect business, government and entrepreneurs to the power of data. http://dubai.theodi.org/ hey have an upcoming course - http://www.mbrsg.ae/HOME/EXECUTIVE-EDUCATION/Open-Enrolments-Programs/Open-Data.aspx?lang=en-US
  10. For the Data-Driven Innovation workshop, I wrote a blog post about what I think needs to happen to connect data-driven innovation for local entrepreneurs. http://ddi-mena.org/ http://textontechs.com/2016/02/hybrid-skills-needed-to-foster-change/
  11. If you are unsure of the data available, you can productively use the Data Expedition model to help seek and find all the data you might use to answer your questions Example from the amazing Kathmandu Living Labs https://twitter.com/KTMLivingLabs/status/706338515684995072 How to do this http://schoolofdata.org/data-expeditions/
  12. How to do some data projects together http://schoolofdata.org/data-expeditions/ Note all the free courses http://schoolofdata.org/learn/
  13. A data clinic is like a hackathon but you involve all the stakeholders to consider a project. Let’s say you have a dataset and you are trying to prove that this type of data will help business. A data clinic is a technique to work on showcasing your desire to use the data appropriately and also give the officials some insights into how the data might help business. An example from my friend Olu in Nigeria http://schoolofdata.org/tag/data-clinic/ It is always about acheiving buyin. More details “ A data clinic is a workshop where participants bring troublesome data and a data scientist/data journalist together to think about how to use the data and view it.
  14. Data has the power to connect us to our audiences. Ferras Mohssen of BQ Magazine advised that staff called all the embassies and collected population data on Nationalities in Qatar for 2013- 2014. They took professional photos and created this map information diagram. The group did this for other GCC countries (Kuwait, UAE, Bahrain). I use this poster daily to remind myself that local social innovation has such a diverse audience. http://www.bq-magazine.com/economy/2013/12/population-qatar
  15. Sometimes you need to collect your own data with trusted partners. QCRI worked with health professionals and two schools to get data insights into health monitoring. The proposed intervention targets Qatari nationals who are overweight or obese. It involves three phases (1) weight loss camps, (2) after-school clubs as supplement, and (3) maintenance through web and social/family support. Data could provide basis for efforts to stem the rise of obesity in Qatar through lifestyle changes. Things we’d like to infer from these images: - what kids *don’t* eat (e.g. leaving vegetables) and if this is personal (= different preferences) - how they eat (e.g. many kids leave the cutlery clean and unused, others make a huge mess) - track their calorie intake Using Crowdflower to label the images, Instagram, mobile data collection Partners: Qatar University, Imperial College and Leeds
  16. OpenStreetMap is a global map of the world - free and opensource. It counts on local communities to always improve the map with data. Imagine if we had a map of Doha to help businesses. This data is pure diy raw business intelligence. All you have to do is look at groups like Mapillary or Mapbox or Cartodb to see how people are using Maps and location data. Here is how - http://learnosm.org/en/
  17. The Data Pipeline really varies from project to project. There are tools, skills and activities common to some projects. I like to add ethical questions and more. See my article - http://textontechs.com/2014/09/infusing-ethics-into-data-projects/ and the Responsible Data Forum’s work on a project lifecycle - https://wiki.responsibledata.io/Data_in_the_project_lifecycle
  18. You are now on a data expedition. while you are doing this research, you should get ready to answer some of these questions. If you are really keen to learn more about verifying data, consider reading the Verification Handbook. http://verificationhandbook.com/ Also be responsible with your data - http://responsibledata.io/
  19. I found out on my data expedition that the Global Biodiversity Information Facility has free and open datasets (about 54 datasets about Qatar from a number of sources.) While maybe not useful for startup, it makes you wonder how these could be used for studying the SDGs. Or, if you are doing a tourism startup. more on that topic soon. Source http://www.gbif.org/country/QA/about/countries. No date provided on the data.
  20. It is always a good idea to ask questions about the sources and check the dates. Where are they getting this data? Is it predictative? Can it be reused? could we have it in another format? Source: http://knoema.com/atlas/Qatar/Expenditures-for-travel-items but the data is really from the World Development Indicators which is the World Bank. http://data.worldbank.org/data-catalog/world-development-indicators
  21. Here is a quick list of some data sources available http://blog.visual.ly/data-sources/ https://data.hdx.rwlabs.org/ http://data.uis.unesco.org/ http://data.worldbank.org/ http://www.forbes.com/sites/bernardmarr/2016/02/12/big-data-35-brilliant-and-free-data-sources-for-2016/#a10f86667961 http://data.un.org/ http://reports.weforum.org/global-competitiveness-report-2015-2016/economies/#economy=QAT
  22. Location data is key for many businesses. There are a few startups here who are using map data. I am just providing some free sources. How to download data: http://wiki.openstreetmap.org/wiki/Downloading_data.
  23. Publications - http://www.gsdp.gov.qa/portal/page/portal/gsdp_en/knowledge_center/Publications/Tab7 Statistics Calendar - http://www.mdps.gov.qa/portal/page/portal/gsdp_en/statistics_en/statistics_calender_en
  24. Data Journalism is simply using data to tell a compelling story. Well, startups do that every day with their investors, supporters and customers. You need to differentiate yourself. This is just another item to add to your toolkit, right by ‘how to stay financially viable’.
  25. The Qatar Census is full of usable business intelligence. The data is available on QALM, but let’s say you did not find it. How would you get access to the data? You just need to use it. PDF - Tools - http://tabula.technology/
  26. I loaded the 111 pages of census data into Tabula. The next step is put the csv into open refine or another too. (QALM also allows you to download into excel if you wish)
  27. Doha News either transcribed the data or used a tool to clean up the data and load it into Tableau. http://dohanews.co/what-are-the-fastest-growing-neighborhoods-in-qatar/ (Census data: http://www.gsdp.gov.qa/portal/page/portal/gsdp_en/knowledge_center/Publications/Tab7/Tab/Census%202015.pdf) Tableau http://public.tableau.com/profile/peter4596#!/vizhome/ChangeinQatarspopulation2010-2015/Dashboard2
  28. This is just another example on how data from a census can be used to help people see details about their communities. In this case, it was about education and age. At Open Data Day on March 5, 2016 a team in South Africa used census data to do this. They used a tool called plot.ly https://plot.ly/~collierab/457/count-vs-age/
  29. This is just a sampler of Data Visualization tools. You can find more all over the net like this great guide http://visualisingadvocacy.org/: Noun Project: http://nounproject.com/ more on vis tools and to decide https://blog.infogr.am/15-thought-leaders-define-what-is-data-visualization/
  30. Map tools - http://fieldpapers.org/
  31. My colleagues are working on Real-time Traffic monitoring as part of the Urban Informatics team. Where did they get the data? Google, Social media, Admin boundaries. Doha over time - http://earthshots.usgs.gov/earthshots/node/69#ad-image-0
  32. Source https://buffer-pictures.s3.amazonaws.com/7f2ca1c922943fa6bc4422c02be50bb0.63fae385d58e2f4a26b2f70c9e262488.png
  33. There are many ways to think about data skills for good. In fact this is how I learned some of these techniques and innovations. The Digital Humanitarian Network is a group of people and communities that do this for humanitarian activities. http://digitalhumanitarians.com/ I would like to point out DataKind and the Standby Task Force. Learning about how people use information for social good is about taking care of our future. It is also a good way to see how you can apply these learnings to social entrepreneurship.
  34. How to collect data to help your career - http://www.slideshare.net/heatherleson/using-your-voice-to-amplify-your-career-may-14-2015
  35. Thanks so much for your interest in QCRI. @qatarcomputing http://qcri.org.qa/