SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
Data 2: Interrogating,
    visualising, mashing



   Online Journalism
   City University
   Paul Bradshaw
Monday, 7 March 2011
Themes


   5 things you need to know about each
   Data journalism in action
   Walkthrough



Monday, 7 March 2011
Interrogating data




   .


Monday, 7 March 2011
Monday, 7 March 2011
5 things you need to know about
    interrogating data

   1. Data always needs cleaning up
   2. Treat the ‘source’ like a source
   3. Use the right ‘average’ and
   percentage
   4. Variation over time & space: context
   5. Spreadsheet tools are your friend -
   but always backup copies
Monday, 7 March 2011
Monday, 7 March 2011
“What the Independent have done
 is confuse the UK’s deficit with our
 debt [making] the debt problem
 look around eight times worse than
 it is. And it used the whole of its
 front page to do so.”

                        - James Ball
Monday, 7 March 2011
Monday, 7 March 2011
What is the data worth?


   Measurement doesn't answer anything if
   there's only one variable
   Statistical significance
   Sample size and selection
   Controls and the placebo effect
   Read up.
Monday, 7 March 2011
1. Variance is interesting.
 2. Variance is different for different
 variables and in different
 populations.
 3. The amount of variance is easily
 quantified.
                       - Philip Meyer, Precision Journalism


Monday, 7 March 2011
Getting data in the right form


   Data > Text to columns
   Find & replace
   Conditional formulas:
   =IF(condition, if met, if not)
   =COUNTIF(range, test)

Monday, 7 March 2011
Walkthrough: cleaning data in
    Google Refine

   Edit cells > common transforms
   Edit cells > split multi-valued cells
   Facet > text facet
   Export...


Monday, 7 March 2011
Visualising data




   .


Monday, 7 March 2011
5 things you need to know about
    visualising data

   1. Choose the chart for the purpose
   2. It can be used to spot a lead
   3. Good design is when there’s nothing
   more to take away
   4. It should be self-contained & have refs
   5. Be careful with scales and classes
Monday, 7 March 2011
or http://chartchooser.juiceanalytics.com/
Monday, 7 March 2011
Monday, 7 March 2011
Monday, 7 March 2011
What is wrong with this picture?

Monday, 7 March 2011
Monday, 7 March 2011
http://simplecomplexity.net/statistics-without-context/


Monday, 7 March 2011
http://junkcharts.typepad.com/junk_charts/trifecta-checkup/

Monday, 7 March 2011
Visualisation tools


   ManyEyes
   Tableau
   Wordle, Tagxedo
   BatchGeo
   Gephi
   Delicious.com/paulb/visualisation+tools
Monday, 7 March 2011
Walkthrough: visualising data
    with Google Gadgets

   .




Monday, 7 March 2011
Walkthrough: visualising data in
    ManyEyes

   .




Monday, 7 March 2011
Mashing data




   .


Monday, 7 March 2011
5 things you need to know about
    mashing data

   1. It is what a journalist does best
   2. Look for a point of connection: place?
   Person? Company? Date?
   3. What an API can do
   4. What APIs there are
   5. Mashups can be live, updated or
   static
Monday, 7 March 2011
Monday, 7 March 2011
Monday, 7 March 2011
Mashup tools


   Yahoo! Pipes
   OpenHeatMap
   Mapalist
   xFruits
   Scraperwiki
   Maptube
Monday, 7 March 2011
Walkthrough: making mashups
    with Yahoo! Pipes

   Inputs - Fetch Feed, CSV, Data, Page,
   YQL, Flickr, Form
   Operators - Filter, Sort, Unique, Union,
   Count, Split, Rename, Regex, Unique,
   Location extractor, URL Builder
   Outputs - Map, Gallery, List, XML, KML
Monday, 7 March 2011
Walkthrough: making mashups
    with OpenHeatMap

   Format the spreadsheet
   Publish it as CSV
   Copy link
   Paste it at OpenHeatMap
   Fix any problems

Monday, 7 March 2011
Walkthrough: grabbing geo data
    with Google Refine

   Edit column > Add column by fetching
   URLs
   Use GREL (Google Refine Expression
   Language)
   Search web for help & examples

Monday, 7 March 2011
Questions?




  .


Monday, 7 March 2011
Links


   OnlineJournalismClasses.tumblr.com
   Delicious.com/paulb/cityoj09
   Delicious.com/paulb/datajournalism
   Delicious.com/paulb/visualisation
   Delicious.com/paulb/statistics
   Delicious.com/paulb/mashups
Monday, 7 March 2011
Lab


  Before the lab: play with these
  techniques yourself, have problems,
  find solutions, raise questions. Install
  Google Refine and Tableau on your
  laptop to use.
  - Visualise, interrogate or mash data
Monday, 7 March 2011
Books


   Kaiser Fung - Numbers Rule Your World
   Ben Goldacre - Bad Science
   Donna Wong - The WSJ Guide to
   Information Graphics
   Brian Suda - A Practical Guide to
   Designing with Data
Monday, 7 March 2011

Contenu connexe

Similaire à Data Journalism 2: Interrogating, Visualising and Mashing

Data Journalism 2: cleaning, combining, communicating
Data Journalism 2: cleaning, combining, communicatingData Journalism 2: cleaning, combining, communicating
Data Journalism 2: cleaning, combining, communicatingPaul Bradshaw
 
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Olivier Dobberkau
 
Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020Philip Bourne
 
Android Development Slides
Android Development SlidesAndroid Development Slides
Android Development SlidesVictor Miclovich
 
Choosing the right Content Management System
Choosing the right Content Management SystemChoosing the right Content Management System
Choosing the right Content Management SystemRachel Andrew
 
Data Driven Innovation
Data Driven InnovationData Driven Innovation
Data Driven Innovationideas.org
 
Data Driven Innovation
Data Driven InnovationData Driven Innovation
Data Driven InnovationSimon Grice
 
IAT334-Lec02-TaskAnalysis.pptx
IAT334-Lec02-TaskAnalysis.pptxIAT334-Lec02-TaskAnalysis.pptx
IAT334-Lec02-TaskAnalysis.pptxssuseraae9cd
 
How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...
How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...
How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...Ronald Ashri
 
Reasoning over big data
Reasoning over big dataReasoning over big data
Reasoning over big dataOSTHUS
 
"The Reality of Digital Science"
"The Reality of Digital Science""The Reality of Digital Science"
"The Reality of Digital Science"Kaitlin Thaney
 
Koss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser appsKoss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser appsEvil Martians
 
Atlassian RoadTrip 2011 Slide Deck
Atlassian RoadTrip 2011 Slide DeckAtlassian RoadTrip 2011 Slide Deck
Atlassian RoadTrip 2011 Slide DeckAtlassian
 
Engineering Software Engineering Teams - SSE 2011
Engineering Software Engineering Teams - SSE 2011Engineering Software Engineering Teams - SSE 2011
Engineering Software Engineering Teams - SSE 2011Patrick Wagstrom
 
IAT334-Lec08-Experiment.pptx
IAT334-Lec08-Experiment.pptxIAT334-Lec08-Experiment.pptx
IAT334-Lec08-Experiment.pptxssuseraae9cd
 

Similaire à Data Journalism 2: Interrogating, Visualising and Mashing (20)

Data Journalism 2: cleaning, combining, communicating
Data Journalism 2: cleaning, combining, communicatingData Journalism 2: cleaning, combining, communicating
Data Journalism 2: cleaning, combining, communicating
 
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
 
Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020Open Data Driven Scholarly Communication in 2020
Open Data Driven Scholarly Communication in 2020
 
Android Development Slides
Android Development SlidesAndroid Development Slides
Android Development Slides
 
Messaging patterns
Messaging patternsMessaging patterns
Messaging patterns
 
Choosing the right Content Management System
Choosing the right Content Management SystemChoosing the right Content Management System
Choosing the right Content Management System
 
Data Driven Innovation
Data Driven InnovationData Driven Innovation
Data Driven Innovation
 
Data Driven Innovation
Data Driven InnovationData Driven Innovation
Data Driven Innovation
 
IAT334-Lec02-TaskAnalysis.pptx
IAT334-Lec02-TaskAnalysis.pptxIAT334-Lec02-TaskAnalysis.pptx
IAT334-Lec02-TaskAnalysis.pptx
 
How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...
How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...
How to Make Entities and Influence Drupal - Emerging Patterns from Drupal Con...
 
Reasoning over big data
Reasoning over big dataReasoning over big data
Reasoning over big data
 
Mahout classifier tour
Mahout classifier tourMahout classifier tour
Mahout classifier tour
 
Ufi Keynote 10 Feb
Ufi Keynote 10 FebUfi Keynote 10 Feb
Ufi Keynote 10 Feb
 
"The Reality of Digital Science"
"The Reality of Digital Science""The Reality of Digital Science"
"The Reality of Digital Science"
 
ITP / SED Day 2
ITP / SED Day 2ITP / SED Day 2
ITP / SED Day 2
 
Koss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser appsKoss, How to make desktop caliber browser apps
Koss, How to make desktop caliber browser apps
 
STI Summit 2011 - Linked services
STI Summit 2011 - Linked servicesSTI Summit 2011 - Linked services
STI Summit 2011 - Linked services
 
Atlassian RoadTrip 2011 Slide Deck
Atlassian RoadTrip 2011 Slide DeckAtlassian RoadTrip 2011 Slide Deck
Atlassian RoadTrip 2011 Slide Deck
 
Engineering Software Engineering Teams - SSE 2011
Engineering Software Engineering Teams - SSE 2011Engineering Software Engineering Teams - SSE 2011
Engineering Software Engineering Teams - SSE 2011
 
IAT334-Lec08-Experiment.pptx
IAT334-Lec08-Experiment.pptxIAT334-Lec08-Experiment.pptx
IAT334-Lec08-Experiment.pptx
 

Plus de Paul Bradshaw

How to work with a bullshitting robot
How to work with a bullshitting robotHow to work with a bullshitting robot
How to work with a bullshitting robotPaul Bradshaw
 
How to generate a 100+ page website using parameterisation in R
How to generate a 100+ page website using parameterisation in RHow to generate a 100+ page website using parameterisation in R
How to generate a 100+ page website using parameterisation in RPaul Bradshaw
 
ChatGPT (and generative AI) in journalism
ChatGPT (and generative AI) in journalismChatGPT (and generative AI) in journalism
ChatGPT (and generative AI) in journalismPaul Bradshaw
 
Data journalism: history and roles
Data journalism: history and rolesData journalism: history and roles
Data journalism: history and rolesPaul Bradshaw
 
Working on data stories: different approaches
Working on data stories: different approachesWorking on data stories: different approaches
Working on data stories: different approachesPaul Bradshaw
 
Visual journalism: gifs, emoji, memes and other techniques
Visual journalism: gifs, emoji, memes and other techniquesVisual journalism: gifs, emoji, memes and other techniques
Visual journalism: gifs, emoji, memes and other techniquesPaul Bradshaw
 
Using narrative structures in shortform and longform journalism
Using narrative structures in shortform and longform journalismUsing narrative structures in shortform and longform journalism
Using narrative structures in shortform and longform journalismPaul Bradshaw
 
Narrative and multiplatform journalism (part 1)
Narrative and multiplatform journalism (part 1)Narrative and multiplatform journalism (part 1)
Narrative and multiplatform journalism (part 1)Paul Bradshaw
 
Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)Paul Bradshaw
 
Data journalism on the air: 3 tips
Data journalism on the air: 3 tipsData journalism on the air: 3 tips
Data journalism on the air: 3 tipsPaul Bradshaw
 
7 angles for data stories
7 angles for data stories7 angles for data stories
7 angles for data storiesPaul Bradshaw
 
Uncertain times, stories of uncertainty
Uncertain times, stories of uncertaintyUncertain times, stories of uncertainty
Uncertain times, stories of uncertaintyPaul Bradshaw
 
Ergodic education (online teaching and interactivity)
Ergodic education (online teaching and interactivity)Ergodic education (online teaching and interactivity)
Ergodic education (online teaching and interactivity)Paul Bradshaw
 
Storytelling in the database era: uncertainty and science reporting
Storytelling in the database era: uncertainty and science reportingStorytelling in the database era: uncertainty and science reporting
Storytelling in the database era: uncertainty and science reportingPaul Bradshaw
 
Cognitive bias: a quick guide for journalists
Cognitive bias: a quick guide for journalistsCognitive bias: a quick guide for journalists
Cognitive bias: a quick guide for journalistsPaul Bradshaw
 
The 3 chords of data journalism
The 3 chords of data journalismThe 3 chords of data journalism
The 3 chords of data journalismPaul Bradshaw
 
Data journalism: what it is, how to use data for stories
Data journalism: what it is, how to use data for storiesData journalism: what it is, how to use data for stories
Data journalism: what it is, how to use data for storiesPaul Bradshaw
 
Teaching AI in data journalism
Teaching AI in data journalismTeaching AI in data journalism
Teaching AI in data journalismPaul Bradshaw
 
10 ways AI can be used for investigations
10 ways AI can be used for investigations10 ways AI can be used for investigations
10 ways AI can be used for investigationsPaul Bradshaw
 
Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)Paul Bradshaw
 

Plus de Paul Bradshaw (20)

How to work with a bullshitting robot
How to work with a bullshitting robotHow to work with a bullshitting robot
How to work with a bullshitting robot
 
How to generate a 100+ page website using parameterisation in R
How to generate a 100+ page website using parameterisation in RHow to generate a 100+ page website using parameterisation in R
How to generate a 100+ page website using parameterisation in R
 
ChatGPT (and generative AI) in journalism
ChatGPT (and generative AI) in journalismChatGPT (and generative AI) in journalism
ChatGPT (and generative AI) in journalism
 
Data journalism: history and roles
Data journalism: history and rolesData journalism: history and roles
Data journalism: history and roles
 
Working on data stories: different approaches
Working on data stories: different approachesWorking on data stories: different approaches
Working on data stories: different approaches
 
Visual journalism: gifs, emoji, memes and other techniques
Visual journalism: gifs, emoji, memes and other techniquesVisual journalism: gifs, emoji, memes and other techniques
Visual journalism: gifs, emoji, memes and other techniques
 
Using narrative structures in shortform and longform journalism
Using narrative structures in shortform and longform journalismUsing narrative structures in shortform and longform journalism
Using narrative structures in shortform and longform journalism
 
Narrative and multiplatform journalism (part 1)
Narrative and multiplatform journalism (part 1)Narrative and multiplatform journalism (part 1)
Narrative and multiplatform journalism (part 1)
 
Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)Teaching data journalism (Abraji 2021)
Teaching data journalism (Abraji 2021)
 
Data journalism on the air: 3 tips
Data journalism on the air: 3 tipsData journalism on the air: 3 tips
Data journalism on the air: 3 tips
 
7 angles for data stories
7 angles for data stories7 angles for data stories
7 angles for data stories
 
Uncertain times, stories of uncertainty
Uncertain times, stories of uncertaintyUncertain times, stories of uncertainty
Uncertain times, stories of uncertainty
 
Ergodic education (online teaching and interactivity)
Ergodic education (online teaching and interactivity)Ergodic education (online teaching and interactivity)
Ergodic education (online teaching and interactivity)
 
Storytelling in the database era: uncertainty and science reporting
Storytelling in the database era: uncertainty and science reportingStorytelling in the database era: uncertainty and science reporting
Storytelling in the database era: uncertainty and science reporting
 
Cognitive bias: a quick guide for journalists
Cognitive bias: a quick guide for journalistsCognitive bias: a quick guide for journalists
Cognitive bias: a quick guide for journalists
 
The 3 chords of data journalism
The 3 chords of data journalismThe 3 chords of data journalism
The 3 chords of data journalism
 
Data journalism: what it is, how to use data for stories
Data journalism: what it is, how to use data for storiesData journalism: what it is, how to use data for stories
Data journalism: what it is, how to use data for stories
 
Teaching AI in data journalism
Teaching AI in data journalismTeaching AI in data journalism
Teaching AI in data journalism
 
10 ways AI can be used for investigations
10 ways AI can be used for investigations10 ways AI can be used for investigations
10 ways AI can be used for investigations
 
Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)
 

Dernier

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Dernier (20)

Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

Data Journalism 2: Interrogating, Visualising and Mashing

  • 1. Data 2: Interrogating, visualising, mashing Online Journalism City University Paul Bradshaw Monday, 7 March 2011
  • 2. Themes 5 things you need to know about each Data journalism in action Walkthrough Monday, 7 March 2011
  • 3. Interrogating data . Monday, 7 March 2011
  • 5. 5 things you need to know about interrogating data 1. Data always needs cleaning up 2. Treat the ‘source’ like a source 3. Use the right ‘average’ and percentage 4. Variation over time & space: context 5. Spreadsheet tools are your friend - but always backup copies Monday, 7 March 2011
  • 7. “What the Independent have done is confuse the UK’s deficit with our debt [making] the debt problem look around eight times worse than it is. And it used the whole of its front page to do so.” - James Ball Monday, 7 March 2011
  • 9. What is the data worth? Measurement doesn't answer anything if there's only one variable Statistical significance Sample size and selection Controls and the placebo effect Read up. Monday, 7 March 2011
  • 10. 1. Variance is interesting. 2. Variance is different for different variables and in different populations. 3. The amount of variance is easily quantified. - Philip Meyer, Precision Journalism Monday, 7 March 2011
  • 11. Getting data in the right form Data > Text to columns Find & replace Conditional formulas: =IF(condition, if met, if not) =COUNTIF(range, test) Monday, 7 March 2011
  • 12. Walkthrough: cleaning data in Google Refine Edit cells > common transforms Edit cells > split multi-valued cells Facet > text facet Export... Monday, 7 March 2011
  • 13. Visualising data . Monday, 7 March 2011
  • 14. 5 things you need to know about visualising data 1. Choose the chart for the purpose 2. It can be used to spot a lead 3. Good design is when there’s nothing more to take away 4. It should be self-contained & have refs 5. Be careful with scales and classes Monday, 7 March 2011
  • 18. What is wrong with this picture? Monday, 7 March 2011
  • 22. Visualisation tools ManyEyes Tableau Wordle, Tagxedo BatchGeo Gephi Delicious.com/paulb/visualisation+tools Monday, 7 March 2011
  • 23. Walkthrough: visualising data with Google Gadgets . Monday, 7 March 2011
  • 24. Walkthrough: visualising data in ManyEyes . Monday, 7 March 2011
  • 25. Mashing data . Monday, 7 March 2011
  • 26. 5 things you need to know about mashing data 1. It is what a journalist does best 2. Look for a point of connection: place? Person? Company? Date? 3. What an API can do 4. What APIs there are 5. Mashups can be live, updated or static Monday, 7 March 2011
  • 29. Mashup tools Yahoo! Pipes OpenHeatMap Mapalist xFruits Scraperwiki Maptube Monday, 7 March 2011
  • 30. Walkthrough: making mashups with Yahoo! Pipes Inputs - Fetch Feed, CSV, Data, Page, YQL, Flickr, Form Operators - Filter, Sort, Unique, Union, Count, Split, Rename, Regex, Unique, Location extractor, URL Builder Outputs - Map, Gallery, List, XML, KML Monday, 7 March 2011
  • 31. Walkthrough: making mashups with OpenHeatMap Format the spreadsheet Publish it as CSV Copy link Paste it at OpenHeatMap Fix any problems Monday, 7 March 2011
  • 32. Walkthrough: grabbing geo data with Google Refine Edit column > Add column by fetching URLs Use GREL (Google Refine Expression Language) Search web for help & examples Monday, 7 March 2011
  • 33. Questions? . Monday, 7 March 2011
  • 34. Links OnlineJournalismClasses.tumblr.com Delicious.com/paulb/cityoj09 Delicious.com/paulb/datajournalism Delicious.com/paulb/visualisation Delicious.com/paulb/statistics Delicious.com/paulb/mashups Monday, 7 March 2011
  • 35. Lab Before the lab: play with these techniques yourself, have problems, find solutions, raise questions. Install Google Refine and Tableau on your laptop to use. - Visualise, interrogate or mash data Monday, 7 March 2011
  • 36. Books Kaiser Fung - Numbers Rule Your World Ben Goldacre - Bad Science Donna Wong - The WSJ Guide to Information Graphics Brian Suda - A Practical Guide to Designing with Data Monday, 7 March 2011