Jaimi Dowdell presents "Data Journalism for Business Reporting" during the free, daylong workshop, "Be a Better Business Watchdog -- CAR for Business Journalists."
For more information about free training for business journalists, please visit businessjournalism.org.
2. Investigative Reporters and Editors, Inc. is a grassroots nonprofit organization
dedicated to improving the quality of investigative reporting.
IRE was formed in 1975 to create a forum in which journalists throughout the
world could help each other by sharing story ideas, newsgathering
techniques and news sources.
3. Training
Resources
Networking
Conferences
Data services
6. More and more, records are
kept electronically.
We can use them to…
7. Uncover basic information. (Simple lookups)
Test government procedures, regulations and
officials’ promises.
Enterprise reporting – do stories that no one
else is doing.
In addition: Journalism that has data elements translates
incredibly well online.
8. Examples
The basics of spreadsheets
Searchable databases available online
Finding and downloading data online
Other databases that aren’t so readily available
Quick online data visualization tools
11. Star Tribune (Minneapolis)
Pioneer Press (St. Paul, MN)
Data: National Inventory of
Bridges
Findings: Following the
collapse of an I-35 bridge
spanning the Mississippi River in
Minneapolis, journalists,
including Dan Browning and
MaryJo Webster, turned to the
National Bridge Inventory
database, available from IRE
and NICAR, to check the
bridge's inspection history. The
Start Tribune and The Pioneer
Press reported that inspection
data from 2005 showed that the
Minnesota Department of
Transportation deemed the
bridge “structurally deficient.”
The Pioneer Press also noted a
federal reporting finding that
Minnesota had 3 percent of its
bridges rated deficient in 2006.
12.
13. Data stories that analyze procedures,
regulations, public safety and test
government officials’ promises. Many
of these are also great enterprise
stories.
14.
15.
16.
17.
18. “ESPN's "Outside the Lines" reviewed
health department inspection reports for
food and beverage outlets at all 107
North American arenas and stadiums
that were home to Major League
Baseball, National Football League,
National Hockey League and National
Basketball Association teams in 2009. At
30 of the venues (28 percent), more than
half of the concession stands or
restaurants had been cited for at least
one "critical" or "major" health violation.
Such violations pose a risk for foodborne
illnesses that can make someone sick, or,
in extreme cases, become fatal.”
19.
20.
21. Milwaukee Journal
Sentinel
Data: State hunting and
corrections data
Findings: Analyzing
state data on hunting
licenses, Ben Poston of
the Milwaukee Journal
Sentinel found that
dozens of convicted
felons in Wisconsin were
issued gun-deer hunting
licenses last year despite
a state law that bans
them for life from
possessing firearms.
Felons with armed
robbery, rape and
weapons convictions all
bought gun-deer
licenses in Wisconsin in
2006. A state legislator
proposed law to close
the license loophole.
24. Herald Tribune (Sarasota, FL)
Data: The newspaper reviewed more than
19 million Florida real estate transactions
to determine how much of the real-estate
bust had its root in housing fraud.
Findings: The year-long investigation
found that more than 50,000 Florida
properties flipped under suspicious
circumstances from 2000 through 2008.
Those flips artificially drove up housing
prices and tax bills and contributed to the
crush of foreclosures that has gutted the
real estate market. All over the state,
professional property flippers made
billions in profits on the back of concocted
land deals.
25.
26.
27.
28.
29. The difference between a spreadsheet and a
database.
Why use other software?
Options?
30. Remember – any time a biz interacts with the
government, information becomes public.
Ask for the document and/or data
retention schedule.
Missouri, Ohio State University
Get a tour.
Grab every blank form – both internal and
external – in all agencies you cover.
Meet the document and data clerks and the IT
folks.
31. FOIA – Open records logs
State and federal statutes
Appendix and footnotes to audits and reports
Inspectors General
State auditors
GAO reports (Government Accountability
Office)
Canada – Auditor general
32. Major Information Systems – request a list from
government agency: example US Marshall’s list
here.
Something in a “chart” format may mean that a
database exists: example California West Nile
Virus here.
33. Online forms to submit information:
example Wisconsin Insurance Complaint
form here.
Actually read a web site – don’t let it force
you into predefined roles: example North
Carolina Department of Revenue here.
(site map, search)
Search for keywords related to data:
download, database, information
system, submit, searchable, inspection,
enforcement, Excel, etc.
34. OSHA workplace safety inspections
Federal contracts (FPDS) (ex. Border
protection)
Recalls (look at tabs for topics)
Economy at a Glance from the Bureau of Labor
Statistics
Local Area Unemployment Statistics from the
BLS
EPA environmental data searches
35. Fatality Analysis Reporting System data (FARS)
National sex offender registry (
Department of Justice)
Transportation data from BTS
Aircraft data (service difficulty reports, on-time
data, aircraft registry, etc.)
NTSB Aviation Accidents (and more)
FAA accidents/incidents and service difficulty
reports (SDRs)
Flight tracker from flightaware
On-time statistics
36. Nursing home, hospital and home health data
from Medicare
Firearms data from the ATF
Migration data from the IRS
Tax data from IRS (tax exempt orgs)
Mine Safety and Health Administration (
MSHA)
National Agricultural Statistics Service
37. Consumer Reports
(pdf report)
•On-time by airport (pg 7)
•Departure by time (pg 13)
•Chronically delayed flights (pg 15)
•Tarmac delays (pg 25)
•Mishandled baggage (pg 29)
•Injuries and deaths involving pets
(49)
From Bureau of Transportation statistics
(Downloadable to Excel)
•Flight delays
•Bag fees by airline
•Chronically delayed flights
•Cancelled flights
38. American FactFinder for quick data profiles.
Datasets from IRE– download profiles,
comparisons and more.
Get information on the Census from ASU’s
Walter Cronkite School of Journalism and IRE.
39. Fedstats – A to Z list of statistical data from the
government.
Data.gov – Searchable catalogs of government
data. (search for business, contracts,
inspections, etc.)
USAspending.gov or FedSpending.org –
Federal spending data including grants, loans,
contracts, etc. (Ex. Coca Cola)
40. State Government Databases wiki from the
American Library Association
BRB Publications links to public records sites
41. There is no magic site.
Half the battle is knowing what exists then
figuring out how to search.
Never hesitate to request information you find
in a more up-to-date format.
Some searchable databases do not include the
entire dataset.
Clues for finding data are everywhere.
42. Search by domain:
.gov – government sites (other types: .edu, .org,
.com, .net, etc.)
Example – search for “Oil spill” “2010” and limit to
site:.gov
Search by file type:
.xls for spreadsheets; .txt, .csv for text files; .mdb,
.dbf for databas files
Example – Same search as above but limit to
filetype:xls
Search for words within a URL:
Examples: ftp, download, data, inspection,
enforcement, 2010, etc.
43. In Excel format – example IRS Exempt
Organizations here.
From the web – example California West Nile
Virus here.
In text format – example from data.gov search
on “inspection” here.
PDF to Excel – try cometdocs.com
44. Our datasets are “cleaned.”
All have national data. Some are sliceable by
state.
They are typically large – meant to be handled
in a database manager and not a spreadsheet.
(We can work with you on that, though.)
Some examples…
45. HMDA, enacted in 1975, requires all
banks, savings and loans, savings
banks and credit unions with assets
of more than $33 million and offices
in metropolitan areas to report
mortgage applications.
Track subprime loans
Look for discrimination
Find changes in your area since the housing crash
Look for trends
46. Data contain loans made to
businesses and individuals
as disaster assistance. The
data identifies the
borrower, the disaster, the
amount and, for business
borrowers, whether the
loan was paid in full or
deemed uncollectible.
Which disasters have hit your community hardest
Are there any people/businesses who’ve received multiple loans?
What’s the biggest loan?
How much hasn’t been collected?
How much won’t be collected?
47. How frequently are bridges being inspected?
How old are they?
What is their government-assigned status: 1=Structurally Deficient;
2=Functionally Obsolete; 0=Not Deficient; N=Not Applicable
What are the sufficiency ratings of bridges in your area?
Doesn’t seem like a business story? Think again. Look at areas that have
grown quickly and have lots of development. Are dams/bridges that were
previously thought harmless now in an area of dense development? How
does that change things?
48. IRS Tax Exempt Orgs database
•Get a list of all of the nonprofits in your community
•Find the biggest ones
•Find the ones the generate the most revenue
•Find the ones that generate the least revenue
•Use this information to investigate individual
nonprofits
•Example – Atlanta spreadsheet
•Get individual 990s from Guidestar or
Foundation Center
50. Dozens of national
data sets for local
enterprise stories
Links to stories based
on each data set
Links to tip sheets on
using the data
Find more
information at
data.nicar.org/data
There are a lot of stories out there that don ’t comes in a long series and didn’t take months to report – but they still have a great impact because they used data to tell the story. This is a recent story from the Kansas City Star about municipal court judges. The paper wanted to look at a long-standing rumor that the judges really didn’t work that much. They found a clever way to measure it --- the city’s parking garage kept a database of the comings and goings of all employees.
There are a wealth of things you can do with sports data, however some of the best and most unique will be cases where you ’ll have to build your own database. At the local level some ideas include: salaries of area high school football coaches; revenues to athletic programs from booster clubs. While I was at USA TODAY we collected data on state championship teams across the US – to make it manageable we took a “sample” -- then assigned each school to an income quartile based on several income factors. In the end we were able to say that schools in the top income quartile win state championships at more than twice the rate of those in the bottom quartile. This is something that could be replicated (and has been by other news organizations) at a state or regional level too.
Example for retention sked: disaster preparedness drills: 259 Disaster Preparedness Drill Records This record series consists of the results of disaster preparedness exercises and supporting documents including scenarios, location of safety related drills, timetables, response times, probable outcomes, areas of difficulty, descriptions of how difficulties were resolved, and areas for improvement. The types of drills include, but are not limited to, fire, tornado, safety, hurricane, and SARA (Superfund Amendments and Reauthorization Act) chemical spills. Section 252.365(3)(b), F.S., requires state agencies to include in their disaster preparedness plans “schedules and procedures for periodic tests, training, and exercises.” Section 252.38, F.S., authorizes counties and municipalities to “develop an emergency management plan and program that is coordinated and consistent with the state comprehensive emergency management plan and program.” See also “DISASTER PREPAREDNESS PLANS,” “DIRECTIVES/POLICIES/PROCEDURES,” and “INSPECTION RECORDS: FIRE/SECURITY/SAFETY.” 2 calendar years provided reviews have been conducted.