#IJF14 Data journalism in a hostile political environment
1. Data Journalism in a hostile political
environment
LA NACION
Momi Peralta Ramos
@momiperalta
LA NACION DATA
www.lanacion.com.ar/data
2. About LA NACION
• Based in Buenos Aires,
Argentina
• Sunday print circulation:
+ 330.000
• www.lanacion.com
unique visitors/month:
+ 15MM
• 9 magazine titles
• Impremedia (90%): US hisp.
leading publishing company
3. LA NACION Data
It s LA NACION s initiative to develop data
journalism and contribute to opening data in
Argentina
6. Argentina s Official advertising funds distribution
2009 – 2013: Friends, politicians and… a stylist!
50% of this amount went
to 10 media groups …
the ones closer to
Government.
In the last period even a
hairdresser (stylist)
received more
advertising money than
the largest newspapers
in Argentina.
7.
8. Why DATA?
- Data is a new raw material for journalism
- Activate demand of public information
- Discover stories hidden in datasets
- Allow citizen´s collaboration
- It is the future of journalism
@momiperalta
CHALLENGE STATUS QUO
9. HOW...is this possible??
• EXCUSES:
– There is NO DATA or DATA is not credible
– We are not the US or the UK in terms or
transparency
– We DON’T have programmers in our newsroom
– We DON’T have skills in our newsroom to gather
or analize datasets
– We don’t… we don’t…
KILLING THIS SCKEPTICISM
ONE BY ONE
10. 1. NEVER STOP LEARNING!
• Learn free online in MOOCs , webinars, blogs, books
• Go to conferences or follow them online.
• ONA 2010 was our first inspiration into dataj, a pre-
conference workshop in ONA.
• Become a member. Subscribe to DDJ Lists.
12. 3. START CREATING DATASETS,
START SMALL
…BE
HUMBLE,
BECOME A
DATA
BUILDER
13. 4. THE TEAM. look around…
• The TEAM is your ENGINE, first HEART then BRAINS…
The perfect team…
DEVELOPER JOURNALIST
IMAGE from Scraperwiki
https://scraperwiki.com/
@momiperalta
DESIGNERDATA MINER
28. SENATE EXPENSES 2004 – 2013
@momiperalta
Processed
more than
34.000
scanned
image PDFs
to build a
structured
dataset,
published
front page
stories now
being
investigated
by justice
…….
32. Senate Expenses – Team Video
http://www.youtube.com/watch?v=qEZ2xMwPMWo&feature=youtu.be
33. La Plata City Major
Floodings
(April 2013)
• Collaborative Tools:
Google Spreadsheets
Google Maps
Google Fusion Tables
• Other tools: Excel, Tableau
Public.
34. Hypothesis: Gov was hiding real number of deaths to
diminish impact of its own responsabilities
• We got 150 copies of
handwritten death certificates in
La Plata for April (1st-15th).
• We made a database model,
typed each case details into a
spreadsheet, then ordered,
filtered, analysed…
@fcoel - @momiperalta
35. • Visualizations for time & place
helped us confirm that most
deaths happened between April
2nd and 4th (or were directly
related) and many were located
over water streams running
under the city and/or flooded
blocks.
36. Impact. Starting from 51 deaths …
One day after publishing: A judge
confirms 60 deaths due to major
floodings
45 days after: 78 deaths officially
confirmed
37. http://youtu.be/a56fWexw8uo <- Meet the team. Journalist,
dataminer, programmer, designer, data producer and me, multitasker.
Goodie for later!
Team explains how collaboration worked
38. Open Assets Declarations. News App in
collaboration with 3 transparency NGOs,
Collaborative Tools:
Google Spreadsheets
Trello
Document Cloud
Team: around 45
LN staff, NGOs staff
+ 30 volunteers
39. Before & After
Public servant declaration of assets.
In total we typed 15.000 rows x 28 cols
44. Opened +30 datasets and made “Dataset cheat sheets” to make them
accessible and ready for analysis with data mining techniques or data
visualization in our first DATAFEST 2012 and 2013.
• Organized by LA NACION and UNIVERSIDAD AUSTRAL Masters Degree in
Data Mining and Universidad Austral Communications Faculty.
45. Journalists explain
raw DATAsets
content and what
could be asked
Dataminers ,
developers and
statistitians help
solve this questions
adding value to the
datasets
47. Cheat Sheet : Subsidies for the Public Bus Transport System – Cleaned ,
normalized and open DATA
48. keep SELF MOTIVATED
IF LOCAL FRUSTRATION?
No resources, no Foi, no Data?, no mindset?!...
THEN GLOBAL INSPIRATION!
Same world regarding technology, talent,
and the explosion of open knowledge and
digital data.