1. Digitale methoden en tools
voor datajournalistiek
Hacking Journalism @SETUP, Utrecht 23 maart 2011
Anne Helmond, PhD bij het Digital Methods
Initiative, Universiteit van Amsterdam.
2. The Digital Methods Initiative is a contribution to
doing research into the "natively digital" where the
focus is on how methods may change, however
slightly or wholesale, owing to the technical
specificities of new media.
6. research question -> operationalization &
method: often a chain of tools
cablegate
7. The Response of the Source
Question: Do the sources acknowledge cablegate?
1. Get all links for US embassy websites:
http://www.usembassy.gov/
2. Compile list of embassies mentioned in
cables: http://wikileaks.ch/cablegate.html
3. Compare lists
4. Query mentioned embassies for:
wikileaks
"Julian Assange"
Assange
cablegate
5. Get number of cables per embassy
6. Visualize output
24. tool: issuegeographer
A2K Network Geography, April 2007. Data by the Issuecrawler. Visualization by the Issuegeographer. Annotated southern organizations and places.
29. kthxbai! kthnxbai!
anne@digitalmethods.net
www.digitalmethods.net
anne@digitalmethods.net
tool wizzard: erik borra
tool wizzard: erik borra
Notes de l'éditeur
Digital Methods is a term coined as a counter-point to virtual methods, which typically digitize existing methods and port them onto the Web. Digital Methods, contrariwise, seek to learn from the methods built into the dominant devices online, and repurpose them for social and cultural research. That is, the challenge is to study the info-web and the social web with the tools that organize them.
There is a general protocol to digital methods. At the outset stock is taken of the natively digital objects that are available (links, tags, threads, etc.) and how devices such as search engines make use of them. Can the device techniques be repurposed, for example by remixing the digital objects they take as inputs?
onderzoeksvraagmethodologietools
tools built on top of dominant devices
onderzoeksvraagmethodologietools Wikileaks began on Sunday November 28th publishing 251,287 leaked United States embassy cables, the largest set of confidential documents ever to be released into the public domain. The documents will give people around the world an unprecedented insight into US Government foreign activities. The cables, which date from 1966 up until the end of February this year, contain confidential communications between 274 embassies in countries throughout the world and the State Department in Washington DC. 15,652 of the cables are classified Secret.
Wikileaks began on Sunday November 28th publishing 251,287 leaked United States embassy cables, the largest set of confidential documents ever to be released into the public domain. The documents will give people around the world an unprecedented insight into US Government foreign activities.The cables, which date from 1966 up until the end of February this year, contain confidential communications between 274 embassies in countries throughout the world and the State Department in Washington DC. 15,652 of the cables are classified Secret. Een ambassade is een diplomatieke vertegenwoordiging van een land in een ander land. What's the embassy website's "public face" around the time of a particular cable. Correlate cable content with embassy content.Do embassies acknowledge {wikileaks, assange, cablegate}
my favorite tool! Capture all internal links and/or outlinks from a page.Output: Internal links Outlinks Expand wayback urls
my favorite tool! Capture all internal links and/or outlinks from a page.Output: Internal links Outlinks Expand wayback urls
CLEAN DATAtodo: copy-paste to spreadsheet and clean data
same for wikileaks cablegate put both lists in spreadsheet
my favorite tool! Capture all internal links and/or outlinks from a page.Output: Internal links Outlinks Expand wayback urls
1 official embassies 2. embassies mentioned on wikileaks website 3. which appear on both?
1 official embassies 2. embassies mentioned on wikileaks website 3. which appear on both?
4. Query mentioned US embassy websites for: wikileaks"Julian Assange"Assangecablegate
get number of leaks per embassy using the leakfeed api. create dynamically updating spreadsheet in google docs.
check which ones were acknowledges and why. qualitative research
enter starting points. very important. expert list or so. this is bloghelden
wait
If we check all the blogs listed in the Loglijst for their response code, or put differently, check to see if they are still online and alive, we notice that many blogs have disappeared.
Geo-locates the organizations on an Issue Crawler map, using whois information, and visualizes the organizations' registered locations on a geographical map.
An underinterrogated aspect of Wikipedians' vigilance are the robots, or bots -- software that often automatically monitors wikipedia pages for changes. Thus, initially, we are interested in the overall picture. How many edits are made by humans, and how many by bots? Having noticed the great discrepancy between bot activity in English and the other language versions of Wikipedia, we became interested in the level of bot activity per language.
comparative maps
DEVICES REQUIRE CUSTOM TOOLS! Twitter goes back two weeks. How to create a story? For the ppl of Iran - #iranelection RT is an exercise in transforming the supposed banality of Twitter into a machine that recounts events on the ground and in social media. #iranelection RT is a collection of all the tweets that have been tagged #iranelection from the first one on10 June up to 30 June 2009, some 650,000 in all. The most “retweeted” tweets (RTs) have been filtered and organised chronologically, as opposed to the reverse chronology that Twitter uses. In “reversed realtime” the most significant #iranelection retweets show the urgency and the emotion of those twenty days in June, when the tensions on the streets and the coverage in the media were at their height. The collection of tweets also shows how tweeters respond to what is happening online and on the ground. Tweets reporting important websites blocked are fol- lowed up by proxies being offered. Accounts of police using pepper spray are followed up by links to websites with first aid information.
Whereas the host domain is available, the Persian section of BBC News is blocked.