CRESI seminar by Arnaldo Barreto & Ben Anderson from Department of Sociology, University of Essex on 18 February 2010.
Abstract:
In this talk we will outline some of the potential sociological research value in transactional (‘radical’) data and present for discussion some preliminary analysis of both internet usage log data from several Brazilian institutions and from UK households. We will also present some exploratory analysis of a large corpus of UK telephone call records from the late 1990s. In these cases we will offer few conclusions but rather hope to generate discussion of their potential value in sociological research and of the ethical dilemmas that surround their collection and (re)use.
9. Ethics/Law and Data BT 100,000 data BT/Essex Home OnLine Panel Data Brazil Internet Data Consent to collect? As part of commercial service provision – monitoring/research? Yes (but not the third party!) As part of commercial service provision – monitoring/research? Consent to link? Consent for future research? And by whom? Risk of ‘disclosure’
10. Ethics/Law and Data BT 100,000 data BT/Essex Home OnLine Panel Data Brazil Internet Data Consent to collect? As part of commercial service provision – monitoring/research? Yes (but not the third party!) As part of commercial service provision – monitoring/research? Consent to link? As above (to customer data – postcode, billing flags etc) but no ‘sensitive personal data’ Yes (to survey data) – contains ‘sensitive personal data’ N/A Consent for future research? And by whom? Risk of ‘disclosure’
11. Ethics/Law and Data BT 100,000 data BT/Essex Home OnLine Panel Data Brazil Internet Data Consent to collect? As part of commercial service provision – monitoring/research? Yes (but not the third party!) As part of commercial service provision – monitoring/research? Consent to link? As above (to customer data – postcode, billing flags etc) but no ‘sensitive personal data’ Yes (to survey data) – contains ‘sensitive personal data’ N/A Consent for future research? And by whom? ? Yes ? Risk of ‘disclosure’
12. Ethics/Law and Data But big effort for what return? BT 100,000 data BT/Essex Home OnLine Panel Data Brazil Internet Data Consent to collect? As part of commercial service provision – monitoring/research? Yes (but not the third party!) As part of commercial service provision – monitoring/research? Consent to link? As above (to customer data – postcode, billing flags etc) but no ‘sensitive personal data’ Yes (to survey data) – contains ‘sensitive personal data’ N/A Consent for future research? And by whom? ? Yes ? Risk of ‘disclosure’ Medium – in principle could locate postcode and ‘ask around’ but it’s 10 years old and no ‘sensitive personal data’ Medium – could search on phone number & locate using date of birth/household characteristics? -> security applied Low – in principle could link to specific PC in organisation and so to a user but requires internal organisation data
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29. Why discuss Internet use? Source: comScore World Metrix 2007 Country Internet Penetration Monthly Unique Users ('000) Average Daily Users ('000) Average Usage Days per User per Month Average Monthly hours per User Average Monthly Pages per User Austria 53% 3,721 1,485 12 16.3 1,906 Belgium 54% 4,728 2,447 15.5 20.6 2,399 Denmark 68% 3,045 1,493 14.7 22 3,058 Finland 65% 2,818 1,544 16.4 29.7 3,749 France 51% 25,388 14,531 17.2 26.1 2,768 Germany 46% 32,578 18,359 16.9 22.6 2,807 Ireland 42% 1,365 591 13 18.9 1,871 Italy 36% 18,086 7,783 12.9 17.7 1,862 Netherlands 83% 11,292 7,35 19.5 27 3,131 Norway 70% 2,62 1,288 14.7 27.4 3,08 Portugal 44% 3,882 1,731 13.4 23.3 2,454 Russia 11% 13,255 5,048 11.4 13.3 1,695 Spain 39% 13,628 8,828 19.4 30.6 2,675 Sweden 70% 5,259 2,895 16.5 31.7 4,019 Switzerland 58% 3,666 1,846 15.1 22.7 2,676 UK 62% 31,15 21,767 21 34.4 3,44 Europe* 40% 221,463 121,774 16.5 24.1 2,662 US* 66% 156,697 114,472 21.9 31.4 2,826
30.
31.
32.
33.
34.
35.
36.
37.
38.
39. Just one example in Brazil... Research about e-Gov with ISP authorization
40. Just one example in Brazil... Research about e-Gov with ISP authorization
41.
42.
43.
Notes de l'éditeur
Industrial not as in the study of industry but as an industry in itself. Knowing Capitalism
To introduce the data…
i.e. why would anyone bother trying to identify people? Lots of effort for what return?
We only have between 5% and 25% of households in each LSOA And the calls include those made to businesses etc etc (need to remove hubs) Eagle, Claxton et al -> similar work using UK call records from 2005, submitted to Science(?)
Panel households knowing each other is not surprising – clustered sample so neighbours. Also discovered people apparently calling themselves = the way particular calling packages were billed (e.g. call-cards for children/students to call home free – goes on home bill)!
Matched to some extent by the time-use data form the time-use survey on same sample
Longitudinal social network analysis…
Contributes to triangulation – greater ‘transparency’ – no lies, no forgetfulness, no social construction EXCEPT in the coding/inerpretation of the data during analysis (“are they doing what we think they are doing?”) sociology, anthropology, administration, public policies, psychology, economics, public health... Maybe the most real picture of 21 st Century society?