SlideShare une entreprise Scribd logo
1  sur  36
Big Data for the Social Sciences:
The Landscape forWeb Observatories
David De Roure, Strategic Adviser for Data Resources @dder
Overview
1. Big Data for research (UK perspective)
2. Social Media Data is distinctive
3. A series of shifts in how scholarship is conducted
4. And hence the context for Web Observatories
Big Data doesn’t respect
disciplinary boundaries
Digital Social Research
Edwards, P. N., et al. (2013) Knowledge Infrastructures: Intellectual Frameworks and Research
Challenges.Ann Arbor: Deep Blue. http://hdl.handle.net/2027.42/97552
theODI.org
Mandy Chessell
The Big Picture
More people
Moremachines
Big Data
Big Compute
Conventional
Computation
“Big Social”
Social Networks
e-infrastructure
online
R&D
Big Data
Production
& Analytics
deeply
about
society
RCUK and Big Data
▶ ‘Big data is a term for a collection of datasets so
large and complex that it is beyond the ability
of typical database software tools to capture,
store, manage, and analyse them. ‘Big’ is not defined
as being larger than a certain number of ‘bytes’
because as technology advances over time, the size
of datasets that qualify as big data will also increase’
(RCUK)
Research benefits of new data
▶ Undertaking research on pressing policy-related issues
without the need for new data collection
• Food consumption, social background and obesity
• Energy consumption, housing type and climatic conditions
• Rural location, private/public transport alternatives and
incomes
• School attainment, higher education participation, subject
choices, student debt and later incomes
▶ New data such as social media enable us to ask big questions,
about big populations, and in real time – this is
transformative
Big Data Network
Phase 1 and 2
Research questions
– Social and political
movements
– Political participation and
trust
– Individual,
group/community and
national identities
– Personal, local, national and
global security (including
crime, law enforcement and
defence)
– Rural development and
‘UrbanTransformations’
– Crisis prevention,
preparedness, response,
management and
recovery
– Education
– Health and wellbeing
(including ageing)
– Environment and
sustainability
– Economic growth and
financial markets
(including employment
and the labour market)
E-infrastructureLeadershipCouncil
Mandy Chessell
F i r s t
Interdisciplinary and “in the wild” *
* “in it” versus “on it”
Nigel Shadbolt et al
Real life is and must be full of all kinds of social
constraint – the very processes from which society
arises. Computers can help if we use them to create
abstract social machines on the Web: processes in
which the people do the creative work and the
machine does the administration...The stage is set for
an evolutionary growth of new social engines.The
ability to create new forms of social process would be
given to the world at large, and development would be
rapid.
Berners-Lee, Weaving the Web, 1999 (pp. 172–175)
The Order of Social Machines
SOCIAM:TheTheory and Practice of Social Machines is funded by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant
number EPJ017728/1 and comprises the Universities of Southampton, Oxford and Edinburgh. See sociam.org
A revolutionary idea…
Open Science!
Join the W3C Community Group www.w3.org/community/rosc
Jun Zhao
www.researchobject.org
Web as
lens
Web as
artefact
Web Observatories
http://www.w3.org/community/webobservatory/
Big data elephant versus sense-making network?
The challenge is to foster the co-constituted socio-technical
system on the right i.e. a computationally-enabled sense-making
network of expertise, data, models, visualisations and narratives
Iain Buchan
PipWillcoxPipWillcox
PipWillcox
The Observatory Context
▶ New forms of data enable us answer old questions in
new ways and to address entirely new questions
– Especially about (new) social processes
▶ There are multiple shifts occurring:
– Academia and business
– Volumes and velocity of data
– Realtime analytics
– Computational infrastructure
– Dataflows vs datasets (and curation infrastructure)
– Correlation vs causation
– Increasing automation and ethical implications
– Machine-to-Machine in Internet of Things
david.deroure@oerc.ox.ac.uk
www.oerc.ox.ac.uk/people/dder
@dder
Slide and image credits: Fiona Armstrong, Christine Borgman, Iain
Buchan, Mandy Chessell, Neil Chue Hong, Cat De Roure, Kevin
Page, Nigel Shadbolt, Pip Willcox, Jun Zhao, Guardian newspaper
www.oerc.ox.ac.uk
david.deroure@oerc.ox.ac.uk
@dder

Contenu connexe

Tendances

Metrics for web-native science - PLOS ALM
Metrics for web-native science - PLOS ALMMetrics for web-native science - PLOS ALM
Metrics for web-native science - PLOS ALM
Kaitlin Thaney
 
Citizen science
Citizen scienceCitizen science
Citizen science
samar1407
 
Making the web work for science - UND
Making the web work for science - UNDMaking the web work for science - UND
Making the web work for science - UND
Kaitlin Thaney
 
Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?
Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?
Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?
Katina Michael
 

Tendances (20)

New Forms of Data for e-Research
New Forms of Data for e-ResearchNew Forms of Data for e-Research
New Forms of Data for e-Research
 
Intersection Scale and Social Machines
Intersection Scale and Social MachinesIntersection Scale and Social Machines
Intersection Scale and Social Machines
 
Social Machines GSS
Social Machines GSSSocial Machines GSS
Social Machines GSS
 
Short and Long of Data Driven Innovation
Short and Long of Data Driven InnovationShort and Long of Data Driven Innovation
Short and Long of Data Driven Innovation
 
Social Machines IIIT
Social Machines IIITSocial Machines IIIT
Social Machines IIIT
 
New Data `New Computation
New Data `New ComputationNew Data `New Computation
New Data `New Computation
 
Ethics of Automation
Ethics of AutomationEthics of Automation
Ethics of Automation
 
2066 and all that
2066 and all that2066 and all that
2066 and all that
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
 
Metrics for web-native science - PLOS ALM
Metrics for web-native science - PLOS ALMMetrics for web-native science - PLOS ALM
Metrics for web-native science - PLOS ALM
 
New and Emerging Forms of Data
New and Emerging Forms of DataNew and Emerging Forms of Data
New and Emerging Forms of Data
 
Citizen science
Citizen scienceCitizen science
Citizen science
 
Making the web work for science - UND
Making the web work for science - UNDMaking the web work for science - UND
Making the web work for science - UND
 
Byu ISYS presentation_seminar
Byu ISYS presentation_seminarByu ISYS presentation_seminar
Byu ISYS presentation_seminar
 
Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?
Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?
Artificial Intelligence & Machine Learning. Is it Planet Saving Tech?
 
Advances in Digital Scholarship Moot
Advances in Digital Scholarship MootAdvances in Digital Scholarship Moot
Advances in Digital Scholarship Moot
 
Deepak data project
Deepak data projectDeepak data project
Deepak data project
 
The Human Intranet
The Human Intranet The Human Intranet
The Human Intranet
 
Data and the City workshop 2015
Data and the City workshop 2015Data and the City workshop 2015
Data and the City workshop 2015
 
OKCon 2008 - Lessons from Environmental information
OKCon 2008 - Lessons from Environmental informationOKCon 2008 - Lessons from Environmental information
OKCon 2008 - Lessons from Environmental information
 

En vedette

En vedette (13)

Humanities in the Digital World
Humanities in the Digital WorldHumanities in the Digital World
Humanities in the Digital World
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research Objects
 
Music Objects to Social Machines
Music Objects to Social MachinesMusic Objects to Social Machines
Music Objects to Social Machines
 
Social Machines of Scholarly Collaboration
Social Machines of Scholarly CollaborationSocial Machines of Scholarly Collaboration
Social Machines of Scholarly Collaboration
 
Working out the plot: the role of Stories in Social Machines
Working out the plot: the role of Stories in Social MachinesWorking out the plot: the role of Stories in Social Machines
Working out the plot: the role of Stories in Social Machines
 
citizens scale scholarly social machines
citizens scale scholarly social machinescitizens scale scholarly social machines
citizens scale scholarly social machines
 
Post-Digital Society
Post-Digital SocietyPost-Digital Society
Post-Digital Society
 
Executable Music Documents
Executable Music DocumentsExecutable Music Documents
Executable Music Documents
 
Taking IT for Granted
Taking IT for GrantedTaking IT for Granted
Taking IT for Granted
 
The Long and the Short of it: a history of Social Machines
The Long and the Short of it:a history of Social MachinesThe Long and the Short of it:a history of Social Machines
The Long and the Short of it: a history of Social Machines
 
Intersection Scale and Social Machines 2016
Intersection Scale and Social Machines 2016Intersection Scale and Social Machines 2016
Intersection Scale and Social Machines 2016
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and Scholarship
 
Imperial College London - journey to open scholarship
Imperial College London - journey to open scholarshipImperial College London - journey to open scholarship
Imperial College London - journey to open scholarship
 

Similaire à Social Science Landscape for Web Observatories

WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
Ramine Tinati
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
SMCFrance
 
Civic Algorithms: A digital intermediation challenge
Civic Algorithms: A digital intermediation challengeCivic Algorithms: A digital intermediation challenge
Civic Algorithms: A digital intermediation challenge
University of Sydney
 

Similaire à Social Science Landscape for Web Observatories (20)

Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Social Machines Paradigm
Social Machines ParadigmSocial Machines Paradigm
Social Machines Paradigm
 
Future of Scholarly Communications
Future of Scholarly CommunicationsFuture of Scholarly Communications
Future of Scholarly Communications
 
Scholarly Social Machines Essay
Scholarly Social Machines EssayScholarly Social Machines Essay
Scholarly Social Machines Essay
 
Taking IT for Granted - David De Roure
Taking IT for Granted - David De RoureTaking IT for Granted - David De Roure
Taking IT for Granted - David De Roure
 
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
The wider environment of open scholarship – Jisc and CNI conference 10 July ...The wider environment of open scholarship – Jisc and CNI conference 10 July ...
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
 
Social Machines Democratization
Social Machines DemocratizationSocial Machines Democratization
Social Machines Democratization
 
e-Research and the Demise of the Scholarly Article
e-Research and the Demise of the Scholarly Articlee-Research and the Demise of the Scholarly Article
e-Research and the Demise of the Scholarly Article
 
Scholarly Social Machines
Scholarly Social MachinesScholarly Social Machines
Scholarly Social Machines
 
Social Technologies for Informaticians and Researchers
Social Technologies for Informaticians and ResearchersSocial Technologies for Informaticians and Researchers
Social Technologies for Informaticians and Researchers
 
Big Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic WebBig Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic Web
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
 
Cook et al
Cook et alCook et al
Cook et al
 
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
The machine in the ghost: a socio-technical perspective...
The machine in the ghost: a socio-technical perspective...The machine in the ghost: a socio-technical perspective...
The machine in the ghost: a socio-technical perspective...
 
Sdi, communities and social media
Sdi, communities and social mediaSdi, communities and social media
Sdi, communities and social media
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
 
Computational Social Science
Computational Social ScienceComputational Social Science
Computational Social Science
 
Civic Algorithms: A digital intermediation challenge
Civic Algorithms: A digital intermediation challengeCivic Algorithms: A digital intermediation challenge
Civic Algorithms: A digital intermediation challenge
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Social Science Landscape for Web Observatories

  • 1. Big Data for the Social Sciences: The Landscape forWeb Observatories David De Roure, Strategic Adviser for Data Resources @dder
  • 2. Overview 1. Big Data for research (UK perspective) 2. Social Media Data is distinctive 3. A series of shifts in how scholarship is conducted 4. And hence the context for Web Observatories
  • 3. Big Data doesn’t respect disciplinary boundaries Digital Social Research
  • 4. Edwards, P. N., et al. (2013) Knowledge Infrastructures: Intellectual Frameworks and Research Challenges.Ann Arbor: Deep Blue. http://hdl.handle.net/2027.42/97552
  • 6.
  • 7.
  • 8.
  • 10. The Big Picture More people Moremachines Big Data Big Compute Conventional Computation “Big Social” Social Networks e-infrastructure online R&D Big Data Production & Analytics deeply about society
  • 11. RCUK and Big Data ▶ ‘Big data is a term for a collection of datasets so large and complex that it is beyond the ability of typical database software tools to capture, store, manage, and analyse them. ‘Big’ is not defined as being larger than a certain number of ‘bytes’ because as technology advances over time, the size of datasets that qualify as big data will also increase’ (RCUK)
  • 12.
  • 13. Research benefits of new data ▶ Undertaking research on pressing policy-related issues without the need for new data collection • Food consumption, social background and obesity • Energy consumption, housing type and climatic conditions • Rural location, private/public transport alternatives and incomes • School attainment, higher education participation, subject choices, student debt and later incomes ▶ New data such as social media enable us to ask big questions, about big populations, and in real time – this is transformative
  • 16. Research questions – Social and political movements – Political participation and trust – Individual, group/community and national identities – Personal, local, national and global security (including crime, law enforcement and defence) – Rural development and ‘UrbanTransformations’ – Crisis prevention, preparedness, response, management and recovery – Education – Health and wellbeing (including ageing) – Environment and sustainability – Economic growth and financial markets (including employment and the labour market)
  • 17.
  • 18.
  • 20.
  • 22.
  • 23. F i r s t
  • 24. Interdisciplinary and “in the wild” * * “in it” versus “on it”
  • 26. Real life is and must be full of all kinds of social constraint – the very processes from which society arises. Computers can help if we use them to create abstract social machines on the Web: processes in which the people do the creative work and the machine does the administration...The stage is set for an evolutionary growth of new social engines.The ability to create new forms of social process would be given to the world at large, and development would be rapid. Berners-Lee, Weaving the Web, 1999 (pp. 172–175) The Order of Social Machines
  • 27. SOCIAM:TheTheory and Practice of Social Machines is funded by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant number EPJ017728/1 and comprises the Universities of Southampton, Oxford and Edinburgh. See sociam.org
  • 29. Join the W3C Community Group www.w3.org/community/rosc Jun Zhao www.researchobject.org
  • 30. Web as lens Web as artefact Web Observatories http://www.w3.org/community/webobservatory/
  • 31. Big data elephant versus sense-making network? The challenge is to foster the co-constituted socio-technical system on the right i.e. a computationally-enabled sense-making network of expertise, data, models, visualisations and narratives Iain Buchan
  • 34. The Observatory Context ▶ New forms of data enable us answer old questions in new ways and to address entirely new questions – Especially about (new) social processes ▶ There are multiple shifts occurring: – Academia and business – Volumes and velocity of data – Realtime analytics – Computational infrastructure – Dataflows vs datasets (and curation infrastructure) – Correlation vs causation – Increasing automation and ethical implications – Machine-to-Machine in Internet of Things
  • 35. david.deroure@oerc.ox.ac.uk www.oerc.ox.ac.uk/people/dder @dder Slide and image credits: Fiona Armstrong, Christine Borgman, Iain Buchan, Mandy Chessell, Neil Chue Hong, Cat De Roure, Kevin Page, Nigel Shadbolt, Pip Willcox, Jun Zhao, Guardian newspaper