Big data, bad data -- Closing keynote at the Open World Forum 2013

Rayna Stamboliyska
Rayna StamboliyskaInnovative data-driven strategist: actionable expertise for complex issues à Désidédata
Big Data – Bad Data
Rayna Stamboliyska, PhD
Paris Descartes University
RS Strategy
XXXL
Data
XXXL
Issues
300 years BC: "of
making many books
there is no end"
1935-1937,
Social Security
Act
1943-1945,
'Colossus' &
Enigma
1997: first mention of 'big data' in
Application-Controlled Demand Paging
for Out-of-Core Visualization (NASA)
“Is there anywhere
on earth exempt
from these swarms
of new books?”
...
What is it?No, Big Data isn't new... but
Maximizing computation power and
algorithmic accuracy
Drawing on a range of tools to analyse
and compare large datasets
Believing large datasets give greater
objectivity and accuracy
No, Data doesn't speak of itself
Numbers never lie... o rly?
* 51% of web traffic is non-human
* there are 30 million fake Twitter accounts
… poor bots on Thursday night :'(
Yes, Big Data discriminates
between social groups
Scientists are allowing their
assumptions about race to
shape their big-data genomics
research
If your past buying history
indicates you are more likely
to pay top euro for trousers,
your starting price the next
time you shop for clothes
online might be considerably
higher.
Big Data Fundamentalism?
There's a word for that: apophenia!
… Does big data change the erroneous 'correlation = causation'
dynamic?
Math majors, rejoice!
''With big data techniques, you can get much
closer to being able to predict causal relations.''
I beg to disagree
Your question and your data are not random
entities.
Big Data needs several steps:
●
of preparation
●
in interpretation
And...
If data are somehow subject to us, we are
also subject to data
1 sur 10

Recommandé

Gendered Quantified Self: my talk at FLOSSIE 2013 par
Gendered Quantified Self: my talk at FLOSSIE 2013Gendered Quantified Self: my talk at FLOSSIE 2013
Gendered Quantified Self: my talk at FLOSSIE 2013Rayna Stamboliyska
3.3K vues15 diapositives
Открытые данные для социально- экономического развития: Роль гражданского общ... par
Открытые данные для социально- экономического развития: Роль гражданского общ...Открытые данные для социально- экономического развития: Роль гражданского общ...
Открытые данные для социально- экономического развития: Роль гражданского общ...Rayna Stamboliyska
1.7K vues18 diapositives
#OpenDataKG: Open Data and the role of civil society par
#OpenDataKG: Open Data and the role of civil society#OpenDataKG: Open Data and the role of civil society
#OpenDataKG: Open Data and the role of civil societyRayna Stamboliyska
1.7K vues18 diapositives
Contes et légendes du RuNet : séminaire EHESS du 16 mars 2015 par
Contes et légendes du RuNet : séminaire EHESS du 16 mars 2015Contes et légendes du RuNet : séminaire EHESS du 16 mars 2015
Contes et légendes du RuNet : séminaire EHESS du 16 mars 2015Rayna Stamboliyska
5K vues50 diapositives
How Old Is Your Data? Don't Settle For Bad Data! par
How Old Is Your Data? Don't Settle For Bad Data!How Old Is Your Data? Don't Settle For Bad Data!
How Old Is Your Data? Don't Settle For Bad Data!removed_98c8d4827eb0208c4db118838b8f6010
3.6K vues12 diapositives
The role of data for economic prosperity in the Middle East and North Africa par
The role of data for economic prosperity in the Middle East and North AfricaThe role of data for economic prosperity in the Middle East and North Africa
The role of data for economic prosperity in the Middle East and North AfricaRayna Stamboliyska
412 vues25 diapositives

Contenu connexe

Similaire à Big data, bad data -- Closing keynote at the Open World Forum 2013

What is Data? par
What is Data?What is Data?
What is Data?DataWorks Summit/Hadoop Summit
838 vues24 diapositives
How to follow actors through their traces. Exploiting digital traceability par
How to follow actors through their traces. Exploiting digital traceabilityHow to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityINRIA - ENS Lyon
7K vues50 diapositives
Data: Past, Present, and Future (Lecture 1, Spring 2018) par
Data: Past, Present, and Future (Lecture 1, Spring 2018)Data: Past, Present, and Future (Lecture 1, Spring 2018)
Data: Past, Present, and Future (Lecture 1, Spring 2018)chris wiggins
2.7K vues54 diapositives
"data: past, present, and future" day 1 lecture 2020-01-20 par
"data: past, present, and future" day 1 lecture 2020-01-20"data: past, present, and future" day 1 lecture 2020-01-20
"data: past, present, and future" day 1 lecture 2020-01-20chris wiggins
319 vues80 diapositives
AI Presentation Y65 Class Dinner par
AI Presentation Y65 Class DinnerAI Presentation Y65 Class Dinner
AI Presentation Y65 Class DinnerJean McKillop
79 vues30 diapositives
APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag... par
APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag...APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag...
APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag...apidays
80 vues26 diapositives

Similaire à Big data, bad data -- Closing keynote at the Open World Forum 2013(20)

How to follow actors through their traces. Exploiting digital traceability par INRIA - ENS Lyon
How to follow actors through their traces. Exploiting digital traceabilityHow to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceability
Data: Past, Present, and Future (Lecture 1, Spring 2018) par chris wiggins
Data: Past, Present, and Future (Lecture 1, Spring 2018)Data: Past, Present, and Future (Lecture 1, Spring 2018)
Data: Past, Present, and Future (Lecture 1, Spring 2018)
chris wiggins2.7K vues
"data: past, present, and future" day 1 lecture 2020-01-20 par chris wiggins
"data: past, present, and future" day 1 lecture 2020-01-20"data: past, present, and future" day 1 lecture 2020-01-20
"data: past, present, and future" day 1 lecture 2020-01-20
chris wiggins319 vues
AI Presentation Y65 Class Dinner par Jean McKillop
AI Presentation Y65 Class DinnerAI Presentation Y65 Class Dinner
AI Presentation Y65 Class Dinner
Jean McKillop79 vues
APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag... par apidays
APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag...APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag...
APIdays Paris 2019 - Data, Cities, APIs, Humans by Daniel Sarasa Dunes, Zarag...
apidays80 vues
When data journalism meets science | Erice, June 10th, 2014 par Dataninja
When data journalism meets science | Erice, June 10th, 2014When data journalism meets science | Erice, June 10th, 2014
When data journalism meets science | Erice, June 10th, 2014
Dataninja2.7K vues
A Primer on Big Data taken by the book: "Big Data" by Schoenberger and Cukier par Mauro Meanti
A Primer on Big Data taken by the book: "Big Data" by Schoenberger and CukierA Primer on Big Data taken by the book: "Big Data" by Schoenberger and Cukier
A Primer on Big Data taken by the book: "Big Data" by Schoenberger and Cukier
Mauro Meanti1K vues
"data: past, present, and future" lecture 1 (intro) 1/22/19 par chris wiggins
"data: past, present, and future" lecture 1 (intro) 1/22/19"data: past, present, and future" lecture 1 (intro) 1/22/19
"data: past, present, and future" lecture 1 (intro) 1/22/19
chris wiggins487 vues
4 Things You Didn't Know About Big Data par Tyrone Systems
4 Things You Didn't Know About Big Data4 Things You Didn't Know About Big Data
4 Things You Didn't Know About Big Data
Tyrone Systems155 vues
A Thinking Person's Guide to Using Big Data for Development: Myths, Opportuni... par Junaid Qadir
A Thinking Person's Guide to Using Big Data for Development: Myths, Opportuni...A Thinking Person's Guide to Using Big Data for Development: Myths, Opportuni...
A Thinking Person's Guide to Using Big Data for Development: Myths, Opportuni...
Junaid Qadir74 vues
Ice summer school-5-7-2013 par Jun Hu
Ice summer school-5-7-2013Ice summer school-5-7-2013
Ice summer school-5-7-2013
Jun Hu641 vues
Parallel economics: from artificial intelligence to intelligent and smart eco... par Sunshine Zhang
Parallel economics: from artificial intelligence to intelligent and smart eco...Parallel economics: from artificial intelligence to intelligent and smart eco...
Parallel economics: from artificial intelligence to intelligent and smart eco...
Sunshine Zhang192 vues
Figures of the Many - Quantitative Concepts for Qualitative Thinking par Bernhard Rieder
Figures of the Many - Quantitative Concepts for Qualitative ThinkingFigures of the Many - Quantitative Concepts for Qualitative Thinking
Figures of the Many - Quantitative Concepts for Qualitative Thinking
Bernhard Rieder4.6K vues
Computing, cognition and the future of knowing,. by IBM par Virginia Fernandez
Computing, cognition and the future of knowing,. by IBMComputing, cognition and the future of knowing,. by IBM
Computing, cognition and the future of knowing,. by IBM
Learning to trust artificial intelligence systems accountability, compliance ... par Diego Alberto Tamayo
Learning to trust artificial intelligence systems accountability, compliance ...Learning to trust artificial intelligence systems accountability, compliance ...
Learning to trust artificial intelligence systems accountability, compliance ...
Moldova Open Data Journalism Workshop par Alexander Howard
Moldova Open Data Journalism WorkshopMoldova Open Data Journalism Workshop
Moldova Open Data Journalism Workshop
Alexander Howard13.6K vues

Plus de Rayna Stamboliyska

The NIS directive: Yet another expensive legality or an opportunity to improv... par
The NIS directive: Yet another expensive legality or an opportunity to improv...The NIS directive: Yet another expensive legality or an opportunity to improv...
The NIS directive: Yet another expensive legality or an opportunity to improv...Rayna Stamboliyska
408 vues12 diapositives
#CoRIIN2018 : Comment ne pas communiquer en temps de crise par
#CoRIIN2018 : Comment ne pas communiquer en temps de crise#CoRIIN2018 : Comment ne pas communiquer en temps de crise
#CoRIIN2018 : Comment ne pas communiquer en temps de criseRayna Stamboliyska
1.4K vues35 diapositives
Références bibliographiques "La face cachée d'Internet" par
Références bibliographiques "La face cachée d'Internet"Références bibliographiques "La face cachée d'Internet"
Références bibliographiques "La face cachée d'Internet"Rayna Stamboliyska
2.7K vues14 diapositives
La question de mémoire collective post-conflictuelle : une comparaison des di... par
La question de mémoire collective post-conflictuelle : une comparaison des di...La question de mémoire collective post-conflictuelle : une comparaison des di...
La question de mémoire collective post-conflictuelle : une comparaison des di...Rayna Stamboliyska
985 vues9 diapositives
ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance) par
ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance)ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance)
ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance)Rayna Stamboliyska
835 vues29 diapositives
Let's talk about policy! Politiques publiques pour l’ouverture des données sc... par
Let's talk about policy! Politiques publiques pour l’ouverture des données sc...Let's talk about policy! Politiques publiques pour l’ouverture des données sc...
Let's talk about policy! Politiques publiques pour l’ouverture des données sc...Rayna Stamboliyska
6.9K vues47 diapositives

Plus de Rayna Stamboliyska(18)

The NIS directive: Yet another expensive legality or an opportunity to improv... par Rayna Stamboliyska
The NIS directive: Yet another expensive legality or an opportunity to improv...The NIS directive: Yet another expensive legality or an opportunity to improv...
The NIS directive: Yet another expensive legality or an opportunity to improv...
#CoRIIN2018 : Comment ne pas communiquer en temps de crise par Rayna Stamboliyska
#CoRIIN2018 : Comment ne pas communiquer en temps de crise#CoRIIN2018 : Comment ne pas communiquer en temps de crise
#CoRIIN2018 : Comment ne pas communiquer en temps de crise
Rayna Stamboliyska1.4K vues
Références bibliographiques "La face cachée d'Internet" par Rayna Stamboliyska
Références bibliographiques "La face cachée d'Internet"Références bibliographiques "La face cachée d'Internet"
Références bibliographiques "La face cachée d'Internet"
Rayna Stamboliyska2.7K vues
La question de mémoire collective post-conflictuelle : une comparaison des di... par Rayna Stamboliyska
La question de mémoire collective post-conflictuelle : une comparaison des di...La question de mémoire collective post-conflictuelle : une comparaison des di...
La question de mémoire collective post-conflictuelle : une comparaison des di...
ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance) par Rayna Stamboliyska
ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance)ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance)
ОТВОРЕНИ ДАННИ ЗА ДОБРО УПРАВЛЕНИЕ (Open Data for Good Governance)
Let's talk about policy! Politiques publiques pour l’ouverture des données sc... par Rayna Stamboliyska
Let's talk about policy! Politiques publiques pour l’ouverture des données sc...Let's talk about policy! Politiques publiques pour l’ouverture des données sc...
Let's talk about policy! Politiques publiques pour l’ouverture des données sc...
Rayna Stamboliyska6.9K vues
Cours pour la Licence "Sciences et Ingéniérie" ENSTA par Rayna Stamboliyska
Cours pour la Licence "Sciences et Ingéniérie" ENSTACours pour la Licence "Sciences et Ingéniérie" ENSTA
Cours pour la Licence "Sciences et Ingéniérie" ENSTA
Rayna Stamboliyska1.2K vues
Open Data in Science & Research -- Open World Forum 2013, Public Policies track par Rayna Stamboliyska
Open Data in Science & Research -- Open World Forum 2013, Public Policies trackOpen Data in Science & Research -- Open World Forum 2013, Public Policies track
Open Data in Science & Research -- Open World Forum 2013, Public Policies track
Rayna Stamboliyska5.1K vues
Knowledge Adventures for Kids: Masterclass presentation during the Social Med... par Rayna Stamboliyska
Knowledge Adventures for Kids: Masterclass presentation during the Social Med...Knowledge Adventures for Kids: Masterclass presentation during the Social Med...
Knowledge Adventures for Kids: Masterclass presentation during the Social Med...
NASA SpaceApps challenges: Paris Off-the-Grid restitution par Rayna Stamboliyska
NASA SpaceApps challenges: Paris Off-the-Grid restitutionNASA SpaceApps challenges: Paris Off-the-Grid restitution
NASA SpaceApps challenges: Paris Off-the-Grid restitution
Gender Equality Summit workshop at the Open World Forum 2010 par Rayna Stamboliyska
Gender Equality Summit workshop at the Open World Forum 2010Gender Equality Summit workshop at the Open World Forum 2010
Gender Equality Summit workshop at the Open World Forum 2010

Dernier

Initiating and Advancing Your Strategic GIS Governance Strategy par
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategySafe Software
176 vues68 diapositives
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... par
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...ShapeBlue
159 vues25 diapositives
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates par
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesShapeBlue
252 vues15 diapositives
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... par
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...ShapeBlue
184 vues12 diapositives
CryptoBotsAI par
CryptoBotsAICryptoBotsAI
CryptoBotsAIchandureddyvadala199
40 vues5 diapositives
Why and How CloudStack at weSystems - Stephan Bienek - weSystems par
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsShapeBlue
238 vues13 diapositives

Dernier(20)

Initiating and Advancing Your Strategic GIS Governance Strategy par Safe Software
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
Safe Software176 vues
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... par ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue159 vues
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates par ShapeBlue
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates
ShapeBlue252 vues
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... par ShapeBlue
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
ShapeBlue184 vues
Why and How CloudStack at weSystems - Stephan Bienek - weSystems par ShapeBlue
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystems
ShapeBlue238 vues
DRBD Deep Dive - Philipp Reisner - LINBIT par ShapeBlue
DRBD Deep Dive - Philipp Reisner - LINBITDRBD Deep Dive - Philipp Reisner - LINBIT
DRBD Deep Dive - Philipp Reisner - LINBIT
ShapeBlue180 vues
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online par ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue221 vues
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas... par Bernd Ruecker
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
iSAQB Software Architecture Gathering 2023: How Process Orchestration Increas...
Bernd Ruecker54 vues
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... par ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue139 vues
NTGapps NTG LowCode Platform par Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu423 vues
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... par ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue119 vues
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... par ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue173 vues
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue par ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue147 vues
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And... par ShapeBlue
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
ShapeBlue106 vues
Digital Personal Data Protection (DPDP) Practical Approach For CISOs par Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash158 vues
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... par ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue198 vues
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue par ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlueVNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
VNF Integration and Support in CloudStack - Wei Zhou - ShapeBlue
ShapeBlue203 vues
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue par ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
ShapeBlue135 vues
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... par The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...

Big data, bad data -- Closing keynote at the Open World Forum 2013

  • 1. Big Data – Bad Data Rayna Stamboliyska, PhD Paris Descartes University RS Strategy
  • 3. 300 years BC: "of making many books there is no end" 1935-1937, Social Security Act 1943-1945, 'Colossus' & Enigma 1997: first mention of 'big data' in Application-Controlled Demand Paging for Out-of-Core Visualization (NASA) “Is there anywhere on earth exempt from these swarms of new books?” ...
  • 4. What is it?No, Big Data isn't new... but Maximizing computation power and algorithmic accuracy Drawing on a range of tools to analyse and compare large datasets Believing large datasets give greater objectivity and accuracy
  • 5. No, Data doesn't speak of itself
  • 6. Numbers never lie... o rly? * 51% of web traffic is non-human * there are 30 million fake Twitter accounts … poor bots on Thursday night :'(
  • 7. Yes, Big Data discriminates between social groups Scientists are allowing their assumptions about race to shape their big-data genomics research If your past buying history indicates you are more likely to pay top euro for trousers, your starting price the next time you shop for clothes online might be considerably higher.
  • 8. Big Data Fundamentalism? There's a word for that: apophenia! … Does big data change the erroneous 'correlation = causation' dynamic?
  • 9. Math majors, rejoice! ''With big data techniques, you can get much closer to being able to predict causal relations.'' I beg to disagree Your question and your data are not random entities. Big Data needs several steps: ● of preparation ● in interpretation
  • 10. And... If data are somehow subject to us, we are also subject to data