SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
© 2014 Quid, Inc.1
Big Data Visualization Meetup @ CA Technologies
Ruggero Altair Tacchi, PhD
Data Scientist at Quid
March 27, 2014
BIG DATA INTELLIGENCE
ruggero@quid.com
© 2014 Quid, Inc.22
Quid is a Business Intelligence platform. Quid has created a full-stack
platform that brings together the world’s business data streams
ü  Company data
ü  News data
ü  Patent data
ü  Organizational data
ü  Scientific papers
…
…
…
© 2014 Quid, Inc.33
Q: Who owns the discussion about the future of the car?
© 2014 Quid, Inc.44 ruggero@quid.com
Q: Who owns the discussion about the future of the car?
© 2014 Quid, Inc.55 Source: xkcd 1334
© 2014 Quid, Inc.66 Source: xkcd 1334
Search is hard
© 2014 Quid, Inc.77
224
239
248
262
286
311
346
346
396
520
550
571
659
735
1,065
Chrysler
Chevrolet
Hyundai
Honda
Volkswagen
Nissan
Volvo
Apple
Mercedes
General
Motors
BMW
Audi
Toyota
Ford
Google
Top entities within future car news
network, by number of articles
23%
16%
14%
12%
12%
11%
9%
8%
8%
7%
6%
6%
5%
5%
5%
Share of voiceProportion of articles by theme
New car
features
Autonomous
driving
A: Google has 23% share of voice in the discussion on the
future of the car; traditional players focused on new features,
not capitalizing on interest in future / autonomous driving
© 2014 Quid, Inc.88
Q: Which media sources are most closely broadcasting
NASA’s press releases?
© 2014 Quid, Inc.99
-100 -80 -60 -40 -20 0 20 40 60 80 100
The New York Times
The Business Insider
CBC News
Aero-News.net
Popular Mechanics
CBS News
International Business Times
Everything Alabama
CNET
PerthNow
Business Wire
The Register
RedOrbit
Boston.com
Space Ref
Freshnews.com
EurekAlert
Softpedia
HeraldOnline.com
Reuters
Engadget
CNN
BBC News
Geekosystem.com
Slashgear
Gizmag
The Inquisitr
Big Pond News
Kurzweilai
Planetary Society Blog
ITWorld.com
The Verge
Ecoustics
More representative
Alignment Score
Less representative
A: The New York Times’ articles most closely reflect
narratives that NASA originates
© 2014 Quid, Inc.1010 Source: xkcd 173
© 2014 Quid, Inc.1111 Source: xkcd 173
Networks, not lists
© 2014 Quid, Inc.1212
Analyzing ~1,500 long-form articles about space, thematic clusters form
around NASA’s past & present, as wells as the future of commercial space
Present: NASA-led Space
Exploration (49%)
Future: Commercial
Space (26%)
Past: Traditional
NASA (25%)
Long-form news network, colored by conversation themes, major themes labeled
© 2014 Quid, Inc.1313
Long-form news network, NASA publications highlighted and clusters containing them labeled
NASA’s press release strategy has focused on the peripheral; it is largely
absent from the core conversation debating the future vs. the past of space
Source
NASA
Other
Vesta Asteroid
Moon
SpaceX & Dragon Capsule
Mars Curiosity
Hubble Telescope & Exoplanets
Space Shuttle
© 2014 Quid, Inc.1414
Quid Technology
Company A
Company B
< >
< >
< >
< machine vision >
< gaze coordinates >
< eye tracking >
Analyst has a
big-picture view
+
Ability to zoom
down to details
KEYWORDS
EXTRACTED
FROM TEXT
REPLICATED AT A
VERY LARGE SCALE
LINKS ESTABLISHED
USING SHARED
LANGUAGE
< eye tracking >
< gestures >
< eye control >
< Neural Connectivity >
< Autism >
< Mathematical >
< Divergent thinking >
NATURAL LANGUAGE PROCESSING
Neural Connectivity
Autism
Mathematical
Divergent thinking
Concepts
DIGITAL SIGNATURE
Significance
Document #1
Document 2
Document 3
Document 4
DIGITAL SIGNATURE
Document #3
Document #1
(similar)
© 2014 Quid, Inc.1919
From lists to networks, from networks to lists
© 2014 Quid, Inc.2020
Demo
© 2014 Quid, Inc.2121
For information contact us at
http://quid.com/contact
… and we’re hiring!
http://quid.com/careers
Danielle Ben-Gera Kartik Sundar Joey Hobbs Ruggero Altair Tacchi

Contenu connexe

Similaire à Extracting Intelligence from Big Data

Similaire à Extracting Intelligence from Big Data (20)

Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
Workshop - Build a Graph Solution
Workshop - Build a Graph SolutionWorkshop - Build a Graph Solution
Workshop - Build a Graph Solution
 
Os Boswell
Os BoswellOs Boswell
Os Boswell
 
Tucana HR Analytics Data Visualisation, April 2014 (London)
Tucana HR Analytics Data Visualisation, April 2014 (London)Tucana HR Analytics Data Visualisation, April 2014 (London)
Tucana HR Analytics Data Visualisation, April 2014 (London)
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Open Source Software Development by TLV Partners
Open Source Software Development by TLV PartnersOpen Source Software Development by TLV Partners
Open Source Software Development by TLV Partners
 
Open source presentation
Open source presentationOpen source presentation
Open source presentation
 
Take the Fastest Path to Node.Js Application Development with Bitnami & AWS L...
Take the Fastest Path to Node.Js Application Development with Bitnami & AWS L...Take the Fastest Path to Node.Js Application Development with Bitnami & AWS L...
Take the Fastest Path to Node.Js Application Development with Bitnami & AWS L...
 
Cloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for DevelopersCloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for Developers
 
Case Study - Wikia Provides Federated Access To Data And Business Critical In...
Case Study - Wikia Provides Federated Access To Data And Business Critical In...Case Study - Wikia Provides Federated Access To Data And Business Critical In...
Case Study - Wikia Provides Federated Access To Data And Business Critical In...
 
Jian Liang (HiScene): AR for Industry in China: From Concepts to Real Applica...
Jian Liang (HiScene): AR for Industry in China: From Concepts to Real Applica...Jian Liang (HiScene): AR for Industry in China: From Concepts to Real Applica...
Jian Liang (HiScene): AR for Industry in China: From Concepts to Real Applica...
 
Engineering Simulation Meets the Cloud
Engineering Simulation Meets the CloudEngineering Simulation Meets the Cloud
Engineering Simulation Meets the Cloud
 
AstriCon 2014 keynote: Russell Bryant
AstriCon 2014 keynote: Russell BryantAstriCon 2014 keynote: Russell Bryant
AstriCon 2014 keynote: Russell Bryant
 
Introduction to developing modern web apps
Introduction to developing modern web appsIntroduction to developing modern web apps
Introduction to developing modern web apps
 
Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...
Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...
Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...
 
Telecom Clouds crossing borders, Chet Golding, Zefflin Systems
Telecom Clouds crossing borders, Chet Golding, Zefflin SystemsTelecom Clouds crossing borders, Chet Golding, Zefflin Systems
Telecom Clouds crossing borders, Chet Golding, Zefflin Systems
 
[OpenStack Day in Korea 2015] Keynote 1 - OpenStack Mission Update
[OpenStack Day in Korea 2015] Keynote 1 - OpenStack Mission Update[OpenStack Day in Korea 2015] Keynote 1 - OpenStack Mission Update
[OpenStack Day in Korea 2015] Keynote 1 - OpenStack Mission Update
 
Where is Data Going? - RMDC Keynote
Where is Data Going? - RMDC KeynoteWhere is Data Going? - RMDC Keynote
Where is Data Going? - RMDC Keynote
 
2011 NASA Open Source Summit - Patrick Hogan
2011 NASA Open Source Summit - Patrick Hogan2011 NASA Open Source Summit - Patrick Hogan
2011 NASA Open Source Summit - Patrick Hogan
 
2014 Ceph NYLUG Talk
2014 Ceph NYLUG Talk2014 Ceph NYLUG Talk
2014 Ceph NYLUG Talk
 

Dernier

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Extracting Intelligence from Big Data

  • 1. © 2014 Quid, Inc.1 Big Data Visualization Meetup @ CA Technologies Ruggero Altair Tacchi, PhD Data Scientist at Quid March 27, 2014 BIG DATA INTELLIGENCE ruggero@quid.com
  • 2. © 2014 Quid, Inc.22 Quid is a Business Intelligence platform. Quid has created a full-stack platform that brings together the world’s business data streams ü  Company data ü  News data ü  Patent data ü  Organizational data ü  Scientific papers … … …
  • 3. © 2014 Quid, Inc.33 Q: Who owns the discussion about the future of the car?
  • 4. © 2014 Quid, Inc.44 ruggero@quid.com Q: Who owns the discussion about the future of the car?
  • 5. © 2014 Quid, Inc.55 Source: xkcd 1334
  • 6. © 2014 Quid, Inc.66 Source: xkcd 1334 Search is hard
  • 7. © 2014 Quid, Inc.77 224 239 248 262 286 311 346 346 396 520 550 571 659 735 1,065 Chrysler Chevrolet Hyundai Honda Volkswagen Nissan Volvo Apple Mercedes General Motors BMW Audi Toyota Ford Google Top entities within future car news network, by number of articles 23% 16% 14% 12% 12% 11% 9% 8% 8% 7% 6% 6% 5% 5% 5% Share of voiceProportion of articles by theme New car features Autonomous driving A: Google has 23% share of voice in the discussion on the future of the car; traditional players focused on new features, not capitalizing on interest in future / autonomous driving
  • 8. © 2014 Quid, Inc.88 Q: Which media sources are most closely broadcasting NASA’s press releases?
  • 9. © 2014 Quid, Inc.99 -100 -80 -60 -40 -20 0 20 40 60 80 100 The New York Times The Business Insider CBC News Aero-News.net Popular Mechanics CBS News International Business Times Everything Alabama CNET PerthNow Business Wire The Register RedOrbit Boston.com Space Ref Freshnews.com EurekAlert Softpedia HeraldOnline.com Reuters Engadget CNN BBC News Geekosystem.com Slashgear Gizmag The Inquisitr Big Pond News Kurzweilai Planetary Society Blog ITWorld.com The Verge Ecoustics More representative Alignment Score Less representative A: The New York Times’ articles most closely reflect narratives that NASA originates
  • 10. © 2014 Quid, Inc.1010 Source: xkcd 173
  • 11. © 2014 Quid, Inc.1111 Source: xkcd 173 Networks, not lists
  • 12. © 2014 Quid, Inc.1212 Analyzing ~1,500 long-form articles about space, thematic clusters form around NASA’s past & present, as wells as the future of commercial space Present: NASA-led Space Exploration (49%) Future: Commercial Space (26%) Past: Traditional NASA (25%) Long-form news network, colored by conversation themes, major themes labeled
  • 13. © 2014 Quid, Inc.1313 Long-form news network, NASA publications highlighted and clusters containing them labeled NASA’s press release strategy has focused on the peripheral; it is largely absent from the core conversation debating the future vs. the past of space Source NASA Other Vesta Asteroid Moon SpaceX & Dragon Capsule Mars Curiosity Hubble Telescope & Exoplanets Space Shuttle
  • 14. © 2014 Quid, Inc.1414 Quid Technology Company A Company B < > < > < > < machine vision > < gaze coordinates > < eye tracking > Analyst has a big-picture view + Ability to zoom down to details KEYWORDS EXTRACTED FROM TEXT REPLICATED AT A VERY LARGE SCALE LINKS ESTABLISHED USING SHARED LANGUAGE < eye tracking > < gestures > < eye control >
  • 15. < Neural Connectivity > < Autism > < Mathematical > < Divergent thinking > NATURAL LANGUAGE PROCESSING
  • 17. Document #1 Document 2 Document 3 Document 4 DIGITAL SIGNATURE
  • 19. © 2014 Quid, Inc.1919 From lists to networks, from networks to lists
  • 20. © 2014 Quid, Inc.2020 Demo
  • 21. © 2014 Quid, Inc.2121 For information contact us at http://quid.com/contact … and we’re hiring! http://quid.com/careers Danielle Ben-Gera Kartik Sundar Joey Hobbs Ruggero Altair Tacchi