SlideShare une entreprise Scribd logo
Brad Hubbard
Product Manager, Developer Relations
DataSift
Five Things You Didn’t Know
DataSift Can Do
#DSwebinar
HUMAN DATA
INTELLIGENCE
FILTER TAG • ENRICH
STORE
Stream products will be covered today
To see PYLON (our aggregated, anonymized Facebook topic data), join our next live demo:
http://lp.datasift.com/20150701-Live-SE-Demo-Registration
DataSift is of Two Minds:
Indexed Data & Streaming
#DSwebinar
VEDO
2011 1K 4
Launched
• San Francisco
• New York
• London
• Reading, UK
Customers across
40 countries
2B
Items
processed
per day
(These don’t count toward the 5 things)
Global offices:
#DSwebinar
Brave New Data World
of all digital data created
by consumers
emails a day
of US adults’ location is
known
increase in global
data by 2020
Thoughts
EmotionsLIKES
Dislikes
Intentions IdeasCurrent
Events
GEO
OccupationAge
Topics
GenderIdeas
Gender
Occupation
Intentions
Age
Thoughts
GEO
Dislikes
Age
Ideas
Thoughts
Age
Intentions
Current Events
Current Events
Emotions
GEO
Ideas
GEO
#DSwebinar
Sources of Human-Generated Data
BLOGS & NEWS INSIDE YOUR
BUSINESS
SOCIAL NETWORKS
#DSwebinar
The Complexity of Human Data
VOLUME
VARIET
Y
VELOCITY
Billions of users
Noisy
Generated in real time
per second
Post vs blog vs like
Terabytes per day
Ambiguous
Big spikesUnstructured
#DSwebinar
Turn Human Data into Meaning
#DSwebinar
Unify Human Data
#DSwebinar
9
We apply structure to the chaotic world of human data
#DSwebinar
Facebook Tencent Weibo Sina Weibo Google+ YouTube Instagram
LexisNexis Wikipedia
Wordpress
Tumblr Intense Debate Disqus NewsCred Reddit
TopixJiveTwitter EDGAR NewsVideoIMDBYammer
Unifying data from across the web
#DSwebinar
Filtering Human Data
with CSDL
#DSwebinar
Filter: CSDL Data Processing Language
WRITE ONCE • USE MANY
Filters against generic objects or get source-specific
#DSwebinar
Rules can contain millions of tag and filter criteria, no
need to limit yourself
INFINITE COMPLEXITY
#DSwebinar
Enrich Human Data
#DSwebinar
Identifies links in social posts and fetches header data
Allowing you to filter against link content
LINKS AUGMENTATION
#DSwebinar
LANGUAGE DETECTION
Write filters on a per-language basis, or limit
yourself to only certain languages
#DSwebinar
Location either disclosed by user or listed in profile
GENDER DETECTION
USING PROFILES AND NAME + LANGUAGE
#DSwebinar
SENTIMENT AND TOPICS
Likely positive • Neutral • Likely Negative
Topic detection (looking for nouns and disambiguating
them)
#DSwebinar
Categorization, Scoring
and Tagging
#DSwebinar
VEDO enables automatic
classification of Human Data
based on it’s meaning
Apply Data Science
#DSwebinar
OFF THE SHELF CLASSIFIERS
Enable automatic scoring and classification
#DSwebinar
CUSTOM TAXONOMIES
Hierarchal rules to mach your business
#DSwebinar
CUSTOM SCORING SYTEM
To expose meaning hidden deep within
unstructured, text-rich data
#DSwebinar
Delivery
Use Everywhere
#DSwebinar
CONSUME A JSON STREAM DIRECTLY
#DSwebinar
Send your data to any of these pre-built connectors
#DSwebinar
We handle the infrastructure and
send you the data you need
#DSwebinar
THANK YOU
#DSwebinar

Contenu connexe

En vedette

Manual UPC de protección de datos
Manual UPC de protección de datosManual UPC de protección de datos
Manual UPC de protección de datos
Luigi Ceccaroni
 
Marketers, Rev Your Engines: Facebook Topic Data is Available Now
Marketers, Rev Your Engines: Facebook Topic Data is Available Now Marketers, Rev Your Engines: Facebook Topic Data is Available Now
Marketers, Rev Your Engines: Facebook Topic Data is Available Now
DataSift
 
Tareas u2 basico 1
Tareas u2 basico 1Tareas u2 basico 1
Tareas u2 basico 1
LuisIxcot
 
Aktuelle projekte
Aktuelle projekteAktuelle projekte
Aktuelle projekte
hausformat
 
AK_RightsList_Frankfurt2012
AK_RightsList_Frankfurt2012AK_RightsList_Frankfurt2012
AK_RightsList_Frankfurt2012
Susana Gross
 

En vedette (18)

SEO on a Budget - Search London - July 30 2014
SEO on a Budget - Search London - July 30 2014SEO on a Budget - Search London - July 30 2014
SEO on a Budget - Search London - July 30 2014
 
Manual UPC de protección de datos
Manual UPC de protección de datosManual UPC de protección de datos
Manual UPC de protección de datos
 
plan de negocios (gustavo marchena)
plan de negocios (gustavo marchena)plan de negocios (gustavo marchena)
plan de negocios (gustavo marchena)
 
Oferteo.pl - jak zarabiać na prowadzeniu bloga
Oferteo.pl - jak zarabiać na prowadzeniu blogaOferteo.pl - jak zarabiać na prowadzeniu bloga
Oferteo.pl - jak zarabiać na prowadzeniu bloga
 
Abencor: Outsourcing Services
Abencor: Outsourcing ServicesAbencor: Outsourcing Services
Abencor: Outsourcing Services
 
Marketers, Rev Your Engines: Facebook Topic Data is Available Now
Marketers, Rev Your Engines: Facebook Topic Data is Available Now Marketers, Rev Your Engines: Facebook Topic Data is Available Now
Marketers, Rev Your Engines: Facebook Topic Data is Available Now
 
Actuadores electricos
Actuadores electricosActuadores electricos
Actuadores electricos
 
Examen parcial ms word 11
Examen parcial ms word 11Examen parcial ms word 11
Examen parcial ms word 11
 
Tareas u2 basico 1
Tareas u2 basico 1Tareas u2 basico 1
Tareas u2 basico 1
 
Beneficios del seguro en los trabajadores
Beneficios del seguro en los trabajadoresBeneficios del seguro en los trabajadores
Beneficios del seguro en los trabajadores
 
Portafolio de servicios empresa neullava
Portafolio de servicios empresa neullavaPortafolio de servicios empresa neullava
Portafolio de servicios empresa neullava
 
Aktuelle projekte
Aktuelle projekteAktuelle projekte
Aktuelle projekte
 
Digital Marketing Success Stories
Digital Marketing   Success StoriesDigital Marketing   Success Stories
Digital Marketing Success Stories
 
AK_RightsList_Frankfurt2012
AK_RightsList_Frankfurt2012AK_RightsList_Frankfurt2012
AK_RightsList_Frankfurt2012
 
Swiss Fluid Cylindrical Plug Valve
Swiss Fluid Cylindrical Plug ValveSwiss Fluid Cylindrical Plug Valve
Swiss Fluid Cylindrical Plug Valve
 
Bio2#7
Bio2#7Bio2#7
Bio2#7
 
Invirtiendo en el Perú
Invirtiendo en el PerúInvirtiendo en el Perú
Invirtiendo en el Perú
 
Desarrollo social
Desarrollo socialDesarrollo social
Desarrollo social
 

Similaire à Five Things You Didn't Know DataSift Can Do

Transitioning to-lean-at-infochimps
Transitioning to-lean-at-infochimpsTransitioning to-lean-at-infochimps
Transitioning to-lean-at-infochimps
Ash Maurya
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
Blogtalk 2008
 

Similaire à Five Things You Didn't Know DataSift Can Do (20)

The Connected Data Imperative: The Shifting Enterprise Data Story
The Connected Data Imperative: The Shifting Enterprise Data StoryThe Connected Data Imperative: The Shifting Enterprise Data Story
The Connected Data Imperative: The Shifting Enterprise Data Story
 
Use Big Data to Improve Content Marketing
Use Big Data to Improve Content MarketingUse Big Data to Improve Content Marketing
Use Big Data to Improve Content Marketing
 
How to Build Innovative Products with Facebook Topic Data
How to Build Innovative Products with Facebook Topic DataHow to Build Innovative Products with Facebook Topic Data
How to Build Innovative Products with Facebook Topic Data
 
How to Build Innovative Products with Facebook Topic Data
How to Build Innovative Products with Facebook Topic DataHow to Build Innovative Products with Facebook Topic Data
How to Build Innovative Products with Facebook Topic Data
 
Graphs are Eating the World
Graphs are Eating the WorldGraphs are Eating the World
Graphs are Eating the World
 
Semantic web & structured data - #SMT Search Marketing Thursday - Jan-Willem ...
Semantic web & structured data - #SMT Search Marketing Thursday - Jan-Willem ...Semantic web & structured data - #SMT Search Marketing Thursday - Jan-Willem ...
Semantic web & structured data - #SMT Search Marketing Thursday - Jan-Willem ...
 
Eight Proven Content Creation & Marketing Strategies with Case Studies
Eight Proven Content Creation & Marketing Strategies with Case StudiesEight Proven Content Creation & Marketing Strategies with Case Studies
Eight Proven Content Creation & Marketing Strategies with Case Studies
 
Solving the Planning Puzzle - Plan Your Next Project with Ease!
Solving the Planning Puzzle - Plan Your Next Project with Ease!Solving the Planning Puzzle - Plan Your Next Project with Ease!
Solving the Planning Puzzle - Plan Your Next Project with Ease!
 
Integrated Media Strategies - RISE Austin 2011
Integrated Media Strategies - RISE Austin 2011Integrated Media Strategies - RISE Austin 2011
Integrated Media Strategies - RISE Austin 2011
 
Transitioning to-lean-at-infochimps
Transitioning to-lean-at-infochimpsTransitioning to-lean-at-infochimps
Transitioning to-lean-at-infochimps
 
Stone Ward Digital Swagger Presentation
Stone Ward Digital Swagger PresentationStone Ward Digital Swagger Presentation
Stone Ward Digital Swagger Presentation
 
Polyglot Persistence with MongoDB and Neo4j
Polyglot Persistence with MongoDB and Neo4jPolyglot Persistence with MongoDB and Neo4j
Polyglot Persistence with MongoDB and Neo4j
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
 
SEOktoberfest 2022 - Blending SEO, Discover, & Entity Extraction to Analyze D...
SEOktoberfest 2022 - Blending SEO, Discover, & Entity Extraction to Analyze D...SEOktoberfest 2022 - Blending SEO, Discover, & Entity Extraction to Analyze D...
SEOktoberfest 2022 - Blending SEO, Discover, & Entity Extraction to Analyze D...
 
Graph databases and the #panamapapers
Graph databases and the #panamapapersGraph databases and the #panamapapers
Graph databases and the #panamapapers
 
Intro to Data Science
Intro to Data ScienceIntro to Data Science
Intro to Data Science
 
Nova Spivack - Semantic Web Talk
Nova Spivack - Semantic Web TalkNova Spivack - Semantic Web Talk
Nova Spivack - Semantic Web Talk
 
What We Pitched the Obama Campaign in 2012
What We Pitched the Obama Campaign in 2012What We Pitched the Obama Campaign in 2012
What We Pitched the Obama Campaign in 2012
 
Building an Online Presence
Building an Online PresenceBuilding an Online Presence
Building an Online Presence
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 

Plus de DataSift

Plus de DataSift (9)

Boosting Your Brand Marketing with Facebook Topic Data Insights
Boosting Your Brand Marketing with Facebook Topic Data InsightsBoosting Your Brand Marketing with Facebook Topic Data Insights
Boosting Your Brand Marketing with Facebook Topic Data Insights
 
Staying on the Right Side of the Fence when Analyzing Human Data
Staying on the Right Side of the Fence when Analyzing Human DataStaying on the Right Side of the Fence when Analyzing Human Data
Staying on the Right Side of the Fence when Analyzing Human Data
 
10 Reasons Facebook Topic Data Will Change Your World
10 Reasons Facebook Topic Data Will Change Your World 10 Reasons Facebook Topic Data Will Change Your World
10 Reasons Facebook Topic Data Will Change Your World
 
Taming Social Data: How Social Data Framing liberates analysis and accelerate...
Taming Social Data: How Social Data Framing liberates analysis and accelerate...Taming Social Data: How Social Data Framing liberates analysis and accelerate...
Taming Social Data: How Social Data Framing liberates analysis and accelerate...
 
Building the Social Powered Brand: Turning Social Data Into Competitive Advan...
Building the Social Powered Brand: Turning Social Data Into Competitive Advan...Building the Social Powered Brand: Turning Social Data Into Competitive Advan...
Building the Social Powered Brand: Turning Social Data Into Competitive Advan...
 
DataSift's Rob Bailey at The Social Media Strategies Summit
DataSift's Rob Bailey at The Social Media Strategies Summit DataSift's Rob Bailey at The Social Media Strategies Summit
DataSift's Rob Bailey at The Social Media Strategies Summit
 
Follow the content
Follow the contentFollow the content
Follow the content
 
Twitter, Social Sentiment and Stock Markets
Twitter, Social Sentiment and Stock MarketsTwitter, Social Sentiment and Stock Markets
Twitter, Social Sentiment and Stock Markets
 
Creating streams with DataSift
Creating streams with DataSiftCreating streams with DataSift
Creating streams with DataSift
 

Dernier

Dernier (20)

Server-Driven User Interface (SDUI) at Priceline
Server-Driven User Interface (SDUI) at PricelineServer-Driven User Interface (SDUI) at Priceline
Server-Driven User Interface (SDUI) at Priceline
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAK
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 
Motion for AI: Creating Empathy in Technology
Motion for AI: Creating Empathy in TechnologyMotion for AI: Creating Empathy in Technology
Motion for AI: Creating Empathy in Technology
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 

Five Things You Didn't Know DataSift Can Do

Notes de l'éditeur

  1. First, a little about DataSift
  2. Human data is a particular challenge: not only is there a lot of it – but it’s complex, highly varied, and comes at you fast. It can also have
  3. We bring in data from a ton of different places. You’ve probably heard of most of these – and we’d be happy to dig into more detail on any of these later on if you’re curious, or you can find more on our website.
  4. A Facebook post looks different than a Disqus comment. But you might want to search for your company or product anywhere. Because we’ve already normalized the data, you write simplified filters that make it easy for you. You can write against both generic targets – like “the main body text contains android” or more specific, nuanced targets, such as “the author’s account is at least 90 days old”
  5. Once we have the data in a standardized format we enrich it with a lot of really useful stuff. Just like the raw content and other information can be filtered on, so can all the enhanced data we add.
  6. “This is cool! http://bit.ly/AsdFa” Shortened URLs and tracking URLs are incredibly common in social data. What we do is not only traverse these redirects to their final destination, but we also fetch the page header information and metadata and append it to the source object. This means you can filter not only on posts which contain “Android”, but also posts with links which contain “Android” in the title, description, or keywords. We do this at line speeds, across every social post on the planet, as it happens. This is an extremely powerful tool and the value it can provide is considerable. So much of the social landscape is dominated by discussions of a shared link, and without that content, you can miss the entirety.