SlideShare a Scribd company logo
1 of 45
Download to read offline
Choosing the Right Data Management Architecture

for Cognitive Computing

Adrian Bowles, PhD

Founder, STORM Insights, Inc.

Lead Analyst, AI, Aragon Research

info@storminsights.com
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
OCTOBER 12, 2017
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AGENDA - CHOOSING THE RIGHT DATA MANAGEMENT ARCHITECTURE FOR COGNITIVE COMPUTING
The Role of Data In AI & CC

What do we need to manage?

Application, Data, and Algorithm Attributes that Influence Architecture

Database Options

Open Source Infrastructure

Prebuilt Knowledge

Getting Started: Basic Principles
Model
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
COGNITIVE COMPUTING FUNDAMENTALS: MODELS & ASSUMPTIONS
Model
The Corpus, Assumptions, Algorithms
Used to
Generate & Score Hypotheses
or
Calculate The Strength of a Relationship
Principles that control the
development and representation
of natural intelligence in the
neocortex provide a guide to the
implementation of machine
intelligence.(Numenta
Hierarchical Temporal Memory)
A function applied to a string
representing data or a concept
results in a value or vector
meaningful for comparison.
A Model is an Abstract Representation of Reality
Essential Data for
Cognitive Computing
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
MODELS WILL MAKE OR BREAK YOUR APPLICATION
Your Model The Real World
“When the map and the terrain disagree, believe the terrain.”
Gause and Weinberg (Exploring Requirements)
Systems
Controls
Learn
Plan Reason
Understand
Model
Data Mgmt
Human
Machine
Input Output
Gestures
Emotions
Language
Narrative Generation
Visualization
Reports
Haptics
Sensors
(IOT)
Systems
Controls
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
COGNITIVE SYSTEMS: COMMUNICATIONS & CONTROL
Perception
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
WHERE YOU ARE DICTATES WHAT YOU NEED
Ingest Analyze Maintain/Manage
When everything is connected…

New sources of data emerge

New sources of value emerge

Old assumptions must be challenged
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
THE IMPACT OF THE IOT
CHOICES HAVE CONSEQUENCES
How You Think About a Domain…
…influences your choice of maps and models…

rules and representations…and required operations.
HOW YOU ORGANIZE CONSTRAINS HOW YOU WORK - DESIGN WORKFLOW FIRST
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
START WITH A TAXONOMY
A taxonomy represents the formal structure of classes or types of objects within a domain. 

•Generally hierarchical and provide names for each class in the domain. 

•May also capture the membership properties of each object in relation to the other objects. 

•The rules of a specific taxonomy are used to classify or categorize any object in the domain, so
they must be complete, consistent, and unambiguous. This rigor in specification should ensure that
any newly discovered object must fit into one, and only one, category or object class.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
ONTOLOGIES
An ontology formalizes and specifies the names, definitions,
and attributes of entities within a domain. For practical
purposes, an accepted ontology defines the domain.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
RDF - Resource Description Framework - A directed, labeled graph.
RDFS - RDF Specifications Suite Recommendations (Language for representing RDF
vocabularies)
SPARQL - A Semantic Protocol & Query Language for RDF Data
OWL - The Web Ontology Language is a Semantic We
language designed to represent knowledge about things
and relationships between things on the Web.
An OWL Document is an Ontology.
https://www.w3.org/2013/data/
THE SEMANTIC WEB - ALL DATA SHOULD BE ASSOCIATED WITH SEMANTIC ATTRIBUTES (MEANING)
BASICS OF THE W3C SEMANTIC WEB ONTOLOGY STACK
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
CRITICAL QUESTIONS…
What data do we need?

What data will be produced?

Where does the data get created?

Where does the data get analyzed/refined?

How do we present/output the data?
And for each data category & data lifecycle phase,

What does it look like?

How much is there?
Architectural
Influences
Copyright (c) 2014-2017 by STORM Insights Inc. All Rights Reserved.
DEEP STRUCTURE REQUIRES STRONGER METHODS FOR ANALYSIS
Perception: obvious
structure is easy to
process…
but most of the
interesting stuff isn’t
obvious to a
computer.
Issue:
Do we store or
generate all
intermediate forms?
STATIC
DIVERTED OR
SAMPLED
STREAMINGIN MOTION
STOP AND FRISK
STORED
DATA - SLOTH KILLS
To understand (analyze) data…
Divert the flow?
Pool the data?
Evaluate everything without changing the flow?
Sample? (catch and release?)
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
COMPLEXITY VS MOBILITY
CCTV SmartPhone
Traffic
Counter Fitbit
Data
Complexity
Stationary Mobile
Low
High
Weather
Station
Telematic
Device
DATA ATTRIBUTES DICTATE ARCHITECTURE CHOICES
Speed
Streaming
Structure/Complexity
Surface_Shallow Dense_Deep
Static
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
DATA LOCATION INFLUENCES ARCHITECTURE CHOICES
Speed
Streaming
Location
Sensor Gateway Cloud Data Center
Static
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
ALGORITHM ATTRIBUTES DICTATE ARCHITECTURE CHOICES
Parallelism
Embarrassing
Computational Complexity
n
Sequential
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
(Parallelism and computational
complexity are not actually
orthogonal…)
p(polynomial)
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
DATABASE OPTIONS
What Do You Want/Need to Store?

How much? How complex? How fast?
What Do You Want/Need to DO With What You Store?
Options Include…
Files, tables, trees, queues, stacks, lists…

Hierarchical

RDBMS

Object DBMS

NoSQL

Graph
How You Think About a Domain…
…influences your choice of maps and models…

rules and representations…and required operations.
Data Management
Options
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
EVOLUTION OF DATA MANAGEMENT SOLUTIONS
Images courtesy of Wikipedia
Today:
Delta Airlines processes 5,000,000 business events per day
Pratt & Whitney jet engine: 5,000 sensors producing 10GB/s/per engine.
Formula 1 car sensors produce about 1.2GB/s
and we need to predict the future…
Perform Operations on Data at Rest
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
GRAPH DATABASES FOR GRAPH DATA!
Why choose a graph database?
Speed to delivery when the data is naturally modeled as a graph

Simplifies multi-hop queries

Visualization? Baked-in
Do you need an on-premise solution, or to manage your own database?
You Probably Already Think In Graphs if…
You watch detective shows
You remember relationships between people
You took a biology class
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Wikipedia contributors. "Taxonomy (biology)." Wikipedia,
The Free Encyclopedia. Wikipedia, The Free Encyclopedia,
11 May. 2016. Web. 12 May. 2016.
GRAPHS 101
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Typical crazy wall whiteboard - from Fargo.
A screen from IBM I2 Coplink
GRAPHS 101
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
GRAPHS 101
Family Tree
LinkedIn Tree
GRAPHS SHOULD BE PART OF YOUR TOOLKIT
A graph is a structure with vertices and edges.
a
e
dc
b
Old Post Road
Cross Highway
Compo
Shinbone Alley
Elk Road
Old Post Road Paved

Old Post Road 11 miles

Elk Road Dirt

Elk Road 2 miles

Cross Highway toll road

Cross Highway 250 miles

Main Street 1 mile

Shinbone Alley .5 miles

a bus stop

b gas station

b Shell

c Elementary school

d House

e Office building
May be labeled, edges may be directed, all may
be stored/processed by properties
represented as key/value pairs.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
GRAPHS HAVE RELEVANT MATHEMATICAL PROPERTIES
e.g. If you represent a graph as a matrix M, then values in Mn
represent the number of paths of length n in the original graph.
a
e
dc
b
a b c d e
a 1
b 1
c 1
d 1
e 1
M =
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
OVERVIEW OF THE GRAPH DATABASE MARKET
Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11
Property
graph
RDF
RDF - Resource Description Framework, W3C specs for
metadata modeling, now used in knowledge management
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
OPEN SOURCE FOR GRAPH DATA
Apache TinkerPop, TinkerPop, Apache, Apache feather logo, and Apache TinkerPop project logo are
either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.
Apache TinkerPop™ is a graph computing framework for both
graph databases (OLTP) and graph analytic systems (OLAP).
“A graph is a structure composed of vertices and edges. Both vertices and edges
can have an arbitrary number of key/value-pairs called properties. Vertices denote
discrete objects such as a person, a place, or an event. Edges denote relationships
between vertices. For instance, a person may know another person, have been
involved in an event, and/or was recently at a particular place. Properties express
non-relational information about the vertices and edges. Example properties include
a vertex having a name, an age and an edge having a timestamp and/or a weight.
Together, the aforementioned graph is known as a property graph and it is the
foundational data structure of Apache TinkerPop.”
Apache TinkerPop™ is an open source, vendor-agnostic, graph computing
framework distributed under the commercial friendly Apache2 license. When a data
system is TinkerPop-enabled, its users are able to model their domain as a graph
and analyze that graph using the Gremlin graph traversal language.
OPEN SOURCE PROJECTS
Apache Spark
Registered trademarks or trademarks of The Apache Software Foundation
UIMA
Hadoop
Open Source
for Infrastructure
RELEVANT APACHE SOFTWARE FOUNDATION OPEN SOURCE PROJECTS
Apache Storm: “a free and open source distributed realtime
computation system. Storm makes it easy to reliably process
unbounded streams of data, doing for realtime processing what
Hadoop did for batch processing.”
Apache Spark Streaming: “Spark Streaming brings Apache
Spark's language-integrated API to stream processing, letting you
write streaming jobs the same way you write batch jobs.”
Registered trademarks or trademarks of The Apache Software Foundation
RELEVANT APACHE SOFTWARE FOUNDATION OPEN SOURCE PROJECTS
Apache Flink: “open-source stream processing framework for
distributed, high-performing, always-available, and accurate data
streaming applications.”
Apache Samza: “a distributed stream processing framework. It
uses Apache Kafka for messaging, and Apache Hadoop YARN to
provide fault tolerance, processor isolation, security, and resource
management.”
Apache Apex: “Enterprise-grade unified stream and batch
processing engine.”
Registered trademarks or trademarks of The Apache Software Foundation
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
USE PRE-BUILT KNOWLEDGE RESOURCES
Off The Shelf
Knowledge
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
OPENCYC
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
OFF THE SHELF KNOWLEDGE - NEED TO ASSOCIATE/RECOGNIZE/UNDERSTAND TO
ORGANIZE/REPRESENT
Wordnet(R) Princeton
University "About WordNet." 

Princeton University. 2010.
<http://
wordnet.princeton.edu>
Do you have or can you capture streaming data that can increase your value proposition?
Data about your product that can improve performance, reliability, predictability…
Can you create value from new analysis of open data?
Adding your own data/algorithms to open data creates value.
Start by evaluating the emerging open source de facto standards.
Choose an infrastructure that allows you to evaluate live streaming data in the context of
relevant historical data.
It’s All About the Data
GETTING STARTED…
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Basic
Principles
Today:
Delta Airlines processes
5,000,000 business events per day
Pratt & Whitney jet engine:
5,000 sensors producing
10GB/s/per engine.
Formula 1 car sensors produce
about 1.2GB/s
and we need to predict the future…
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
AS THE SCOPE CHANGES, SO MUST THE SOLUTIONS
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
PRODUCTION ARCHITECTURE VS TRAINING ARCHITECTURE: CHALLENGE YOUR ASSUMPTIONS
In Production, 

May Scale UP or DOWN.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
The SourceFog Cloud
Data CenterGateway
SHOULD YOU MOVE THE COMPUTATION TO THE DATA, OR DATA TO THE PROCESSOR?
STREAMING
ANALYTICS
DESCRIPTIVE
DATA
PREDICTIVE
STREAMING ANALYTICS: MOVE THE PROCESS TO THE DATA
STREAMING ANALYTICS: STATISTICAL ANALYSIS OF DATA IN MOTION
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
3-TIER IOT ARCHITECTURE ENABLES DISTRIBUTED INTELLIGENCE & ANALYTICS
Sensors/
Devices
Train the Deep Learning Model
Data Center

Cloud

Cluster

Network
Compress & Run

The DL Model
DataSources
Store
Process/Transform
Observe
Key
Data Flows on the Edges, Queries Everywhere
Sampling vs Monitoring Everything…
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
STREAMING DATA ARCHITECTURE
PRIMUS INTER PARES
Cloud First!
Mobile First!
AI First!
Data First!!!
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
6 RECOMMENDATIONS
Define Your Application Requirements in Terms of Data

Streaming? Plan for it

Process/Analyze As Close to the Source as Possible

Move Intelligence To The Edge (Fog)

Parallelism in Algorithms? Exploit it with hardware

Start With Open Source for Infrastructure
adrian@storminsights.com
Twitter @ajbowles
Skype ajbowles
If you would like to connect on LinkedIn,
please let me know that you that you
registered for the Smart Data webinar series.
NEXT WEEK…
October 18 Enterprise Analytics Online


1PM Eastern: 

ModernAI From Machine Learning to Cognitive Computing
KEEP IN TOUCH
Upcoming SmartData Webinar Dates & Topics
Nov. 9 See Me Feel Me, Touch Me, Heal Me:

The Rise of the Cognitive Interface

Dec. 14 The Road to Autonomous Applications

Jan. 11 AI At The Edge:

Pushing Intelligence to Fog Computing Nodes

More Related Content

What's hot

The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data LakeCaserta
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkCaserta
 
Advanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipAdvanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipDATAVERSITY
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Caserta
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesLars E Martinsson
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on HadoopCaserta
 
Big Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesBig Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesSlideTeam
 
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDATAVERSITY
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteCaserta
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDATAVERSITY
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
NLB Analytics Overview
NLB Analytics OverviewNLB Analytics Overview
NLB Analytics OverviewKevin Dingle
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
Mastering Data Modeling for NoSQL Platforms
Mastering Data Modeling for NoSQL PlatformsMastering Data Modeling for NoSQL Platforms
Mastering Data Modeling for NoSQL PlatformsDATAVERSITY
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeDATAVERSITY
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Caserta
 
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017Caserta
 
Using Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingUsing Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingCaserta
 
The Missed Promise of Hadoop and New and Emerging Technologies
The Missed Promise of Hadoop and New and Emerging TechnologiesThe Missed Promise of Hadoop and New and Emerging Technologies
The Missed Promise of Hadoop and New and Emerging TechnologiesDATAVERSITY
 

What's hot (20)

The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
Advanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and StewardshipAdvanced Analytics Governance - Effective Model Management and Stewardship
Advanced Analytics Governance - Effective Model Management and Stewardship
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on Hadoop
 
Big Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesBig Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation Slides
 
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
NLB Analytics Overview
NLB Analytics OverviewNLB Analytics Overview
NLB Analytics Overview
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Mastering Data Modeling for NoSQL Platforms
Mastering Data Modeling for NoSQL PlatformsMastering Data Modeling for NoSQL Platforms
Mastering Data Modeling for NoSQL Platforms
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
 
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017
 
Using Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingUsing Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven Marketing
 
The Missed Promise of Hadoop and New and Emerging Technologies
The Missed Promise of Hadoop and New and Emerging TechnologiesThe Missed Promise of Hadoop and New and Emerging Technologies
The Missed Promise of Hadoop and New and Emerging Technologies
 

Similar to Smart Data Webinar: Choosing the Right Data Management Architecture for Cognitive Computing

Streaming Analytics for IoT-Oriented Applications
Streaming Analytics for IoT-Oriented ApplicationsStreaming Analytics for IoT-Oriented Applications
Streaming Analytics for IoT-Oriented ApplicationsDATAVERSITY
 
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...DATAVERSITY
 
Smart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a ServiceSmart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a ServiceDATAVERSITY
 
Smart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateSmart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateDATAVERSITY
 
GPSBUS201-GPS Demystifying Artificial Intelligence
GPSBUS201-GPS Demystifying Artificial IntelligenceGPSBUS201-GPS Demystifying Artificial Intelligence
GPSBUS201-GPS Demystifying Artificial IntelligenceAmazon Web Services
 
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...DATAVERSITY
 
Applying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesApplying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesData Ninja API
 
SmartData Slides: Machine Learning - From Discovery to Understanding
SmartData Slides: Machine Learning - From Discovery to UnderstandingSmartData Slides: Machine Learning - From Discovery to Understanding
SmartData Slides: Machine Learning - From Discovery to UnderstandingDATAVERSITY
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data ScienceDataWorks Summit
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachAndre Freitas
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) SkillsOscar Corcho
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezBig Data Spain
 
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017Amazon Web Services
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...Alex Liu
 
NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...
NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...
NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...Amazon Web Services
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?Samet KILICTAS
 
IIPGH Webinar 1: Getting Started With Data Science
IIPGH Webinar 1: Getting Started With Data ScienceIIPGH Webinar 1: Getting Started With Data Science
IIPGH Webinar 1: Getting Started With Data Scienceds4good
 
Decoding Data Science
Decoding Data ScienceDecoding Data Science
Decoding Data ScienceMatt Fornito
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...Codemotion Tel Aviv
 

Similar to Smart Data Webinar: Choosing the Right Data Management Architecture for Cognitive Computing (20)

Streaming Analytics for IoT-Oriented Applications
Streaming Analytics for IoT-Oriented ApplicationsStreaming Analytics for IoT-Oriented Applications
Streaming Analytics for IoT-Oriented Applications
 
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
 
Smart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a ServiceSmart Data Webinar: Knowledge as a Service
Smart Data Webinar: Knowledge as a Service
 
Smart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning UpdateSmart Data Webinar: Machine Learning Update
Smart Data Webinar: Machine Learning Update
 
GPSBUS201-GPS Demystifying Artificial Intelligence
GPSBUS201-GPS Demystifying Artificial IntelligenceGPSBUS201-GPS Demystifying Artificial Intelligence
GPSBUS201-GPS Demystifying Artificial Intelligence
 
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
Smart Data Slides: Modern AI and Cognitive Computing - Boundaries and Opportu...
 
Applying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesApplying large scale text analytics with graph databases
Applying large scale text analytics with graph databases
 
SmartData Slides: Machine Learning - From Discovery to Understanding
SmartData Slides: Machine Learning - From Discovery to UnderstandingSmartData Slides: Machine Learning - From Discovery to Understanding
SmartData Slides: Machine Learning - From Discovery to Understanding
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycle
 
(Big) Data (Science) Skills
(Big) Data (Science) Skills(Big) Data (Science) Skills
(Big) Data (Science) Skills
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier Dominguez
 
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
The IoT Offering Explained in Plain English - IOT201 - re:Invent 2017
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
 
NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...
NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...
NEW LAUNCH! Graph-based Approaches for Cyber Investigative Analytics Using GP...
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 
IIPGH Webinar 1: Getting Started With Data Science
IIPGH Webinar 1: Getting Started With Data ScienceIIPGH Webinar 1: Getting Started With Data Science
IIPGH Webinar 1: Getting Started With Data Science
 
Decoding Data Science
Decoding Data ScienceDecoding Data Science
Decoding Data Science
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Recently uploaded (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Smart Data Webinar: Choosing the Right Data Management Architecture for Cognitive Computing

  • 1. Choosing the Right Data Management Architecture for Cognitive Computing Adrian Bowles, PhD Founder, STORM Insights, Inc. Lead Analyst, AI, Aragon Research info@storminsights.com Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. OCTOBER 12, 2017
  • 2. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AGENDA - CHOOSING THE RIGHT DATA MANAGEMENT ARCHITECTURE FOR COGNITIVE COMPUTING The Role of Data In AI & CC What do we need to manage? Application, Data, and Algorithm Attributes that Influence Architecture Database Options Open Source Infrastructure Prebuilt Knowledge Getting Started: Basic Principles
  • 3. Model Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. COGNITIVE COMPUTING FUNDAMENTALS: MODELS & ASSUMPTIONS Model The Corpus, Assumptions, Algorithms Used to Generate & Score Hypotheses or Calculate The Strength of a Relationship Principles that control the development and representation of natural intelligence in the neocortex provide a guide to the implementation of machine intelligence.(Numenta Hierarchical Temporal Memory) A function applied to a string representing data or a concept results in a value or vector meaningful for comparison. A Model is an Abstract Representation of Reality Essential Data for Cognitive Computing
  • 4. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. MODELS WILL MAKE OR BREAK YOUR APPLICATION Your Model The Real World “When the map and the terrain disagree, believe the terrain.” Gause and Weinberg (Exploring Requirements)
  • 5. Systems Controls Learn Plan Reason Understand Model Data Mgmt Human Machine Input Output Gestures Emotions Language Narrative Generation Visualization Reports Haptics Sensors (IOT) Systems Controls Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. COGNITIVE SYSTEMS: COMMUNICATIONS & CONTROL Perception
  • 6. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. WHERE YOU ARE DICTATES WHAT YOU NEED Ingest Analyze Maintain/Manage
  • 7. When everything is connected… New sources of data emerge New sources of value emerge Old assumptions must be challenged Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. THE IMPACT OF THE IOT
  • 8. CHOICES HAVE CONSEQUENCES How You Think About a Domain… …influences your choice of maps and models… rules and representations…and required operations.
  • 9. HOW YOU ORGANIZE CONSTRAINS HOW YOU WORK - DESIGN WORKFLOW FIRST Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 10. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. START WITH A TAXONOMY A taxonomy represents the formal structure of classes or types of objects within a domain. •Generally hierarchical and provide names for each class in the domain. •May also capture the membership properties of each object in relation to the other objects. •The rules of a specific taxonomy are used to classify or categorize any object in the domain, so they must be complete, consistent, and unambiguous. This rigor in specification should ensure that any newly discovered object must fit into one, and only one, category or object class.
  • 11. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. ONTOLOGIES An ontology formalizes and specifies the names, definitions, and attributes of entities within a domain. For practical purposes, an accepted ontology defines the domain.
  • 12. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. RDF - Resource Description Framework - A directed, labeled graph. RDFS - RDF Specifications Suite Recommendations (Language for representing RDF vocabularies) SPARQL - A Semantic Protocol & Query Language for RDF Data OWL - The Web Ontology Language is a Semantic We language designed to represent knowledge about things and relationships between things on the Web. An OWL Document is an Ontology. https://www.w3.org/2013/data/ THE SEMANTIC WEB - ALL DATA SHOULD BE ASSOCIATED WITH SEMANTIC ATTRIBUTES (MEANING) BASICS OF THE W3C SEMANTIC WEB ONTOLOGY STACK
  • 13. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. CRITICAL QUESTIONS… What data do we need? What data will be produced? Where does the data get created? Where does the data get analyzed/refined? How do we present/output the data? And for each data category & data lifecycle phase, What does it look like? How much is there? Architectural Influences
  • 14. Copyright (c) 2014-2017 by STORM Insights Inc. All Rights Reserved. DEEP STRUCTURE REQUIRES STRONGER METHODS FOR ANALYSIS Perception: obvious structure is easy to process… but most of the interesting stuff isn’t obvious to a computer. Issue: Do we store or generate all intermediate forms?
  • 15. STATIC DIVERTED OR SAMPLED STREAMINGIN MOTION STOP AND FRISK STORED DATA - SLOTH KILLS To understand (analyze) data… Divert the flow? Pool the data? Evaluate everything without changing the flow? Sample? (catch and release?) Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 16. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. COMPLEXITY VS MOBILITY CCTV SmartPhone Traffic Counter Fitbit Data Complexity Stationary Mobile Low High Weather Station Telematic Device
  • 17. DATA ATTRIBUTES DICTATE ARCHITECTURE CHOICES Speed Streaming Structure/Complexity Surface_Shallow Dense_Deep Static Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 18. DATA LOCATION INFLUENCES ARCHITECTURE CHOICES Speed Streaming Location Sensor Gateway Cloud Data Center Static Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 19. ALGORITHM ATTRIBUTES DICTATE ARCHITECTURE CHOICES Parallelism Embarrassing Computational Complexity n Sequential Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. (Parallelism and computational complexity are not actually orthogonal…) p(polynomial)
  • 20. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. DATABASE OPTIONS What Do You Want/Need to Store? How much? How complex? How fast? What Do You Want/Need to DO With What You Store? Options Include… Files, tables, trees, queues, stacks, lists… Hierarchical RDBMS Object DBMS NoSQL Graph How You Think About a Domain… …influences your choice of maps and models… rules and representations…and required operations. Data Management Options
  • 21. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. EVOLUTION OF DATA MANAGEMENT SOLUTIONS Images courtesy of Wikipedia Today: Delta Airlines processes 5,000,000 business events per day Pratt & Whitney jet engine: 5,000 sensors producing 10GB/s/per engine. Formula 1 car sensors produce about 1.2GB/s and we need to predict the future… Perform Operations on Data at Rest
  • 22. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. GRAPH DATABASES FOR GRAPH DATA! Why choose a graph database? Speed to delivery when the data is naturally modeled as a graph Simplifies multi-hop queries Visualization? Baked-in Do you need an on-premise solution, or to manage your own database? You Probably Already Think In Graphs if… You watch detective shows You remember relationships between people You took a biology class
  • 23. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. Wikipedia contributors. "Taxonomy (biology)." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 May. 2016. Web. 12 May. 2016. GRAPHS 101
  • 24. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. Typical crazy wall whiteboard - from Fargo. A screen from IBM I2 Coplink GRAPHS 101
  • 25. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. GRAPHS 101 Family Tree LinkedIn Tree
  • 26. GRAPHS SHOULD BE PART OF YOUR TOOLKIT A graph is a structure with vertices and edges. a e dc b Old Post Road Cross Highway Compo Shinbone Alley Elk Road Old Post Road Paved Old Post Road 11 miles Elk Road Dirt Elk Road 2 miles Cross Highway toll road Cross Highway 250 miles Main Street 1 mile Shinbone Alley .5 miles a bus stop b gas station b Shell c Elementary school d House e Office building May be labeled, edges may be directed, all may be stored/processed by properties represented as key/value pairs.
  • 27. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. GRAPHS HAVE RELEVANT MATHEMATICAL PROPERTIES e.g. If you represent a graph as a matrix M, then values in Mn represent the number of paths of length n in the original graph. a e dc b a b c d e a 1 b 1 c 1 d 1 e 1 M =
  • 28. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. OVERVIEW OF THE GRAPH DATABASE MARKET Wikipedia contributors. "Graph database." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 11 Property graph RDF RDF - Resource Description Framework, W3C specs for metadata modeling, now used in knowledge management
  • 29. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. OPEN SOURCE FOR GRAPH DATA Apache TinkerPop, TinkerPop, Apache, Apache feather logo, and Apache TinkerPop project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries. Apache TinkerPop™ is a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP). “A graph is a structure composed of vertices and edges. Both vertices and edges can have an arbitrary number of key/value-pairs called properties. Vertices denote discrete objects such as a person, a place, or an event. Edges denote relationships between vertices. For instance, a person may know another person, have been involved in an event, and/or was recently at a particular place. Properties express non-relational information about the vertices and edges. Example properties include a vertex having a name, an age and an edge having a timestamp and/or a weight. Together, the aforementioned graph is known as a property graph and it is the foundational data structure of Apache TinkerPop.” Apache TinkerPop™ is an open source, vendor-agnostic, graph computing framework distributed under the commercial friendly Apache2 license. When a data system is TinkerPop-enabled, its users are able to model their domain as a graph and analyze that graph using the Gremlin graph traversal language.
  • 30. OPEN SOURCE PROJECTS Apache Spark Registered trademarks or trademarks of The Apache Software Foundation UIMA Hadoop Open Source for Infrastructure
  • 31. RELEVANT APACHE SOFTWARE FOUNDATION OPEN SOURCE PROJECTS Apache Storm: “a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing.” Apache Spark Streaming: “Spark Streaming brings Apache Spark's language-integrated API to stream processing, letting you write streaming jobs the same way you write batch jobs.” Registered trademarks or trademarks of The Apache Software Foundation
  • 32. RELEVANT APACHE SOFTWARE FOUNDATION OPEN SOURCE PROJECTS Apache Flink: “open-source stream processing framework for distributed, high-performing, always-available, and accurate data streaming applications.” Apache Samza: “a distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.” Apache Apex: “Enterprise-grade unified stream and batch processing engine.” Registered trademarks or trademarks of The Apache Software Foundation
  • 33. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. USE PRE-BUILT KNOWLEDGE RESOURCES Off The Shelf Knowledge
  • 34. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. OPENCYC
  • 35. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. OFF THE SHELF KNOWLEDGE - NEED TO ASSOCIATE/RECOGNIZE/UNDERSTAND TO ORGANIZE/REPRESENT Wordnet(R) Princeton University "About WordNet." Princeton University. 2010. <http:// wordnet.princeton.edu>
  • 36. Do you have or can you capture streaming data that can increase your value proposition? Data about your product that can improve performance, reliability, predictability… Can you create value from new analysis of open data? Adding your own data/algorithms to open data creates value. Start by evaluating the emerging open source de facto standards. Choose an infrastructure that allows you to evaluate live streaming data in the context of relevant historical data. It’s All About the Data GETTING STARTED… Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. Basic Principles
  • 37. Today: Delta Airlines processes 5,000,000 business events per day Pratt & Whitney jet engine: 5,000 sensors producing 10GB/s/per engine. Formula 1 car sensors produce about 1.2GB/s and we need to predict the future… Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. AS THE SCOPE CHANGES, SO MUST THE SOLUTIONS
  • 38. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. PRODUCTION ARCHITECTURE VS TRAINING ARCHITECTURE: CHALLENGE YOUR ASSUMPTIONS In Production, May Scale UP or DOWN.
  • 39. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. The SourceFog Cloud Data CenterGateway SHOULD YOU MOVE THE COMPUTATION TO THE DATA, OR DATA TO THE PROCESSOR?
  • 40. STREAMING ANALYTICS DESCRIPTIVE DATA PREDICTIVE STREAMING ANALYTICS: MOVE THE PROCESS TO THE DATA STREAMING ANALYTICS: STATISTICAL ANALYSIS OF DATA IN MOTION Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 41. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 3-TIER IOT ARCHITECTURE ENABLES DISTRIBUTED INTELLIGENCE & ANALYTICS Sensors/ Devices Train the Deep Learning Model Data Center Cloud Cluster Network Compress & Run The DL Model
  • 42. DataSources Store Process/Transform Observe Key Data Flows on the Edges, Queries Everywhere Sampling vs Monitoring Everything… Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. STREAMING DATA ARCHITECTURE
  • 43. PRIMUS INTER PARES Cloud First! Mobile First! AI First! Data First!!! Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved.
  • 44. Copyright (c) 2017 by STORM Insights Inc. All Rights Reserved. 6 RECOMMENDATIONS Define Your Application Requirements in Terms of Data
 Streaming? Plan for it Process/Analyze As Close to the Source as Possible
 Move Intelligence To The Edge (Fog) Parallelism in Algorithms? Exploit it with hardware
 Start With Open Source for Infrastructure
  • 45. adrian@storminsights.com Twitter @ajbowles Skype ajbowles If you would like to connect on LinkedIn, please let me know that you that you registered for the Smart Data webinar series. NEXT WEEK… October 18 Enterprise Analytics Online 
 1PM Eastern: ModernAI From Machine Learning to Cognitive Computing KEEP IN TOUCH Upcoming SmartData Webinar Dates & Topics Nov. 9 See Me Feel Me, Touch Me, Heal Me:
 The Rise of the Cognitive Interface
 Dec. 14 The Road to Autonomous Applications
 Jan. 11 AI At The Edge: Pushing Intelligence to Fog Computing Nodes