SlideShare a Scribd company logo
1 of 27
Download to read offline
Why Big Data is Really
About Small Data:
The Big Data Paradox
Judith Hurwitz
President & CEO, Hurwitz & Associates
Agenda
§  What is so big about Big Data?
§  What is a data scientist
§  Data at rest, data in motion
§  Is Big Analytics more important?
§  Rethinking data modeling in a big data world
§  A couple of examples
§  What you should think about
§  Questions?
Meet the Speaker
§  Judith Hurwitz
§ 

President and CEO of Hurwitz & Associates, Inc., a strategy consulting and research firm
focused on distributed computing technologies. A pioneer in anticipating technology
innovation and adoption, Judith advocates for a pragmatic adoption of an architectural and
business approach to the emerging market for cloud computing, service orientation, and
service management. She has served as a trusted advisor to many industry leaders over the
years. Judith has helped these companies make the transition to a new business model
focused on the business value of emerging platforms. Judith is an accomplished author and
most recently co-author of Big Data for Dummies.
Our Team’s Latest Book

4
What is so big about big data?
§  Definition of Big Data
§ 
§ 
§ 
§ 

Volume – How much data
Variety – Various types of data (structured, unstructured)
Velocity – Speed that data moves from one location to another
Veracity – Accuracy (Do the results of a big data analysis make
sense?)

§  Big Data is not new
§  So, why now?

§  Impacting the way you collect, store, manage, analyze,
and visualize data
What is the Purpose of Big Data?

§  Gather, store, manage, and manipulate
vast amounts of data at the right speed, at
the right time to get the right results
§  Gather enough data so that you can find
patterns
§  Put those patterns to work to gain insights
in context

6
Examples of Big Data
§  Analyze multiple data sources to detect and protect
against insider trading, money laundering, credit card
theft
§  Monitoring market feeds
§  Managing risk models
§  Log files
§  Spatial data from sensors
§  Medical device data – data from sensors connected to
medical equipment
§  GPS data
§  Unstructured data in emails, text messages, call center
notes
7
Why do we need to think about Big Data?

§  What big data means
to business
§  More data for better
decision making
§  Integration of data
across business units
and silos
§  Detecting risks in real
time

§  Focus on putting
information in context
with supporting
business decisions
§  Improving the
customer experience
by leveraging
customer feedback
from many different
sources
8
From Big to Small
•  Big data is only the first
step in the journey
•  Big data requires that
you reduce the amount
of data to a subset so
that your organization
can take a deeper look
•  Once this subset of
data is cleansed and
verified, it can help
analyze, predict, and
prepare to address the
future
9
The Role of a Data Scientist?
§  Combining computing science, math, statistics, and
business (domain) knowledge
§  Looking for answers when you don’t know the question
you want to ask
§  Asking new types of questions: finding nuggets of
actionable information in huge volumes of data
§  Making analytics consumable: real-time analysis to help
the business take the right action at the right time
§  Predictive analytics: What is the next best action?

10
Representation Technology Stack

Interfaces$and$feeds$from/to$internal$applica@ons$

Interfaces$and$feeds$from/to$the$Internet$

Big$Data$Tech$Stack$

Big$Data$Applica@ons$
Repor@ng$&$Visualiza@on$
Analy@cs$(Tradi@onal$and$Advanced)$
Analy@cal$Data$Warehouses$and$Data$Marts$
“Organizing”$Databases$and$Tools$
Opera@onal$Databases$(Structured,$Unstructured,$SemiMstructured)$
Security$Infrastructure$
Redundant$Physical$Infrastructure$

11
Where Most of This Began

Data	
  Warehouse

Data
Mart

Transactional
System
(Production	
  Data)

12
Then It Got “Better”

Data	
  Warehouse

Data
Mart

Data	
  Warehouse

Data
Mart

Transactional
System
(Production	
  Data)

Transactional
System
(Production	
  Data)

13
Then It Got “More Better”

Operational	
  
System

LOB
Data
Mart

Operational	
  
System

Data	
  Warehouse

LOB
Data
Mart

LOB
Data
Mart
Transactional
System(s)

14
And Better Still

Operational	
  
System

Operational	
  
System

LOB
Data
Mart

Staging
Area

Data	
  Warehouse

LOB
Data
Mart

LOB
Data
Mart
Transactional
System(s)

15
Oops. Data at rest vs. data in motion

Operational	
  
System

Operational	
  
System

Staging
Area

????

Transactional
System(s)

16
Data At Rest, Data In Motion
§  Data in motion is no longer a bad thing
§  Trend is combining “traditional” with
streaming
§  Instant analysis isn’t fast enough
§  It’s all about real-time

§  What data to keep?
Is Big Analytics More Important?
§  In a word

YES

§  We are looking for answers to questions we haven’t
asked yet
§  Patterns, patterns, patterns
§  But…
§  Current generation analytics engines can be overwhelmed
§  Results may be too difficult to understand even with visualization
§  You may be looking in the wrong place or at the wrong things
Is Hadoop the New EDW?
§  No one type of Big Data platform is optimal for all
requirements
§  Hadoop is changing the economics of storing and
analyzing large volumes and variety of data
§  Results of Hadoop analytics needs to be understood in
context
§  Increasing importance of hybrid big data architectures –
combine Hadoop with your systems of record
§  Hadoop for specific roles
§  Exploratory data-science sandboxes
§  Staging platform for unstructured data

19
Rethinking Data Modeling
§  Traditional data models assume:
§  Relational data
§  Clean data
§  A few clearly identifiable data sources

§  Next generation data model – the rules have changed
§ 
§ 
§ 
§ 

Some relational data, some NoSQL
Some of the data is dirty
Lots of data sources coming from many different places
Some of the data you will keep and some you will not

§  Design your data model to account for new world of
large and varied data sources

20
Big Data Use Cases
§  “Voice of the Customer”, 360-degree view of customer
§  Strengthen brand and increase customer loyalty
§  Improve operational analytics
§  Target and reduce fraud and improve security
§  Use sensors to provide real-time information about rivers
and oceans to predict impact of environmental changes

21
Correlating Varied Data Sources in Finance
§  Financial services is highly competitive and highly regulated.
Financial services needs to create innovative customer experience
while protecting IP. Companies need to anticipate the next best
action.
§  What type of data is needed?
§ 
§ 
§ 
§ 
§ 
§ 
§ 
§ 

Transaction data
Threat data
Log data
Customer survey data
Customer support data
Customer social media data
Partner data
News and event data, ……

§  Need to be able to correlate all types of structured and unstructured
data to predict the future and provide opportunities for growth and
expansion
22
Advanced Security Analytics to Predict and Protect
§  Government agency needed more visibility into all
system traffic
§  Concern about the unknown – needed to look for and
protect from malicious activity
§  Used advanced security analytics to correlate data
across seemingly unrelated events
§  Real-time
§  Analyze variety data sources- emails, documents, social
media data, business process data, DNS transactions
§  Analyze massive amounts structured and unstructured
data
23
Matching Capabilities to Business Problems

§  Text Analytics
§  Next Best Action
§  Data in Motion
§  Adding business
process and rules
§  Anamoly Detection
§  Data Visualization

§  Correlation between
customer service,
comments in the
market, customer
management
§  Putting a lot of data
types together to
determine best
actions
§  Detecting Fraud
24
How Do You Manage Big Data?

§  Big data is not clean –
it is massive and
much is unstructured
§  Resulting patterns
from big data
analytics needs to be
culled, cleaned and
matched to enterprise
data

§  Culled data now must
be analyzed in
context with your
systems of record
§  Apply data
visualization and best
practices to determine
how to apply data to
actions

25
You need to think about the following:
§  Where are the sources of
the data that could be
important?
§  How often do you need
access to particular types
of data?
§  How long and how much
data do you need to
keep?
§  Can you trust the data
and its sources?

§  Use Big Data analytics to
overcome conventional
wisdom and conventional
thinking.
§  If you already know the
questions to ask you
aren’t moving forward.

26
Q&A

§  Thank you!
§  Contact info:
§  Judith Hurwitz: judith.hurwitz@hurwitz.com

27

More Related Content

What's hot

Becoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italyBecoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italySAS Italy
 
Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you
Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you
Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you Intellectyx Inc
 
DMTI Spatial Location Hub Analytics: big data, analytics, visualization
DMTI Spatial Location Hub Analytics: big data, analytics, visualizationDMTI Spatial Location Hub Analytics: big data, analytics, visualization
DMTI Spatial Location Hub Analytics: big data, analytics, visualizationDMTI Spatial
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on HadoopCaserta
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Big, small or just complex data?
Big, small or just complex data?Big, small or just complex data?
Big, small or just complex data?panoratio
 
Importance of Big data for your Business
Importance of Big data for your BusinessImportance of Big data for your Business
Importance of Big data for your Businessazuyo.com
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
Deliver Data Governance with a “Yes”
Deliver Data Governance with a “Yes”Deliver Data Governance with a “Yes”
Deliver Data Governance with a “Yes”Jean-Michel Franco
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationDoug Denton
 
Building a Data Driven Organization
Building a Data Driven OrganizationBuilding a Data Driven Organization
Building a Data Driven OrganizationIT Weekend
 
Analytics, Business Intelligence, and Data Science - What's the Progression?
Analytics, Business Intelligence, and Data Science - What's the Progression?Analytics, Business Intelligence, and Data Science - What's the Progression?
Analytics, Business Intelligence, and Data Science - What's the Progression?DATAVERSITY
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperRobertson Executive Search
 
CIO Review - Treselle Systems
CIO Review - Treselle SystemsCIO Review - Treselle Systems
CIO Review - Treselle SystemsTharun Sairam
 

What's hot (17)

Big data
Big dataBig data
Big data
 
Becoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italyBecoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italy
 
Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you
Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you
Whitepaper: Thriving in the Big Data era Manage Data before Data Manages you
 
DMTI Spatial Location Hub Analytics: big data, analytics, visualization
DMTI Spatial Location Hub Analytics: big data, analytics, visualizationDMTI Spatial Location Hub Analytics: big data, analytics, visualization
DMTI Spatial Location Hub Analytics: big data, analytics, visualization
 
Big Data SurVey - IOUG - 2013 - 594292
Big Data SurVey - IOUG - 2013 - 594292Big Data SurVey - IOUG - 2013 - 594292
Big Data SurVey - IOUG - 2013 - 594292
 
Data Analyics
Data AnalyicsData Analyics
Data Analyics
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on Hadoop
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Big, small or just complex data?
Big, small or just complex data?Big, small or just complex data?
Big, small or just complex data?
 
Importance of Big data for your Business
Importance of Big data for your BusinessImportance of Big data for your Business
Importance of Big data for your Business
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Deliver Data Governance with a “Yes”
Deliver Data Governance with a “Yes”Deliver Data Governance with a “Yes”
Deliver Data Governance with a “Yes”
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
Building a Data Driven Organization
Building a Data Driven OrganizationBuilding a Data Driven Organization
Building a Data Driven Organization
 
Analytics, Business Intelligence, and Data Science - What's the Progression?
Analytics, Business Intelligence, and Data Science - What's the Progression?Analytics, Business Intelligence, and Data Science - What's the Progression?
Analytics, Business Intelligence, and Data Science - What's the Progression?
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White Paper
 
CIO Review - Treselle Systems
CIO Review - Treselle SystemsCIO Review - Treselle Systems
CIO Review - Treselle Systems
 

Similar to Why Big Data is Really about Small Data

02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBigDataExpo
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data TipsQubole
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to knowV2Soft
 
Data Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsData Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsDenodo
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfPridesys IT Ltd.
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyNeo4j
 
Data science and its potential to change business as we know it. The Roadmap ...
Data science and its potential to change business as we know it. The Roadmap ...Data science and its potential to change business as we know it. The Roadmap ...
Data science and its potential to change business as we know it. The Roadmap ...InnoTech
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 
Analytics for actuaries cia
Analytics for actuaries ciaAnalytics for actuaries cia
Analytics for actuaries ciaKevin Pledge
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfPridesys IT Ltd.
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperExperian
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data miningEmran Hossain
 
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Kevin Pledge
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...Experfy
 
Big Data for the Next Big Idea in Financial Services (Whitepaper)
Big Data for the Next Big Idea in Financial Services (Whitepaper)Big Data for the Next Big Idea in Financial Services (Whitepaper)
Big Data for the Next Big Idea in Financial Services (Whitepaper)NAFCU Services Corporation
 

Similar to Why Big Data is Really about Small Data (20)

Thilga
ThilgaThilga
Thilga
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
 
Big Data
Big DataBig Data
Big Data
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data Tips
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to know
 
Data Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsData Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation Analytics
 
Bigdata (1) converted
Bigdata (1) convertedBigdata (1) converted
Bigdata (1) converted
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
Data science and its potential to change business as we know it. The Roadmap ...
Data science and its potential to change business as we know it. The Roadmap ...Data science and its potential to change business as we know it. The Roadmap ...
Data science and its potential to change business as we know it. The Roadmap ...
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Analytics for actuaries cia
Analytics for actuaries ciaAnalytics for actuaries cia
Analytics for actuaries cia
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White Paper
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
Big Data for the Next Big Idea in Financial Services (Whitepaper)
Big Data for the Next Big Idea in Financial Services (Whitepaper)Big Data for the Next Big Idea in Financial Services (Whitepaper)
Big Data for the Next Big Idea in Financial Services (Whitepaper)
 

Recently uploaded

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Why Big Data is Really about Small Data

  • 1. Why Big Data is Really About Small Data: The Big Data Paradox Judith Hurwitz President & CEO, Hurwitz & Associates
  • 2. Agenda §  What is so big about Big Data? §  What is a data scientist §  Data at rest, data in motion §  Is Big Analytics more important? §  Rethinking data modeling in a big data world §  A couple of examples §  What you should think about §  Questions?
  • 3. Meet the Speaker §  Judith Hurwitz §  President and CEO of Hurwitz & Associates, Inc., a strategy consulting and research firm focused on distributed computing technologies. A pioneer in anticipating technology innovation and adoption, Judith advocates for a pragmatic adoption of an architectural and business approach to the emerging market for cloud computing, service orientation, and service management. She has served as a trusted advisor to many industry leaders over the years. Judith has helped these companies make the transition to a new business model focused on the business value of emerging platforms. Judith is an accomplished author and most recently co-author of Big Data for Dummies.
  • 5. What is so big about big data? §  Definition of Big Data §  §  §  §  Volume – How much data Variety – Various types of data (structured, unstructured) Velocity – Speed that data moves from one location to another Veracity – Accuracy (Do the results of a big data analysis make sense?) §  Big Data is not new §  So, why now? §  Impacting the way you collect, store, manage, analyze, and visualize data
  • 6. What is the Purpose of Big Data? §  Gather, store, manage, and manipulate vast amounts of data at the right speed, at the right time to get the right results §  Gather enough data so that you can find patterns §  Put those patterns to work to gain insights in context 6
  • 7. Examples of Big Data §  Analyze multiple data sources to detect and protect against insider trading, money laundering, credit card theft §  Monitoring market feeds §  Managing risk models §  Log files §  Spatial data from sensors §  Medical device data – data from sensors connected to medical equipment §  GPS data §  Unstructured data in emails, text messages, call center notes 7
  • 8. Why do we need to think about Big Data? §  What big data means to business §  More data for better decision making §  Integration of data across business units and silos §  Detecting risks in real time §  Focus on putting information in context with supporting business decisions §  Improving the customer experience by leveraging customer feedback from many different sources 8
  • 9. From Big to Small •  Big data is only the first step in the journey •  Big data requires that you reduce the amount of data to a subset so that your organization can take a deeper look •  Once this subset of data is cleansed and verified, it can help analyze, predict, and prepare to address the future 9
  • 10. The Role of a Data Scientist? §  Combining computing science, math, statistics, and business (domain) knowledge §  Looking for answers when you don’t know the question you want to ask §  Asking new types of questions: finding nuggets of actionable information in huge volumes of data §  Making analytics consumable: real-time analysis to help the business take the right action at the right time §  Predictive analytics: What is the next best action? 10
  • 12. Where Most of This Began Data  Warehouse Data Mart Transactional System (Production  Data) 12
  • 13. Then It Got “Better” Data  Warehouse Data Mart Data  Warehouse Data Mart Transactional System (Production  Data) Transactional System (Production  Data) 13
  • 14. Then It Got “More Better” Operational   System LOB Data Mart Operational   System Data  Warehouse LOB Data Mart LOB Data Mart Transactional System(s) 14
  • 15. And Better Still Operational   System Operational   System LOB Data Mart Staging Area Data  Warehouse LOB Data Mart LOB Data Mart Transactional System(s) 15
  • 16. Oops. Data at rest vs. data in motion Operational   System Operational   System Staging Area ???? Transactional System(s) 16
  • 17. Data At Rest, Data In Motion §  Data in motion is no longer a bad thing §  Trend is combining “traditional” with streaming §  Instant analysis isn’t fast enough §  It’s all about real-time §  What data to keep?
  • 18. Is Big Analytics More Important? §  In a word YES §  We are looking for answers to questions we haven’t asked yet §  Patterns, patterns, patterns §  But… §  Current generation analytics engines can be overwhelmed §  Results may be too difficult to understand even with visualization §  You may be looking in the wrong place or at the wrong things
  • 19. Is Hadoop the New EDW? §  No one type of Big Data platform is optimal for all requirements §  Hadoop is changing the economics of storing and analyzing large volumes and variety of data §  Results of Hadoop analytics needs to be understood in context §  Increasing importance of hybrid big data architectures – combine Hadoop with your systems of record §  Hadoop for specific roles §  Exploratory data-science sandboxes §  Staging platform for unstructured data 19
  • 20. Rethinking Data Modeling §  Traditional data models assume: §  Relational data §  Clean data §  A few clearly identifiable data sources §  Next generation data model – the rules have changed §  §  §  §  Some relational data, some NoSQL Some of the data is dirty Lots of data sources coming from many different places Some of the data you will keep and some you will not §  Design your data model to account for new world of large and varied data sources 20
  • 21. Big Data Use Cases §  “Voice of the Customer”, 360-degree view of customer §  Strengthen brand and increase customer loyalty §  Improve operational analytics §  Target and reduce fraud and improve security §  Use sensors to provide real-time information about rivers and oceans to predict impact of environmental changes 21
  • 22. Correlating Varied Data Sources in Finance §  Financial services is highly competitive and highly regulated. Financial services needs to create innovative customer experience while protecting IP. Companies need to anticipate the next best action. §  What type of data is needed? §  §  §  §  §  §  §  §  Transaction data Threat data Log data Customer survey data Customer support data Customer social media data Partner data News and event data, …… §  Need to be able to correlate all types of structured and unstructured data to predict the future and provide opportunities for growth and expansion 22
  • 23. Advanced Security Analytics to Predict and Protect §  Government agency needed more visibility into all system traffic §  Concern about the unknown – needed to look for and protect from malicious activity §  Used advanced security analytics to correlate data across seemingly unrelated events §  Real-time §  Analyze variety data sources- emails, documents, social media data, business process data, DNS transactions §  Analyze massive amounts structured and unstructured data 23
  • 24. Matching Capabilities to Business Problems §  Text Analytics §  Next Best Action §  Data in Motion §  Adding business process and rules §  Anamoly Detection §  Data Visualization §  Correlation between customer service, comments in the market, customer management §  Putting a lot of data types together to determine best actions §  Detecting Fraud 24
  • 25. How Do You Manage Big Data? §  Big data is not clean – it is massive and much is unstructured §  Resulting patterns from big data analytics needs to be culled, cleaned and matched to enterprise data §  Culled data now must be analyzed in context with your systems of record §  Apply data visualization and best practices to determine how to apply data to actions 25
  • 26. You need to think about the following: §  Where are the sources of the data that could be important? §  How often do you need access to particular types of data? §  How long and how much data do you need to keep? §  Can you trust the data and its sources? §  Use Big Data analytics to overcome conventional wisdom and conventional thinking. §  If you already know the questions to ask you aren’t moving forward. 26
  • 27. Q&A §  Thank you! §  Contact info: §  Judith Hurwitz: judith.hurwitz@hurwitz.com 27