SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Big	Data:	Beyond	Hype	to	Insight	
Srinath	Perera		(@srinath_perera)	
VP	–	Research	
WSO2
Success Stories
•  Money	Ball	(	Baseball	draDing)		
•  Nate	Silver	predicted	outcomes	in	49	of	the	50	
states	in	the	2008	U.S.	PresidenQal	elecQon	
•  Cancer	detecQon	from	Biopsy	cells	(	Big	Data	
find	12	paUerns	while	we	only	knew	9),	
hUp://go.ted.com/CseS		
•  Bristol-Myers	Squibb	reduced	the	Qme	it	takes	
to	run	clinical	trial	simulaQons	by	98%	
•  Xerox	used	big	data	to	reduce	the	aUriQon	rate	
in	its	call	centers	by	20%.	
•  Kroger	Loyalty	programs	(	growth	in	45	
consecuQve	quarters)
Premise	of	Big	Data	
If you collect data about your business, and feed it to a Big Data
system, you will find useful insights that will provide competitive
advantage
–  (e.g. Analysis of data sets can find new correlations to "spot business
trends, prevent diseases, combat crime and so on”. [Wikipedia])
Underline assumption is that way we
operate, and organizations are
inefficient.
Big	Data	as	a	
Way	to	OpQmize	
•  Assumptions: Once you
identify your sickness, you are
halfway cured
•  You must know what is worth
Optimizing
premature	
opQmizaQon	is	
the	root	of	all	
evil
“Big Data Washing”
You can tick yes, but unlikely to make a difference
How to Big Data Wash
your System in 24 hours?
•  Publish	collect	the	data	you	can	with	
minimal	effort	
•  Do	lot	of	simple	aggregaQons	
•  Figure	out	what	data	combinaQons	makes	
predest	pictures		
•  Throw	in	some	machine	learning	
algorithms,	predict	something	but	don’t	
compare	
•  Create	a	cool	dashboard	and	do	a	cool	
demo,	and	say	that	you	are	just	scratching	
the	surface!!
Are Insights are
automatic?
•  I wish
•  Only if we have right data
•  Only if we look at the right place
•  Only if such insights are there
•  Only if we found the insights
Value Preposition
Big	Data	Tools		
•  KPIs	
•  AnalyQcs	(	Batch,	Real-Qme,	
InteracQve,	PredicaQve)		
•  VisualizaQons,	Dashboards		
•  Alerts		
•  Sensors	(	and	other	data	
collecQon	plumbing)
KPIs and their Role
•  KPIs	(Key	Performance	Indicators)	are	numbers	
that	can	give	you	an	idea	about	performance	
of	something		
–  E.g.	Countries	have	them	(	GDP,	Per	Capita	
Income,	HDI	index	etc)		
•  Examples	
–  Company	Revenue		
–  LifeQme	value	of	a	customer		
–  Revenue	per	Square	foot	(	in	retail	industry)	
•  Idea	is	to	define	them	and	monitor	them.	But	
defining	them	is	hard	work!!	
•  ODen	one	indicator	tells	half	the	story,	and	you	
need	several	that	cover	different	angles
What is a Dashboard?
•  Think	a	car	dashboard		
•  It	give	you	idea	about	
overall	system	in	a	glance		
•  It	is	boring	when	all	is	
good,	and	grab	aUenQon	
when	something	is	wrong		
•  ODen	have	support	for	
drill	down	and	find	root	
cause
Alerts
•  NoQficaQons	(	sent	via	email,	
SMS,	Pager	etc.)		
•  Goal	is	to	give	you	peace	of	mind	
(	not	having	to	check	all	the	Qme)		
•  They	should	be	specific		
•  They	should	be	infrequent		
•  They	should	have	very	low	false	
posiQves		
•  Let	users	control	sensiQvity
You need a Human in the Loop
Systems that digest your data, take decisions, and run the system by itself, they can only
be used with limited applications Yet
(e.g. Algorithmic trading, Showing Advertisements, or War)
Decisions, Actions, and
Drill down
•  Operators	need	to	see	the	data	in	
context,	and	drill	down	into	detail	to	
understand	the	root	cause		
•  Typical	model	is	to	start	from	an	alert	
or	dashboard,	see	data	in	context	
(other	transacQons	around	same	
Qme,	what	does	same	user	did	before	
and	aDer	etc.)	and	then	let	the	user	
drill	down	
•  For	example,	
hUp://wso2.com/videos/wso2-fraud-
detecQon-soluQon
Role of Realtime Analytics
•  Use	to	detect	something	very	fast!	
Within	few	milliseconds	to	few	
seconds.		
•  Very	powerful	in	detecQng	
condiQons	over	Qme	(e.g.	ball	
possession	in	a	football	game)	
•  Alerts	are	done	through	RealQme	
analyQcs
Role of Predictive Analytics
•  PredicQve	analyQcs	learn	a	problem	from	
examples	
–  E.g.	learn	to	drive		
•  Two	main	cases	are		
–  PredicQng	next	value	or	values	(e.g.	electricity	load	
predicQon)		
–  PredicQng	category	(e.g.	SPAM	or	not	for	a	email)	
•  Used	to	grouping,	to	generate	alerts,	or	to	
augment	visualizaQons		
•  Need	lot	of	experQse	to	create	correct	models		
and	use	them.
Big Data Pipeline
Doing	it	Once	is	Cheap,	
Sedng	up	a	system	to	do	it	
conQnuously	is	Expensive			
Do	your	scenarios	ad-hoc	first	(hire	some	experQse	if	you	must),	
before	sedng	up	a	system	that	does	it	every	day
Keeping it running is Even Harder
●  Incorporate	ConQnuous	data		
o  Integrate	data	conQnuously		
o  We	get	feedback	about	effecQveness	
of	decisions	(e.g.	Accuracy	of	Fraud)	
●  Track	and	update	models	
o  Trends	change	
o  Generate	models	in	batch	mode	and	
update
Templates for Big Data Projects
•  Use existing Dataset: I already have a data set, and list of
potential problems, and figure out how to fix it.
•  **Fix a known Problem: Find a problem, collect data about it,
analyze, visualize, build a model and improve. Then build a
dashboard to monitor.
•  Improve Overall Process: Instrument processes ( start with
most crucial), find KPIs, analyze and visualize the processes, and
improve
•  Find Correlations: Collect all available data, data mine the data
or visualize, find interesting correlations.
Actionable Insights
are the Key!!
•  Insights	are	about	significant	event	that	
warrant	aUenQon	(	e.g.	more	than	two	
technical	issues	would	lead	customer	to	
churn)		
•  Decision	makers	can	idenQfy	the	
context	associated	with	the	insight	
(	e.g.	operators	can	see	though	history	
of	customers	who	qualify)		
•  Decision	makers	can	do	something	
about	the	insight	(	e.g.	can	work	with	
customers	to	reassures	and	fix)
Think Deeply about Who is using your
System and How?
Summary
•  Big	Data	provide	a	way	to	OpQmize		
•  Tools		
–  KPIs	
–  AnalyQcs	(	Batch,	Real-Qme,	InteracQve,	PredicaQve)		
–  VisualizaQons,	Dashboards		
–  Alerts		
–  Sensors	(	and	other	data	collecQon	plumbing)		
•  Start	small		
•  Try	out	with	data	sets	before	setup	a	system		
•  Find	a	high	impact	problem	and	make	it	work	
end	to	end		
•  Pay	aUenQon	to	user	Experience
Thank	You!	
#WSO2ConEU	
Share	your	feedback	for	this	session	
wso2con.com/app

Contenu connexe

Plus de WSO2

Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessWSO2
 
How to Create a Service in Choreo
How to Create a Service in ChoreoHow to Create a Service in Choreo
How to Create a Service in ChoreoWSO2
 
Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023WSO2
 
Platform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on AzurePlatform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on AzureWSO2
 
GartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdfGartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdfWSO2
 
[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in Minutes[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in MinutesWSO2
 
Modernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos IdentityModernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos IdentityWSO2
 
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...WSO2
 
CIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdfCIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdfWSO2
 
Delivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing ChoreoDelivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing ChoreoWSO2
 
Fueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected ProductsFueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected ProductsWSO2
 
A Reference Methodology for Agile Digital Businesses
 A Reference Methodology for Agile Digital Businesses A Reference Methodology for Agile Digital Businesses
A Reference Methodology for Agile Digital BusinessesWSO2
 
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)WSO2
 
Lessons from the pandemic - From a single use case to true transformation
 Lessons from the pandemic - From a single use case to true transformation Lessons from the pandemic - From a single use case to true transformation
Lessons from the pandemic - From a single use case to true transformationWSO2
 
Adding Liveliness to Banking Experiences
Adding Liveliness to Banking ExperiencesAdding Liveliness to Banking Experiences
Adding Liveliness to Banking ExperiencesWSO2
 
Building a Future-ready Bank
Building a Future-ready BankBuilding a Future-ready Bank
Building a Future-ready BankWSO2
 
WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021WSO2
 
[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIs[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIsWSO2
 
[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native Deployment[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native DeploymentWSO2
 
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”WSO2
 

Plus de WSO2 (20)

Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with Platformless
 
How to Create a Service in Choreo
How to Create a Service in ChoreoHow to Create a Service in Choreo
How to Create a Service in Choreo
 
Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023
 
Platform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on AzurePlatform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on Azure
 
GartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdfGartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdf
 
[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in Minutes[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in Minutes
 
Modernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos IdentityModernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos Identity
 
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
 
CIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdfCIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdf
 
Delivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing ChoreoDelivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing Choreo
 
Fueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected ProductsFueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected Products
 
A Reference Methodology for Agile Digital Businesses
 A Reference Methodology for Agile Digital Businesses A Reference Methodology for Agile Digital Businesses
A Reference Methodology for Agile Digital Businesses
 
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
 
Lessons from the pandemic - From a single use case to true transformation
 Lessons from the pandemic - From a single use case to true transformation Lessons from the pandemic - From a single use case to true transformation
Lessons from the pandemic - From a single use case to true transformation
 
Adding Liveliness to Banking Experiences
Adding Liveliness to Banking ExperiencesAdding Liveliness to Banking Experiences
Adding Liveliness to Banking Experiences
 
Building a Future-ready Bank
Building a Future-ready BankBuilding a Future-ready Bank
Building a Future-ready Bank
 
WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021
 
[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIs[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIs
 
[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native Deployment[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native Deployment
 
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
 

Dernier

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Dernier (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

WSO2Con EU 2016: Big Data & Analytics: From Hype to Insight

  • 2. Success Stories •  Money Ball ( Baseball draDing) •  Nate Silver predicted outcomes in 49 of the 50 states in the 2008 U.S. PresidenQal elecQon •  Cancer detecQon from Biopsy cells ( Big Data find 12 paUerns while we only knew 9), hUp://go.ted.com/CseS •  Bristol-Myers Squibb reduced the Qme it takes to run clinical trial simulaQons by 98% •  Xerox used big data to reduce the aUriQon rate in its call centers by 20%. •  Kroger Loyalty programs ( growth in 45 consecuQve quarters)
  • 3. Premise of Big Data If you collect data about your business, and feed it to a Big Data system, you will find useful insights that will provide competitive advantage –  (e.g. Analysis of data sets can find new correlations to "spot business trends, prevent diseases, combat crime and so on”. [Wikipedia]) Underline assumption is that way we operate, and organizations are inefficient.
  • 4. Big Data as a Way to OpQmize •  Assumptions: Once you identify your sickness, you are halfway cured •  You must know what is worth Optimizing premature opQmizaQon is the root of all evil
  • 5. “Big Data Washing” You can tick yes, but unlikely to make a difference
  • 6. How to Big Data Wash your System in 24 hours? •  Publish collect the data you can with minimal effort •  Do lot of simple aggregaQons •  Figure out what data combinaQons makes predest pictures •  Throw in some machine learning algorithms, predict something but don’t compare •  Create a cool dashboard and do a cool demo, and say that you are just scratching the surface!!
  • 7. Are Insights are automatic? •  I wish •  Only if we have right data •  Only if we look at the right place •  Only if such insights are there •  Only if we found the insights
  • 9. Big Data Tools •  KPIs •  AnalyQcs ( Batch, Real-Qme, InteracQve, PredicaQve) •  VisualizaQons, Dashboards •  Alerts •  Sensors ( and other data collecQon plumbing)
  • 10. KPIs and their Role •  KPIs (Key Performance Indicators) are numbers that can give you an idea about performance of something –  E.g. Countries have them ( GDP, Per Capita Income, HDI index etc) •  Examples –  Company Revenue –  LifeQme value of a customer –  Revenue per Square foot ( in retail industry) •  Idea is to define them and monitor them. But defining them is hard work!! •  ODen one indicator tells half the story, and you need several that cover different angles
  • 11. What is a Dashboard? •  Think a car dashboard •  It give you idea about overall system in a glance •  It is boring when all is good, and grab aUenQon when something is wrong •  ODen have support for drill down and find root cause
  • 12. Alerts •  NoQficaQons ( sent via email, SMS, Pager etc.) •  Goal is to give you peace of mind ( not having to check all the Qme) •  They should be specific •  They should be infrequent •  They should have very low false posiQves •  Let users control sensiQvity
  • 13. You need a Human in the Loop Systems that digest your data, take decisions, and run the system by itself, they can only be used with limited applications Yet (e.g. Algorithmic trading, Showing Advertisements, or War)
  • 14. Decisions, Actions, and Drill down •  Operators need to see the data in context, and drill down into detail to understand the root cause •  Typical model is to start from an alert or dashboard, see data in context (other transacQons around same Qme, what does same user did before and aDer etc.) and then let the user drill down •  For example, hUp://wso2.com/videos/wso2-fraud- detecQon-soluQon
  • 15. Role of Realtime Analytics •  Use to detect something very fast! Within few milliseconds to few seconds. •  Very powerful in detecQng condiQons over Qme (e.g. ball possession in a football game) •  Alerts are done through RealQme analyQcs
  • 16. Role of Predictive Analytics •  PredicQve analyQcs learn a problem from examples –  E.g. learn to drive •  Two main cases are –  PredicQng next value or values (e.g. electricity load predicQon) –  PredicQng category (e.g. SPAM or not for a email) •  Used to grouping, to generate alerts, or to augment visualizaQons •  Need lot of experQse to create correct models and use them.
  • 19. Keeping it running is Even Harder ●  Incorporate ConQnuous data o  Integrate data conQnuously o  We get feedback about effecQveness of decisions (e.g. Accuracy of Fraud) ●  Track and update models o  Trends change o  Generate models in batch mode and update
  • 20. Templates for Big Data Projects •  Use existing Dataset: I already have a data set, and list of potential problems, and figure out how to fix it. •  **Fix a known Problem: Find a problem, collect data about it, analyze, visualize, build a model and improve. Then build a dashboard to monitor. •  Improve Overall Process: Instrument processes ( start with most crucial), find KPIs, analyze and visualize the processes, and improve •  Find Correlations: Collect all available data, data mine the data or visualize, find interesting correlations.
  • 21. Actionable Insights are the Key!! •  Insights are about significant event that warrant aUenQon ( e.g. more than two technical issues would lead customer to churn) •  Decision makers can idenQfy the context associated with the insight ( e.g. operators can see though history of customers who qualify) •  Decision makers can do something about the insight ( e.g. can work with customers to reassures and fix)
  • 22. Think Deeply about Who is using your System and How?
  • 23. Summary •  Big Data provide a way to OpQmize •  Tools –  KPIs –  AnalyQcs ( Batch, Real-Qme, InteracQve, PredicaQve) –  VisualizaQons, Dashboards –  Alerts –  Sensors ( and other data collecQon plumbing) •  Start small •  Try out with data sets before setup a system •  Find a high impact problem and make it work end to end •  Pay aUenQon to user Experience