SlideShare une entreprise Scribd logo
1  sur  38
BIG DATA
Arnon Rotem-Gal-Oz
Director of Technology Research, Amdocs
The blind men and the elephant. Poem by John Godfrey Saxe (Cartoon originally copyrighted by the authors; G.
Renee Guzlas, artists http://www.nature.com/ki/journal/v62/n5/fig_tab/4493262f1.html
1880 US
Census
Hollerith
Tabulating
Machine
Hollerith photos by Martin Wichary :
http://www.flickr.com/photos/mwichary/4358926764/in/photostream/
ource: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/
Big data happens when the data you
have to process is bigger than what
you can process in the given time with
current technologies
Myth: Big data = keep all data
Source: Big Data Public Private Forum : http://www.big-
project.eu/sites/default/files/D2.2.1_First%20draft%20of%20Technical%20white%20papers_FINAL_v1.01_
0.pdf
Source: Big Data Public Private Forum : http://www.big-
project.eu/sites/default/files/D2.2.1_First%20draft%20of%20Technical%20white%20papers_FINAL_v1.01_
0.pdf
Some Telco
Numbers
Source: Wikipedia
http://upload.wikimedia.org/wikipedia/commons/5/50/Telephone_operators,_1952.jpg
So, what do we do
with all this data?
Wikipedia http://upload.wikimedia.org/wikipedia/commons/0/06/UPS_Truck.jpg
It’s the insights, stupid*
* With apologies to Bill Clinton
ource: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/
Big data analytics is when sample = N
• Big data happens when the data you have to process
is bigger than what you can process in the given time
with current technologies
“My daughter got this in the
mail!, She’s still in high school,
and you’re sending her coupons
for baby clothes and cribs? Are
you trying to encourage her to
get pregnant?”
Source: Forbes http://www.forbes.com/sites/kashmirhill/2012/02/16/how-target-figured-out-a-teen-girl-was-pregnant-before-her-
father-did/
We need to
watch out that
Analytics won’t
get too creepy
When people hear
big data they think
fast data
Source: Steve Jones Cap Gemini
http://www.no.capgemini.com/node/778541
Subscribers
Collect
& Filter
Correlate
(simplified) Network proactive care flow
Account
Event Store
Identify &
Predict
Network
Failures
Reimburse
VIPs
Prioritize
technicians
Identify
impact on
high valued
Accounts
ource: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/
Big data is when we can handle data
fast enough to make a difference
• Big data happens when the data you have to process
is bigger than what you can process in the given time
with current technologies
• Big data analytics is when sample = N
Technology space
The Elephant in the room
Hadoop Stack
Map/R
educe
HDFS
HBase
Pig
Hive
Zoo
Keeper
Oozie Mahout
Giraph
Schema on read
Move data to computation
Maybe we should rethink
moving data to
computation…
Source : http://my-inner-voice.blogspot.co.il/2012/06/haddop-101-paper-by-miha-ahronovitz-and.html
Map/reduce
Source: http://www.bodhtree.com/blog/2012/10/18/ever-wondered-what-happens-between-map-and-reduce/
Customer Segmentation
First
name
Last
name
ARPU Age Device Country …
Mr. Smith 100 22 iPhone 5s,White USA
John Doe 87 42 Samsung Galaxy S5,Gold France
Lady In Red 105 21 Samsung Note 3, White UK
…
Uluru, Australia by Stuart Edwards (cc) http://en.wikipedia.org/wiki/Uluru#mediaviewer/File:Uluru_Panorama.jpg
K-Means
ARPU
Age
Source : http://pypr.sourceforge.net/kmeans.html
K=3ARPU
Age
ARPU
Age
Source : http://pypr.sourceforge.net/kmeans.html
New paradigms
Map/R
educe
HDFS
HBase
Pig
Hive
Zoo
Keeper
Oozie Mahout
Giraph
New Paradigms
Map/R
educe
HDFS
HBase
Pig Hive
Zoo
Keeper
Oozie Mahout
YARN
Giraph
New Paradigms
Map/R
educe
HDFS
HBase
Pig Hive
Zoo
Keeper
Oozie Mahout
YARN
Giraph
Spark
Storm
Slider
Flink
Impala
Tez
Presto
Amdocs Analytics & Data Management
Heritage
2013
• Proactive Care
• TerraScale
• Network optimization
• Real time
analytics platform
• Single product catalog
• BSS–OSS
Integration
• CRM-Billing
Integration
OSS
Analytics Platform,
16 Analytics Patents
• aLDM logical data
model
• Policy control
Network Analytics
CRM
2000 2008
AcquisitionsPortfolio
34
Information Security Level 2 – Sensitive
© 2014 – Proprietary and Confidential Information of Amdocs
Touchpoints & Applications
CRM Self Service E-MailPCRF SMS OtherWi-Fi OffloadCampaign Mng. • • • • • • •
Operational
Envelope &
Platform
Administration
• Security
Management
• Configuration
Management
• Services
Inventory
• Performance
Management
• Fault
Management
• LoggerCollect &
Ingest
Transform
& Enrich
Aggregate
& Correlate
Drive
Insight
Close the
Loop
Machine
Learn &
Score
Application-Ready Data and Analytics/ML Insights
Entities and Profiles
Detailed Data
OSS
Probes SocialRAN Inventory Usage &
Charging
CRM
Real-Time & Batch Connectors
Insight Platform
Marketing
Analytical
Application
Framework:
Dashboards &
Visualisation
Decisioning
Engine
Dynamic Micro
Segmentation
Network Care Operations
ource: Silicon Angle http://siliconangle.com/blog/2013/11/13/how-big-is-big-data-really/
• Big data happens when the data you have to process
is bigger than what you can process in the given time
with current technologies
• Big data analytics is when sample = N
• Big data is when we can handle data fast enough to
make a difference
Additional takeaways
• CSPs have always been in the big data
business – they just didn’t know it
• Big data is not a panacea
• Hadoop is shaping up as the big data OS
– Though there are alternatives arriving from the
cloud arena (mesos, kubernetes)
What we
covered here
is not even
the tip of the
iceberg
Source: wikimedia http://commons.wikimedia.org/wiki/File:Iceberg.jpg
Arnon Rotem-Gal-Oz
Director of Technology Research, Amdocs
arnonrot@amdocs.com / arnon@rgoarchitects.com

Contenu connexe

En vedette

introduction to data processing using Hadoop and Pig
introduction to data processing using Hadoop and Pigintroduction to data processing using Hadoop and Pig
introduction to data processing using Hadoop and PigRicardo Varela
 
Pig, Making Hadoop Easy
Pig, Making Hadoop EasyPig, Making Hadoop Easy
Pig, Making Hadoop EasyNick Dimiduk
 
Hive Quick Start Tutorial
Hive Quick Start TutorialHive Quick Start Tutorial
Hive Quick Start TutorialCarl Steinbach
 
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and HadoopFacebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and Hadooproyans
 
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)Kevin Weil
 
Introduction To Map Reduce
Introduction To Map ReduceIntroduction To Map Reduce
Introduction To Map Reducerantav
 
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on HadoopHIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on HadoopZheng Shao
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBernard Marr
 

En vedette (15)

introduction to data processing using Hadoop and Pig
introduction to data processing using Hadoop and Pigintroduction to data processing using Hadoop and Pig
introduction to data processing using Hadoop and Pig
 
Pig, Making Hadoop Easy
Pig, Making Hadoop EasyPig, Making Hadoop Easy
Pig, Making Hadoop Easy
 
Hive Quick Start Tutorial
Hive Quick Start TutorialHive Quick Start Tutorial
Hive Quick Start Tutorial
 
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and HadoopFacebooks Petabyte Scale Data Warehouse using Hive and Hadoop
Facebooks Petabyte Scale Data Warehouse using Hive and Hadoop
 
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big data security
Big data securityBig data security
Big data security
 
Big Data: Issues and Challenges
Big Data: Issues and ChallengesBig Data: Issues and Challenges
Big Data: Issues and Challenges
 
Introduction To Map Reduce
Introduction To Map ReduceIntroduction To Map Reduce
Introduction To Map Reduce
 
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on HadoopHIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 

Similaire à Big data Overview

Semantics, Deep Learning, and the Transformation of Business
Semantics, Deep Learning, and the Transformation of BusinessSemantics, Deep Learning, and the Transformation of Business
Semantics, Deep Learning, and the Transformation of BusinessSteve Omohundro
 
Strata Conference NYC 2013 Full Version
Strata Conference NYC 2013 Full VersionStrata Conference NYC 2013 Full Version
Strata Conference NYC 2013 Full VersionTaewook Eom
 
Minnesota ARLD Day 2011
Minnesota ARLD Day 2011Minnesota ARLD Day 2011
Minnesota ARLD Day 2011Jason Griffey
 
The Edge Group Quito Lima - july 2014
The Edge Group   Quito Lima - july 2014The Edge Group   Quito Lima - july 2014
The Edge Group Quito Lima - july 2014Jose A Torres
 
10fpresentation(bsj)final
10fpresentation(bsj)final10fpresentation(bsj)final
10fpresentation(bsj)finalJohn Jung
 
Safe use of cloud - alternative cloud
Safe use of cloud - alternative cloudSafe use of cloud - alternative cloud
Safe use of cloud - alternative cloudTomppa Järvinen
 
Evolution of AI - Why is my computer still so dumb?
Evolution of AI - Why is my computer still so dumb?Evolution of AI - Why is my computer still so dumb?
Evolution of AI - Why is my computer still so dumb?Olivia Klose
 
TFS Talk by Hackathorn 20100527 v2
TFS Talk by Hackathorn 20100527 v2TFS Talk by Hackathorn 20100527 v2
TFS Talk by Hackathorn 20100527 v2Richard Hackathorn
 
Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)Paul Bradshaw
 
Andrew Hessel - The Internet of Living Things
Andrew Hessel - The Internet of Living ThingsAndrew Hessel - The Internet of Living Things
Andrew Hessel - The Internet of Living ThingsMobile Monday Amsterdam
 
Georgia library association 2011
Georgia library association 2011Georgia library association 2011
Georgia library association 2011Jason Griffey
 
OakX:Data+The Power of Visual Storytelling Anca Mosoiu/Techliminal
OakX:Data+The Power of Visual Storytelling  Anca Mosoiu/TechliminalOakX:Data+The Power of Visual Storytelling  Anca Mosoiu/Techliminal
OakX:Data+The Power of Visual Storytelling Anca Mosoiu/TechliminalOak X
 
Into the next dimension
Into the next dimensionInto the next dimension
Into the next dimensionEd Charbeneau
 
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Sören Auer
 
CONFidence 2014: Davi Ottenheimer Protecting big data at scale
CONFidence 2014: Davi Ottenheimer Protecting big data at scaleCONFidence 2014: Davi Ottenheimer Protecting big data at scale
CONFidence 2014: Davi Ottenheimer Protecting big data at scalePROIDEA
 

Similaire à Big data Overview (20)

Semantics, Deep Learning, and the Transformation of Business
Semantics, Deep Learning, and the Transformation of BusinessSemantics, Deep Learning, and the Transformation of Business
Semantics, Deep Learning, and the Transformation of Business
 
Strata Conference NYC 2013 Full Version
Strata Conference NYC 2013 Full VersionStrata Conference NYC 2013 Full Version
Strata Conference NYC 2013 Full Version
 
Minnesota ARLD Day 2011
Minnesota ARLD Day 2011Minnesota ARLD Day 2011
Minnesota ARLD Day 2011
 
The Edge Group Quito Lima - july 2014
The Edge Group   Quito Lima - july 2014The Edge Group   Quito Lima - july 2014
The Edge Group Quito Lima - july 2014
 
10fpresentation(bsj)final
10fpresentation(bsj)final10fpresentation(bsj)final
10fpresentation(bsj)final
 
Stephen downes 2012 learning in a digital age, the reality and the myth
Stephen downes 2012 learning in a digital age, the reality and the mythStephen downes 2012 learning in a digital age, the reality and the myth
Stephen downes 2012 learning in a digital age, the reality and the myth
 
Safe use of cloud - alternative cloud
Safe use of cloud - alternative cloudSafe use of cloud - alternative cloud
Safe use of cloud - alternative cloud
 
Copyright for the Digital Arts and Humanities
Copyright for the Digital Arts and HumanitiesCopyright for the Digital Arts and Humanities
Copyright for the Digital Arts and Humanities
 
Big Data Curation And Its Application
Big Data Curation And Its ApplicationBig Data Curation And Its Application
Big Data Curation And Its Application
 
Evolution of AI - Why is my computer still so dumb?
Evolution of AI - Why is my computer still so dumb?Evolution of AI - Why is my computer still so dumb?
Evolution of AI - Why is my computer still so dumb?
 
Teams indian river_2.12.2009_v1.0
Teams indian river_2.12.2009_v1.0Teams indian river_2.12.2009_v1.0
Teams indian river_2.12.2009_v1.0
 
Ixd12 Frantic recap
Ixd12 Frantic recapIxd12 Frantic recap
Ixd12 Frantic recap
 
TFS Talk by Hackathorn 20100527 v2
TFS Talk by Hackathorn 20100527 v2TFS Talk by Hackathorn 20100527 v2
TFS Talk by Hackathorn 20100527 v2
 
Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)Open Data Utopia? (SciCAR 19)
Open Data Utopia? (SciCAR 19)
 
Andrew Hessel - The Internet of Living Things
Andrew Hessel - The Internet of Living ThingsAndrew Hessel - The Internet of Living Things
Andrew Hessel - The Internet of Living Things
 
Georgia library association 2011
Georgia library association 2011Georgia library association 2011
Georgia library association 2011
 
OakX:Data+The Power of Visual Storytelling Anca Mosoiu/Techliminal
OakX:Data+The Power of Visual Storytelling  Anca Mosoiu/TechliminalOakX:Data+The Power of Visual Storytelling  Anca Mosoiu/Techliminal
OakX:Data+The Power of Visual Storytelling Anca Mosoiu/Techliminal
 
Into the next dimension
Into the next dimensionInto the next dimension
Into the next dimension
 
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
 
CONFidence 2014: Davi Ottenheimer Protecting big data at scale
CONFidence 2014: Davi Ottenheimer Protecting big data at scaleCONFidence 2014: Davi Ottenheimer Protecting big data at scale
CONFidence 2014: Davi Ottenheimer Protecting big data at scale
 

Plus de Arnon Rotem-Gal-Oz

Plus de Arnon Rotem-Gal-Oz (20)

Taking ML to production - a journey
Taking ML to production - a journeyTaking ML to production - a journey
Taking ML to production - a journey
 
Apache spark
Apache sparkApache spark
Apache spark
 
Fallacies of Distributed Computing
Fallacies of Distributed Computing Fallacies of Distributed Computing
Fallacies of Distributed Computing
 
Docker & Kubernetes intro
Docker & Kubernetes introDocker & Kubernetes intro
Docker & Kubernetes intro
 
Docker Intro
Docker IntroDocker Intro
Docker Intro
 
Data security @ the personal level
Data security @ the personal levelData security @ the personal level
Data security @ the personal level
 
Microservices - it's déjà vu all over again
Microservices  - it's déjà vu all over againMicroservices  - it's déjà vu all over again
Microservices - it's déjà vu all over again
 
Big data in the cloud - welcome to cost oriented design
Big data in the cloud - welcome to cost oriented designBig data in the cloud - welcome to cost oriented design
Big data in the cloud - welcome to cost oriented design
 
Distilling insights @ AppsFlyer
Distilling insights @ AppsFlyerDistilling insights @ AppsFlyer
Distilling insights @ AppsFlyer
 
Distilling Insights @ Appsflyer (Data Architecture)
Distilling Insights @ Appsflyer (Data Architecture)Distilling Insights @ Appsflyer (Data Architecture)
Distilling Insights @ Appsflyer (Data Architecture)
 
Hadoop YARN overview
Hadoop YARN overviewHadoop YARN overview
Hadoop YARN overview
 
SAF
SAFSAF
SAF
 
REST presentation
REST presentationREST presentation
REST presentation
 
SOA & Big Data
SOA & Big DataSOA & Big Data
SOA & Big Data
 
Why the JVM?
Why the JVM?Why the JVM?
Why the JVM?
 
Building reliable systems from unreliable components
Building reliable systems from unreliable componentsBuilding reliable systems from unreliable components
Building reliable systems from unreliable components
 
Azure migration
Azure migrationAzure migration
Azure migration
 
Things to think about while architecting azure solutions
Things to think about while architecting azure solutionsThings to think about while architecting azure solutions
Things to think about while architecting azure solutions
 
Soa
Soa Soa
Soa
 
Rest
RestRest
Rest
 

Dernier

Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 

Dernier (20)

Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 

Big data Overview

Notes de l'éditeur

  1. Poem by John Godfrey Saxe It was six men of Indostan To learning much inclined, Who went to see the Elephant (Though all of them were blind), That each by observation Might satisfy his mind.
  2. 50 Mil. People 7+ years of manual summations read a blog post by Gil Press that stated that the first big data problem was in the 1880s (yes you read that right). In the late 1800s the processing of the US census was beginning to take so long that it was getting close to 10 years. Crossing this mark is meaningful as the census runs every 10 years and as birth rates are getting higher the outlook wasn’t very good. In 1886 Herman Hollerith started a business (that year later was merged with other companies to form IBM) to sell a tabulating machine that holds census data on punch cards. Indeed the 1890 census took less than 2 years to complete and handled both larger population (62 million people) and more data points than the 1880 census.
  3. https://www.census.gov/history/www/census_then_now/notable_alumni/herman_hollerith.html << year instead of almost 10 years 62 Million people 1890 census
  4. https://www.census.gov/history/www/census_then_now/notable_alumni/herman_hollerith.html << year instead of almost 10 years 62 Million people 1890 census
  5. Large Telco – 200M subscribers Orders data few GB Charge Events – 100TB per month Network 800TB - day
  6. So we pile up all this data – but what are we piling it for? 1992 Bill Clinton campaign – It’s the economy, stupid http://upload.wikimedia.org/wikipedia/commons/0/06/UPS_Truck.jpgz
  7. Now get this: In 2007 alone, this helped us: * shave nearly 30 million miles off already streamlined delivery routes. * save 3 million gallons of gas, and * reduce CO2 emissions by 32,000 metric tons¿the equivalent of removing 5,300 passenger cars from the road for an entire year.
  8. Now get this: In 2007 alone, this helped us: * shave nearly 30 million miles off already streamlined delivery routes. * save 3 million gallons of gas, and * reduce CO2 emissions by 32,000 metric tons¿the equivalent of removing 5,300 passenger cars from the road for an entire year.
  9. Retail are the leaders in using analytics Amazon is famous for that but they are not alone
  10. hat Target discovered fairly quickly is that it creeped people out that the company knew about their pregnancies in advance. “If we send someone a catalog and say, ‘Congratulations on your first child!’ and they’ve never told us they’re pregnant, that’s going to make some people uncomfortable,” Pole told me. “We are very conservative about compliance with all privacy laws. But even if you’re following the law, you can do things where people get queasy.” Bold is mine. That’s a quote for our times. So Target got sneakier about sending the coupons. The company can create personalized booklets; instead of sending people with high pregnancy scores books o’ coupons solely for diapers, rattles, strollers, and the “Go the F*** to Sleep” book, they more subtly spread them about: “Then we started mixing in all these ads for things we knew pregnant women would never buy, so the baby ads looked random. We’d put an ad for a lawn mower next to diapers. We’d put a coupon for wineglasses next to infant clothes. That way, it looked like all the products were chosen by chance. “And we found out that as long as a pregnant woman thinks she hasn’t been spied on, she’ll use the coupons. She just assumes that everyone else on her block got the same mailer for diapers and cribs. As long as we don’t spook her, it works.”
  11. hat Target discovered fairly quickly is that it creeped people out that the company knew about their pregnancies in advance. “If we send someone a catalog and say, ‘Congratulations on your first child!’ and they’ve never told us they’re pregnant, that’s going to make some people uncomfortable,” Pole told me. “We are very conservative about compliance with all privacy laws. But even if you’re following the law, you can do things where people get queasy.” Bold is mine. That’s a quote for our times. So Target got sneakier about sending the coupons. The company can create personalized booklets; instead of sending people with high pregnancy scores books o’ coupons solely for diapers, rattles, strollers, and the “Go the F*** to Sleep” book, they more subtly spread them about: “Then we started mixing in all these ads for things we knew pregnant women would never buy, so the baby ads looked random. We’d put an ad for a lawn mower next to diapers. We’d put a coupon for wineglasses next to infant clothes. That way, it looked like all the products were chosen by chance. “And we found out that as long as a pregnant woman thinks she hasn’t been spied on, she’ll use the coupons. She just assumes that everyone else on her block got the same mailer for diapers and cribs. As long as we don’t spook her, it works.” http://www.geektime.co.il/okcupid-experiments-on-users/ <-Facebook, okCupid
  12. Data from Actix (or other network sources) –20/30M subscribers would generate ~ 250K messages per second Monitor for anomalies like dropped calls Correlate with data from CRM (identify customer, account) Analyze for impact on VIPs Analyze for problems in the netwrok Automated action Change SLAs Notify customers (sorry note, small freebie etc) <1 -5 seconds away from the problem <-can have real time impact on satisfaction (should avoid falling into the creepiness problem mentioned with Target use case (we know what you’re doing!!)
  13. Fraud analysis at big telco – where insights arrive ong after the fraud ended Multiple connections with same IP from different locations Buying unlimited data and letting “reselling” it for Skype etc.
  14. Think of it as defining a view on a table but the underlying data can be Poly structured and unstructured data
  15. CRM data Map – identify subscriber, account Group by (account) Reduce update account profile
  16. Average revenue per user - ARPU
  17. SQL on Hadoop Streaming “Enterprise Grade”
  18. SQL on Hadoop Streaming “Enterprise Grade”
  19. Fraud analysis at big telco – where insights arrive ong after the fraud ended Multiple connections with same IP from different locations Buying unlimited data and letting “reselling” it for Skype etc.
  20. Volume Velocity (variety, ver