SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Unleashing the power of telco
analytics using
Open Big Data Data Exploration &
Visualization Technologies
Ognjen Antonic, DWH & BI Manager
Telemach Slovenia, Member of United Group
ognjen.antonic@telemach.si
www.telemach.si www.the-united-group.com
2
About Telemach Slovenia
- Leading alternative telecom operator in Slovenia
- Acquired Tusmobil, 3rd largest and fastest growing mobile
operator in Slovenia during April 2015, now fully
integrated operation of Telemach
- 70% cable TV market share
- 30% IP telephony market share
- 24% fixed broadband internet access market share
- 14% mobile market share
3
About United Group
- Largest alternative telecom provider in the region of
former Yugoslavia
- Main operating companies:
- SBB (leading cable operator in Serbia)
- Telemach Slovenia (leading cable operator and fastest
growing mobile operator in Slovenia)
- Telemach Bosnia
- Telemach Montenegro
- Main business segments:
- Telecommunication platforms
- Media (Content & Advertising)
4
About United Group
5
About United Group
• Pay TV
• Broadband
• Fixed telephony
• Mobile telephony
• CuTV
• VOD
• D3Go
• HD
• Premium content (HBO,...)
Products and Services, Business & Residential
• Content rights (Premier
League, UEFA Europa
League, UEFA
Champions League,
NBA, WTA, Primera
Division, ATP World
Tour, Euroleague
Basketball…)
6
About United Group
7
The Quest for Big Data – How it all started
- Mobile Fraud
- International Roaming Fraud – Own Subs Abroad
- International Roaming Fraud – Foreign Visitors in our
Network
- Interconnect Fraud
- Multiple different Attack Vectors used
- High velocity of occurrences
8
The Quest for Big Data – Requirements
- Answering fundamental questions
- Who?
- Where?
- When?
- How?
- Giving Self Service Analytical Ability of find answers
through Data Discovery & Visualization
9
The Quest for Big Data – Challenges
- IT Challenges
- Small Team
- Limited Budget
- Reusing current IT infrastructure in-place to the
maximum level
- Leveraging and combining current multiple data
sources
10
The Quest for Big Data – Challenges
- Old IT Systems Challenges
- Fully Based on conventional relational database
technologies for data layer – data warehouse
- Reporting and analysis based on conventional
Business Intelligence tools
- Not designed for real-time or near-real-time capture
and processing of large amounts of data for analytical
purposes
- Data discovery & visualization not primary focus of
conventional Business Intelligence tools
11
Big Data Solution Requirements
- Easy to deploy, develop and manage
- Should scale both horizontally & vertically
- CPU/processing power wise
- Data storage wise
- Should cover whole stack
- Data Storage & Retreival
- Data Integration
- Data Presentation
- Actively Developed with Good Documentation
- Should be Free or Low-Cost Licence wise
12
Big Data Solutions Evaluated
- Splunk
- Very good commercial solution, covering full stack,
easy to deploy and manage
- Free only up to 500 MB data per day
- Gets very expensive with larger amounts of data
processed daily
- Apache Hadoop
- Free licence-wise, offering extreme scalability
- Not covering full stack in pre-integrated way, Data
Presentation layer poor in features for self service
- Difficult to deploy and manage
- Better suited for companies with larger IT teams
13
Settling for the ELK Stack
- Settled for ELK Stack, fully integrated with:
- Elasticsearch – Data Storage Layer
- Logstash - Data Integration Layer
- Kibana – Data Presentation Layer
- Open Source Based & Free Licensing
- Actively Developed with Good Documentation
- Scalable and easy to deploy and manage
- Enterprise Level Commercial Support Available
14
Technology behind Elasticsearch
- NoSQL Data Storage, Indexing and Data Retreival
Engine, derivative of Apache Lucene Open Source
Project, written fully in Java
- Built on following fundamental technological enablers:
- Data Sharding and Massively Parallel Processing over
cluster of inexpensive servers using Shared Nothing
Data Architecture
- Data Compression & In-Memory Storage
- Probabilistic Computation Models (possibility to trade
speed for accuracy or vice versa)
- Lots of different free & open source plugins and addons
available
15
Developing & Deploying our First Solution
- International Roaming Fraud Dashboard
- Highest Risk Area – Our customers abroad
- Giving ability to Fraud department to visualize what is
going on without IT interventions
- Based on NRTRDE data received from roaming
partners and our own risk scoring engine development
- Combining NRTRDE and DWH data
16
Developing & Deploying our First Solution
- Effort Spent on our First Solution – Fully In-House
- Server deployment and configuration - 1 man day
- ELK stack installation, configuration and tuning – 2
man days
- Solution Development
- Data Integration – 3 man days
- Dashboard Development – 2 man days
- Total initial effort spent including learning new
technology:
- 8 man days
- 4 core virtual server with 32 GB RAM and 300 GB
disk storage
17
Gaining Visual Insight
- Dashboards enabled visual insight and discovery what is
going on, including Geo-Visualizations
- Ability to filter and drilldown to lowest level
18
Gaining Visual Insight
19
Gaining Visual Insight
20
Gaining Visual Insight
21
Gaining Visual Insight
22
Building upon Initial Success Story
- Based on Initial Success Story we continued with rolling
out:
- Visiting Roamers Fraud Dashboard
- Interconnect Fraud Dashboard
- Circuit Switched Mobile Traffic Dashboard
- Packet Based Mobile Traffic Dashboard
- Online Charging System Real-Time Performance
Dashboard
- Real-Time Data Feeds to Elasticsearch directly
- Complex Near-Real Time Data Feeds using conventional
DWH as Data Integration and Data Distribution point
towards Elasticearch
23
Building upon Initial Success Story
24
Building upon Initial Success Story
25
Building upon Initial Success Story
26
Building upon Initial Success Story
27
Building upon Initial Success Story
28
Open Source Big Data – IT Benefits
- Open Source Big Data Solution Enabled IT to:
- To complement our existing DWH & BI Infrastructure
including data in those systems in Cost Efficient
Manner and Short Timeframe
- To provide to our end users large amounts of data for
self-service Data Visualization & Discovery not before
available
- Ability to provide Real-Time Analytics for data sources
with real-time data feeds
29
Open Source Big Data - Business Benefits
- Open Source Big Data Data Discovery & Visualization
Solution Enabled our Business Users:
- To quickly visualize and discover what and where is
going on in a self-service manner
- To dramatically cut decision times and take appropriate
actions resulting in:
- Reduced business risks
- Decreased service/network downtimes
- Increased customer satisfaction
30
Questions?

Contenu connexe

Tendances

"Who Moved my Data? - Why tracking changes and sources of data is critical to...
"Who Moved my Data? - Why tracking changes and sources of data is critical to..."Who Moved my Data? - Why tracking changes and sources of data is critical to...
"Who Moved my Data? - Why tracking changes and sources of data is critical to...Cask Data
 
Realtime Analytical Query Processing and Predictive Model Building on High Di...
Realtime Analytical Query Processing and Predictive Model Building on High Di...Realtime Analytical Query Processing and Predictive Model Building on High Di...
Realtime Analytical Query Processing and Predictive Model Building on High Di...Spark Summit
 
Querying Druid in SQL with Superset
Querying Druid in SQL with SupersetQuerying Druid in SQL with Superset
Querying Druid in SQL with SupersetDataWorks Summit
 
Free Servers to Build Big Data System on: Bing’s Approach
Free Servers to Build Big Data System on: Bing’s ApproachFree Servers to Build Big Data System on: Bing’s Approach
Free Servers to Build Big Data System on: Bing’s ApproachDataWorks Summit
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on HadoopTyler Mitchell
 
Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...
Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...
Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...Data Con LA
 
Building Data Pipelines with Spark and StreamSets
Building Data Pipelines with Spark and StreamSetsBuilding Data Pipelines with Spark and StreamSets
Building Data Pipelines with Spark and StreamSetsPat Patterson
 
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop WarehouseData Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop WarehouseDataWorks Summit
 
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...Spark Summit
 
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Data Con LA
 
GPU Acceleration for Financial Services
GPU Acceleration for Financial ServicesGPU Acceleration for Financial Services
GPU Acceleration for Financial ServicesKinetica
 
Built-In Security for the Cloud
Built-In Security for the CloudBuilt-In Security for the Cloud
Built-In Security for the CloudDataWorks Summit
 
Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016StampedeCon
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...DataStax
 
Scaling Deep Learning on Hadoop at LinkedIn
Scaling Deep Learning on Hadoop at LinkedInScaling Deep Learning on Hadoop at LinkedIn
Scaling Deep Learning on Hadoop at LinkedInDataWorks Summit
 
ETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure DatabricksETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure DatabricksDatabricks
 
Spark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with SparkSpark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with SparkMatt Ingenthron
 
How To Achieve Real-Time Analytics On A Data Lake Using GPUs
How To Achieve Real-Time Analytics On A Data Lake Using GPUsHow To Achieve Real-Time Analytics On A Data Lake Using GPUs
How To Achieve Real-Time Analytics On A Data Lake Using GPUsKinetica
 

Tendances (20)

What's new in SQL on Hadoop and Beyond
What's new in SQL on Hadoop and BeyondWhat's new in SQL on Hadoop and Beyond
What's new in SQL on Hadoop and Beyond
 
"Who Moved my Data? - Why tracking changes and sources of data is critical to...
"Who Moved my Data? - Why tracking changes and sources of data is critical to..."Who Moved my Data? - Why tracking changes and sources of data is critical to...
"Who Moved my Data? - Why tracking changes and sources of data is critical to...
 
Realtime Analytical Query Processing and Predictive Model Building on High Di...
Realtime Analytical Query Processing and Predictive Model Building on High Di...Realtime Analytical Query Processing and Predictive Model Building on High Di...
Realtime Analytical Query Processing and Predictive Model Building on High Di...
 
Querying Druid in SQL with Superset
Querying Druid in SQL with SupersetQuerying Druid in SQL with Superset
Querying Druid in SQL with Superset
 
Free Servers to Build Big Data System on: Bing’s Approach
Free Servers to Build Big Data System on: Bing’s ApproachFree Servers to Build Big Data System on: Bing’s Approach
Free Servers to Build Big Data System on: Bing’s Approach
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on Hadoop
 
Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...
Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...
Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawre...
 
Building Data Pipelines with Spark and StreamSets
Building Data Pipelines with Spark and StreamSetsBuilding Data Pipelines with Spark and StreamSets
Building Data Pipelines with Spark and StreamSets
 
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop WarehouseData Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
 
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
 
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
 
GPU Acceleration for Financial Services
GPU Acceleration for Financial ServicesGPU Acceleration for Financial Services
GPU Acceleration for Financial Services
 
Built-In Security for the Cloud
Built-In Security for the CloudBuilt-In Security for the Cloud
Built-In Security for the Cloud
 
Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
 
Scaling Deep Learning on Hadoop at LinkedIn
Scaling Deep Learning on Hadoop at LinkedInScaling Deep Learning on Hadoop at LinkedIn
Scaling Deep Learning on Hadoop at LinkedIn
 
ETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure DatabricksETL Made Easy with Azure Data Factory and Azure Databricks
ETL Made Easy with Azure Data Factory and Azure Databricks
 
Spark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with SparkSpark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with Spark
 
How To Achieve Real-Time Analytics On A Data Lake Using GPUs
How To Achieve Real-Time Analytics On A Data Lake Using GPUsHow To Achieve Real-Time Analytics On A Data Lake Using GPUs
How To Achieve Real-Time Analytics On A Data Lake Using GPUs
 
Architecting a datalake
Architecting a datalakeArchitecting a datalake
Architecting a datalake
 

En vedette

Big Telco Real-Time Network Analytics
Big Telco Real-Time Network AnalyticsBig Telco Real-Time Network Analytics
Big Telco Real-Time Network AnalyticsYousun Jeong
 
Gruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter
 
Telco Cloud - An evolution approach 2016
Telco Cloud - An evolution approach 2016Telco Cloud - An evolution approach 2016
Telco Cloud - An evolution approach 2016Fernando Herrera
 
Telco analytics at scale
Telco analytics at scaleTelco analytics at scale
Telco analytics at scaledatamantra
 
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...DataStax Academy
 
The Modern Telco Network: Defining The Telco Cloud
The Modern Telco Network: Defining The Telco CloudThe Modern Telco Network: Defining The Telco Cloud
The Modern Telco Network: Defining The Telco CloudMarco Rodrigues
 
Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...
Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...
Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...Spark Summit
 
Simplifying Big Data Analytics with Apache Spark
Simplifying Big Data Analytics with Apache SparkSimplifying Big Data Analytics with Apache Spark
Simplifying Big Data Analytics with Apache SparkDatabricks
 
IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...
IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...
IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...Spark Summit
 
Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in SparkPaco Nathan
 

En vedette (11)

Big Telco Real-Time Network Analytics
Big Telco Real-Time Network AnalyticsBig Telco Real-Time Network Analytics
Big Telco Real-Time Network Analytics
 
RCA-Mobile-Networks-Summary-20150421
RCA-Mobile-Networks-Summary-20150421RCA-Mobile-Networks-Summary-20150421
RCA-Mobile-Networks-Summary-20150421
 
Gruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in Telco
 
Telco Cloud - An evolution approach 2016
Telco Cloud - An evolution approach 2016Telco Cloud - An evolution approach 2016
Telco Cloud - An evolution approach 2016
 
Telco analytics at scale
Telco analytics at scaleTelco analytics at scale
Telco analytics at scale
 
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
Cassandra Community Webinar: Apache Spark Analytics at The Weather Channel - ...
 
The Modern Telco Network: Defining The Telco Cloud
The Modern Telco Network: Defining The Telco CloudThe Modern Telco Network: Defining The Telco Cloud
The Modern Telco Network: Defining The Telco Cloud
 
Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...
Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...
Predictive Analytics for IoT Network Capacity Planning: Spark Summit East tal...
 
Simplifying Big Data Analytics with Apache Spark
Simplifying Big Data Analytics with Apache SparkSimplifying Big Data Analytics with Apache Spark
Simplifying Big Data Analytics with Apache Spark
 
IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...
IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...
IoT and the Autonomous Vehicle in the Clouds: Simultaneous Localization and M...
 
Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in Spark
 

Similaire à 03-NOV-1510-Ognjen-Antonic-Telemach-stream-1

Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Precisely
 
How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?OVHcloud
 
How First to Value Beats First to Market: Case Studies of Fast Data Success
How First to Value Beats First to Market: Case Studies of Fast Data SuccessHow First to Value Beats First to Market: Case Studies of Fast Data Success
How First to Value Beats First to Market: Case Studies of Fast Data SuccessVoltDB
 
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...Jochem van Grondelle
 
How Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsHow Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsDATAVERSITY
 
Top 10 Enterprise Use Cases for NoSQL
Top 10 Enterprise Use Cases for NoSQLTop 10 Enterprise Use Cases for NoSQL
Top 10 Enterprise Use Cases for NoSQLDATAVERSITY
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsStephan Reimann
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersDataWorks Summit
 
th1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdf
th1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdfth1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdf
th1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdfTarekHassan840678
 
Monetizing Big Data at Telecom Service Providers
Monetizing Big Data at Telecom Service ProvidersMonetizing Big Data at Telecom Service Providers
Monetizing Big Data at Telecom Service ProvidersDataWorks Summit
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bankChungsik Yun
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTKiththi Perera
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardKiththi Perera
 
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...Istituto nazionale di statistica
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches DataWorks Summit
 
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetBig Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetSAP Technology
 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceInfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceWilfried Hoge
 
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & KafkaMohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & KafkaFlink Forward
 
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWSAmazon Web Services
 

Similaire à 03-NOV-1510-Ognjen-Antonic-Telemach-stream-1 (20)

Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?How to scale your PaaS with OVH infrastructure?
How to scale your PaaS with OVH infrastructure?
 
How First to Value Beats First to Market: Case Studies of Fast Data Success
How First to Value Beats First to Market: Case Studies of Fast Data SuccessHow First to Value Beats First to Market: Case Studies of Fast Data Success
How First to Value Beats First to Market: Case Studies of Fast Data Success
 
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
 
How Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical ApplicationsHow Enterprises are Using NoSQL for Mission-Critical Applications
How Enterprises are Using NoSQL for Mission-Critical Applications
 
Top 10 Enterprise Use Cases for NoSQL
Top 10 Enterprise Use Cases for NoSQLTop 10 Enterprise Use Cases for NoSQL
Top 10 Enterprise Use Cases for NoSQL
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of Things
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 
th1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdf
th1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdfth1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdf
th1330-1410effectenbeurszaal4-3v2-140424180955-phpapp01 (1).pdf
 
Monetizing Big Data at Telecom Service Providers
Monetizing Big Data at Telecom Service ProvidersMonetizing Big Data at Telecom Service Providers
Monetizing Big Data at Telecom Service Providers
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
 
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
 
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetBig Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceInfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experience
 
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & KafkaMohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
 
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
(ARC346) Scaling To 25 Billion Daily Requests Within 3 Months On AWS
 

03-NOV-1510-Ognjen-Antonic-Telemach-stream-1

  • 1. Unleashing the power of telco analytics using Open Big Data Data Exploration & Visualization Technologies Ognjen Antonic, DWH & BI Manager Telemach Slovenia, Member of United Group ognjen.antonic@telemach.si www.telemach.si www.the-united-group.com
  • 2. 2 About Telemach Slovenia - Leading alternative telecom operator in Slovenia - Acquired Tusmobil, 3rd largest and fastest growing mobile operator in Slovenia during April 2015, now fully integrated operation of Telemach - 70% cable TV market share - 30% IP telephony market share - 24% fixed broadband internet access market share - 14% mobile market share
  • 3. 3 About United Group - Largest alternative telecom provider in the region of former Yugoslavia - Main operating companies: - SBB (leading cable operator in Serbia) - Telemach Slovenia (leading cable operator and fastest growing mobile operator in Slovenia) - Telemach Bosnia - Telemach Montenegro - Main business segments: - Telecommunication platforms - Media (Content & Advertising)
  • 5. 5 About United Group • Pay TV • Broadband • Fixed telephony • Mobile telephony • CuTV • VOD • D3Go • HD • Premium content (HBO,...) Products and Services, Business & Residential • Content rights (Premier League, UEFA Europa League, UEFA Champions League, NBA, WTA, Primera Division, ATP World Tour, Euroleague Basketball…)
  • 7. 7 The Quest for Big Data – How it all started - Mobile Fraud - International Roaming Fraud – Own Subs Abroad - International Roaming Fraud – Foreign Visitors in our Network - Interconnect Fraud - Multiple different Attack Vectors used - High velocity of occurrences
  • 8. 8 The Quest for Big Data – Requirements - Answering fundamental questions - Who? - Where? - When? - How? - Giving Self Service Analytical Ability of find answers through Data Discovery & Visualization
  • 9. 9 The Quest for Big Data – Challenges - IT Challenges - Small Team - Limited Budget - Reusing current IT infrastructure in-place to the maximum level - Leveraging and combining current multiple data sources
  • 10. 10 The Quest for Big Data – Challenges - Old IT Systems Challenges - Fully Based on conventional relational database technologies for data layer – data warehouse - Reporting and analysis based on conventional Business Intelligence tools - Not designed for real-time or near-real-time capture and processing of large amounts of data for analytical purposes - Data discovery & visualization not primary focus of conventional Business Intelligence tools
  • 11. 11 Big Data Solution Requirements - Easy to deploy, develop and manage - Should scale both horizontally & vertically - CPU/processing power wise - Data storage wise - Should cover whole stack - Data Storage & Retreival - Data Integration - Data Presentation - Actively Developed with Good Documentation - Should be Free or Low-Cost Licence wise
  • 12. 12 Big Data Solutions Evaluated - Splunk - Very good commercial solution, covering full stack, easy to deploy and manage - Free only up to 500 MB data per day - Gets very expensive with larger amounts of data processed daily - Apache Hadoop - Free licence-wise, offering extreme scalability - Not covering full stack in pre-integrated way, Data Presentation layer poor in features for self service - Difficult to deploy and manage - Better suited for companies with larger IT teams
  • 13. 13 Settling for the ELK Stack - Settled for ELK Stack, fully integrated with: - Elasticsearch – Data Storage Layer - Logstash - Data Integration Layer - Kibana – Data Presentation Layer - Open Source Based & Free Licensing - Actively Developed with Good Documentation - Scalable and easy to deploy and manage - Enterprise Level Commercial Support Available
  • 14. 14 Technology behind Elasticsearch - NoSQL Data Storage, Indexing and Data Retreival Engine, derivative of Apache Lucene Open Source Project, written fully in Java - Built on following fundamental technological enablers: - Data Sharding and Massively Parallel Processing over cluster of inexpensive servers using Shared Nothing Data Architecture - Data Compression & In-Memory Storage - Probabilistic Computation Models (possibility to trade speed for accuracy or vice versa) - Lots of different free & open source plugins and addons available
  • 15. 15 Developing & Deploying our First Solution - International Roaming Fraud Dashboard - Highest Risk Area – Our customers abroad - Giving ability to Fraud department to visualize what is going on without IT interventions - Based on NRTRDE data received from roaming partners and our own risk scoring engine development - Combining NRTRDE and DWH data
  • 16. 16 Developing & Deploying our First Solution - Effort Spent on our First Solution – Fully In-House - Server deployment and configuration - 1 man day - ELK stack installation, configuration and tuning – 2 man days - Solution Development - Data Integration – 3 man days - Dashboard Development – 2 man days - Total initial effort spent including learning new technology: - 8 man days - 4 core virtual server with 32 GB RAM and 300 GB disk storage
  • 17. 17 Gaining Visual Insight - Dashboards enabled visual insight and discovery what is going on, including Geo-Visualizations - Ability to filter and drilldown to lowest level
  • 22. 22 Building upon Initial Success Story - Based on Initial Success Story we continued with rolling out: - Visiting Roamers Fraud Dashboard - Interconnect Fraud Dashboard - Circuit Switched Mobile Traffic Dashboard - Packet Based Mobile Traffic Dashboard - Online Charging System Real-Time Performance Dashboard - Real-Time Data Feeds to Elasticsearch directly - Complex Near-Real Time Data Feeds using conventional DWH as Data Integration and Data Distribution point towards Elasticearch
  • 23. 23 Building upon Initial Success Story
  • 24. 24 Building upon Initial Success Story
  • 25. 25 Building upon Initial Success Story
  • 26. 26 Building upon Initial Success Story
  • 27. 27 Building upon Initial Success Story
  • 28. 28 Open Source Big Data – IT Benefits - Open Source Big Data Solution Enabled IT to: - To complement our existing DWH & BI Infrastructure including data in those systems in Cost Efficient Manner and Short Timeframe - To provide to our end users large amounts of data for self-service Data Visualization & Discovery not before available - Ability to provide Real-Time Analytics for data sources with real-time data feeds
  • 29. 29 Open Source Big Data - Business Benefits - Open Source Big Data Data Discovery & Visualization Solution Enabled our Business Users: - To quickly visualize and discover what and where is going on in a self-service manner - To dramatically cut decision times and take appropriate actions resulting in: - Reduced business risks - Decreased service/network downtimes - Increased customer satisfaction