SlideShare une entreprise Scribd logo
1  sur  22
© 2012 University of Moratuwa




                        Big Data CDR
                          Analyzer
           “The Next Generation Mobile Promotions”



Project Supervisors-                  •   080201N – M.K.P.R. Jayawardhana
Mr. Thilina Anjitha – hSenid          •   080254D – P.K.A.M. Kumara
Dr.Shahani Markus Weerawarana
                                      •   080331L – W.D.A.I. Paranawithana
                                      •   080357V – T.D.K. Perera
© 2012 University of Moratuwa




                OVERVIEW
 Background
 Current Situation
 Scope and Assumptions
 Kanthaka – big data CDR Analyzer System
 Technology Comparison
     - Map Reduce
     - NoSQL Databases
 Architecture
 Risks and Possible Remedies
 References
© 2012 University of Moratuwa



Background
Mobile Promotions
© 2012 University of Moratuwa




    CURRENT SITUATION
•   Promotions based only on their network
    usage
•   Use only active call switch for triggering
    promotions
•   No way of analyzing and processing high
    volume CDR records
•   No efficient CDR analyzing method
•   No access to historical data
•   Complex rules not supported                         &@$*#
© 2012 University of Moratuwa




                                                TO RESCUE

   Selecting eligible users for both commercial
    organizations based and network usage
    based promotions.
     Eg- giving 20% discount for pizza lovers within age group 16-40 who
        have called pizza hut more than 5 times a month

 High volume CDR analysis.
 Near real time selection of eligible users for
  promotions.
© 2012 University of Moratuwa




   CDR Analyzer system which
     can   process 30 million records per day

     can   produce results within 30 seconds

     provides   a GUI to define dynamic rules

     can   be used to offer real-time sales
     promotions for mobile subscribers
© 2012 University of Moratuwa




This location information retrieving from Location Based System(LBS) can
be replaced with any other information retrieving such as subscriber age
from the Customer Relationship Management system to support attractive
promotions.
© 2012 University of Moratuwa



SCOPE AND ASSUMPTIONS
SCOPE




  30 M                         30 M
  Multiple Rules               Multiple Rules
  Offer Promotion              Select eligibilities for
                                 promotion only

  Real system operation           Operation expect by Kanthaka
© 2012 University of Moratuwa




ASSUMPTIONS

 CDR records can be only in .CSV format.
 Event type can be in different types like
  SMS, Voice call, MMS, USSD, Top-up,
  GPRS, LBS.
 CDR can be received as batches to the
  system asynchronously.
 Only 6 attributes out of many attributes will
  be considered during processing.
© 2012 University of Moratuwa




TECHNOLOGY COMPARISON
© 2012 University of Moratuwa
© 2012 University of Moratuwa




YCSB BENCHMARKS




   With more big users, active mailing lists, most
    promising technologies (secondary index,
    counters) best to try out is Cassandra.
© 2012 University of Moratuwa
© 2012 University of Moratuwa




TECHNOLOGY SELECTION
TECHNOLOGIES LEFT BEHIND            TECHNOLOGIES SELECTED


   Complex Event                      NoSQL DB - Cassandra
    Processing engines(CEP)
       No persistency
   Rules Engine
       More layers  More
        latency
   Hadoop - latency
   NoSQL DB- Hbase,
    MongoDB, Hive
© 2012 University of Moratuwa




BRIEF ARCHITECTURE OF ‘KANTHAKA’




Promotion definition
                                                             Cassandra Cluster




                       Pre-processing unit
© 2012 University of Moratuwa




TEST RESULTS IN SINGLE NODE
© 2012 University of Moratuwa




TEST RESULTS IN TWO NODE- CLUSTER
© 2012 University of Moratuwa




CLUSTER BETTER IN HIGH LOADS
© 2012 University of Moratuwa




RISKS AND POSSIBLE REMEDIES

 NoSQL databases
  High performance More memory
 Use an external cluster with descent memory
 Concurrency Issues Handling
  Low speed  Locking database
  Use shadow copy
 Handling sudden peaks
  Should have an auto balancing mechanism ready
© 2012 University of Moratuwa




FINAL DELIVERABLES

 Big Data CDR Analyzer system

 Research Paper

 Final Report
© 2012 University of Moratuwa



REFERENCES

   B. F. Cooper, A. Silberstein, E. Tam, R.
    Ramakrishnan, and R. Sears,
    “Benchmarking cloud serving systems with
    YCSB,” 2010, pp. 143–154.

Visit us at Kanthaka
© 2012 University of Moratuwa




          Thank you



                                Manoj



Dhanika                                         Amila




          Pushpalanka

Contenu connexe

En vedette

C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...
C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...
C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...
DataStax Academy
 
MongoDB Interface for Asterisk PBX
MongoDB Interface for Asterisk PBXMongoDB Interface for Asterisk PBX
MongoDB Interface for Asterisk PBX
Sokratis Galiatsis
 
Application of Data Mining and Machine Learning techniques for Fraud Detectio...
Application of Data Mining and Machine Learning techniques for Fraud Detectio...Application of Data Mining and Machine Learning techniques for Fraud Detectio...
Application of Data Mining and Machine Learning techniques for Fraud Detectio...
Christian Adom
 
Fraud Detection using Data Mining Project
Fraud Detection using Data Mining ProjectFraud Detection using Data Mining Project
Fraud Detection using Data Mining Project
Albert Kennedy III
 

En vedette (20)

CDR-Stats : VoIP Analytics Solution for Asterisk and FreeSWITCH with MongoDB
CDR-Stats : VoIP Analytics Solution for Asterisk and FreeSWITCH with MongoDBCDR-Stats : VoIP Analytics Solution for Asterisk and FreeSWITCH with MongoDB
CDR-Stats : VoIP Analytics Solution for Asterisk and FreeSWITCH with MongoDB
 
telecom analytics ppt
telecom analytics ppttelecom analytics ppt
telecom analytics ppt
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry
 
Predictive Analytics in Telecommunication
Predictive Analytics in TelecommunicationPredictive Analytics in Telecommunication
Predictive Analytics in Telecommunication
 
Deep Learning for Fraud Detection
Deep Learning for Fraud DetectionDeep Learning for Fraud Detection
Deep Learning for Fraud Detection
 
PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014
 
C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...
C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...
C* Summit EU 2013: Capitalizing on Data in Telecommunications: The Cassandra ...
 
Keyword query routing
Keyword query routingKeyword query routing
Keyword query routing
 
Cassandra Cluster Manager (CCM)
Cassandra Cluster Manager (CCM)Cassandra Cluster Manager (CCM)
Cassandra Cluster Manager (CCM)
 
Ativ1 4 rafaelaam
Ativ1 4 rafaelaamAtiv1 4 rafaelaam
Ativ1 4 rafaelaam
 
MongoDB Interface for Asterisk PBX
MongoDB Interface for Asterisk PBXMongoDB Interface for Asterisk PBX
MongoDB Interface for Asterisk PBX
 
Data Science Strategy
Data Science StrategyData Science Strategy
Data Science Strategy
 
Application of Data Mining and Machine Learning techniques for Fraud Detectio...
Application of Data Mining and Machine Learning techniques for Fraud Detectio...Application of Data Mining and Machine Learning techniques for Fraud Detectio...
Application of Data Mining and Machine Learning techniques for Fraud Detectio...
 
A data mining framework for fraud detection in telecom based on MapReduce (Pr...
A data mining framework for fraud detection in telecom based on MapReduce (Pr...A data mining framework for fraud detection in telecom based on MapReduce (Pr...
A data mining framework for fraud detection in telecom based on MapReduce (Pr...
 
Fraud Detection using Data Mining Project
Fraud Detection using Data Mining ProjectFraud Detection using Data Mining Project
Fraud Detection using Data Mining Project
 
Customer insights from telecom data using deep learning
Customer insights from telecom data using deep learning Customer insights from telecom data using deep learning
Customer insights from telecom data using deep learning
 
Big Data Telecom
Big Data TelecomBig Data Telecom
Big Data Telecom
 
Frauds in telecom sector
Frauds in telecom sectorFrauds in telecom sector
Frauds in telecom sector
 
Data Mining in telecommunication industry
Data Mining in telecommunication industryData Mining in telecommunication industry
Data Mining in telecommunication industry
 
Data mining in Telecommunications
Data mining in TelecommunicationsData mining in Telecommunications
Data mining in Telecommunications
 

Similaire à Big Data CDR Analyzer - Kanthaka

Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
birdsking
 
Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...
IJECEIAES
 
Final Year Project Titles 2013 2014
Final Year Project Titles 2013 2014Final Year Project Titles 2013 2014
Final Year Project Titles 2013 2014
sybiantech
 
CA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdf
CA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdfCA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdf
CA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdf
princeharit48
 
Synopsis on ulip upated
Synopsis on ulip upatedSynopsis on ulip upated
Synopsis on ulip upated
Prem Kumar
 

Similaire à Big Data CDR Analyzer - Kanthaka (20)

Forecast 2014: Welcome to the ODCA University - School is Way Cool!
Forecast 2014: Welcome to the ODCA University - School is Way Cool!Forecast 2014: Welcome to the ODCA University - School is Way Cool!
Forecast 2014: Welcome to the ODCA University - School is Way Cool!
 
mu-DDRL_A_QoS-Aware_Distributed_Deep_Reinforcement_Learning_Technique_for_Ser...
mu-DDRL_A_QoS-Aware_Distributed_Deep_Reinforcement_Learning_Technique_for_Ser...mu-DDRL_A_QoS-Aware_Distributed_Deep_Reinforcement_Learning_Technique_for_Ser...
mu-DDRL_A_QoS-Aware_Distributed_Deep_Reinforcement_Learning_Technique_for_Ser...
 
MDG & RDM field reports Aaron Zornes - 2012 Singapore- print v1
MDG & RDM field reports   Aaron Zornes - 2012 Singapore- print v1MDG & RDM field reports   Aaron Zornes - 2012 Singapore- print v1
MDG & RDM field reports Aaron Zornes - 2012 Singapore- print v1
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containers
 
7th cloud computing & big data 2013 Summit - 2013
7th cloud computing & big data 2013 Summit - 2013 7th cloud computing & big data 2013 Summit - 2013
7th cloud computing & big data 2013 Summit - 2013
 
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
Ieee projects-2014-bulk-ieee-projects-2015-title-list-for-me-be-mphil-final-y...
 
Umu seminar 02-2019
Umu seminar 02-2019Umu seminar 02-2019
Umu seminar 02-2019
 
Adopting Cloud Testing for Continuous Delivery, with the premier global provi...
Adopting Cloud Testing for Continuous Delivery, with the premier global provi...Adopting Cloud Testing for Continuous Delivery, with the premier global provi...
Adopting Cloud Testing for Continuous Delivery, with the premier global provi...
 
Smart storage Solutions for Big Data in Smart Cities
Smart storage Solutions for Big Data in Smart Cities Smart storage Solutions for Big Data in Smart Cities
Smart storage Solutions for Big Data in Smart Cities
 
DICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, Romania
DICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, RomaniaDICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, Romania
DICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, Romania
 
Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...
 
Itc571 Project Presentation
Itc571 Project PresentationItc571 Project Presentation
Itc571 Project Presentation
 
The Benefits of a Seamless IRT and EDC Integration in Clinical Trial Execution
The Benefits of a Seamless IRT and EDC Integration in Clinical Trial ExecutionThe Benefits of a Seamless IRT and EDC Integration in Clinical Trial Execution
The Benefits of a Seamless IRT and EDC Integration in Clinical Trial Execution
 
SDN-Based Enhancements to QoS and Data Quality in Multi-Tenanted Data Center ...
SDN-Based Enhancements to QoS and Data Quality in Multi-Tenanted Data Center ...SDN-Based Enhancements to QoS and Data Quality in Multi-Tenanted Data Center ...
SDN-Based Enhancements to QoS and Data Quality in Multi-Tenanted Data Center ...
 
Final Year Project Titles 2013 2014
Final Year Project Titles 2013 2014Final Year Project Titles 2013 2014
Final Year Project Titles 2013 2014
 
Forecast deploy1 100_ak2
Forecast deploy1 100_ak2Forecast deploy1 100_ak2
Forecast deploy1 100_ak2
 
CA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdf
CA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdfCA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdf
CA1038 - A Secure Data Sharing Strategy for Mobile Cloud Platform.pdf
 
Synopsis on ulip upated
Synopsis on ulip upatedSynopsis on ulip upated
Synopsis on ulip upated
 
Total interpretive structural modelling on enablers of cloud computing
Total interpretive structural modelling on enablers of cloud computingTotal interpretive structural modelling on enablers of cloud computing
Total interpretive structural modelling on enablers of cloud computing
 
Presentation_final.pdf
Presentation_final.pdfPresentation_final.pdf
Presentation_final.pdf
 

Plus de Pushpalanka Jayawardhana

Plus de Pushpalanka Jayawardhana (10)

Authorization for workloads in a dynamically scaling heterogeneous system
Authorization for workloads in a  dynamically scaling heterogeneous systemAuthorization for workloads in a  dynamically scaling heterogeneous system
Authorization for workloads in a dynamically scaling heterogeneous system
 
The role of IAM in OpenBanking and where do we stand
The role of IAM in OpenBanking and where do we stand The role of IAM in OpenBanking and where do we stand
The role of IAM in OpenBanking and where do we stand
 
Frictionless Adaption of PSD2 with WSO2
Frictionless Adaption of PSD2 with WSO2Frictionless Adaption of PSD2 with WSO2
Frictionless Adaption of PSD2 with WSO2
 
Identity mediation for enterprise identity bus
Identity mediation for enterprise identity busIdentity mediation for enterprise identity bus
Identity mediation for enterprise identity bus
 
Threads and Concurrency Identifying Performance Deviations in Thread Pools
Threads and Concurrency Identifying Performance Deviations in Thread PoolsThreads and Concurrency Identifying Performance Deviations in Thread Pools
Threads and Concurrency Identifying Performance Deviations in Thread Pools
 
Approximate Protocol for Privacy Preserving Associate Rule Mining
Approximate Protocol for Privacy Preserving Associate Rule MiningApproximate Protocol for Privacy Preserving Associate Rule Mining
Approximate Protocol for Privacy Preserving Associate Rule Mining
 
Leveraging federation capabilities of identity server for api gateway
Leveraging federation capabilities  of identity server for api gatewayLeveraging federation capabilities  of identity server for api gateway
Leveraging federation capabilities of identity server for api gateway
 
Feedback queuing models for time shared systems
Feedback queuing models for time shared systemsFeedback queuing models for time shared systems
Feedback queuing models for time shared systems
 
Experience at WSO2 as an Intern
Experience at WSO2 as an InternExperience at WSO2 as an Intern
Experience at WSO2 as an Intern
 
Cosmology in general
Cosmology in generalCosmology in general
Cosmology in general
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Big Data CDR Analyzer - Kanthaka

  • 1. © 2012 University of Moratuwa Big Data CDR Analyzer “The Next Generation Mobile Promotions” Project Supervisors- • 080201N – M.K.P.R. Jayawardhana Mr. Thilina Anjitha – hSenid • 080254D – P.K.A.M. Kumara Dr.Shahani Markus Weerawarana • 080331L – W.D.A.I. Paranawithana • 080357V – T.D.K. Perera
  • 2. © 2012 University of Moratuwa OVERVIEW  Background  Current Situation  Scope and Assumptions  Kanthaka – big data CDR Analyzer System  Technology Comparison - Map Reduce - NoSQL Databases  Architecture  Risks and Possible Remedies  References
  • 3. © 2012 University of Moratuwa Background Mobile Promotions
  • 4. © 2012 University of Moratuwa CURRENT SITUATION • Promotions based only on their network usage • Use only active call switch for triggering promotions • No way of analyzing and processing high volume CDR records • No efficient CDR analyzing method • No access to historical data • Complex rules not supported &@$*#
  • 5. © 2012 University of Moratuwa TO RESCUE  Selecting eligible users for both commercial organizations based and network usage based promotions. Eg- giving 20% discount for pizza lovers within age group 16-40 who have called pizza hut more than 5 times a month  High volume CDR analysis.  Near real time selection of eligible users for promotions.
  • 6. © 2012 University of Moratuwa  CDR Analyzer system which  can process 30 million records per day  can produce results within 30 seconds  provides a GUI to define dynamic rules  can be used to offer real-time sales promotions for mobile subscribers
  • 7. © 2012 University of Moratuwa This location information retrieving from Location Based System(LBS) can be replaced with any other information retrieving such as subscriber age from the Customer Relationship Management system to support attractive promotions.
  • 8. © 2012 University of Moratuwa SCOPE AND ASSUMPTIONS SCOPE  30 M  30 M  Multiple Rules  Multiple Rules  Offer Promotion  Select eligibilities for promotion only Real system operation Operation expect by Kanthaka
  • 9. © 2012 University of Moratuwa ASSUMPTIONS  CDR records can be only in .CSV format.  Event type can be in different types like SMS, Voice call, MMS, USSD, Top-up, GPRS, LBS.  CDR can be received as batches to the system asynchronously.  Only 6 attributes out of many attributes will be considered during processing.
  • 10. © 2012 University of Moratuwa TECHNOLOGY COMPARISON
  • 11. © 2012 University of Moratuwa
  • 12. © 2012 University of Moratuwa YCSB BENCHMARKS  With more big users, active mailing lists, most promising technologies (secondary index, counters) best to try out is Cassandra.
  • 13. © 2012 University of Moratuwa
  • 14. © 2012 University of Moratuwa TECHNOLOGY SELECTION TECHNOLOGIES LEFT BEHIND TECHNOLOGIES SELECTED  Complex Event  NoSQL DB - Cassandra Processing engines(CEP)  No persistency  Rules Engine  More layers  More latency  Hadoop - latency  NoSQL DB- Hbase, MongoDB, Hive
  • 15. © 2012 University of Moratuwa BRIEF ARCHITECTURE OF ‘KANTHAKA’ Promotion definition Cassandra Cluster Pre-processing unit
  • 16. © 2012 University of Moratuwa TEST RESULTS IN SINGLE NODE
  • 17. © 2012 University of Moratuwa TEST RESULTS IN TWO NODE- CLUSTER
  • 18. © 2012 University of Moratuwa CLUSTER BETTER IN HIGH LOADS
  • 19. © 2012 University of Moratuwa RISKS AND POSSIBLE REMEDIES  NoSQL databases High performance More memory  Use an external cluster with descent memory  Concurrency Issues Handling Low speed  Locking database Use shadow copy  Handling sudden peaks Should have an auto balancing mechanism ready
  • 20. © 2012 University of Moratuwa FINAL DELIVERABLES Big Data CDR Analyzer system Research Paper Final Report
  • 21. © 2012 University of Moratuwa REFERENCES  B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, “Benchmarking cloud serving systems with YCSB,” 2010, pp. 143–154. Visit us at Kanthaka
  • 22. © 2012 University of Moratuwa Thank you Manoj Dhanika Amila Pushpalanka