SlideShare une entreprise Scribd logo
Predictive Analytics & Business Insights – 2015 , Chicago
Mudit Mangal
Project Lead, Data Analytics, Supply Chain
Sears Holdings Corporation
06/11/2015
Agenda
— WHAT IS HAPPENING
— WHAT IS DATA ANALYTICS AND ITS CHALLENGES
— WHY DATA FOUNDATION
— HOW TO APPROACH
— WHY DATA GOVERNANCE
— WHAT ARE SECURITY ISSUES WITH DATA
— USE CASE
— WHAT IS IN FUTURE
— QUSESTIONS
New Data Frontiers
— Fueled by growing demand for anytime anywhere access to
information, technology is disrupting all areas of enterprise, driving
myriad opportunities and challenges.
— Enormous opportunities exist for enterprises to take advantage of
connected devices enabled by the “Internet of Things” to capture vast
amounts of information.
— Digital transformation is changing business models –pricing
strategies, processes, relationship between businesses and customers.
— Declining PC usage and increasing mobile device adoption is
driving a “mobile first” world.
— However, the evolution of the digital enterprise also presents
significant challenges, including new competition, changing
customer engagement and business models, unprecedented
transparency, privacy concerns and cybersecurity threats.
Digital Frontiers
Understanding Analytics
360-degree view
360-degree view of all the data is important to know what’s happening in
a marketplace—the combination of structured information, human
interactions, and machine-to-machine data.
360-degree view in practice
— In IT operations management, a 360-degree view allows to see logs,
performance ,it also lets you see what the test team said to the app dev
team, what the customers said to the help desk, and what the app
support team said to the help desk.
— In security management, a 360-degree view lets you see security alerts
from the network, applications, and infrastructure, while human
interaction data allows you to see security threats in emails.
— In retail, a 360-degree view allows you to analyze sales in stores and
online, as well as understand consumers’ expectations ,social
sentiment regarding the store. What do people think about your
service compared to that of your competitors?
By its definition : “Data that was previously ignored because of
technology limitations” examples includes unstructured data that
companies have struggled to analyze in the past, documents, social
data, customer surveys, web logs, and a lot of ‘dark’ structured data.
Dark Data
Data Torturing
Are you still torturing your data to get what you want?
Data Foundation
— Data is the foundation of all information solutions, BI and analytical
decisions and choosing the right technology is important.
— The data foundation encompasses the integration of data from
multiple, disparate sources into a trusted, understandable form for
use in analytics and making data as an enterprise asset.
— The escalating volume, variety, and velocity of information that is
being generated today present with many critical challenges.
— However, this overabundance of information can be an important asset
to those organizations that choose to capitalize on it.
Components of data foundation
Benefits of robust data foundation
A robust data foundation provides an organization with tremendous
benefits in terms of efficiency and effectiveness in decision making.
— One-stop shopping for data: Most significant uses of time in
decision making is getting the data in usable format. A robust data
foundation changes the 80 percent time spent on gathering to 80
percent time spent on analyzing the data.
— Single version of the truth: Getting different answers to same
questions is frustrating experience for decision makers. A robust data
foundation provide single version of truth on which everyone can rely.
— Drives common understanding across the enterprise: One of the
key objectives of data foundation is to integrate data from disparate
sources. A robust data foundation provides the structure and
enforcement of these, resulting everyone in organization working on
same page.
Consequences of the lack of a robust
data foundation
— Multiple answers to same questions
— Making less optimal business decisions
— Wasted time finding, collecting, summarizing data for use in analytics
Getting Started: Building Out
— Analyzing data often requires a transformational approach to many
critical IT processes.
— Ask yourself what data points do I need, how I am going to get them,
and what am I going to do with them once I have them ?
— Try building a Distributed platform that is small, low cost, fluent in all
forms of data and analytics. E.g. data in motion.
— Next, identify a low impact use case for implementation.
— Your application should be a good candidate for Distributed Data
Computing.
— If so, a successful POC will be assured.
From Data Lakes To Data Swamps
“By its definition, a data lake accepts any data, without governance.
Without metadata and a mechanism to maintain it, the data lake risks
turning into a data swamp and leads to hardest problem of data quality.”
Big Data without Governance
— Dumping data into Big Data Lake without repeatable processes and
data governance will create messy, uncontrollable data environment.
— Insights harvested from ungoverned data lake is not reliable and
trustworthy, so cannot make business decisions confidently.
— In an industry where data is the most valuable asset, data integrity is
essential. If the data is compromised, it can have vast consequences.
— Data must be physically safe. Whether data is stored internally or
within the cloud, Disaster recovery, security and other actions must be
taken to ensure the physical integrity of data.
— Humans make mistakes. Maintaining data integrity is difficult when
humans enter free-form text into software systems.
Governance Disciplines
Evolving Data Governance
Security Risk for Big Data
— As cyber threats continue to multiply, it is becoming harder to
safeguard data, intellectual property, and personal information.
— Greater use of the internet, smartphones and tablets in combination
with bring-your-own-device policies has made organizations’ data
more accessible and vulnerable.
— More data implies higher risk of exposure.
— New data types may give rise to new security breach scenarios.
Data Lake Security Solutions
Evolving Data Security
— Apache Knox : Perimeter/ Network Security
— Apache Ranger : Data Protection, Authorization, Audit tracking
— Apache Sentry : Authorization
Retail Use Case
Let’s face it: when it comes to giving business users the information they
need, retail is as tough. With multiple stores, myriad, ever-changing
products and constant transactions, every day is a new challenge.
— Most retailers already have systems in place to provide business users
with information. The question is, how to do better?
— Web-based retail analytic applications extend existing business
reporting and analytic solutions. Helps in understanding Customers.
— Sears has a very intensive big data program to drive customer loyalty,
Sears is doing amazing things with technology and Competes On Big
Data.
Predicting Future
Future of Data
What kind of data will be Big Data in the future?
— Structured data - This is the data companies store today: sales
transactions, maintenance details. Since Big Data technology allows us
to store more data and analyze it much faster, there will be increase in
amount of details stored ,in the time period for which data is kept.
— Human interaction data - Unstructured data refers to the data of
human interactions: emails, phone conversations, video, pictures,
documents, social media interactions on Twitter, Facebook and other
communities. This type of data represents 90 percent of useful data.
— Machine to machine data - By 2020 there will have been a massive
increase in the number of connected smart devices. Cooktops,
shopping carts, home thermostats, cars, bicycles, and refrigerators will
run applications that connect to the emerging Internet of Things.
These devices will generate huge amounts of data. Collecting and
analyzing this data will lead to new insights.
Conclusion
— Reporting and Analytics can be transformational for an organization.
However, having the proper data foundation that provides trusted,
well-integrated and well-managed data is essential to realize the
desired reporting and analytical capabilities.
— Mapping out a strategy and plan to establish the data foundation is
time well spent and will provide a return many times over.
Questions?
www.linkedin.com/in/muditmangal

Contenu connexe

Tendances

Change management success for data governance
Change management success for data governanceChange management success for data governance
Change management success for data governance
Reid Elliott
 

Tendances (20)

Data Analytics Business Intelligence
Data Analytics Business IntelligenceData Analytics Business Intelligence
Data Analytics Business Intelligence
 
Product Management for AI
Product Management for AIProduct Management for AI
Product Management for AI
 
Business analytics presentation
Business analytics presentationBusiness analytics presentation
Business analytics presentation
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 
Sample - Data Warehouse Requirements
Sample -  Data Warehouse RequirementsSample -  Data Warehouse Requirements
Sample - Data Warehouse Requirements
 
Business Intelligence concepts
Business Intelligence conceptsBusiness Intelligence concepts
Business Intelligence concepts
 
Big Data in e-Commerce
Big Data in e-CommerceBig Data in e-Commerce
Big Data in e-Commerce
 
Big Data: Issues and Challenges
Big Data: Issues and ChallengesBig Data: Issues and Challenges
Big Data: Issues and Challenges
 
Business Intelligence Presentation (1/2)
Business Intelligence Presentation (1/2)Business Intelligence Presentation (1/2)
Business Intelligence Presentation (1/2)
 
Business Analytics
 Business Analytics  Business Analytics
Business Analytics
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Change management success for data governance
Change management success for data governanceChange management success for data governance
Change management success for data governance
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applications
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Strategic Business Requirements for Master Data Management Systems
Strategic Business Requirements for Master Data Management SystemsStrategic Business Requirements for Master Data Management Systems
Strategic Business Requirements for Master Data Management Systems
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
New Analytic Uses of Master Data Management in the Enterprise
New Analytic Uses of Master Data Management in the EnterpriseNew Analytic Uses of Master Data Management in the Enterprise
New Analytic Uses of Master Data Management in the Enterprise
 

Similaire à Data foundation for analytics excellence

Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin Malhotra
Vin Malhotra
 
Practical analytics john enoch white paper
Practical analytics john enoch white paperPractical analytics john enoch white paper
Practical analytics john enoch white paper
John Enoch
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidence
IBM Software India
 

Similaire à Data foundation for analytics excellence (20)

Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
Transforming Big Data into business value
Transforming Big Data into business valueTransforming Big Data into business value
Transforming Big Data into business value
 
Big data assignment
Big data assignmentBig data assignment
Big data assignment
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin Malhotra
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - Whitepaper
 
A data powered future
A data powered futureA data powered future
A data powered future
 
6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics
 
Bidata
BidataBidata
Bidata
 
new.pptx
new.pptxnew.pptx
new.pptx
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Big data's impact on online marketing
Big data's impact on online marketingBig data's impact on online marketing
Big data's impact on online marketing
 
Practical analytics john enoch white paper
Practical analytics john enoch white paperPractical analytics john enoch white paper
Practical analytics john enoch white paper
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidence
 
Top Trends & Predictions That Will Drive Data Science in 2022.pdf
Top Trends & Predictions That Will Drive Data Science in 2022.pdfTop Trends & Predictions That Will Drive Data Science in 2022.pdf
Top Trends & Predictions That Will Drive Data Science in 2022.pdf
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Rising Significance of Big Data Analytics for Exponential Growth.docx
Rising Significance of Big Data Analytics for Exponential Growth.docxRising Significance of Big Data Analytics for Exponential Growth.docx
Rising Significance of Big Data Analytics for Exponential Growth.docx
 

Dernier

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 

Dernier (20)

Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Buy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptxBuy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptx
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 

Data foundation for analytics excellence

  • 1. Predictive Analytics & Business Insights – 2015 , Chicago Mudit Mangal Project Lead, Data Analytics, Supply Chain Sears Holdings Corporation 06/11/2015
  • 2. Agenda — WHAT IS HAPPENING — WHAT IS DATA ANALYTICS AND ITS CHALLENGES — WHY DATA FOUNDATION — HOW TO APPROACH — WHY DATA GOVERNANCE — WHAT ARE SECURITY ISSUES WITH DATA — USE CASE — WHAT IS IN FUTURE — QUSESTIONS
  • 3. New Data Frontiers — Fueled by growing demand for anytime anywhere access to information, technology is disrupting all areas of enterprise, driving myriad opportunities and challenges. — Enormous opportunities exist for enterprises to take advantage of connected devices enabled by the “Internet of Things” to capture vast amounts of information. — Digital transformation is changing business models –pricing strategies, processes, relationship between businesses and customers. — Declining PC usage and increasing mobile device adoption is driving a “mobile first” world. — However, the evolution of the digital enterprise also presents significant challenges, including new competition, changing customer engagement and business models, unprecedented transparency, privacy concerns and cybersecurity threats.
  • 6. 360-degree view 360-degree view of all the data is important to know what’s happening in a marketplace—the combination of structured information, human interactions, and machine-to-machine data.
  • 7. 360-degree view in practice — In IT operations management, a 360-degree view allows to see logs, performance ,it also lets you see what the test team said to the app dev team, what the customers said to the help desk, and what the app support team said to the help desk. — In security management, a 360-degree view lets you see security alerts from the network, applications, and infrastructure, while human interaction data allows you to see security threats in emails. — In retail, a 360-degree view allows you to analyze sales in stores and online, as well as understand consumers’ expectations ,social sentiment regarding the store. What do people think about your service compared to that of your competitors?
  • 8. By its definition : “Data that was previously ignored because of technology limitations” examples includes unstructured data that companies have struggled to analyze in the past, documents, social data, customer surveys, web logs, and a lot of ‘dark’ structured data. Dark Data
  • 9. Data Torturing Are you still torturing your data to get what you want?
  • 10. Data Foundation — Data is the foundation of all information solutions, BI and analytical decisions and choosing the right technology is important. — The data foundation encompasses the integration of data from multiple, disparate sources into a trusted, understandable form for use in analytics and making data as an enterprise asset. — The escalating volume, variety, and velocity of information that is being generated today present with many critical challenges. — However, this overabundance of information can be an important asset to those organizations that choose to capitalize on it.
  • 11. Components of data foundation
  • 12. Benefits of robust data foundation A robust data foundation provides an organization with tremendous benefits in terms of efficiency and effectiveness in decision making. — One-stop shopping for data: Most significant uses of time in decision making is getting the data in usable format. A robust data foundation changes the 80 percent time spent on gathering to 80 percent time spent on analyzing the data. — Single version of the truth: Getting different answers to same questions is frustrating experience for decision makers. A robust data foundation provide single version of truth on which everyone can rely. — Drives common understanding across the enterprise: One of the key objectives of data foundation is to integrate data from disparate sources. A robust data foundation provides the structure and enforcement of these, resulting everyone in organization working on same page.
  • 13. Consequences of the lack of a robust data foundation — Multiple answers to same questions — Making less optimal business decisions — Wasted time finding, collecting, summarizing data for use in analytics
  • 14. Getting Started: Building Out — Analyzing data often requires a transformational approach to many critical IT processes. — Ask yourself what data points do I need, how I am going to get them, and what am I going to do with them once I have them ? — Try building a Distributed platform that is small, low cost, fluent in all forms of data and analytics. E.g. data in motion. — Next, identify a low impact use case for implementation. — Your application should be a good candidate for Distributed Data Computing. — If so, a successful POC will be assured.
  • 15. From Data Lakes To Data Swamps “By its definition, a data lake accepts any data, without governance. Without metadata and a mechanism to maintain it, the data lake risks turning into a data swamp and leads to hardest problem of data quality.”
  • 16. Big Data without Governance — Dumping data into Big Data Lake without repeatable processes and data governance will create messy, uncontrollable data environment. — Insights harvested from ungoverned data lake is not reliable and trustworthy, so cannot make business decisions confidently. — In an industry where data is the most valuable asset, data integrity is essential. If the data is compromised, it can have vast consequences. — Data must be physically safe. Whether data is stored internally or within the cloud, Disaster recovery, security and other actions must be taken to ensure the physical integrity of data. — Humans make mistakes. Maintaining data integrity is difficult when humans enter free-form text into software systems.
  • 19. Security Risk for Big Data — As cyber threats continue to multiply, it is becoming harder to safeguard data, intellectual property, and personal information. — Greater use of the internet, smartphones and tablets in combination with bring-your-own-device policies has made organizations’ data more accessible and vulnerable. — More data implies higher risk of exposure. — New data types may give rise to new security breach scenarios.
  • 20. Data Lake Security Solutions
  • 21. Evolving Data Security — Apache Knox : Perimeter/ Network Security — Apache Ranger : Data Protection, Authorization, Audit tracking — Apache Sentry : Authorization
  • 22. Retail Use Case Let’s face it: when it comes to giving business users the information they need, retail is as tough. With multiple stores, myriad, ever-changing products and constant transactions, every day is a new challenge. — Most retailers already have systems in place to provide business users with information. The question is, how to do better? — Web-based retail analytic applications extend existing business reporting and analytic solutions. Helps in understanding Customers. — Sears has a very intensive big data program to drive customer loyalty, Sears is doing amazing things with technology and Competes On Big Data.
  • 24. Future of Data What kind of data will be Big Data in the future? — Structured data - This is the data companies store today: sales transactions, maintenance details. Since Big Data technology allows us to store more data and analyze it much faster, there will be increase in amount of details stored ,in the time period for which data is kept. — Human interaction data - Unstructured data refers to the data of human interactions: emails, phone conversations, video, pictures, documents, social media interactions on Twitter, Facebook and other communities. This type of data represents 90 percent of useful data. — Machine to machine data - By 2020 there will have been a massive increase in the number of connected smart devices. Cooktops, shopping carts, home thermostats, cars, bicycles, and refrigerators will run applications that connect to the emerging Internet of Things. These devices will generate huge amounts of data. Collecting and analyzing this data will lead to new insights.
  • 25. Conclusion — Reporting and Analytics can be transformational for an organization. However, having the proper data foundation that provides trusted, well-integrated and well-managed data is essential to realize the desired reporting and analytical capabilities. — Mapping out a strategy and plan to establish the data foundation is time well spent and will provide a return many times over.