SlideShare une entreprise Scribd logo
1  sur  36
Big Data
Trịnh Phong Nhã
Võ Hoàng Trôvi
Võ Đình Chinh
GVGD: TS. Nguyễn Đức Thái
Memory storage…
Computer Memory: 640K Ought to
be Enough for Anyone
How much data?
7 billion people
Google processes 100 PB/day; 3 million servers
Facebook has 300 PB + 500 TB/day; 35% of
world’s photos
YouTube 1000 PB video storage; 4 billion
views/day
Twitter processes 124 billion tweets/year
SMS messages – 6.1T per year
US Cell Calls – 2.2T minutes per year
US Credit cards - 1.4B Cards; 20B
transactions/year
3
Contents
4. Big Data Security
3. SQL vs NoSQL
2. Big Data Technology Today
1. Big Data Overview
5. Big data trends
6. Demo with MongoDB & Ref docs
1. Big Data Overview (tt)
“Big data is not a single technology
but a combination of old and new
tech-nologies that helps companies
gain actionable insight”.
(“Big Data For DummiesPublished by John Wiley & Sons,
Inc. ” book reference)
1. Big Data Overview (tt)
Characteristics of Big Data
Sources of Big Data
ERP
RFID
Website
Network Switches
Social Media
Examining Big Data Types
Structured Data
Structured Data(…)
Computer- or machine-generated:
Machine-generated data generally
refers to data that is created by a
machine without human intervention.
(Sensor data, Web log data, Point-of-
sale data, Financial data…)
Human-generated: This is data that
humans, in interaction with
computers, supply (Input data, Click-
stream data, Gaming-related data…)
Examining Big Data Types
Unstructured Data
Unstructured Data(…)
Unstructured data is everywhere
Machine-generated unstructured
data: Satellite images, Scientific
data, Photographs and video, Radar
or sonar data…
 Human-generated unstructured
data:Text internal to your company,
Social media data, Mobile data…
Managing different data types
Managing different data types
Integrating data types into a big data
environment need:
Connectors: enable you to pull data
in from various big data sources
Metadata is the definitions,
mappings, and other characteristics
used to describe how to find, access,
and use a company’s data (and
software) components
Analysis
• Querying
• Statistic
• Modeling
• Data Mining
• Text analytics
Analysis &
Processing
Processing
• Data storage
• Data transfer
• Data monitoring
What will we do with Big Data?
Quiz….?
How to store and
handle Big Data?
2. Big Data Technology Today
Storage…NoSQL Database
2.Big Data Technology Today(tt)
Processing
2.Big Data Technology Today(tt)
 The Apache Hadoop software library is a
framework that allows for the distributed
processing of large data sets across clusters of
computers using simple programming models.
2.Big Data Technology Today(tt)
Instead of treating
memory as a cache,
why not treat it as a
primary data store?
 Facebook keeps 80% of its
data in Memory (Stanford
research)
 RAM is 100-1000x faster
than Disk (Random seek)
• Disk - 5 -10ms
• RAM – x0.001msec
20
Events
FACEBOOK
FACEBOOK
FACEBOOK
Memory Grid
Data Grid
Data Grid
Data Grid
2.Big Data Technology Today(tt)
Transfer data:
2.Big Data Technology Today(tt)
Open-source software framework from
Apache Hadoop
 Google MapReduce
 GFS (Google File System)
 HDFS
 Map/Reduce
3. SQL vs NoSQL
Data
storage
File
SQL
DBMS
NoSQL
3. SQL vs NoSQL (…)
A relational database is a set of tables
containing data fitted into predefined
categories.
Each table contains one or more data
categories in columns.
Each row contains a unique instance of
data for the categories defined by the
columns.
3. SQL vs NoSQL (…)
Key-value stores. As the name implies, a
key-value store is a system that stores
values indexed for retrieval by keys.
Some of the market
leaders:
Riak
Amazon Dynamo
Voldermort
3. SQL vs NoSQL (…)
Column-oriented databases. column-
oriented databases contain one extendable
column of closely related data
Some of the market
leaders:
HBase
Cassandra
3. SQL vs NoSQL (…)
Document-based stores. These databases
store and organize data as collections of
documents, rather than as structured tables
with uniform sized fields for each record
Some of the
market
leaders:
MongoDB
CouchDB
SimpleDB
3. SQL vs NoSQL (…)
SQL 2008 Data
storage capacity
3. SQL vs NoSQL (…)
GridFS stores files in two
collections:
 chunks stores the binary chunks. For
details, see The chunks Collection.
 files stores the file’s metadata. For
details, see The files Collection.
3. SQL vs NoSQL (…)
BSON Types
The chunks Collection
The files Collection
3. SQL vs NoSQL (…)
4. Big Data Security
• Secure computations in distributed
programming frameworks
• Security best practices for non-relational
data stores
• Secure data storage and transactions logs
• Cryptographically enforced access control
and secure communication
• Granular access control
• Real-time security/compliance monitoring
4. Big Data Security (…)
Technical Recommendations for
sercurity
• Use Kerberos for node authentication
• Use file layer encryption
• Data anonymization
• Use key management
• Deployment validation
• Use secure communication
• Tokenization
• Cloud database controls
5. Big data trends
• Big data – of the people, by the
people, for the people
• Big data and social computing
• Cloud computing
• In memmory computing
• Mobile Applications and HTML5
• Internet and big data
6. Demo with MongoDB & Ref docs
Ref docs:
 Judith Hurwitz, Alan Nugent, Dr. Fern Halper,
and Marcia Kaufman: Big Data For Dummies.
John Wiley & Sons, Inc. 2013.
 “Technology Trends for 2013” prepared by
Kaushal Amin, Chief Technology Officer, KMS
Technology – Atlanta, GA, USA
 Website: http://hadoop.apache.org/
Demo with MongoDB
Big data presentation

Contenu connexe

Tendances

Bigdata
BigdataBigdata
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
AASTHA PANDEY
 

Tendances (20)

Bigdata
BigdataBigdata
Bigdata
 
Structuring Big Data
Structuring Big DataStructuring Big Data
Structuring Big Data
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Technical Demonstration - Denodo Platform 7.0
Technical Demonstration - Denodo Platform 7.0Technical Demonstration - Denodo Platform 7.0
Technical Demonstration - Denodo Platform 7.0
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Future of Data Strategy
Future of Data StrategyFuture of Data Strategy
Future of Data Strategy
 
Overview of Bigdata Analytics
Overview of Bigdata Analytics Overview of Bigdata Analytics
Overview of Bigdata Analytics
 
Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
 
Data virtualization an introduction
Data virtualization an introductionData virtualization an introduction
Data virtualization an introduction
 
Take your Data Management Practice to the Next Level with Denodo 7
Take your Data Management Practice to the Next Level with Denodo 7Take your Data Management Practice to the Next Level with Denodo 7
Take your Data Management Practice to the Next Level with Denodo 7
 
Big Data Overview 2013-2014
Big Data Overview 2013-2014Big Data Overview 2013-2014
Big Data Overview 2013-2014
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to Hero
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Creating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital TransformationCreating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital Transformation
 
Enterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingEnterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum Computing
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 

Similaire à Big data presentation

INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_public
Attila Barta
 
Big data & hadoop framework
Big data & hadoop frameworkBig data & hadoop framework
Big data & hadoop framework
Tu Pham
 

Similaire à Big data presentation (20)

Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data Mining
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_public
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by Sunny
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining Introduction
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data & hadoop framework
Big data & hadoop frameworkBig data & hadoop framework
Big data & hadoop framework
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
A Gentle Introduction to Big Data
A Gentle Introduction to Big DataA Gentle Introduction to Big Data
A Gentle Introduction to Big Data
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
PUC Masterclass Big Data
PUC Masterclass Big DataPUC Masterclass Big Data
PUC Masterclass Big Data
 
Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and Applications
 
NoSQL Basics - a quick tour
NoSQL Basics - a quick tourNoSQL Basics - a quick tour
NoSQL Basics - a quick tour
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with Azure
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
 

Plus de Chinh Vo Wili (6)

BYOD -Bring your own device
BYOD -Bring your own deviceBYOD -Bring your own device
BYOD -Bring your own device
 
Tieu luan triet hoc - Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
Tieu luan triet hoc -  Phan tich tu tuong nhan sinh quan trong mot so đieu ra...Tieu luan triet hoc -  Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
Tieu luan triet hoc - Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
 
De cuong on thi mon triet hoc
De cuong on thi mon triet hocDe cuong on thi mon triet hoc
De cuong on thi mon triet hoc
 
File bao cao Wifi Robot
File bao cao Wifi RobotFile bao cao Wifi Robot
File bao cao Wifi Robot
 
Thesis final - Wifi Robot
Thesis final - Wifi RobotThesis final - Wifi Robot
Thesis final - Wifi Robot
 
Atmel avr
Atmel avrAtmel avr
Atmel avr
 

Dernier

%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Dernier (20)

Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 

Big data presentation

  • 1. Big Data Trịnh Phong Nhã Võ Hoàng Trôvi Võ Đình Chinh GVGD: TS. Nguyễn Đức Thái
  • 2. Memory storage… Computer Memory: 640K Ought to be Enough for Anyone
  • 3. How much data? 7 billion people Google processes 100 PB/day; 3 million servers Facebook has 300 PB + 500 TB/day; 35% of world’s photos YouTube 1000 PB video storage; 4 billion views/day Twitter processes 124 billion tweets/year SMS messages – 6.1T per year US Cell Calls – 2.2T minutes per year US Credit cards - 1.4B Cards; 20B transactions/year 3
  • 4. Contents 4. Big Data Security 3. SQL vs NoSQL 2. Big Data Technology Today 1. Big Data Overview 5. Big data trends 6. Demo with MongoDB & Ref docs
  • 5. 1. Big Data Overview (tt) “Big data is not a single technology but a combination of old and new tech-nologies that helps companies gain actionable insight”. (“Big Data For DummiesPublished by John Wiley & Sons, Inc. ” book reference)
  • 6. 1. Big Data Overview (tt)
  • 8. Sources of Big Data ERP RFID Website Network Switches Social Media
  • 9. Examining Big Data Types Structured Data
  • 10. Structured Data(…) Computer- or machine-generated: Machine-generated data generally refers to data that is created by a machine without human intervention. (Sensor data, Web log data, Point-of- sale data, Financial data…) Human-generated: This is data that humans, in interaction with computers, supply (Input data, Click- stream data, Gaming-related data…)
  • 11. Examining Big Data Types Unstructured Data
  • 12. Unstructured Data(…) Unstructured data is everywhere Machine-generated unstructured data: Satellite images, Scientific data, Photographs and video, Radar or sonar data…  Human-generated unstructured data:Text internal to your company, Social media data, Mobile data…
  • 14. Managing different data types Integrating data types into a big data environment need: Connectors: enable you to pull data in from various big data sources Metadata is the definitions, mappings, and other characteristics used to describe how to find, access, and use a company’s data (and software) components
  • 15. Analysis • Querying • Statistic • Modeling • Data Mining • Text analytics Analysis & Processing Processing • Data storage • Data transfer • Data monitoring What will we do with Big Data?
  • 16. Quiz….? How to store and handle Big Data?
  • 17. 2. Big Data Technology Today Storage…NoSQL Database
  • 18. 2.Big Data Technology Today(tt) Processing
  • 19. 2.Big Data Technology Today(tt)  The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.
  • 20. 2.Big Data Technology Today(tt) Instead of treating memory as a cache, why not treat it as a primary data store?  Facebook keeps 80% of its data in Memory (Stanford research)  RAM is 100-1000x faster than Disk (Random seek) • Disk - 5 -10ms • RAM – x0.001msec 20 Events FACEBOOK FACEBOOK FACEBOOK Memory Grid Data Grid Data Grid Data Grid
  • 21. 2.Big Data Technology Today(tt) Transfer data:
  • 22. 2.Big Data Technology Today(tt) Open-source software framework from Apache Hadoop  Google MapReduce  GFS (Google File System)  HDFS  Map/Reduce
  • 23. 3. SQL vs NoSQL Data storage File SQL DBMS NoSQL
  • 24. 3. SQL vs NoSQL (…) A relational database is a set of tables containing data fitted into predefined categories. Each table contains one or more data categories in columns. Each row contains a unique instance of data for the categories defined by the columns.
  • 25. 3. SQL vs NoSQL (…) Key-value stores. As the name implies, a key-value store is a system that stores values indexed for retrieval by keys. Some of the market leaders: Riak Amazon Dynamo Voldermort
  • 26. 3. SQL vs NoSQL (…) Column-oriented databases. column- oriented databases contain one extendable column of closely related data Some of the market leaders: HBase Cassandra
  • 27. 3. SQL vs NoSQL (…) Document-based stores. These databases store and organize data as collections of documents, rather than as structured tables with uniform sized fields for each record Some of the market leaders: MongoDB CouchDB SimpleDB
  • 28. 3. SQL vs NoSQL (…) SQL 2008 Data storage capacity
  • 29. 3. SQL vs NoSQL (…) GridFS stores files in two collections:  chunks stores the binary chunks. For details, see The chunks Collection.  files stores the file’s metadata. For details, see The files Collection.
  • 30. 3. SQL vs NoSQL (…) BSON Types The chunks Collection The files Collection
  • 31. 3. SQL vs NoSQL (…)
  • 32. 4. Big Data Security • Secure computations in distributed programming frameworks • Security best practices for non-relational data stores • Secure data storage and transactions logs • Cryptographically enforced access control and secure communication • Granular access control • Real-time security/compliance monitoring
  • 33. 4. Big Data Security (…) Technical Recommendations for sercurity • Use Kerberos for node authentication • Use file layer encryption • Data anonymization • Use key management • Deployment validation • Use secure communication • Tokenization • Cloud database controls
  • 34. 5. Big data trends • Big data – of the people, by the people, for the people • Big data and social computing • Cloud computing • In memmory computing • Mobile Applications and HTML5 • Internet and big data
  • 35. 6. Demo with MongoDB & Ref docs Ref docs:  Judith Hurwitz, Alan Nugent, Dr. Fern Halper, and Marcia Kaufman: Big Data For Dummies. John Wiley & Sons, Inc. 2013.  “Technology Trends for 2013” prepared by Kaushal Amin, Chief Technology Officer, KMS Technology – Atlanta, GA, USA  Website: http://hadoop.apache.org/ Demo with MongoDB