SlideShare une entreprise Scribd logo
1  sur  36
Big Data
Trịnh Phong Nhã
Võ Hoàng Trôvi
Võ Đình Chinh
GVGD: TS. Nguyễn Đức Thái
Memory storage…
Computer Memory: 640K Ought to
be Enough for Anyone
How much data?
7 billion people
Google processes 100 PB/day; 3 million servers
Facebook has 300 PB + 500 TB/day; 35% of
world’s photos
YouTube 1000 PB video storage; 4 billion
views/day
Twitter processes 124 billion tweets/year
SMS messages – 6.1T per year
US Cell Calls – 2.2T minutes per year
US Credit cards - 1.4B Cards; 20B
transactions/year
3
Contents
4. Big Data Security
3. SQL vs NoSQL
2. Big Data Technology Today
1. Big Data Overview
5. Big data trends
6. Demo with MongoDB & Ref docs
1. Big Data Overview (tt)
“Big data is not a single technology
but a combination of old and new
tech-nologies that helps companies
gain actionable insight”.
(“Big Data For DummiesPublished by John Wiley & Sons,
Inc. ” book reference)
1. Big Data Overview (tt)
Characteristics of Big Data
Sources of Big Data
ERP
RFID
Website
Network Switches
Social Media
Examining Big Data Types
Structured Data
Structured Data(…)
Computer- or machine-generated:
Machine-generated data generally
refers to data that is created by a
machine without human intervention.
(Sensor data, Web log data, Point-of-
sale data, Financial data…)
Human-generated: This is data that
humans, in interaction with
computers, supply (Input data, Click-
stream data, Gaming-related data…)
Examining Big Data Types
Unstructured Data
Unstructured Data(…)
Unstructured data is everywhere
Machine-generated unstructured
data: Satellite images, Scientific
data, Photographs and video, Radar
or sonar data…
 Human-generated unstructured
data:Text internal to your company,
Social media data, Mobile data…
Managing different data types
Managing different data types
Integrating data types into a big data
environment need:
Connectors: enable you to pull data
in from various big data sources
Metadata is the definitions,
mappings, and other characteristics
used to describe how to find, access,
and use a company’s data (and
software) components
Analysis
• Querying
• Statistic
• Modeling
• Data Mining
• Text analytics
Analysis &
Processing
Processing
• Data storage
• Data transfer
• Data monitoring
What will we do with Big Data?
Quiz….?
How to store and
handle Big Data?
2. Big Data Technology Today
Storage…NoSQL Database
2.Big Data Technology Today(tt)
Processing
2.Big Data Technology Today(tt)
 The Apache Hadoop software library is a
framework that allows for the distributed
processing of large data sets across clusters of
computers using simple programming models.
2.Big Data Technology Today(tt)
Instead of treating
memory as a cache,
why not treat it as a
primary data store?
 Facebook keeps 80% of its
data in Memory (Stanford
research)
 RAM is 100-1000x faster
than Disk (Random seek)
• Disk - 5 -10ms
• RAM – x0.001msec
20
Events
FACEBOOK
FACEBOOK
FACEBOOK
Memory Grid
Data Grid
Data Grid
Data Grid
2.Big Data Technology Today(tt)
Transfer data:
2.Big Data Technology Today(tt)
Open-source software framework from
Apache Hadoop
 Google MapReduce
 GFS (Google File System)
 HDFS
 Map/Reduce
3. SQL vs NoSQL
Data
storage
File
SQL
DBMS
NoSQL
3. SQL vs NoSQL (…)
A relational database is a set of tables
containing data fitted into predefined
categories.
Each table contains one or more data
categories in columns.
Each row contains a unique instance of
data for the categories defined by the
columns.
3. SQL vs NoSQL (…)
Key-value stores. As the name implies, a
key-value store is a system that stores
values indexed for retrieval by keys.
Some of the market
leaders:
Riak
Amazon Dynamo
Voldermort
3. SQL vs NoSQL (…)
Column-oriented databases. column-
oriented databases contain one extendable
column of closely related data
Some of the market
leaders:
HBase
Cassandra
3. SQL vs NoSQL (…)
Document-based stores. These databases
store and organize data as collections of
documents, rather than as structured tables
with uniform sized fields for each record
Some of the
market
leaders:
MongoDB
CouchDB
SimpleDB
3. SQL vs NoSQL (…)
SQL 2008 Data
storage capacity
3. SQL vs NoSQL (…)
GridFS stores files in two
collections:
 chunks stores the binary chunks. For
details, see The chunks Collection.
 files stores the file’s metadata. For
details, see The files Collection.
3. SQL vs NoSQL (…)
BSON Types
The chunks Collection
The files Collection
3. SQL vs NoSQL (…)
4. Big Data Security
• Secure computations in distributed
programming frameworks
• Security best practices for non-relational
data stores
• Secure data storage and transactions logs
• Cryptographically enforced access control
and secure communication
• Granular access control
• Real-time security/compliance monitoring
4. Big Data Security (…)
Technical Recommendations for
sercurity
• Use Kerberos for node authentication
• Use file layer encryption
• Data anonymization
• Use key management
• Deployment validation
• Use secure communication
• Tokenization
• Cloud database controls
5. Big data trends
• Big data – of the people, by the
people, for the people
• Big data and social computing
• Cloud computing
• In memmory computing
• Mobile Applications and HTML5
• Internet and big data
6. Demo with MongoDB & Ref docs
Ref docs:
 Judith Hurwitz, Alan Nugent, Dr. Fern Halper,
and Marcia Kaufman: Big Data For Dummies.
John Wiley & Sons, Inc. 2013.
 “Technology Trends for 2013” prepared by
Kaushal Amin, Chief Technology Officer, KMS
Technology – Atlanta, GA, USA
 Website: http://hadoop.apache.org/
Demo with MongoDB
Big data presentation

Contenu connexe

Tendances

Structuring Big Data
Structuring Big DataStructuring Big Data
Structuring Big DataFujitsu UK
 
Technical Demonstration - Denodo Platform 7.0
Technical Demonstration - Denodo Platform 7.0Technical Demonstration - Denodo Platform 7.0
Technical Demonstration - Denodo Platform 7.0Denodo
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
Future of Data Strategy
Future of Data StrategyFuture of Data Strategy
Future of Data StrategyDenodo
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentationAASTHA PANDEY
 
Data virtualization an introduction
Data virtualization an introductionData virtualization an introduction
Data virtualization an introductionDenodo
 
Take your Data Management Practice to the Next Level with Denodo 7
Take your Data Management Practice to the Next Level with Denodo 7Take your Data Management Practice to the Next Level with Denodo 7
Take your Data Management Practice to the Next Level with Denodo 7Denodo
 
Big Data Overview 2013-2014
Big Data Overview 2013-2014Big Data Overview 2013-2014
Big Data Overview 2013-2014KMS Technology
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data AnalyticsTUSHAR GARG
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroDenodo
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
Creating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital TransformationCreating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital TransformationMongoDB
 
Enterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingEnterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingKnowledgent
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview pptVIKAS KATARE
 

Tendances (20)

Bigdata
BigdataBigdata
Bigdata
 
Structuring Big Data
Structuring Big DataStructuring Big Data
Structuring Big Data
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Technical Demonstration - Denodo Platform 7.0
Technical Demonstration - Denodo Platform 7.0Technical Demonstration - Denodo Platform 7.0
Technical Demonstration - Denodo Platform 7.0
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Future of Data Strategy
Future of Data StrategyFuture of Data Strategy
Future of Data Strategy
 
Overview of Bigdata Analytics
Overview of Bigdata Analytics Overview of Bigdata Analytics
Overview of Bigdata Analytics
 
Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
 
Data virtualization an introduction
Data virtualization an introductionData virtualization an introduction
Data virtualization an introduction
 
Take your Data Management Practice to the Next Level with Denodo 7
Take your Data Management Practice to the Next Level with Denodo 7Take your Data Management Practice to the Next Level with Denodo 7
Take your Data Management Practice to the Next Level with Denodo 7
 
Big Data Overview 2013-2014
Big Data Overview 2013-2014Big Data Overview 2013-2014
Big Data Overview 2013-2014
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to Hero
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Creating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital TransformationCreating a Modern Data Architecture for Digital Transformation
Creating a Modern Data Architecture for Digital Transformation
 
Enterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingEnterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum Computing
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 

Similaire à Big data presentation

Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data MiningRanak Ghosh
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data WarehousingAmdocs
 
INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_publicAttila Barta
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by SunnyDignitasDigital1
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Trieu Nguyen
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big DataFrank Kienle
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataVipin Batra
 
Big data & hadoop framework
Big data & hadoop frameworkBig data & hadoop framework
Big data & hadoop frameworkTu Pham
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...i_scienceEU
 
A Gentle Introduction to Big Data
A Gentle Introduction to Big DataA Gentle Introduction to Big Data
A Gentle Introduction to Big DataMehmet Ali Akyol
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
PUC Masterclass Big Data
PUC Masterclass Big DataPUC Masterclass Big Data
PUC Masterclass Big DataArjen de Vries
 
Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsIJMER
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureChristos Charmatzis
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseStavros Papadopoulos
 

Similaire à Big data presentation (20)

Data Warehouse and Data Mining
Data Warehouse and Data MiningData Warehouse and Data Mining
Data Warehouse and Data Mining
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
INF2190_W1_2016_public
INF2190_W1_2016_publicINF2190_W1_2016_public
INF2190_W1_2016_public
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by Sunny
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining Introduction
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data & hadoop framework
Big data & hadoop frameworkBig data & hadoop framework
Big data & hadoop framework
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
A Gentle Introduction to Big Data
A Gentle Introduction to Big DataA Gentle Introduction to Big Data
A Gentle Introduction to Big Data
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
PUC Masterclass Big Data
PUC Masterclass Big DataPUC Masterclass Big Data
PUC Masterclass Big Data
 
Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and Applications
 
NoSQL Basics - a quick tour
NoSQL Basics - a quick tourNoSQL Basics - a quick tour
NoSQL Basics - a quick tour
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with Azure
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
 

Plus de Chinh Vo Wili

BYOD -Bring your own device
BYOD -Bring your own deviceBYOD -Bring your own device
BYOD -Bring your own deviceChinh Vo Wili
 
Tieu luan triet hoc - Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
Tieu luan triet hoc -  Phan tich tu tuong nhan sinh quan trong mot so đieu ra...Tieu luan triet hoc -  Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
Tieu luan triet hoc - Phan tich tu tuong nhan sinh quan trong mot so đieu ra...Chinh Vo Wili
 
De cuong on thi mon triet hoc
De cuong on thi mon triet hocDe cuong on thi mon triet hoc
De cuong on thi mon triet hocChinh Vo Wili
 
File bao cao Wifi Robot
File bao cao Wifi RobotFile bao cao Wifi Robot
File bao cao Wifi RobotChinh Vo Wili
 
Thesis final - Wifi Robot
Thesis final - Wifi RobotThesis final - Wifi Robot
Thesis final - Wifi RobotChinh Vo Wili
 

Plus de Chinh Vo Wili (6)

BYOD -Bring your own device
BYOD -Bring your own deviceBYOD -Bring your own device
BYOD -Bring your own device
 
Tieu luan triet hoc - Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
Tieu luan triet hoc -  Phan tich tu tuong nhan sinh quan trong mot so đieu ra...Tieu luan triet hoc -  Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
Tieu luan triet hoc - Phan tich tu tuong nhan sinh quan trong mot so đieu ra...
 
De cuong on thi mon triet hoc
De cuong on thi mon triet hocDe cuong on thi mon triet hoc
De cuong on thi mon triet hoc
 
File bao cao Wifi Robot
File bao cao Wifi RobotFile bao cao Wifi Robot
File bao cao Wifi Robot
 
Thesis final - Wifi Robot
Thesis final - Wifi RobotThesis final - Wifi Robot
Thesis final - Wifi Robot
 
Atmel avr
Atmel avrAtmel avr
Atmel avr
 

Dernier

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 

Dernier (20)

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 

Big data presentation

  • 1. Big Data Trịnh Phong Nhã Võ Hoàng Trôvi Võ Đình Chinh GVGD: TS. Nguyễn Đức Thái
  • 2. Memory storage… Computer Memory: 640K Ought to be Enough for Anyone
  • 3. How much data? 7 billion people Google processes 100 PB/day; 3 million servers Facebook has 300 PB + 500 TB/day; 35% of world’s photos YouTube 1000 PB video storage; 4 billion views/day Twitter processes 124 billion tweets/year SMS messages – 6.1T per year US Cell Calls – 2.2T minutes per year US Credit cards - 1.4B Cards; 20B transactions/year 3
  • 4. Contents 4. Big Data Security 3. SQL vs NoSQL 2. Big Data Technology Today 1. Big Data Overview 5. Big data trends 6. Demo with MongoDB & Ref docs
  • 5. 1. Big Data Overview (tt) “Big data is not a single technology but a combination of old and new tech-nologies that helps companies gain actionable insight”. (“Big Data For DummiesPublished by John Wiley & Sons, Inc. ” book reference)
  • 6. 1. Big Data Overview (tt)
  • 8. Sources of Big Data ERP RFID Website Network Switches Social Media
  • 9. Examining Big Data Types Structured Data
  • 10. Structured Data(…) Computer- or machine-generated: Machine-generated data generally refers to data that is created by a machine without human intervention. (Sensor data, Web log data, Point-of- sale data, Financial data…) Human-generated: This is data that humans, in interaction with computers, supply (Input data, Click- stream data, Gaming-related data…)
  • 11. Examining Big Data Types Unstructured Data
  • 12. Unstructured Data(…) Unstructured data is everywhere Machine-generated unstructured data: Satellite images, Scientific data, Photographs and video, Radar or sonar data…  Human-generated unstructured data:Text internal to your company, Social media data, Mobile data…
  • 14. Managing different data types Integrating data types into a big data environment need: Connectors: enable you to pull data in from various big data sources Metadata is the definitions, mappings, and other characteristics used to describe how to find, access, and use a company’s data (and software) components
  • 15. Analysis • Querying • Statistic • Modeling • Data Mining • Text analytics Analysis & Processing Processing • Data storage • Data transfer • Data monitoring What will we do with Big Data?
  • 16. Quiz….? How to store and handle Big Data?
  • 17. 2. Big Data Technology Today Storage…NoSQL Database
  • 18. 2.Big Data Technology Today(tt) Processing
  • 19. 2.Big Data Technology Today(tt)  The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.
  • 20. 2.Big Data Technology Today(tt) Instead of treating memory as a cache, why not treat it as a primary data store?  Facebook keeps 80% of its data in Memory (Stanford research)  RAM is 100-1000x faster than Disk (Random seek) • Disk - 5 -10ms • RAM – x0.001msec 20 Events FACEBOOK FACEBOOK FACEBOOK Memory Grid Data Grid Data Grid Data Grid
  • 21. 2.Big Data Technology Today(tt) Transfer data:
  • 22. 2.Big Data Technology Today(tt) Open-source software framework from Apache Hadoop  Google MapReduce  GFS (Google File System)  HDFS  Map/Reduce
  • 23. 3. SQL vs NoSQL Data storage File SQL DBMS NoSQL
  • 24. 3. SQL vs NoSQL (…) A relational database is a set of tables containing data fitted into predefined categories. Each table contains one or more data categories in columns. Each row contains a unique instance of data for the categories defined by the columns.
  • 25. 3. SQL vs NoSQL (…) Key-value stores. As the name implies, a key-value store is a system that stores values indexed for retrieval by keys. Some of the market leaders: Riak Amazon Dynamo Voldermort
  • 26. 3. SQL vs NoSQL (…) Column-oriented databases. column- oriented databases contain one extendable column of closely related data Some of the market leaders: HBase Cassandra
  • 27. 3. SQL vs NoSQL (…) Document-based stores. These databases store and organize data as collections of documents, rather than as structured tables with uniform sized fields for each record Some of the market leaders: MongoDB CouchDB SimpleDB
  • 28. 3. SQL vs NoSQL (…) SQL 2008 Data storage capacity
  • 29. 3. SQL vs NoSQL (…) GridFS stores files in two collections:  chunks stores the binary chunks. For details, see The chunks Collection.  files stores the file’s metadata. For details, see The files Collection.
  • 30. 3. SQL vs NoSQL (…) BSON Types The chunks Collection The files Collection
  • 31. 3. SQL vs NoSQL (…)
  • 32. 4. Big Data Security • Secure computations in distributed programming frameworks • Security best practices for non-relational data stores • Secure data storage and transactions logs • Cryptographically enforced access control and secure communication • Granular access control • Real-time security/compliance monitoring
  • 33. 4. Big Data Security (…) Technical Recommendations for sercurity • Use Kerberos for node authentication • Use file layer encryption • Data anonymization • Use key management • Deployment validation • Use secure communication • Tokenization • Cloud database controls
  • 34. 5. Big data trends • Big data – of the people, by the people, for the people • Big data and social computing • Cloud computing • In memmory computing • Mobile Applications and HTML5 • Internet and big data
  • 35. 6. Demo with MongoDB & Ref docs Ref docs:  Judith Hurwitz, Alan Nugent, Dr. Fern Halper, and Marcia Kaufman: Big Data For Dummies. John Wiley & Sons, Inc. 2013.  “Technology Trends for 2013” prepared by Kaushal Amin, Chief Technology Officer, KMS Technology – Atlanta, GA, USA  Website: http://hadoop.apache.org/ Demo with MongoDB