Motivation for big data

•Télécharger en tant que PPTX, PDF•

2 j'aime•4,059 vues

Arockiaraj Durairaj

A very basic introduction to big data.

Technologie

MOTIVATION FOR BIG DATA
Arockiaraj Durairaj

WHAT IS BIG DATA?
 Terra bytes(1024 GB) of data to be processed(or
analyzed).
 Giga toTerra bytes of new data generated daily

IMPLICATIONS OF BIG DATA
 Data will be spread across multiple machines.
 Data will be in different formats
 Structured
 .CSV, rdbms
 Log files

 Unstructured data
 Data extracted from web pages, email content

ISSUES
 Moving data to databases is expensive
 Daily terra bytes of data to be uploaded which is
cumbersome
 How to handle data errors?

POSSIBLE SOLUTION
 Analyze the data in the format they are
 I.e. A text file need not be uploaded into database to
analyze it.
 Thus data need not be uploaded into any system.

HOW TO ANALYZE DATA?
 The data has to be read by your code to analyze
the data.
 If the code is in different machine than the data
again huge data transfer will happen during
analysis
 This happens for every analysis

POSSIBLE SOLUTION
 Do not move the data out of the box.
 Instead move the code to the box where data
resides. The size of the code is very less when
compared to the data.
 Thus network contention problem is solved.

MAP REDUCE FRAMEWORK
 Map reduce framework implements the solution that
we saw in the previous slide

HDFS
 HDFS is very similar to a file system, except that
files are replicated to multiple machines for
availability and scalability

Contenu connexe

Tendances

Big DataSeminar Links

Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj

Chapter 1 big dataProf .Pragati Khade

Big data-pptNazir Ahmed

Big data pptOECLIB Odisha Electronics Control Library

Big Data pptVivek Gautam

introduction to NOSQL Databasenehabsairam

NPTEL BIG DATA FULL PPT BOOK WITH ASSIGNMENT SOLUTION RAJIV MISHRA IIT PATNA...SayantanRoy14

Big data by Mithlesh sadhMithlesh Sadh

NOSQL- Presentation on NoSQLRamakant Soni

Multimedia Database Avnish Patel

Presentation on Big DataMaruf Abdullah (Rion)

BIG DATA and USE CASESBhaskara Reddy Sannapureddy

Big_data_ppt Sadhana Singh

Data warehousingJuhi Mahajan

NOSQL Databases types and UsesSuvradeep Rudra

Data warehouse,data mining & Big DataRavinder Kamboj

Big datavaleri kopaleishvili

Data warehousingMohammed Bindrees , PhD

Big Data - Applications and Technologies OverviewSivashankar Ganapathy

Tendances (20)

Big Data

Big data PPT prepared by Hritika Raj (Shivalik college of engg.)

Chapter 1 big data

Big data-ppt

Big data ppt

Big Data ppt

introduction to NOSQL Database

NPTEL BIG DATA FULL PPT BOOK WITH ASSIGNMENT SOLUTION RAJIV MISHRA IIT PATNA...

Big data by Mithlesh sadh

NOSQL- Presentation on NoSQL

Multimedia Database

Presentation on Big Data

BIG DATA and USE CASES

Big_data_ppt

Data warehousing

NOSQL Databases types and Uses

Data warehouse,data mining & Big Data

Big data

Data warehousing

Big Data - Applications and Technologies Overview

En vedette

Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Geoffrey Fox

What is Big Data?Bernard Marr

Aamod_ChandraAamod Chandra

Identity Fraud Protection Using Big Data Analytics - StampedeCon 2015StampedeCon

Ten Commandments for Tackling Fraud: The Role of Big Data and Predictive Anal...CA Technologies

The Great Unknown - How can operators leverage big data to prevent future rev...cVidya Networks

Masters thesis - Fraud & Big DataStephanie Canovas

"The Impact of Data Traffic Explosion and LTE on Revenue Assurance and Risk" cVidya Networks

Webinar: Using Big Data Technology in Fraud PreventionNetGuardians

How to Leverage Big Data to Help Finding Fraud Patterns & Revenue AssurancecVidya Networks

PRODUCT DEVELOPMENT PROCESSgouravranjan27

Online Fraud Detection Using Big Data Analytics WebinarDatameer

DB9711ICT Admin

Growth motivation and positive psychologyJames Neill

DB9715ICT Admin

Hadoop BIG Data - Fraud Detection with Real-Time Analyticshkbhadraa

Medical University of South Carolina: Using Big Data and Predictive Analytics...Seeling Cheung

Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Seeling Cheung

Web Mining guestb73ec6

Big data pptThirunavukkarasu Ps

En vedette (20)

Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...

What is Big Data?

Aamod_Chandra

Identity Fraud Protection Using Big Data Analytics - StampedeCon 2015

Ten Commandments for Tackling Fraud: The Role of Big Data and Predictive Anal...

The Great Unknown - How can operators leverage big data to prevent future rev...

Masters thesis - Fraud & Big Data

"The Impact of Data Traffic Explosion and LTE on Revenue Assurance and Risk"

Webinar: Using Big Data Technology in Fraud Prevention

How to Leverage Big Data to Help Finding Fraud Patterns & Revenue Assurance

PRODUCT DEVELOPMENT PROCESS

Online Fraud Detection Using Big Data Analytics Webinar

DB9711

Growth motivation and positive psychology

DB9715

Hadoop BIG Data - Fraud Detection with Real-Time Analytics

Medical University of South Carolina: Using Big Data and Predictive Analytics...

Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...

Web Mining

Big data ppt

Similaire à Motivation for big data

Big Data and HadoopMr. Ankit

Vikram Andem Big Data Strategy @ IATA Technology Roadmap IT Strategy Group

عصر کلان داده، چرا و چگونه؟datastack

HadoopMayuri Gupta

Big Data: An OverviewC. Scyphers

Hadoop introduction , Why and What is Hadoop ?sudhakara st

Final deckSteve Watt

Big data Hadoop presentation Shivanee garg

Big Data and HadoopFlavio Vit

Hadoop Online training by KeylabsSiva Sankar

Big data processing with apache sparksarith divakar

Big Data - Need of Converged Data PlatformGeekNightHyderabad

big data and hadoopahmed alshikh

Big data analytics: Technology's bleeding edgeBhavya Gulati

Big data and hadoop overvewKunal Khanna

A gentle introduction to the world of BigData and HadoopStefano Paluello

BIG DATAShashank Shetty

Hadoop by kamran khanKamranKhan587

Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk

IJARCCE_49Mr.Sameer Kumar Das

Similaire à Motivation for big data (20)

Big Data and Hadoop

Vikram Andem Big Data Strategy @ IATA Technology Roadmap

عصر کلان داده، چرا و چگونه؟

Hadoop

Big Data: An Overview

Hadoop introduction , Why and What is Hadoop ?

Final deck

Big data Hadoop presentation

Big Data and Hadoop

Hadoop Online training by Keylabs

Big data processing with apache spark

Big Data - Need of Converged Data Platform

big data and hadoop

Big data analytics: Technology's bleeding edge

Big data and hadoop overvew

A gentle introduction to the world of BigData and Hadoop

BIG DATA

Hadoop by kamran khan

Lecture 5 - Big Data and Hadoop Intro.ppt

IJARCCE_49

Dernier

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Developing An App To Navigate The Roads of BrazilV3cube

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Dernier (20)

08448380779 Call Girls In Civil Lines Women Seeking Men

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

08448380779 Call Girls In Friends Colony Women Seeking Men

Data Cloud, More than a CDP by Matt Robison

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Handwritten Text Recognition for manuscripts and early printed texts

Developing An App To Navigate The Roads of Brazil

How to Troubleshoot Apps for the Modern Connected Worker

Boost PC performance: How more available memory can improve productivity

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Exploring the Future Potential of AI-Enabled Smartphone Processors

Motivation for big data

1. MOTIVATION FOR BIG DATA Arockiaraj Durairaj

2. WHAT IS BIG DATA?  Terra bytes(1024 GB) of data to be processed(or analyzed).  Giga toTerra bytes of new data generated daily

3. IMPLICATIONS OF BIG DATA  Data will be spread across multiple machines.  Data will be in different formats  Structured  .CSV, rdbms  Log files  Unstructured data  Data extracted from web pages, email content

4. ISSUES  Moving data to databases is expensive  Daily terra bytes of data to be uploaded which is cumbersome  How to handle data errors?

5. POSSIBLE SOLUTION  Analyze the data in the format they are  I.e. A text file need not be uploaded into database to analyze it.  Thus data need not be uploaded into any system.

6. HOW TO ANALYZE DATA?  The data has to be read by your code to analyze the data.  If the code is in different machine than the data again huge data transfer will happen during analysis  This happens for every analysis

7. POSSIBLE SOLUTION  Do not move the data out of the box.  Instead move the code to the box where data resides. The size of the code is very less when compared to the data.  Thus network contention problem is solved.

8. MAP REDUCE FRAMEWORK  Map reduce framework implements the solution that we saw in the previous slide

9. HDFS  HDFS is very similar to a file system, except that files are replicated to multiple machines for availability and scalability

10. THANKS

Motivation for big data

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (20)

Similaire à Motivation for big data

Similaire à Motivation for big data (20)

Dernier

Dernier (20)

Motivation for big data