SlideShare une entreprise Scribd logo
1  sur  10
Data Mining with Big
data
By: Pouya Otarod
Spring 2014
What is …… ?
• Data Mining
‣ computational process of discovering patterns in
large data sets
• Big Data
‣ it is the term for a collection of data sets so large
and complex that it becomes difficult to process
‣ data has exponential growth, both structured and
unstructured
How much Data does
exist?
• 2.5 quintillion bytes of data are created
EVERY DAY
• IBM: 90 percent of the data in the world today were
produced with past two years
• Forms of Data????
Big Data Examples
• October 4th, 2012, the first presidential debate
• Flicker and its photos
Problem…!
• Data has grown tremendously
• This large amount of data is beyond the of software
tools to manage
• Exploring the large volume of data and extracting
useful information and knowledge is a challenge,
and sometimes, it is almost infeasible
HACE Theorem
• Heterogeneous, Autonomous, Complex, Evolving
• Big data starts with large volume, heterogeneous,
autonomous sources with distributed and
decentralized control, and seeks to explore
complex and evolving relationships among data
• These are characteristics of Big Data
• This is theorem to model Big Data characteristics
• Huge Data with heterogeneous and diverse
dimensionality
‣ represent huge volume of data
• Autonomous sources with distributed and
decentralized control
‣ main characteristics of Big Data
• Complex and evolving relationships
Data Mining Challenges with Big
Data
• Big Data Mining Platform
• Dig Data Semantics and Application Knowledge
I. Information Sharing and Data Privacy
II. Domain and Application Knowledge
• Big Data Mining Algorithm
I. Local Learning and Model Fusion for Multiple
Information Sources
II. mining from Sparse, Uncertain, and Incomplete Data
III. Mining Complex and Dynamic Data
Thanks for you
attentions !

Contenu connexe

Tendances

Big data
Big dataBig data
Big data
hsn99
 

Tendances (20)

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Big Data
Big DataBig Data
Big Data
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data
Big dataBig data
Big data
 
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsBig Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
 
Big data
Big dataBig data
Big data
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
Big data
Big dataBig data
Big data
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Big data
Big dataBig data
Big data
 
Data mining & big data presentation 01
Data mining & big data presentation 01Data mining & big data presentation 01
Data mining & big data presentation 01
 
Big Data
Big DataBig Data
Big Data
 
Big data
Big dataBig data
Big data
 
Introduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemIntroduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 System
 
Data Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research OpportunitiesData Mining and Big Data Challenges and Research Opportunities
Data Mining and Big Data Challenges and Research Opportunities
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
 

Similaire à Data mining with big data

Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
Thinkful
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
ssuser0413ec
 
Big Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptxBig Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptx
varun453331
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
HASHEMHASH
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
Thinkful
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
Thinkful
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
Manish Chopra
 

Similaire à Data mining with big data (20)

Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
 
Big data
Big dataBig data
Big data
 
Big data Mining
Big data MiningBig data Mining
Big data Mining
 
Privacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social WebPrivacy, Ethics, and Future Uses of the Social Web
Privacy, Ethics, and Future Uses of the Social Web
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Big Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptxBig Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptx
 
Big data
Big dataBig data
Big data
 
Bigdata
BigdataBigdata
Bigdata
 
Introduction to big data for the EA course at Solvay MBA
Introduction to big data for the EA course at Solvay MBAIntroduction to big data for the EA course at Solvay MBA
Introduction to big data for the EA course at Solvay MBA
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Big Data & the importance of Data Science
Big Data & the importance of Data ScienceBig Data & the importance of Data Science
Big Data & the importance of Data Science
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 

Data mining with big data

  • 1. Data Mining with Big data By: Pouya Otarod Spring 2014
  • 2. What is …… ? • Data Mining ‣ computational process of discovering patterns in large data sets • Big Data ‣ it is the term for a collection of data sets so large and complex that it becomes difficult to process ‣ data has exponential growth, both structured and unstructured
  • 3. How much Data does exist? • 2.5 quintillion bytes of data are created EVERY DAY • IBM: 90 percent of the data in the world today were produced with past two years • Forms of Data????
  • 4. Big Data Examples • October 4th, 2012, the first presidential debate • Flicker and its photos
  • 5. Problem…! • Data has grown tremendously • This large amount of data is beyond the of software tools to manage • Exploring the large volume of data and extracting useful information and knowledge is a challenge, and sometimes, it is almost infeasible
  • 6. HACE Theorem • Heterogeneous, Autonomous, Complex, Evolving • Big data starts with large volume, heterogeneous, autonomous sources with distributed and decentralized control, and seeks to explore complex and evolving relationships among data • These are characteristics of Big Data • This is theorem to model Big Data characteristics
  • 7.
  • 8. • Huge Data with heterogeneous and diverse dimensionality ‣ represent huge volume of data • Autonomous sources with distributed and decentralized control ‣ main characteristics of Big Data • Complex and evolving relationships
  • 9. Data Mining Challenges with Big Data • Big Data Mining Platform • Dig Data Semantics and Application Knowledge I. Information Sharing and Data Privacy II. Domain and Application Knowledge • Big Data Mining Algorithm I. Local Learning and Model Fusion for Multiple Information Sources II. mining from Sparse, Uncertain, and Incomplete Data III. Mining Complex and Dynamic Data