SlideShare une entreprise Scribd logo
1  sur  21
Amarjit Kaur
Samiksha Sharma
 What is data mining ?
 Why data mining ?
 Data mining as a necessity
 Evolution of database
 Origin of data mining
 Data mining : A KDD process
 Applications
 Management areas
 Examples
 techniques
 Extration of implicit, previously unknown and
potentially useful information from data
 Exploration & analysis by automatic and semi
automatic means of large quantities of data
in order to discover meaningful patterns
 Lots of data is being collected and
warehoused
 Web data , e commerce
 Purchases at departmental store/ groceries
store
 Bank/credit card transactions
 Computers have become cheaper and more
powerful
 Competitve pressure is strong
 Provide better,customized services for an
edge (e.g in customer relationship
management)
DATA MINING –
AS A
NECESSITY
DATA EXPLOSION PROBLEM
 Automated data collection tools and mature
database technology lead to tremendous
amounts of data stored in databases, data
warehouses and other information
repositories
 We are drowning in data, but starving for
knowledge!
 Solution : data mining
 Extraction of interesting knowledge(rules,
regularities,patterns,constraints) from data
in large databases
 1960s:
Data collection, database creation, IMS and
network DBMS.
 1970s:
Relational data model, relational DBMS
implementation
 1980s:
RDBMS, advanced data models( extended
relational, OO, deductive sets) and application-
oriented DBMS(spatial,scientific,engineering etc)
 1990s-2000s:
Data mining and data warehousing, multimedia
databases, and web databases
 The term “data mining” was introduced in the
1990s. Data mining roots are traced back along
three family lines:
Classical
statistics
Artificial
intelligence
Machine
learning
 Statistical are the foundations of most technologies on
which data mining is built, e.g. regression analysis,
standard deviation etc. All these are used to study data
and data relationships.
 Artificial intelligence which is built upon heuristics as
opposed to statistics, attempts to apply human-
thoughts like processing to statistical problems. E.g.
RDBMS.
 Machine learning is to union of statistics and AI.
 DATA MINING therefore uses AI and statistical approach
together. It blends AI heuristics with advanced
statistical analysis to study data and find previously –
hidden trends or patterns within company using
statistical fundamental concepts and adding more
advanced AI algorithms to achieve the goal.
 Database analysis and decision support
 Market analysis and management
 Target marketing, customer relation
management, market basket analysis, cross
selling, market segmentation
 Risk analysis and management
 Forecasting, customer retention,improved
underwriting, quality control,competitive
analysis
 Fraud detection and management
 Other applications
 Text mining(news group,email,documents) and
web analysis
 Intelligent query answering
 Sports
 IBM Advanced Scout analyzed NBA game statistics to
gain competitive advantage for New York Knicks and
Miami Heat
 Astronomy
 JPL and the Palomar Observatory discovered 22
quasars with the help of data-mining.
 Internet Web Surf-Aid
 IBM Surf-Aid applies data mining algorithms to web
access logs for market-related pages to discover
customer preference and behaviour pages, analyzing
effectiveness of web marketing, improving web site
organizations etc
 Cross-market analysis
 Associations/Co-relations between product sales
 Prediction based on the associations information
 Customer profiling
 Data mining can tell you what types of customers
buy what products.
 Identifying customer requirements
 Identifying the best products for different
customers.
 Use prediction to find what factors will attract new
customers.
 Provides summary information
 Various multidimensional summary reports
 Statistical summary information
 Finance planning and asset evaluation
i. Cash flow analysis and prediction
ii. Contingent claim analysis to evaluate assets
iii. Cross-sectional and time series analysis
 Resources planning
i. Summarize and compare the resources and
spending
 Competition
i. Monitor competition and market directions
ii. Set price strategy in the market
iii. Grouping of customer into classes
 Applications
 Widely used in health care, retail, credit card
services, telecommunications etc.
 Approach
 Use historical data to build models of fraudulent
behaviour and use data mining to help identify
similar instances.
 Examples
 Auto insurance : detect a group of people who
stage accidents to collect on insurance.
 Money laundering : detect suspicious money
transactions
Techniques
CLASSIFICATION
ASSOCIATION
SEQUENCE
CLUSTER
Information Technology Data Mining

Contenu connexe

Tendances

critical analysis of netra roga nidana.pptx
critical  analysis of  netra roga nidana.pptxcritical  analysis of  netra roga nidana.pptx
critical analysis of netra roga nidana.pptxShraddha Joshi
 
Management of Karnasrava - A Case Report
Management of Karnasrava - A Case ReportManagement of Karnasrava - A Case Report
Management of Karnasrava - A Case Reportijtsrd
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining TechniquesSanzid Kawsar
 
Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...
Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...
Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...ijtsrd
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining Sushil Kulkarni
 
What Is DATA MINING(INTRODUCTION)
What Is DATA MINING(INTRODUCTION)What Is DATA MINING(INTRODUCTION)
What Is DATA MINING(INTRODUCTION)Pratik Tambekar
 
Karnapoorana, Karnaprakshalana and Karnadhpana.pptx
Karnapoorana, Karnaprakshalana and Karnadhpana.pptxKarnapoorana, Karnaprakshalana and Karnadhpana.pptx
Karnapoorana, Karnaprakshalana and Karnadhpana.pptxShraddha Joshi
 
Vaitarana basti,krimigna basti,lekana basti
Vaitarana basti,krimigna basti,lekana bastiVaitarana basti,krimigna basti,lekana basti
Vaitarana basti,krimigna basti,lekana bastiAkshay Shetty
 
Unsupervised learning
Unsupervised learningUnsupervised learning
Unsupervised learningamalalhait
 
Tonsillitis In Ayurveda
Tonsillitis In AyurvedaTonsillitis In Ayurveda
Tonsillitis In AyurvedaRaghu Ramudu
 
Netra roga samanya nidana
Netra roga samanya nidanaNetra roga samanya nidana
Netra roga samanya nidanaDr-Champa Vathi
 

Tendances (20)

Data mining
Data miningData mining
Data mining
 
critical analysis of netra roga nidana.pptx
critical  analysis of  netra roga nidana.pptxcritical  analysis of  netra roga nidana.pptx
critical analysis of netra roga nidana.pptx
 
Management of Karnasrava - A Case Report
Management of Karnasrava - A Case ReportManagement of Karnasrava - A Case Report
Management of Karnasrava - A Case Report
 
Arma pterygium
Arma  pterygiumArma  pterygium
Arma pterygium
 
Katigraha
KatigrahaKatigraha
Katigraha
 
Data mining
Data mining Data mining
Data mining
 
Pratishyaya -Nasya ksr
Pratishyaya -Nasya ksrPratishyaya -Nasya ksr
Pratishyaya -Nasya ksr
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
 
Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...
Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...
Conceptual Study of Prathama Patalagata Timira (Incipient Presbyopia) '“ A Re...
 
Shalakya tantra syllabus PPT
Shalakya tantra syllabus PPTShalakya tantra syllabus PPT
Shalakya tantra syllabus PPT
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
What Is DATA MINING(INTRODUCTION)
What Is DATA MINING(INTRODUCTION)What Is DATA MINING(INTRODUCTION)
What Is DATA MINING(INTRODUCTION)
 
Karnapoorana, Karnaprakshalana and Karnadhpana.pptx
Karnapoorana, Karnaprakshalana and Karnadhpana.pptxKarnapoorana, Karnaprakshalana and Karnadhpana.pptx
Karnapoorana, Karnaprakshalana and Karnadhpana.pptx
 
Vaitarana basti,krimigna basti,lekana basti
Vaitarana basti,krimigna basti,lekana bastiVaitarana basti,krimigna basti,lekana basti
Vaitarana basti,krimigna basti,lekana basti
 
Unsupervised learning
Unsupervised learningUnsupervised learning
Unsupervised learning
 
Ayur stat - Research and Statistical Solution for Ayurveda
Ayur stat - Research and Statistical Solution for AyurvedaAyur stat - Research and Statistical Solution for Ayurveda
Ayur stat - Research and Statistical Solution for Ayurveda
 
Chakshusya dravyas
Chakshusya dravyasChakshusya dravyas
Chakshusya dravyas
 
Aushadh pratishaya
Aushadh pratishayaAushadh pratishaya
Aushadh pratishaya
 
Tonsillitis In Ayurveda
Tonsillitis In AyurvedaTonsillitis In Ayurveda
Tonsillitis In Ayurveda
 
Netra roga samanya nidana
Netra roga samanya nidanaNetra roga samanya nidana
Netra roga samanya nidana
 

En vedette

SAS M2006 Presentation
SAS M2006 PresentationSAS M2006 Presentation
SAS M2006 PresentationGregPotts
 
SAS M2007 Presentation
SAS M2007 PresentationSAS M2007 Presentation
SAS M2007 PresentationGregPotts
 
Macquarie University Workshop on Text Mining and Health
Macquarie University Workshop on Text Mining and HealthMacquarie University Workshop on Text Mining and Health
Macquarie University Workshop on Text Mining and HealthDiego Molla-Aliod
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningSalford Systems
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternZakaria Zubi
 
Communication Research Methods
Communication Research MethodsCommunication Research Methods
Communication Research MethodsJenny Donley
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data miningSlideshare
 
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in HealthcareHealth Catalyst
 
Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016charles kangethe
 
PfizerProposal_Final-1
PfizerProposal_Final-1PfizerProposal_Final-1
PfizerProposal_Final-1Robert Vasquez
 
Flick Film FestivalInternship
Flick Film FestivalInternshipFlick Film FestivalInternship
Flick Film FestivalInternshipKayla Kehler
 
BVPresentation
BVPresentationBVPresentation
BVPresentationBen Arditi
 
Top 8 Arquitectura europea
Top 8 Arquitectura europeaTop 8 Arquitectura europea
Top 8 Arquitectura europeaElmonenunblog
 

En vedette (15)

SAS M2006 Presentation
SAS M2006 PresentationSAS M2006 Presentation
SAS M2006 Presentation
 
SAS M2007 Presentation
SAS M2007 PresentationSAS M2007 Presentation
SAS M2007 Presentation
 
Macquarie University Workshop on Text Mining and Health
Macquarie University Workshop on Text Mining and HealthMacquarie University Workshop on Text Mining and Health
Macquarie University Workshop on Text Mining and Health
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime Pattern
 
Communication Research Methods
Communication Research MethodsCommunication Research Methods
Communication Research Methods
 
Statistical software
Statistical softwareStatistical software
Statistical software
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
 
Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016
 
PfizerProposal_Final-1
PfizerProposal_Final-1PfizerProposal_Final-1
PfizerProposal_Final-1
 
klinik medika edukasi PJK
klinik medika edukasi PJKklinik medika edukasi PJK
klinik medika edukasi PJK
 
Flick Film FestivalInternship
Flick Film FestivalInternshipFlick Film FestivalInternship
Flick Film FestivalInternship
 
BVPresentation
BVPresentationBVPresentation
BVPresentation
 
Top 8 Arquitectura europea
Top 8 Arquitectura europeaTop 8 Arquitectura europea
Top 8 Arquitectura europea
 

Similaire à Information Technology Data Mining

Data mining final year project in jalandhar
Data mining final year project in jalandharData mining final year project in jalandhar
Data mining final year project in jalandhardeepikakaler1
 
Data mining final year project in ludhiana
Data mining final year project in ludhianaData mining final year project in ludhiana
Data mining final year project in ludhianadeepikakaler1
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data miningRohit Kumar
 
Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)yesheeka
 
6months industrial training in data mining,ludhiana
6months industrial training in data mining,ludhiana6months industrial training in data mining,ludhiana
6months industrial training in data mining,ludhianadeepikakaler1
 
6months industrial training in data mining, jalandhar
6months industrial training in data mining, jalandhar6months industrial training in data mining, jalandhar
6months industrial training in data mining, jalandhardeepikakaler1
 
6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,ludhiana6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,ludhianadeepikakaler1
 
6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,jalandhar6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,jalandhardeepikakaler1
 
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Data Mining and Business Analytics by Seyed Ziae Mousavi MojabData Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojabzmojab
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashokAshok Kumar
 
Introduction.ppt
Introduction.pptIntroduction.ppt
Introduction.pptbommaiah
 
Data Mining vs. Machine Learning Unveiling Major Differences
Data Mining vs. Machine Learning Unveiling Major DifferencesData Mining vs. Machine Learning Unveiling Major Differences
Data Mining vs. Machine Learning Unveiling Major DifferencesCapital Numbers
 

Similaire à Information Technology Data Mining (20)

Data mining and its applications!
Data mining and its applications!Data mining and its applications!
Data mining and its applications!
 
Data mining final year project in jalandhar
Data mining final year project in jalandharData mining final year project in jalandhar
Data mining final year project in jalandhar
 
Data mining final year project in ludhiana
Data mining final year project in ludhianaData mining final year project in ludhiana
Data mining final year project in ludhiana
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Data Mining
Data MiningData Mining
Data Mining
 
Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)
 
6months industrial training in data mining,ludhiana
6months industrial training in data mining,ludhiana6months industrial training in data mining,ludhiana
6months industrial training in data mining,ludhiana
 
6months industrial training in data mining, jalandhar
6months industrial training in data mining, jalandhar6months industrial training in data mining, jalandhar
6months industrial training in data mining, jalandhar
 
6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,ludhiana6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,ludhiana
 
6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,jalandhar6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,jalandhar
 
Data mining
Data miningData mining
Data mining
 
Data mining-basic
Data mining-basicData mining-basic
Data mining-basic
 
Data mining 1
Data mining 1Data mining 1
Data mining 1
 
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Data Mining and Business Analytics by Seyed Ziae Mousavi MojabData Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
 
Data mining
Data miningData mining
Data mining
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
 
Introduction.ppt
Introduction.pptIntroduction.ppt
Introduction.ppt
 
1intro
1intro1intro
1intro
 
Data Mining vs. Machine Learning Unveiling Major Differences
Data Mining vs. Machine Learning Unveiling Major DifferencesData Mining vs. Machine Learning Unveiling Major Differences
Data Mining vs. Machine Learning Unveiling Major Differences
 
Introduction
IntroductionIntroduction
Introduction
 

Information Technology Data Mining

  • 2.  What is data mining ?  Why data mining ?  Data mining as a necessity  Evolution of database  Origin of data mining  Data mining : A KDD process  Applications  Management areas  Examples  techniques
  • 3.  Extration of implicit, previously unknown and potentially useful information from data  Exploration & analysis by automatic and semi automatic means of large quantities of data in order to discover meaningful patterns
  • 4.  Lots of data is being collected and warehoused  Web data , e commerce  Purchases at departmental store/ groceries store  Bank/credit card transactions  Computers have become cheaper and more powerful  Competitve pressure is strong  Provide better,customized services for an edge (e.g in customer relationship management)
  • 5. DATA MINING – AS A NECESSITY
  • 6. DATA EXPLOSION PROBLEM  Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories  We are drowning in data, but starving for knowledge!  Solution : data mining  Extraction of interesting knowledge(rules, regularities,patterns,constraints) from data in large databases
  • 7.  1960s: Data collection, database creation, IMS and network DBMS.  1970s: Relational data model, relational DBMS implementation  1980s: RDBMS, advanced data models( extended relational, OO, deductive sets) and application- oriented DBMS(spatial,scientific,engineering etc)  1990s-2000s: Data mining and data warehousing, multimedia databases, and web databases
  • 8.  The term “data mining” was introduced in the 1990s. Data mining roots are traced back along three family lines: Classical statistics Artificial intelligence Machine learning
  • 9.  Statistical are the foundations of most technologies on which data mining is built, e.g. regression analysis, standard deviation etc. All these are used to study data and data relationships.  Artificial intelligence which is built upon heuristics as opposed to statistics, attempts to apply human- thoughts like processing to statistical problems. E.g. RDBMS.  Machine learning is to union of statistics and AI.  DATA MINING therefore uses AI and statistical approach together. It blends AI heuristics with advanced statistical analysis to study data and find previously – hidden trends or patterns within company using statistical fundamental concepts and adding more advanced AI algorithms to achieve the goal.
  • 10.
  • 11.  Database analysis and decision support  Market analysis and management  Target marketing, customer relation management, market basket analysis, cross selling, market segmentation  Risk analysis and management  Forecasting, customer retention,improved underwriting, quality control,competitive analysis  Fraud detection and management  Other applications  Text mining(news group,email,documents) and web analysis  Intelligent query answering
  • 12.  Sports  IBM Advanced Scout analyzed NBA game statistics to gain competitive advantage for New York Knicks and Miami Heat  Astronomy  JPL and the Palomar Observatory discovered 22 quasars with the help of data-mining.  Internet Web Surf-Aid  IBM Surf-Aid applies data mining algorithms to web access logs for market-related pages to discover customer preference and behaviour pages, analyzing effectiveness of web marketing, improving web site organizations etc
  • 13.  Cross-market analysis  Associations/Co-relations between product sales  Prediction based on the associations information  Customer profiling  Data mining can tell you what types of customers buy what products.  Identifying customer requirements  Identifying the best products for different customers.  Use prediction to find what factors will attract new customers.  Provides summary information  Various multidimensional summary reports  Statistical summary information
  • 14.  Finance planning and asset evaluation i. Cash flow analysis and prediction ii. Contingent claim analysis to evaluate assets iii. Cross-sectional and time series analysis  Resources planning i. Summarize and compare the resources and spending  Competition i. Monitor competition and market directions ii. Set price strategy in the market iii. Grouping of customer into classes
  • 15.  Applications  Widely used in health care, retail, credit card services, telecommunications etc.  Approach  Use historical data to build models of fraudulent behaviour and use data mining to help identify similar instances.  Examples  Auto insurance : detect a group of people who stage accidents to collect on insurance.  Money laundering : detect suspicious money transactions
  • 16.
  • 17.
  • 18.
  • 19.