SlideShare une entreprise Scribd logo
1  sur  13
DATA MINING
› 05 Angela Mary Binoy
› 06 Annies Minu SathiyaSeelan
Index
›What is Data Mining?
›Architecture.
›KDD process.
What is Data Mining?
•Data mining refers to extracting or “mining” knowledge from
large amounts of data.
•Data mining field brings together techniques from learning ,
pattern recognition , statistics , databases and visualization to
deal with the issues of information extraction from large data
bases.
•Data mining field finds its application in market analysis and
management like for e.g. customer relationship management
, cross selling, market segmentation.
ARCHITECTURE OF DATA MINING
Architecture of a typical data mining system may have the following major
components:
1) Database , Data warehouse , World Wide Web:
- This is one or set of databases, data warehouses, spreadsheets or other kind
of information repositories. Data cleaning and data integration techniques
may be performed.
2) Databases or Data warehouse Server:
- It is responsible for fetching the relevant data, based on the user’s
requirement needed for data mining.
3) Knowledge base:
- This is domain knowledge that is used to guide the search , and gives
interesting and hidden patterns from data. Such knowledge can include concept
hierarchies, used to organize attribute or attribute values into different levels of
abstraction.
-Knowledge such as user beliefs, which can be used to asses a pattern’s
interestingness based on it’s unexpectedness may also be included
-Other example are constraints, threshold & metadata.
4) Data Mining Engine:
- This is essential to the data mining system & ideally consists of a set of
functional modules for tasks such as characterization, association & correlation
analysis, classification, prediction, cluster analysis, outlier analysis & evolution
analysis.
5) Pattern Evaluation Module:
- It is integrated with the mining module and it gives the search
of only the interesting patterns.
6) Graphical User Interface:
- Used to communicate between users and the data mining
system, allowing the users to interact with the system by
specifying a data mining query or task, & performing exploratory
data mining based on the intermediate data mining results.
-This component allows the user to browse database or data
warehouse schemas or data structures, evaluate mined patterns,
& visualize the patterns in different forms.
Knowledge Discovery Data(KDD)
•The unifying goal of the KDD process is to extract knowledge from
Data in the context of large databases .
•It consists of an iterative sequence of the following steps:
1) Data Cleaning:
-To remove noise and inconsistent data.
2) Data Integration:
-Combining multiple data sources.
3) Data Selection:
-Data relevant to the analysis task are retrieved from the
database.
4) Data Transformation:
- Data are transformed into forms appropriate for mining by
performing summary or aggregation operations, for instance.
5) Data Mining:
- An essential process where intelligent methods are applied in
order to extract data patterns.
6) Pattern Evalution:
-To identify the truly interesting patterns representing knowledge
base on some interestingness measures.
7) Knowledge Presentation:
- Visualization and knowledge representation techniques are used
to present the mined knowledge to the user.
Steps 1 to 4 are different forms of data preprocessing, where the
data are prepared for mining.
-The data mining step may interact with the user or knowledge
base.
-The interesting patterns are represented to the user & may be
stored as a new knowledge in the knowledge base.
-Data mining is only step which is more essential because it
uncovers hidden patterns for evaluation.
› KDD and Data Mining are not same thing.
› KDD is the overall process of discovering useful
knowledge from data whereas Data Mining is only one
step in the KDD process.
› KDD is the nontrivial process of identifying valid ,
potentially useful and ultimately understandable
patterns in data and Data Mining is an application of
specific algorithms for extracting patterns for data.
How does KDD defer from Data Mining:
THANK YOU!

Contenu connexe

Tendances

Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
Slideshare
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
pcherukumalla
 

Tendances (20)

introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
web mining
web miningweb mining
web mining
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
data mining
data miningdata mining
data mining
 
Data mining
Data miningData mining
Data mining
 
Outlier Detection
Outlier DetectionOutlier Detection
Outlier Detection
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
 
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 5 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
 
1.2 steps and functionalities
1.2 steps and functionalities1.2 steps and functionalities
1.2 steps and functionalities
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
Data Mining
Data MiningData Mining
Data Mining
 
Web mining
Web mining Web mining
Web mining
 
Data Mining: Data processing
Data Mining: Data processingData Mining: Data processing
Data Mining: Data processing
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
 
Application of data mining
Application of data miningApplication of data mining
Application of data mining
 

Similaire à Data mining

Similaire à Data mining (20)

DM-Unit-1-Part 1-R.pdf
DM-Unit-1-Part 1-R.pdfDM-Unit-1-Part 1-R.pdf
DM-Unit-1-Part 1-R.pdf
 
Dma unit 1
Dma unit   1Dma unit   1
Dma unit 1
 
dwdm unit 1.ppt
dwdm unit 1.pptdwdm unit 1.ppt
dwdm unit 1.ppt
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
Data Mining and Knowledge
Data Mining and KnowledgeData Mining and Knowledge
Data Mining and Knowledge
 
Data mining basic concept and Data warehousing
Data mining basic concept and Data warehousingData mining basic concept and Data warehousing
Data mining basic concept and Data warehousing
 
2 introductory slides
2 introductory slides2 introductory slides
2 introductory slides
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
Data mining and business intelligence
Data mining and business intelligenceData mining and business intelligence
Data mining and business intelligence
 
6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining
 
17 cs002
17 cs00217 cs002
17 cs002
 
Business Intelligence and Analytics Unit-2 part-A .pptx
Business Intelligence and Analytics Unit-2 part-A .pptxBusiness Intelligence and Analytics Unit-2 part-A .pptx
Business Intelligence and Analytics Unit-2 part-A .pptx
 
Datamininglecture
DatamininglectureDatamininglecture
Datamininglecture
 
BAS 250 Lecture 1
BAS 250 Lecture 1BAS 250 Lecture 1
BAS 250 Lecture 1
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
 
Data mining
Data miningData mining
Data mining
 
A review on data mining
A  review on data miningA  review on data mining
A review on data mining
 

Dernier

FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Dernier (20)

Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 

Data mining

  • 1. DATA MINING › 05 Angela Mary Binoy › 06 Annies Minu SathiyaSeelan
  • 2. Index ›What is Data Mining? ›Architecture. ›KDD process.
  • 3. What is Data Mining? •Data mining refers to extracting or “mining” knowledge from large amounts of data. •Data mining field brings together techniques from learning , pattern recognition , statistics , databases and visualization to deal with the issues of information extraction from large data bases. •Data mining field finds its application in market analysis and management like for e.g. customer relationship management , cross selling, market segmentation.
  • 4. ARCHITECTURE OF DATA MINING Architecture of a typical data mining system may have the following major components: 1) Database , Data warehouse , World Wide Web: - This is one or set of databases, data warehouses, spreadsheets or other kind of information repositories. Data cleaning and data integration techniques may be performed. 2) Databases or Data warehouse Server: - It is responsible for fetching the relevant data, based on the user’s requirement needed for data mining.
  • 5.
  • 6. 3) Knowledge base: - This is domain knowledge that is used to guide the search , and gives interesting and hidden patterns from data. Such knowledge can include concept hierarchies, used to organize attribute or attribute values into different levels of abstraction. -Knowledge such as user beliefs, which can be used to asses a pattern’s interestingness based on it’s unexpectedness may also be included -Other example are constraints, threshold & metadata. 4) Data Mining Engine: - This is essential to the data mining system & ideally consists of a set of functional modules for tasks such as characterization, association & correlation analysis, classification, prediction, cluster analysis, outlier analysis & evolution analysis.
  • 7. 5) Pattern Evaluation Module: - It is integrated with the mining module and it gives the search of only the interesting patterns. 6) Graphical User Interface: - Used to communicate between users and the data mining system, allowing the users to interact with the system by specifying a data mining query or task, & performing exploratory data mining based on the intermediate data mining results. -This component allows the user to browse database or data warehouse schemas or data structures, evaluate mined patterns, & visualize the patterns in different forms.
  • 8. Knowledge Discovery Data(KDD) •The unifying goal of the KDD process is to extract knowledge from Data in the context of large databases . •It consists of an iterative sequence of the following steps: 1) Data Cleaning: -To remove noise and inconsistent data. 2) Data Integration: -Combining multiple data sources. 3) Data Selection: -Data relevant to the analysis task are retrieved from the database.
  • 9.
  • 10. 4) Data Transformation: - Data are transformed into forms appropriate for mining by performing summary or aggregation operations, for instance. 5) Data Mining: - An essential process where intelligent methods are applied in order to extract data patterns. 6) Pattern Evalution: -To identify the truly interesting patterns representing knowledge base on some interestingness measures. 7) Knowledge Presentation: - Visualization and knowledge representation techniques are used to present the mined knowledge to the user.
  • 11. Steps 1 to 4 are different forms of data preprocessing, where the data are prepared for mining. -The data mining step may interact with the user or knowledge base. -The interesting patterns are represented to the user & may be stored as a new knowledge in the knowledge base. -Data mining is only step which is more essential because it uncovers hidden patterns for evaluation.
  • 12. › KDD and Data Mining are not same thing. › KDD is the overall process of discovering useful knowledge from data whereas Data Mining is only one step in the KDD process. › KDD is the nontrivial process of identifying valid , potentially useful and ultimately understandable patterns in data and Data Mining is an application of specific algorithms for extracting patterns for data. How does KDD defer from Data Mining: