SlideShare une entreprise Scribd logo
1  sur  12
Introduction to Data Mining
• What is Data Mining?
• Related technologies
• Data Mining techniques
• Data Mining Goals
• Stages of data mining process
• Knowledge representation methods
• Applications
What is Data Mining?
• The process of extracting information to identify patterns, trends,
and useful data that would allow the business to take the data-
driven decision from huge sets of data is called Data Mining.
• Data mining is the act of automatically searching for large stores
of information to find trends and patterns that go beyond simple
analysis procedures.
• Data Mining is a process used by organizations to extract
specific data from huge databases to solve business problems.
It primarily turns raw data into useful information.
• Data mining utilizes complex mathematical algorithms for data
segments and evaluates the probability of future events. Data
Mining is also called Knowledge Discovery of Data (KDD).
Related Technologies
Data mining is related to many concepts. We briefly
introduce each concept and indicate how it is related to
data mining.
• Machine Learning
• DBMS
• OLAP
• Statistics
Machine Learning
• Machine learning is the area of AI that examines how to write programs that
can learn.
• In data mining, machine learning is often used for prediction or classification.
• Applications that typically use machine learning techniques include speech
recognition, training moving robots, classification of astronomical structures,
and game playing.
• When machine learning is applied to data mining tasks, a model is used to
represent the data (such as a graphical structure like a neural network or a
decision tree).
• During the learning process, a sample of the database is used to train the
system to properly perform the desired task.
• Then the system is applied to the general database to actually perform the
task.
Machine Learning
• Machine learning algorithms are divided into two types:
1. Unsupervised Learning
2. Supervised Learning
1. Unsupervised Machine Learning:
Unsupervised learning does not depend on trained data sets to predict the
results, but it utilizes direct techniques such as clustering and association in
order to predict the results.
2. Supervised Machine Learning:
Supervised learning is a learning process in which we teach or train the
machine using data which is well leveled implies that some data is already
marked with the correct responses. After that, the machine is provided with
the new sets of data so that the supervised learning algorithm analyzes the
training data and gives an accurate result.
OLAP
• OLAP stands for On-Line Analytic Processing.
• OLAP systems are targeted to provide more complex query
results than traditional OLTP or database systems.
• OLAP is performed on data warehouses or data marts. The
primary goal of OLAP is to support ad hoc querying needed to
support DSS.
• The multidimensional view of data is fundamental to OLAP
applications.
• OLAP tools can be classified as ROLAP or MOLAP.
• ROLAP- Relational OLAP
• MOLAP- Multidimensional OLAP
OLAP operations
OLAP operations
There are several types of OLAP operations supported by OLAP tools:
• A simple query may look at a single cell within the cube [Figure (a)] .
• Slice: Look at a subcube to get more specific information. This is performed
by selecting on one dimension. As seen in Figure (c), this is looking at a
portion of the cube.
• Dice: Look at a subcube by selecting on two or more dimensions. This can be
performed by a slice on one dimension and then rotating the cube to select
on a second dimension. In Figure (d)
• Roll up (dimension reduction, aggregation): Roll up allows the user to ask
questions that move up an aggregation hierarchy. Figure (b) represents a roll
up from (a).
• Drill down: Figure (a) represents a drill down from (b). These functions allow a
user to get more detailed fact information by navigating lower in the
aggregation hierarchy.
• Visualization: Visualization allows the OLAP users to actually "see" results of
an operation.
DBMS
• A database is a collection of data usually associated with some
organization or enterprise.
• Schema
– e.g. (ID,Name,Address,Salary,JobNo) may be the schema for a
personnel database.
• A database management system (DBMS) is the software used to access a
database.
• Data model is used to describe the data, attributes, and relationships
among them.
– ER Model.
DBMS
• Transaction
• Query:
SELECT Name
FROM T
WHERE Salary > 100000
• A major difference between data mining queries and those of database
systems is the output .
• Basic database queries always output either a subset of the database or
aggregates of the data. A data mining query outputs a KDD object.
Statistics
• Simple statistical concepts as determining a data distribution and calculating
a mean and a variance can be viewed as data mining techniques.
• Statistical inference: Generalizing a model created from a sample of the
data to the entire dataset.
• Exploratory Data Analysis:
– Data can actually drive the creation of the model
– Opposite of traditional statistical view.
• Statistics research has produced many of the proposed data mining
algorithms.
• The difference between the data mining and statistics is data mining is
targeted to business users not to the statistician.
Goals of Data Mining?
• Data mining is one of the most useful techniques that help
entrepreneurs, researchers, and individuals to extract valuable
information from huge sets of data.
• Data mining Store and manage the data in a multidimensional
database system.
• Data mining Provide data access to business analysts and
information technology professionals.
• Data mining Analyze the data by application software.
• Data mining Present the data in a useful format, such as a
graph or table.

Contenu connexe

Similaire à Lecture2 (1).ppt

Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfSaketBansal9
 
MS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql ServerMS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql Serversqlserver content
 
MS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql ServerMS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql ServerDataminingTools Inc
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data AnalyticsUtkarsh Sharma
 
Data mining Basics and complete description onword
Data mining Basics and complete description onwordData mining Basics and complete description onword
Data mining Basics and complete description onwordSulman Ahmed
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data miningSamrat Tayade
 
Data mining basic concept and Data warehousing
Data mining basic concept and Data warehousingData mining basic concept and Data warehousing
Data mining basic concept and Data warehousingNivaTripathy1
 
overview of_data_processing
overview of_data_processingoverview of_data_processing
overview of_data_processingFEG
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data MiningValerii Klymchuk
 
Unit 3 part ii Data mining
Unit 3 part ii Data miningUnit 3 part ii Data mining
Unit 3 part ii Data miningDhilsath Fathima
 
001 More introduction to big data analytics
001   More introduction to big data analytics001   More introduction to big data analytics
001 More introduction to big data analyticsDendej Sawarnkatat
 
Data Mining Implementation process.pptx
Data Mining Implementation process.pptxData Mining Implementation process.pptx
Data Mining Implementation process.pptxLithal Fragrance
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processingSamraiz Tejani
 

Similaire à Lecture2 (1).ppt (20)

Machinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdfMachinr Learning and artificial_Lect1.pdf
Machinr Learning and artificial_Lect1.pdf
 
Complete unit ii notes
Complete unit ii notesComplete unit ii notes
Complete unit ii notes
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
 
MS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql ServerMS SQL SERVER: Introduction To Datamining Suing Sql Server
MS SQL SERVER: Introduction To Datamining Suing Sql Server
 
MS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql ServerMS Sql Server: Introduction To Datamining Suing Sql Server
MS Sql Server: Introduction To Datamining Suing Sql Server
 
Ch_2.pdf
Ch_2.pdfCh_2.pdf
Ch_2.pdf
 
Data Mining-2023 (2).ppt
Data Mining-2023 (2).pptData Mining-2023 (2).ppt
Data Mining-2023 (2).ppt
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
Ch~2.pdf
Ch~2.pdfCh~2.pdf
Ch~2.pdf
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Data mining Basics and complete description onword
Data mining Basics and complete description onwordData mining Basics and complete description onword
Data mining Basics and complete description onword
 
Introduction to data mining
Introduction to data miningIntroduction to data mining
Introduction to data mining
 
Data mining basic concept and Data warehousing
Data mining basic concept and Data warehousingData mining basic concept and Data warehousing
Data mining basic concept and Data warehousing
 
overview of_data_processing
overview of_data_processingoverview of_data_processing
overview of_data_processing
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data Mining
 
Unit 3 part ii Data mining
Unit 3 part ii Data miningUnit 3 part ii Data mining
Unit 3 part ii Data mining
 
001 More introduction to big data analytics
001   More introduction to big data analytics001   More introduction to big data analytics
001 More introduction to big data analytics
 
Data Mining Implementation process.pptx
Data Mining Implementation process.pptxData Mining Implementation process.pptx
Data Mining Implementation process.pptx
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Online analytical processing
Online analytical processingOnline analytical processing
Online analytical processing
 

Plus de Minakshee Patil

Plus de Minakshee Patil (7)

Lecture2 (9).ppt
Lecture2 (9).pptLecture2 (9).ppt
Lecture2 (9).ppt
 
oracle.pptx
oracle.pptxoracle.pptx
oracle.pptx
 
Lecture1.ppt
Lecture1.pptLecture1.ppt
Lecture1.ppt
 
Unit 1.ppt
Unit 1.pptUnit 1.ppt
Unit 1.ppt
 
Hierarchical clustering algorithm.pptx
Hierarchical clustering algorithm.pptxHierarchical clustering algorithm.pptx
Hierarchical clustering algorithm.pptx
 
Lecture3 (3).ppt
Lecture3 (3).pptLecture3 (3).ppt
Lecture3 (3).ppt
 
Lecture4.ppt
Lecture4.pptLecture4.ppt
Lecture4.ppt
 

Dernier

chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 

Dernier (20)

chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 

Lecture2 (1).ppt

  • 1. Introduction to Data Mining • What is Data Mining? • Related technologies • Data Mining techniques • Data Mining Goals • Stages of data mining process • Knowledge representation methods • Applications
  • 2. What is Data Mining? • The process of extracting information to identify patterns, trends, and useful data that would allow the business to take the data- driven decision from huge sets of data is called Data Mining. • Data mining is the act of automatically searching for large stores of information to find trends and patterns that go beyond simple analysis procedures. • Data Mining is a process used by organizations to extract specific data from huge databases to solve business problems. It primarily turns raw data into useful information. • Data mining utilizes complex mathematical algorithms for data segments and evaluates the probability of future events. Data Mining is also called Knowledge Discovery of Data (KDD).
  • 3. Related Technologies Data mining is related to many concepts. We briefly introduce each concept and indicate how it is related to data mining. • Machine Learning • DBMS • OLAP • Statistics
  • 4. Machine Learning • Machine learning is the area of AI that examines how to write programs that can learn. • In data mining, machine learning is often used for prediction or classification. • Applications that typically use machine learning techniques include speech recognition, training moving robots, classification of astronomical structures, and game playing. • When machine learning is applied to data mining tasks, a model is used to represent the data (such as a graphical structure like a neural network or a decision tree). • During the learning process, a sample of the database is used to train the system to properly perform the desired task. • Then the system is applied to the general database to actually perform the task.
  • 5. Machine Learning • Machine learning algorithms are divided into two types: 1. Unsupervised Learning 2. Supervised Learning 1. Unsupervised Machine Learning: Unsupervised learning does not depend on trained data sets to predict the results, but it utilizes direct techniques such as clustering and association in order to predict the results. 2. Supervised Machine Learning: Supervised learning is a learning process in which we teach or train the machine using data which is well leveled implies that some data is already marked with the correct responses. After that, the machine is provided with the new sets of data so that the supervised learning algorithm analyzes the training data and gives an accurate result.
  • 6. OLAP • OLAP stands for On-Line Analytic Processing. • OLAP systems are targeted to provide more complex query results than traditional OLTP or database systems. • OLAP is performed on data warehouses or data marts. The primary goal of OLAP is to support ad hoc querying needed to support DSS. • The multidimensional view of data is fundamental to OLAP applications. • OLAP tools can be classified as ROLAP or MOLAP. • ROLAP- Relational OLAP • MOLAP- Multidimensional OLAP
  • 8. OLAP operations There are several types of OLAP operations supported by OLAP tools: • A simple query may look at a single cell within the cube [Figure (a)] . • Slice: Look at a subcube to get more specific information. This is performed by selecting on one dimension. As seen in Figure (c), this is looking at a portion of the cube. • Dice: Look at a subcube by selecting on two or more dimensions. This can be performed by a slice on one dimension and then rotating the cube to select on a second dimension. In Figure (d) • Roll up (dimension reduction, aggregation): Roll up allows the user to ask questions that move up an aggregation hierarchy. Figure (b) represents a roll up from (a). • Drill down: Figure (a) represents a drill down from (b). These functions allow a user to get more detailed fact information by navigating lower in the aggregation hierarchy. • Visualization: Visualization allows the OLAP users to actually "see" results of an operation.
  • 9. DBMS • A database is a collection of data usually associated with some organization or enterprise. • Schema – e.g. (ID,Name,Address,Salary,JobNo) may be the schema for a personnel database. • A database management system (DBMS) is the software used to access a database. • Data model is used to describe the data, attributes, and relationships among them. – ER Model.
  • 10. DBMS • Transaction • Query: SELECT Name FROM T WHERE Salary > 100000 • A major difference between data mining queries and those of database systems is the output . • Basic database queries always output either a subset of the database or aggregates of the data. A data mining query outputs a KDD object.
  • 11. Statistics • Simple statistical concepts as determining a data distribution and calculating a mean and a variance can be viewed as data mining techniques. • Statistical inference: Generalizing a model created from a sample of the data to the entire dataset. • Exploratory Data Analysis: – Data can actually drive the creation of the model – Opposite of traditional statistical view. • Statistics research has produced many of the proposed data mining algorithms. • The difference between the data mining and statistics is data mining is targeted to business users not to the statistician.
  • 12. Goals of Data Mining? • Data mining is one of the most useful techniques that help entrepreneurs, researchers, and individuals to extract valuable information from huge sets of data. • Data mining Store and manage the data in a multidimensional database system. • Data mining Provide data access to business analysts and information technology professionals. • Data mining Analyze the data by application software. • Data mining Present the data in a useful format, such as a graph or table.