SlideShare une entreprise Scribd logo
1  sur  1
DATA WAREHOUSING AND DATA MINING - Presentation Transcript DATA WAREHOUSING AND DATA MINING PRESENTED BY :- ANIL SHARMA B-TECH(IT)MBA-A REG NO : 3470070100 PANKAJ JARIAL BTECH(IT)MBA-A REG NO : 3470070086 DATA WAREHOUSING Data warehousing is combining data from multiple sources into one comprehensive and easily manipulated database. The primary aim for data warehousing is to provide businesses with analytics results from data mining, OLAP, Scorecarding and reporting. NEED FOR DATA WAREHOUSING Information is now considered as a key for all the works. Those who gather, analyze, understand, and act upon information are winners. Information have no limits, it is very hard to collect information from various sources, so we need an data warehouse from where we can get all the information. TODAYS BUISNESS INFORMATION Retrieving data Analyzing data Extracting data Loading data Transforming data Managing data DATA WAREHOUSING INCLUDES:- DATA WAREHOUSE ARCHITECTURE Data warehousing is designed to provide an architecture that will make cooperate data accessible and useful to users. There is no right or wrong architecture. The worthiness of the architecture can be judge by its use, and concept behind it . Data Warehouses can be architected in many different ways, depending on the specific needs of a business.  Typical Data Warehousing Environment An operational data store (ODS) is basically a database that is used for being an temporary storage area for a datawarehouse. Its primary purpose is for handling data which are progressively in use. Operational data store contains data which are constantly updated through the course of the business operations. ETL (Extract, Transform, Load) is used to copy data from:- ODS to data warehouse staging area. Data warehouse staging area to data warehouse . Data warehouse to data mart . ETL extracts data, transforms values of inconsistent data, cleanses "bad" data, filters data and loads data into a target database.   The Data Warehouse Staging Area is temporary location where data from source systems is copied.  It increases the speed of data warehouse architecture. It is very essential since data is increasing day by day. The purpose of the Data Warehouse is to integrate corporate data. The amount of data in the Data Warehouse is massive.  Data is stored at a very deep level of detail. This allows data to be grouped in unimaginable ways. Data Warehouses does not contain all the data in the organization ,It's purpose is to provide base that are needed by the organization for strategic and tactical decision making.   ETL extract data from the Data Warehouse and send to one or more Data Marts for use of users. Data marts are represented as shortcut to a data warehouse ,to save time. It is just an partition of data present in data warehouse. Each Data Mart can contain different combinations of tables, columns and rows from the Enterprise Data Warehouse.  REASONS FOR CREATING AN DATA MART Easy access to frequently needed data. Creates collective view by a group of users. Improves user response time. Ease of creation. Lower cost than implementing a full Data warehouse DATA MINING The non-trivial extraction of implicit, previously unknown, and potentially useful information from large databases. – Extremely large datasets – Useful knowledge that can improve processes – Cannot be done manually Where Has it Come From ? Motivation Databases today are huge: – More than 1,000,000 entities/records/rows – From 10 to 10,000 fields/attributes/variables – Giga-bytes and tera-bytes Databases a growing at an unprecendented rate The corporate world is a cut-throat world – Decisions must be made rapidly – Decisions must be made with maximum knowledge How does data mining work? Extract, transform, and load transaction data onto the data warehouse system. Store and manage the data in a multidimensional database system. Provide data access to business analysts and information technology professionals. Analyze the data by application software. Present the data in a useful format, such as a graph or table DATA MINING MEASURES Accuracy Clarity Dirty Data Scalability Speed Validation Typical Applications of Data Mining ADVANTAGES OF DATA MINING Engineering and Technology Medical Science Business Combating Terrorism Games Research and Development Engineering and Technology In Electrical Power Engineering - used for condition monitoring of high voltage electrical equipment - vibration monitoring and analysis of transformer on-load tap-changers Education - to concentrate their knowledge Medical Science Data mining has been widely used in area of bioinformatics , genetics DNA sequences and variability in disease susceptibility which is very important to help improve the diagnosis, prevention and treatment of the diseases BUSINESS In Customer Relationship Management applications It Translate data from customer to merchant Accurately Distribute Business Processes Powerful Tool For Marketing Combating terrorism Concept used by Interpol against terrorists for searching their records by Multistate Anti-Terrorism Information Exchange In the Secure Flight program , Computer Assisted Passenger Pre screening System , Semantic Enhancement Games for certain combinatorial games, also called table bases (e.g. for 3x3-chess) It includes extraction of human-usable strategies Berlekamp in dots-and-boxes and Joh Nunn in chess endgames are notable examples Research And Development Helps to Develop the search algorithms It offers huge libraries of graphing and visualisation softwares The users can easily create the models optimally List of the top eight data-mining software vendors in 2008 Angoss Software Infor CRM Epiphany Portrait Software SAS G-Stat SPSS ThinkAnalytics Unica Viscovery THANK YOU

Contenu connexe

Tendances

Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
sumit621
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Eyad Manna
 
Dataware housing
Dataware housingDataware housing
Dataware housing
work
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
Sukirti Garg
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Edureka!
 
Data mining
Data miningData mining
Data mining
Silicon
 

Tendances (20)

Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
 
Data Mining: A Short Survey
Data Mining: A Short SurveyData Mining: A Short Survey
Data Mining: A Short Survey
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 
Data mining
Data miningData mining
Data mining
 
Data Ware Housing And Data Mining
Data Ware Housing And Data MiningData Ware Housing And Data Mining
Data Ware Housing And Data Mining
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
Data mining
Data miningData mining
Data mining
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Datawarehouse olap olam
Datawarehouse olap olamDatawarehouse olap olam
Datawarehouse olap olam
 
Data mining
Data miningData mining
Data mining
 
Thilga
ThilgaThilga
Thilga
 

Similaire à Data Warehousing And Data Mining Presentation Transcript

Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
work
 
Data warehouse
Data warehouseData warehouse
Data warehouse
MR Z
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
gulab sharma
 

Similaire à Data Warehousing And Data Mining Presentation Transcript (20)

DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
Data mining and data warehousing
Data mining and data warehousingData mining and data warehousing
Data mining and data warehousing
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Abstract
AbstractAbstract
Abstract
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
 
CTP Data Warehouse
CTP Data WarehouseCTP Data Warehouse
CTP Data Warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Data Mining
Data MiningData Mining
Data Mining
 
Data warehousing unit 1
Data warehousing unit 1Data warehousing unit 1
Data warehousing unit 1
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 
Data Science
Data ScienceData Science
Data Science
 
DataWarehousingandAbInitioConcepts.ppt
DataWarehousingandAbInitioConcepts.pptDataWarehousingandAbInitioConcepts.ppt
DataWarehousingandAbInitioConcepts.ppt
 
BVRM 402 IMS UNIT V
BVRM 402 IMS UNIT VBVRM 402 IMS UNIT V
BVRM 402 IMS UNIT V
 
BVRM 402 IMS Database Concept.pptx
BVRM 402 IMS Database Concept.pptxBVRM 402 IMS Database Concept.pptx
BVRM 402 IMS Database Concept.pptx
 
us it recruiter
us it recruiterus it recruiter
us it recruiter
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Data Warehousing And Data Mining Presentation Transcript

  • 1. DATA WAREHOUSING AND DATA MINING - Presentation Transcript DATA WAREHOUSING AND DATA MINING PRESENTED BY :- ANIL SHARMA B-TECH(IT)MBA-A REG NO : 3470070100 PANKAJ JARIAL BTECH(IT)MBA-A REG NO : 3470070086 DATA WAREHOUSING Data warehousing is combining data from multiple sources into one comprehensive and easily manipulated database. The primary aim for data warehousing is to provide businesses with analytics results from data mining, OLAP, Scorecarding and reporting. NEED FOR DATA WAREHOUSING Information is now considered as a key for all the works. Those who gather, analyze, understand, and act upon information are winners. Information have no limits, it is very hard to collect information from various sources, so we need an data warehouse from where we can get all the information. TODAYS BUISNESS INFORMATION Retrieving data Analyzing data Extracting data Loading data Transforming data Managing data DATA WAREHOUSING INCLUDES:- DATA WAREHOUSE ARCHITECTURE Data warehousing is designed to provide an architecture that will make cooperate data accessible and useful to users. There is no right or wrong architecture. The worthiness of the architecture can be judge by its use, and concept behind it . Data Warehouses can be architected in many different ways, depending on the specific needs of a business.  Typical Data Warehousing Environment An operational data store (ODS) is basically a database that is used for being an temporary storage area for a datawarehouse. Its primary purpose is for handling data which are progressively in use. Operational data store contains data which are constantly updated through the course of the business operations. ETL (Extract, Transform, Load) is used to copy data from:- ODS to data warehouse staging area. Data warehouse staging area to data warehouse . Data warehouse to data mart . ETL extracts data, transforms values of inconsistent data, cleanses "bad" data, filters data and loads data into a target database.   The Data Warehouse Staging Area is temporary location where data from source systems is copied.  It increases the speed of data warehouse architecture. It is very essential since data is increasing day by day. The purpose of the Data Warehouse is to integrate corporate data. The amount of data in the Data Warehouse is massive.  Data is stored at a very deep level of detail. This allows data to be grouped in unimaginable ways. Data Warehouses does not contain all the data in the organization ,It's purpose is to provide base that are needed by the organization for strategic and tactical decision making.   ETL extract data from the Data Warehouse and send to one or more Data Marts for use of users. Data marts are represented as shortcut to a data warehouse ,to save time. It is just an partition of data present in data warehouse. Each Data Mart can contain different combinations of tables, columns and rows from the Enterprise Data Warehouse.  REASONS FOR CREATING AN DATA MART Easy access to frequently needed data. Creates collective view by a group of users. Improves user response time. Ease of creation. Lower cost than implementing a full Data warehouse DATA MINING The non-trivial extraction of implicit, previously unknown, and potentially useful information from large databases. – Extremely large datasets – Useful knowledge that can improve processes – Cannot be done manually Where Has it Come From ? Motivation Databases today are huge: – More than 1,000,000 entities/records/rows – From 10 to 10,000 fields/attributes/variables – Giga-bytes and tera-bytes Databases a growing at an unprecendented rate The corporate world is a cut-throat world – Decisions must be made rapidly – Decisions must be made with maximum knowledge How does data mining work? Extract, transform, and load transaction data onto the data warehouse system. Store and manage the data in a multidimensional database system. Provide data access to business analysts and information technology professionals. Analyze the data by application software. Present the data in a useful format, such as a graph or table DATA MINING MEASURES Accuracy Clarity Dirty Data Scalability Speed Validation Typical Applications of Data Mining ADVANTAGES OF DATA MINING Engineering and Technology Medical Science Business Combating Terrorism Games Research and Development Engineering and Technology In Electrical Power Engineering - used for condition monitoring of high voltage electrical equipment - vibration monitoring and analysis of transformer on-load tap-changers Education - to concentrate their knowledge Medical Science Data mining has been widely used in area of bioinformatics , genetics DNA sequences and variability in disease susceptibility which is very important to help improve the diagnosis, prevention and treatment of the diseases BUSINESS In Customer Relationship Management applications It Translate data from customer to merchant Accurately Distribute Business Processes Powerful Tool For Marketing Combating terrorism Concept used by Interpol against terrorists for searching their records by Multistate Anti-Terrorism Information Exchange In the Secure Flight program , Computer Assisted Passenger Pre screening System , Semantic Enhancement Games for certain combinatorial games, also called table bases (e.g. for 3x3-chess) It includes extraction of human-usable strategies Berlekamp in dots-and-boxes and Joh Nunn in chess endgames are notable examples Research And Development Helps to Develop the search algorithms It offers huge libraries of graphing and visualisation softwares The users can easily create the models optimally List of the top eight data-mining software vendors in 2008 Angoss Software Infor CRM Epiphany Portrait Software SAS G-Stat SPSS ThinkAnalytics Unica Viscovery THANK YOU