SlideShare une entreprise Scribd logo
1  sur  15
Data Warehouse
Done by : Safaa Taamsah
Alzaytonah University
201527036
Definition
•In computing, a data warehouse (DW or DWH),
also known as an enterprise data
warehouse (EDW), is a system used
for reporting and data analysis, and is considered a
core component of business intelligence. DWs are
central repositories of integrated data from one or
more disparate sources. They store current and
historical data in one single place that are used for
creating analytical reports for workers throughout the
enterprise.
The Concept of Data Warehouse
• The concept of data warehousing dates back to the late 1980s
• when IBM researchers Barry Devlin and Paul Murphy developed
the "business data warehouse". In essence, the data warehousing
concept was intended to provide an architectural model for the
flow of data from operational systems to decision support
system. The concept attempted to address the various problems
associated with this flow, mainly the high costs associated with it.
The Concept of Data Warehouse
• A data warehouse is a relational database that is
designed for query and analysis rather than for
transaction processing. It usually contains historical data
derived from transaction data, but it can include data
from other sources. It separates analysis workload from
transaction workload and enables an organization to
consolidate data from several sources.
• In addition to a relational database, a data warehouse
environment includes an extraction, transportation,
transformation, and loading (ETL) solution, an online
analytical processing (OLAP) engine, client analysis
tools, and other applications that manage the process of
gathering data and delivering it to business users.
Characteristics of a data warehouse
A common way of introducing data warehousing is to
refer to the characteristics of a data warehouse as set
forth by :
Subject Oriented
Integrated
Nonvolatile
Time Variant
• Subject Oriented
• Data warehouses are designed to help you analyze data. For example, to learn more
about your company's sales data, you can build a warehouse that concentrates on sales.
Using this warehouse, you can answer questions like "Who was our best customer for
this item last year?" This ability to define a data warehouse by subject matter, sales in
this case, makes the data warehouse subject oriented.
• Integrated
• Integration is closely related to subject orientation. Data warehouses must put data
from disparate sources into a consistent format. They must resolve such problems as
naming conflicts and inconsistencies among units of measure. When they achieve this,
they are said to be integrated.
• Nonvolatile
• Nonvolatile means that, once entered into the warehouse, data should not change. This
is logical because the purpose of a warehouse is to enable you to analyze what has
occurred.
• Time Variant
• In order to discover trends in business, analysts need large amounts of data. A data
warehouse's focus on change over time is what is meant by the term time variant.
Data Mart
• A data mart is the access layer of the data warehouse environment that is
used to get data out to the users. The data mart is a subset of the data
warehouse and is usually oriented to a specific business line or team.
• A data mart is basically a condensed and more focused version of a data
warehouse that reflects the regulations and process specifications of each
business unit within an organization. Each data mart is dedicated to a specific
business function or region. This subset of data may span across many or all
of an enterprise’s functional subject areas. It is common for multiple data
marts to be used in order to serve the needs of each individual business unit
(different data marts can be used to obtain specific information for various
enterprise departments, such as accounting, marketing, sales, etc.)
Types of Data Stored in a Data
Warehouse
• Historical Data
A data warehouse typically contains several years of historical data.
The amount of data that you decide to make available depends on
available disk space and the types of analysis that you want to
support. This data can come from your transactional database
archives or other sources.
• Metadata is "data [information] that provides information about
other data". Three distinct types of metadata exist: descriptive
metadata, structural metadata, and administrative metadata.
Descriptive metadata describes a resource for purposes such as
discovery and identification. It can include elements such as title,
abstract, author, and keywords.
Structural metadata is metadata about containers of data and
indicates how compound objects are put together, for example,
how pages are ordered to form chapters. It describes the types,
versions, relationships and other characteristics of digital
materials.
Administrative metadata provides information to help manage a
resource, such as when and how it was created, file type and
other technical information, and who can access it.
• Derived data
A derived data element is a data element derived from other data
elements using a mathematical, logical, or other type of
transformation, e.g. arithmetic formula, composition, aggregation
• Raw data
Also known as primary data, is data (e.g., numbers, instrument
readings, figures, etc.) collected from a source.
Business Intelligence and Data Warehousing
 One ultimate use of the data gathered and processed in the data life cycle is
for business intelligence.
 Business intelligence generally involves the creation or use of a data
warehouse and/or data mart for storage of data, and the use of front-end
analytical tools such as Oracle’s Sales Analyzer and Financial Analyzer or
Micro Strategy’s Web.
 Such tools can be employed by end users to access data, ask queries, request
ad hoc (special) reports, examine scenarios, create CRM activities, devise
pricing strategies, and much more.
 More advanced applications of business intelligence include outputs such as:
• financial modeling
• budgeting
• resource allocation
• and competitive intelligence.
Data Warehouse Applications
oRetail Industry: Forecasting, Market research,
Merchandising etc.
oManufacturing and distribution : Sales history/trends,
Market demand projects etc.
oBanks : Spot market trends, Marketing, Credit cards etc.
oInsurance Companies : Property and casualty fraud etc.
oHealth Care Providers : Fraud detection, Patient
matching etc.
Data Warehouse Applications
o Government Agencies : Auditing tax records, information sharing
across different agencies etc.
o Internet Companies : Analyzing shopping behavior, CRM etc.
o Telecommunications : Telemarketing, Product development etc.
o Sports : Analyzing strategies, Winning player combinations etc.
Data Warehouse Sizes
Terabyte (10^12) - Walmart (24 TB)
Petabyte (10^15) - Geographic Information Systems
Exabyte (10^18) - National Medical Association
Zettabyte (10^21) - Weather Images
Zottabyte (10^24) - Intelligence Agency (Video)

Contenu connexe

Tendances

Tendances (20)

Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata Strategies
 
Data warehouse : Order Management
Data warehouse : Order ManagementData warehouse : Order Management
Data warehouse : Order Management
 
Data governance - An Insight
Data governance - An InsightData governance - An Insight
Data governance - An Insight
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data Quality
 
Introduction to DCAM, the Data Management Capability Assessment Model - Editi...
Introduction to DCAM, the Data Management Capability Assessment Model - Editi...Introduction to DCAM, the Data Management Capability Assessment Model - Editi...
Introduction to DCAM, the Data Management Capability Assessment Model - Editi...
 
MDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large Enterprises
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
 
Data Stewards – Defining and Assigning
Data Stewards – Defining and AssigningData Stewards – Defining and Assigning
Data Stewards – Defining and Assigning
 
Enterprise Data Management Framework Overview
Enterprise Data Management Framework OverviewEnterprise Data Management Framework Overview
Enterprise Data Management Framework Overview
 
Etl design document
Etl design documentEtl design document
Etl design document
 
Data Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureData Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and Future
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data Governance
 
Mdm: why, when, how
Mdm: why, when, howMdm: why, when, how
Mdm: why, when, how
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
 
Reference master data management
Reference master data managementReference master data management
Reference master data management
 
Sap Intro
Sap IntroSap Intro
Sap Intro
 
A Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsA Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity Models
 
Data Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business ApproachesData Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business Approaches
 
Data Governance
Data GovernanceData Governance
Data Governance
 

Similaire à Data warehouse

Data warehouse
Data warehouseData warehouse
Data warehouse
MR Z
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
Shanthi Mukkavilli
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
ParnalSatle
 

Similaire à Data warehouse (20)

Oracle sql plsql & dw
Oracle sql plsql & dwOracle sql plsql & dw
Oracle sql plsql & dw
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
 
Cognos datawarehouse
Cognos datawarehouseCognos datawarehouse
Cognos datawarehouse
 
Data warehousing.pptx
Data warehousing.pptxData warehousing.pptx
Data warehousing.pptx
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptx
 
Data Mining & Data Warehousing
Data Mining & Data WarehousingData Mining & Data Warehousing
Data Mining & Data Warehousing
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
data warehousing
data warehousingdata warehousing
data warehousing
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Unit 1
Unit 1Unit 1
Unit 1
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptx
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 

Dernier

Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
daisycvs
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
lizamodels9
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
Abortion pills in Kuwait Cytotec pills in Kuwait
 
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Anamikakaur10
 

Dernier (20)

Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
Falcon Invoice Discounting: Empowering Your Business Growth
Falcon Invoice Discounting: Empowering Your Business GrowthFalcon Invoice Discounting: Empowering Your Business Growth
Falcon Invoice Discounting: Empowering Your Business Growth
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 
Falcon Invoice Discounting platform in india
Falcon Invoice Discounting platform in indiaFalcon Invoice Discounting platform in india
Falcon Invoice Discounting platform in india
 
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort ServiceEluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
Eluru Call Girls Service ☎ ️93326-06886 ❤️‍🔥 Enjoy 24/7 Escort Service
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
 
Phases of Negotiation .pptx
 Phases of Negotiation .pptx Phases of Negotiation .pptx
Phases of Negotiation .pptx
 
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLBAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
PHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation FinalPHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation Final
 

Data warehouse

  • 1. Data Warehouse Done by : Safaa Taamsah Alzaytonah University 201527036
  • 2. Definition •In computing, a data warehouse (DW or DWH), also known as an enterprise data warehouse (EDW), is a system used for reporting and data analysis, and is considered a core component of business intelligence. DWs are central repositories of integrated data from one or more disparate sources. They store current and historical data in one single place that are used for creating analytical reports for workers throughout the enterprise.
  • 3. The Concept of Data Warehouse • The concept of data warehousing dates back to the late 1980s • when IBM researchers Barry Devlin and Paul Murphy developed the "business data warehouse". In essence, the data warehousing concept was intended to provide an architectural model for the flow of data from operational systems to decision support system. The concept attempted to address the various problems associated with this flow, mainly the high costs associated with it.
  • 4. The Concept of Data Warehouse • A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. It usually contains historical data derived from transaction data, but it can include data from other sources. It separates analysis workload from transaction workload and enables an organization to consolidate data from several sources. • In addition to a relational database, a data warehouse environment includes an extraction, transportation, transformation, and loading (ETL) solution, an online analytical processing (OLAP) engine, client analysis tools, and other applications that manage the process of gathering data and delivering it to business users.
  • 5. Characteristics of a data warehouse A common way of introducing data warehousing is to refer to the characteristics of a data warehouse as set forth by : Subject Oriented Integrated Nonvolatile Time Variant
  • 6. • Subject Oriented • Data warehouses are designed to help you analyze data. For example, to learn more about your company's sales data, you can build a warehouse that concentrates on sales. Using this warehouse, you can answer questions like "Who was our best customer for this item last year?" This ability to define a data warehouse by subject matter, sales in this case, makes the data warehouse subject oriented. • Integrated • Integration is closely related to subject orientation. Data warehouses must put data from disparate sources into a consistent format. They must resolve such problems as naming conflicts and inconsistencies among units of measure. When they achieve this, they are said to be integrated. • Nonvolatile • Nonvolatile means that, once entered into the warehouse, data should not change. This is logical because the purpose of a warehouse is to enable you to analyze what has occurred. • Time Variant • In order to discover trends in business, analysts need large amounts of data. A data warehouse's focus on change over time is what is meant by the term time variant.
  • 7. Data Mart • A data mart is the access layer of the data warehouse environment that is used to get data out to the users. The data mart is a subset of the data warehouse and is usually oriented to a specific business line or team. • A data mart is basically a condensed and more focused version of a data warehouse that reflects the regulations and process specifications of each business unit within an organization. Each data mart is dedicated to a specific business function or region. This subset of data may span across many or all of an enterprise’s functional subject areas. It is common for multiple data marts to be used in order to serve the needs of each individual business unit (different data marts can be used to obtain specific information for various enterprise departments, such as accounting, marketing, sales, etc.)
  • 8. Types of Data Stored in a Data Warehouse • Historical Data A data warehouse typically contains several years of historical data. The amount of data that you decide to make available depends on available disk space and the types of analysis that you want to support. This data can come from your transactional database archives or other sources.
  • 9. • Metadata is "data [information] that provides information about other data". Three distinct types of metadata exist: descriptive metadata, structural metadata, and administrative metadata. Descriptive metadata describes a resource for purposes such as discovery and identification. It can include elements such as title, abstract, author, and keywords. Structural metadata is metadata about containers of data and indicates how compound objects are put together, for example, how pages are ordered to form chapters. It describes the types, versions, relationships and other characteristics of digital materials. Administrative metadata provides information to help manage a resource, such as when and how it was created, file type and other technical information, and who can access it.
  • 10. • Derived data A derived data element is a data element derived from other data elements using a mathematical, logical, or other type of transformation, e.g. arithmetic formula, composition, aggregation • Raw data Also known as primary data, is data (e.g., numbers, instrument readings, figures, etc.) collected from a source.
  • 11.
  • 12. Business Intelligence and Data Warehousing  One ultimate use of the data gathered and processed in the data life cycle is for business intelligence.  Business intelligence generally involves the creation or use of a data warehouse and/or data mart for storage of data, and the use of front-end analytical tools such as Oracle’s Sales Analyzer and Financial Analyzer or Micro Strategy’s Web.  Such tools can be employed by end users to access data, ask queries, request ad hoc (special) reports, examine scenarios, create CRM activities, devise pricing strategies, and much more.  More advanced applications of business intelligence include outputs such as: • financial modeling • budgeting • resource allocation • and competitive intelligence.
  • 13. Data Warehouse Applications oRetail Industry: Forecasting, Market research, Merchandising etc. oManufacturing and distribution : Sales history/trends, Market demand projects etc. oBanks : Spot market trends, Marketing, Credit cards etc. oInsurance Companies : Property and casualty fraud etc. oHealth Care Providers : Fraud detection, Patient matching etc.
  • 14. Data Warehouse Applications o Government Agencies : Auditing tax records, information sharing across different agencies etc. o Internet Companies : Analyzing shopping behavior, CRM etc. o Telecommunications : Telemarketing, Product development etc. o Sports : Analyzing strategies, Winning player combinations etc.
  • 15. Data Warehouse Sizes Terabyte (10^12) - Walmart (24 TB) Petabyte (10^15) - Geographic Information Systems Exabyte (10^18) - National Medical Association Zettabyte (10^21) - Weather Images Zottabyte (10^24) - Intelligence Agency (Video)

Notes de l'éditeur

  1. في الحوسبة ، يعد مستودع البيانات (DW أو DWH) ، المعروف أيضًا باسم مستودع بيانات المؤسسة (EDW) ، نظامًا يستخدم في إعداد التقارير وتحليل البيانات ، ويعتبر مكونًا أساسيًا في ذكاء الأعمال. تعتبر DWs مستودعات مركزية للبيانات المتكاملة من واحد أو أكثر من المصادر المتباينة. تقوم بتخزين البيانات الحالية والتاريخية في مكان واحد تستخدم لإنشاء تقارير تحليلية للعمال في جميع أنحاء المؤسسة.
  2. يعود مفهوم تخزين البيانات إلى أواخر الثمانينات عندما طور باحثون IBM باري ديفلين وبول مورفي "مستودع بيانات الأعمال". في جوهره ، كان المقصود من مفهوم تخزين البيانات توفير نموذج معماري لتدفق البيانات من الأنظمة التشغيلية إلى نظام دعم القرار. لقد حاول المفهوم معالجة مختلف المشاكل المرتبطة بهذا التدفق ، خاصة التكاليف المرتفعة المرتبطة به.
  3. مستودع البيانات عبارة عن قاعدة بيانات علائقية مصممة للاستعلام والتحليل بدلاً من معالجة المعاملات. عادةً ما تحتوي على بيانات تاريخية مشتقة من بيانات المعاملات ، ولكنها يمكن أن تتضمن بيانات من مصادر أخرى. يفصل عبء العمل التحليل من عبء العمل المعاملة وتمكين مؤسسة لتوحيد البيانات من مصادر متعددة. بالإضافة إلى قاعدة البيانات العلائقية ، تشتمل بيئة مستودع البيانات على حل الاستخراج والنقل والتحويل والتحميل (ETL) ، ومحرك معالج تحليلي (OLAP) على الإنترنت ، وأدوات تحليل العميل ، وتطبيقات أخرى تدير عملية جمع البيانات و تقديمها لمستخدمي الأعمال.
  4. من الطرق الشائعة لتقديم مستودعات البيانات الرجوع إلى خصائص مستودع البيانات كما هو محدد في: الموضوع موجه متكامل غير متطاير تغير الوقت
  5. الموضوع موجه تم تصميم مستودعات البيانات لمساعدتك على تحليل البيانات. على سبيل المثال ، لمعرفة المزيد عن بيانات مبيعات شركتك ، يمكنك إنشاء مستودع يركز على المبيعات. باستخدام هذا المستودع ، يمكنك الإجابة عن أسئلة مثل "من كان أفضل عميل لدينا لهذا البند في العام الماضي؟" هذه القدرة على تحديد مستودع البيانات حسب الموضوع ، والمبيعات في هذه الحالة ، يجعل موضوع مستودع البيانات موجهًا. متكامل يرتبط التكامل ارتباطًا وثيقًا بتوجيه الموضوع. يجب أن تضع مستودعات البيانات البيانات من مصادر متباينة في تنسيق ثابت. يجب أن تحل مشاكل مثل تسمية الصراعات وعدم التناسق بين وحدات القياس. عندما يحققون ذلك ، يقال إنهم متكاملون. غير متطاير تعني كلمة Nonvolatile أنه بمجرد إدخالها في المستودع ، يجب ألا تتغير البيانات. هذا أمر منطقي لأن الغرض من المستودع هو تمكينك من تحليل ما حدث. تغير الوقت من أجل اكتشاف الاتجاهات في مجال الأعمال ، يحتاج المحللون إلى كميات كبيرة من البيانات. إن تركيز مستودع البيانات على التغيير بمرور الوقت هو المقصود بالمصطلح "متغير الوقت".
  6. سوق البيانات هو طبقة الوصول لبيئة مستودع البيانات التي يتم استخدامها للحصول على البيانات إلى المستخدمين. سوق البيانات هو مجموعة فرعية من مستودع البيانات وعادة ما يتم توجيهه إلى خط عمل أو فريق عمل محدد. إن سوق البيانات هو في الأساس نسخة مكثفة وأكثر تركيزًا من مستودع البيانات الذي يعكس اللوائح ومواصفات العملية لكل وحدة أعمال داخل المؤسسة. يتم تخصيص كل سوق بيانات لوظيفة أو منطقة عمل محددة. قد تمتد هذه المجموعة الفرعية من البيانات عبر العديد من مجالات الموضوعات الوظيفية الخاصة بالمؤسسة أو كلها. من الشائع استخدام العديد من سجلات البيانات من أجل تلبية احتياجات كل وحدة أعمال فردية (يمكن استخدام سجلات البيانات المختلفة للحصول على معلومات محددة لمختلف إدارات المؤسسات ، مثل المحاسبة والتسويق والمبيعات ، إلخ.
  7. البيانات التاريخية عادةً ما يحتوي مستودع البيانات على عدة سنوات من البيانات التاريخية. يعتمد مقدار البيانات التي تقرر توفيرها على مساحة القرص المتوفرة وأنواع التحليل التي تريد دعمها. يمكن أن تأتي هذه البيانات من أرشيف قواعد بيانات المعاملات أو مصادر أخرى.
  8. لبيانات الوصفية هي "بيانات [معلومات] توفر معلومات حول بيانات أخرى". توجد ثلاثة أنواع مميزة من البيانات الوصفية: البيانات الوصفية الوصفية والبيانات الوصفية الهيكلية والبيانات الوصفية الإدارية. تصف البيانات الوصفية الوصفية أحد الموارد لأغراض مثل الاكتشاف والتعرف. يمكن أن تتضمن عناصر مثل العنوان والملخص والمؤلف والكلمات الرئيسية. البيانات الوصفية الهيكلية عبارة عن بيانات وصفية عن حاويات البيانات وتشير إلى كيفية تجميع الكائنات المركبة ، على سبيل المثال ، كيفية ترتيب الصفحات لتشكيل فصول. فهو يصف الأنواع والإصدارات والعلاقات والخصائص الأخرى للمواد الرقمية. توفر البيانات الوصفية الإدارية معلومات للمساعدة في إدارة أحد الموارد ، مثل وقت وكيفية إنشاء الملف ونوع الملف والمعلومات الفنية الأخرى ، ومن يمكنه الوصول إليه.
  9. البيانات المشتقة عنصر البيانات المشتقة هو عنصر بيانات مشتق من عناصر البيانات الأخرى باستخدام نوع رياضي أو منطقي أو أي نوع آخر من التحويل ، على سبيل المثال ، صيغة حسابية ، تكوين ، تجميع مسودة بيانات   تُعرف أيضًا باسم البيانات الأساسية ، وهي البيانات (مثل الأرقام وقراءات الأدوات والأرقام وغير ذلك) التي يتم جمعها من المصدر.
  10. احد الاستخدامات النهائية للبيانات التي تم جمعها ومعالجتها في دورة حياة البيانات هي استخبارات الأعمال.   يتضمن ذكاء الأعمال بشكل عام إنشاء أو استخدام مستودع البيانات و / أو سوق البيانات لتخزين البيانات ، واستخدام أدوات التحليل الأمامية مثل محلل مبيعات Oracle أو محلل مالي أو Web Strategy Strategy.   يمكن استخدام هذه الأدوات من قبل المستخدمين النهائيين للوصول إلى البيانات ، وطرح الاستفسارات ، وطلب تقارير مخصصة (خاصة) ، ودراسة السيناريوهات ، وإنشاء أنشطة CRM ، ووضع استراتيجيات التسعير ، وأكثر من ذلك بكثير. تتضمن التطبيقات الأكثر تقدمًا لذكاء الأعمال نواتج مثل: • النماذج المالية • الميزنة • تخصيص الموارد • والذكاء التنافسي.
  11. صناعة البيع بالتجزئة: التنبؤ ، أبحاث السوق ، تجارة الخ التصنيع والتوزيع: تاريخ المبيعات / الاتجاهات ، ومشاريع الطلب في السوق إلخ. البنوك: اتجاهات السوق الفورية ، التسويق ، بطاقات الائتمان ، إلخ. شركات التأمين: الاحتيال في الممتلكات والاصابات الخ مقدمي الرعاية الصحية: كشف الاحتيال ، مطابقة المرضى الخ
  12. الوكالات الحكومية: تدقيق السجلات الضريبية وتبادل المعلومات بين الوكالات المختلفة إلخ.   شركات الإنترنت: تحليل سلوك التسوق ، CRM إلخ.   الاتصالات: التسويق عبر الهاتف ، وتطوير المنتجات ، إلخ.   الرياضة: تحليل الاستراتيجيات ، ومجموعات لاعب الفوز وما إلى ذلك.
  13. تيرابايت (10 ^ 12) - وول مارت (24 تيرابايت) Petabyte (10 ^ 15) - نظم المعلومات الجغرافية Exabyte (10 ^ 18) - الجمعية الطبية الوطنية Zettabyte (10 ^ 21) - صور الطقس Zottabyte (10 ^ 24) - وكالة الاستخبارات (فيديو)