IDERA Live | Decode your Organization's Data DNA

1© 2018 IDERA, Inc. All rights reserved.
DECODE YOUR ORGANIZATION'S DATA DNA
AND TRANSFORM IT INTO KNOWLEDGE
MAY 16, 2018
Ron Huizenga
Senior Product Manager, Enterprise Architecture & Modeling
@DataAviator

2© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 2© 2018 IDERA, Inc. All rights reserved.
TOPICS
 DNA & data similarities
 Data value chain
 Data Complexity
 Decoding the data
 Ancestry & lineage
 Governance
 Summary

Logical Data Lake
DATA COMPLEXITY: THE MULTI-HYBRID DATA ECOSYSTEM
RDBMS
DataIngestion
Approved
Raw Data
Sandboxes
(Data Science)
Raw Transient
Data
Refinery Refined
Data
Trusted
Data
MDM
Store
Self-serve
Analytics &
Reporting

DECODING THE DATA
 Data Models
• Conceptual
• Logical
• Physical
• Dimensional
• Enterprise/Canonical
 Visual Data Lineage
 Enterprise Data
Dictionaries
• Naming Standards
• Attachments
 Metadata Repository
• Business Glossaries

DATA VALUE CHAIN
Data
Data is the representation of
facts as text, numbers,
graphics, images sound or
video
Information
Definition
Format
Timeframe
Relevance
=+
Information is Data in
context. Without context,
data is meaningless.
Knowledge
Patterns & Trends
Relationships
Assumptions
=+
Knowledge is information in
perspective, integrated into
a viewpoint based upon the
recognition and
interpretation of patterns
(i.e. trends) formed with
other information and
experience.

DECODING THE DATA
• Identify data stores
• Reverse engineer
Where is the data?
• Naming standards
• Universal mappings to link entity instances
What is it?
• Visual data lineage
• Business process models
Where did it come from?
• Data dictionary
• Business Glossary
What does it mean?
• Reference & master data management
• Attachments (Enterprise data dictionary)
• Security classifications (Enterprise data dictionary)
• Regulatory policies (Team Server)
How do I govern it?

7© 2016 IDERA, Inc. All rights reserved.
NAMING STANDARDS
 Extremely important
• Define
• Apply
• Enforce
 Represent real world business objects
 Typically comprised of
• Business terms
• Abbreviation for each
• Template (specify order)
• Case
• Prefixes, Suffixes

NAMING STANDARDS SETUP/USAGE
 Traditional use case
• Logical -> physical
• Entity name -> table name
• Attribute name -> column name
 Mapping existing data stores
• Physical -> logical
• Table name -> entity name
• Column Name -> attribute name

BOUND NAMING STANDARDS

ENTITY INSTANCES
Repository
Universal Mappings

UNIVERSAL MAPPINGS
 Ability to link “like” or related objects
• Within same model file
• Across separate model files
 Entity/Table level
 Attribute/Column level

DATA MODEL CONSTRUCTS
 Full Specification
• Logical
• Physical
 Persistence Boundaries
• Business Data Objects
 Descriptive metadata
• Names
• Definitions (data dictionary)
• Notes
 Implementation characteristics
• Data types
• Keys
• Indexes
• Views
 Business Rules
• Relationships (referential
constraints)
• Value Restrictions (constraints)
 Security Classifications + Rules
 Governance Metadata
• Master Data Management classes
• Data Quality classifications
• Retention policies

ATTACHMENTS (METADATA EXTENSIONS)

DATA - LIFECYCLE
 Describes how a data element is created, read,
updated, deleted (CRUD)
 Many factors come into play
• Business rules
• Business processes
• Applications
 There may be more than 1 way a particular data
element is created
 Need to model:
• Business process
• Data lineage
• Data flow
• Integration
• Include Extract Transform and Load (ETL) for
data warehouse/data marts and staging areas
Create/Collect
Classify
Store
Use/ModifyShare
Retain/Archive
Destroy

DATA LINEAGE

BUSINESS GLOSSARY HIERARCHY
 Alignment to functional areas
 Child glossaries inherit a subset of
parent terms
 No limit to hierarchy level

ENABLING KNOWLEDGE: BUSINESS GLOSSARY INTEGRATION

GOVERNANCE POLICY HIERARCHY

SPECIFIC REGULATION (HIPAA)

HIPAA: SPECIFIC POLICY STATEMENTS

MODEL DRIVEN SECURITY ALERTS

HIPAA: RELATED POLICY STATEMENTS FOR THE OBJECT

REFERENCE DATA SET LIBRARY

SPECIFIC REFERENCE DATA SETS (LINK TO SOURCE)

REFERENCE DATA: LINKED WORKBOOK EXAMPLE

ER/STUDIO ENTERPRISE TEAM EDITION:
INTEGRATED MODELING, ENTERPRISE ARCHITECTURE, GOVERNANCE COLLABORATION PLATFORM
Enterprise Data
Dictionaries
Logical & Physical Data Models
Dimensional Models
Visual Data Lineage
Conceptual Data Models
Business Process Models
Goals &
Strategies
Applications
Business
Units
Business
Rules
Stewards
Business
Glossaries
Business
Concepts
Reference
Data Sets
Policies
Alerts &
Notifications
Security
Follow
Capability
Discussion
Threads
Data
Sources

SUMMARY
 Data is a fundamental building block of every organization
• Rooted in the past
• A key indicator of the present
• A strategic asset of the future
 Data is part of a complex ecosystem
• Just as all organisms constitute a complex biological ecosystem
 We must decode the data and unlock the value chain
• Data – Information – Knowledge
• Knowledge provides strategic advantage
 Modeling is more important than ever before!
• Data modeling, process modeling, visual data lineage, metadata
 Governance helps us to harness the full potential of the ecosystem
• Glossaries, policies, n:n linking to metadata constructs

THANKS!
Any questions?
You can find me at:
ron.huizenga@idera.com
@DataAviator

IDERA Live | Decode your Organization's Data DNA

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à IDERA Live | Decode your Organization's Data DNA

Similaire à IDERA Live | Decode your Organization's Data DNA (20)

Plus de IDERA Software

Plus de IDERA Software (20)

Dernier

Dernier (20)

IDERA Live | Decode your Organization's Data DNA

Notes de l'éditeur