SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Physical Data Modeling Blunders
And How to Avoid Them



Karen Lopez
Sr. Project Manager
Abstract

What’s going on in your physical data model? How many people can
or will update it to match the reality of what’s going on in your
databases? Who decides what goes into the physical model? In this
presentation we discuss 5 physical data modeling mistakes that cost
you dearly: performance snags, development delays, bugs and
professional respect. Data Architects are often tasked to prepare
first-cut physical data models, yet these skills usually overlap those of
DBAs and Developers and this overlap can lead to contention,
confusion, and complacency.


With this presentation, you’ll learn about the 5 blunders, how to find
them, plus many tips on how to avoid them.
  PAGE 2
Speaker Bio

Karen López is a principal
consultant at InfoAdvisors. She
specializes in the practical
application of data management
principles. Karen is also the
ListMistress and moderator of
the InfoAdvisors Discussion
Groups at www.infoadvisors.com
and other online communities.




  PAGE 3
Agenda




The Problem
Blunders, FAILs, D’Oh’s, Faults and WTHs?
10 Tips
 PAGE 4
POLL: Who Are You?
POLL: Physical Model Much?
The Problem




 PAGE 7
Confirmation
                               Number




     Assuming Numbers are Numbers




PAGE 8
D’OH!


•Leading Zeros
•Non-Numeric
•Special Characters
•Sorting Issues
•Externally Managed Numbers

 PAGE 9
Avoid it

1. Use a numeric-ish datatype when
   there’s math involved.
2. Do your research
3. If it’s externally managed data, its
   format or composition could change at
   any time. Anticipate that.
4. Anticipate leading zeros.
5. Anticipate significant formatting,
   characters & other tricks.



  PAGE 10
Choosing the Wrong Primary Key




PAGE 11
32 + 8 + 32 + 32 + (0 to 22) = A really big key
              that only a Data Architect could love



PAGE 12
Avoid it


• Establish Primary Key standards and styles
• Smaller PKs lead to smaller indexes and
  smaller databases. Take advantage of that
• Choose PKs that do not change when data
  changes
• Understand how Identity columns work



  PAGE 13
Applying Surrogate Keys Incorrectly




PAGE 14
Applying a Surrogate Key




 PAGE 15
WTH?




 PAGE 16
Alternate Key




 PAGE 17
Avoid it


• Establish Primary Key standards and styles
• Smaller PKs lead to smaller indexes and
  smaller databases. Take advantage of that.
• Don’t forget the Business Key
• Choose PKs that do not change when data
  changes
• Understand how Identity columns work
  PAGE 18
Turning off RI for Development




PAGE 19
Blunder




 PAGE 20
RI = FK




  PAGE 21
Avoid it
• Educate team on the risks & nearly universal outcome of turning
  off RI
• Generate RI in every script




                                                • Develop test data
                                  • Develop scripts for loading data
                      • Ensure team has easy access to Data Models
  PAGE 22
Using the Defaults




PAGE 23
Defaults

 • Defaults <> Best Practices
 • Product Defaults aren’t YOUR defaults
 • Defaults might even be…faulty.


  PAGE 24
Defaulting
Faults




 PAGE 25
Avoid it

  • Create a DA Test database…and use it.
  • Test Generation Options to Test DB with
    Data
  • Generate Test Scripts, Varying Options
  • Review options with DBAs and
    Developers.
  • Choose Defaults, don’t default them
  • Save your Default Set
  PAGE 26
Other Blunders

• Using the wrong tool            •   Hand generating scripts
• Too much Subtyping              •   Failing to report issues
• Logical Flexible hierarchies    •   Denormalizing too soon
• Wrong Datatypes                 •   Failing to use constraints
• Dual updates                    •   Making performance more
• Forgetting to manage the data       important than integrity
• Duplicate Indexes               •   Completely separate logical
• Ignoring Clustering                 model
• Using overly generic design     •   Failing to take into account
  patterns                            whole environment.
• Column order                    •   Applying surrogate keys where
                                      they have no business
• Silly Long Names                •   Not testing your design
• Failing to compare              •   Ignoring reference data design

    PAGE 27
10 Tips for Avoiding Physical Modeling Blunders


1. Understand cost, benefit and risk
2. Get formal training on target technologies
3. Work near your DBAs and Developers
4. Ask DBAs about trade-offs, not just
   solutions
5. Build portfolio of performance vs. integrity
   trade-offs
 PAGE 28
10 Tips for Avoiding Physical Modeling Blunders


6. Profile Source Data
7. Build Test Databases, with Data
8. Test Scripts
9. Compare, Compare, Compare…then
   Compare some More.
10.Get More Training.

 PAGE 29
Summary    1.Don’t assume that
             numbers are….numbers
           2.Choose the right primary
             key
           3.Apply surrogate keys
             correctly
           4.Keep RI turned on
           5.Architect, don’t default




 PAGE 30
Questions?




Feedback, questions,
comments are always                     Karen@InfoAdvisors.com
appreciated.
http://www.speakerrate.com/karenlopez




    PAGE 31

Contenu connexe

Similaire à 5 physical data modeling blunders 09092010

Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010ERwin Modeling
 
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...IDERA Software
 
Product Management in the Era of Data Science
Product Management in the Era of Data ScienceProduct Management in the Era of Data Science
Product Management in the Era of Data ScienceMandar Parikh
 
Best practices with development of enterprise-scale SharePoint solutions - Pa...
Best practices with development of enterprise-scale SharePoint solutions - Pa...Best practices with development of enterprise-scale SharePoint solutions - Pa...
Best practices with development of enterprise-scale SharePoint solutions - Pa...SPC Adriatics
 
How to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldHow to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldKaren Lopez
 
From NASA to Startups to Big Commerce
From NASA to Startups to Big CommerceFrom NASA to Startups to Big Commerce
From NASA to Startups to Big CommerceDaniel Greenfeld
 
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceAll Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceInside Analysis
 
Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?IDERA Software
 
4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to Avoid4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to AvoidSenturus
 
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBig Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBigDataExpo
 
What is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMWhat is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMProduct School
 
Designing High Quality Data Driven Solutions 110520
Designing High Quality Data Driven Solutions 110520Designing High Quality Data Driven Solutions 110520
Designing High Quality Data Driven Solutions 110520MariaHalstead1
 
10 things to avoid in data model 09242010
10 things to avoid in data model 0924201010 things to avoid in data model 09242010
10 things to avoid in data model 09242010ERwin Modeling
 
Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentTasktop
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementIDERA Software
 
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongThe Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongDATAVERSITY
 
Lesson 3 ai in the enterprise
Lesson 3   ai in the enterpriseLesson 3   ai in the enterprise
Lesson 3 ai in the enterpriseankit_ppt
 
Redgate How to be Friends with Developers
Redgate How to be Friends with DevelopersRedgate How to be Friends with Developers
Redgate How to be Friends with DevelopersKellyn Pot'Vin-Gorman
 
Protect your Database with Data Masking & Enforced Version Control
Protect your Database with Data Masking & Enforced Version Control	Protect your Database with Data Masking & Enforced Version Control
Protect your Database with Data Masking & Enforced Version Control DBmaestro - Database DevOps
 

Similaire à 5 physical data modeling blunders 09092010 (20)

Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010Mastering your data with ca e rwin dm 09082010
Mastering your data with ca e rwin dm 09082010
 
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
 
Product Management in the Era of Data Science
Product Management in the Era of Data ScienceProduct Management in the Era of Data Science
Product Management in the Era of Data Science
 
Best practices with development of enterprise-scale SharePoint solutions - Pa...
Best practices with development of enterprise-scale SharePoint solutions - Pa...Best practices with development of enterprise-scale SharePoint solutions - Pa...
Best practices with development of enterprise-scale SharePoint solutions - Pa...
 
How to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database WorldHow to Survive as a Data Architect in a Polyglot Database World
How to Survive as a Data Architect in a Polyglot Database World
 
From NASA to Startups to Big Commerce
From NASA to Startups to Big CommerceFrom NASA to Startups to Big Commerce
From NASA to Startups to Big Commerce
 
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceAll Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data Governance
 
Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?
 
4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to Avoid4 Common Analytics Reporting Errors to Avoid
4 Common Analytics Reporting Errors to Avoid
 
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBig Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
 
What is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMWhat is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PM
 
Designing High Quality Data Driven Solutions 110520
Designing High Quality Data Driven Solutions 110520Designing High Quality Data Driven Solutions 110520
Designing High Quality Data Driven Solutions 110520
 
10 things to avoid in data model 09242010
10 things to avoid in data model 0924201010 things to avoid in data model 09242010
10 things to avoid in data model 09242010
 
Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics Environment
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change Management
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongThe Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
 
Lesson 3 ai in the enterprise
Lesson 3   ai in the enterpriseLesson 3   ai in the enterprise
Lesson 3 ai in the enterprise
 
Redgate How to be Friends with Developers
Redgate How to be Friends with DevelopersRedgate How to be Friends with Developers
Redgate How to be Friends with Developers
 
Protect your Database with Data Masking & Enforced Version Control
Protect your Database with Data Masking & Enforced Version Control	Protect your Database with Data Masking & Enforced Version Control
Protect your Database with Data Masking & Enforced Version Control
 

Plus de ERwin Modeling

Using ca e rwin modeling to asure data 09162010
Using ca e rwin modeling to asure data 09162010Using ca e rwin modeling to asure data 09162010
Using ca e rwin modeling to asure data 09162010ERwin Modeling
 
Staying relevant in todays changing dm environment 09282010
Staying relevant in todays changing dm environment 09282010Staying relevant in todays changing dm environment 09282010
Staying relevant in todays changing dm environment 09282010ERwin Modeling
 
Sneak peak ca e rwin data modeler r8 preview09222010
Sneak peak ca e rwin data modeler r8 preview09222010Sneak peak ca e rwin data modeler r8 preview09222010
Sneak peak ca e rwin data modeler r8 preview09222010ERwin Modeling
 
Monetizing data management 09162010
Monetizing data management 09162010Monetizing data management 09162010
Monetizing data management 09162010ERwin Modeling
 
Integrating data process a roundtrip modeling using e rwin data modeler_erwin...
Integrating data process a roundtrip modeling using e rwin data modeler_erwin...Integrating data process a roundtrip modeling using e rwin data modeler_erwin...
Integrating data process a roundtrip modeling using e rwin data modeler_erwin...ERwin Modeling
 
Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010ERwin Modeling
 
Deciding to go cloud 09212010
Deciding to go cloud  09212010Deciding to go cloud  09212010
Deciding to go cloud 09212010ERwin Modeling
 
Data modeling for the business 09282010
Data modeling for the business  09282010Data modeling for the business  09282010
Data modeling for the business 09282010ERwin Modeling
 
Cust experience a practical guide 09152010
Cust experience a practical guide 09152010Cust experience a practical guide 09152010
Cust experience a practical guide 09152010ERwin Modeling
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010ERwin Modeling
 
Ca e rwin state of the union 09082010
Ca e rwin state of the union 09082010Ca e rwin state of the union 09082010
Ca e rwin state of the union 09082010ERwin Modeling
 
Optimizing the design of your data warehouse 09222010
Optimizing the design of your data warehouse 09222010Optimizing the design of your data warehouse 09222010
Optimizing the design of your data warehouse 09222010ERwin Modeling
 

Plus de ERwin Modeling (12)

Using ca e rwin modeling to asure data 09162010
Using ca e rwin modeling to asure data 09162010Using ca e rwin modeling to asure data 09162010
Using ca e rwin modeling to asure data 09162010
 
Staying relevant in todays changing dm environment 09282010
Staying relevant in todays changing dm environment 09282010Staying relevant in todays changing dm environment 09282010
Staying relevant in todays changing dm environment 09282010
 
Sneak peak ca e rwin data modeler r8 preview09222010
Sneak peak ca e rwin data modeler r8 preview09222010Sneak peak ca e rwin data modeler r8 preview09222010
Sneak peak ca e rwin data modeler r8 preview09222010
 
Monetizing data management 09162010
Monetizing data management 09162010Monetizing data management 09162010
Monetizing data management 09162010
 
Integrating data process a roundtrip modeling using e rwin data modeler_erwin...
Integrating data process a roundtrip modeling using e rwin data modeler_erwin...Integrating data process a roundtrip modeling using e rwin data modeler_erwin...
Integrating data process a roundtrip modeling using e rwin data modeler_erwin...
 
Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010
 
Deciding to go cloud 09212010
Deciding to go cloud  09212010Deciding to go cloud  09212010
Deciding to go cloud 09212010
 
Data modeling for the business 09282010
Data modeling for the business  09282010Data modeling for the business  09282010
Data modeling for the business 09282010
 
Cust experience a practical guide 09152010
Cust experience a practical guide 09152010Cust experience a practical guide 09152010
Cust experience a practical guide 09152010
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010
 
Ca e rwin state of the union 09082010
Ca e rwin state of the union 09082010Ca e rwin state of the union 09082010
Ca e rwin state of the union 09082010
 
Optimizing the design of your data warehouse 09222010
Optimizing the design of your data warehouse 09222010Optimizing the design of your data warehouse 09222010
Optimizing the design of your data warehouse 09222010
 

Dernier

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Dernier (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

5 physical data modeling blunders 09092010

  • 1. Physical Data Modeling Blunders And How to Avoid Them Karen Lopez Sr. Project Manager
  • 2. Abstract What’s going on in your physical data model? How many people can or will update it to match the reality of what’s going on in your databases? Who decides what goes into the physical model? In this presentation we discuss 5 physical data modeling mistakes that cost you dearly: performance snags, development delays, bugs and professional respect. Data Architects are often tasked to prepare first-cut physical data models, yet these skills usually overlap those of DBAs and Developers and this overlap can lead to contention, confusion, and complacency. With this presentation, you’ll learn about the 5 blunders, how to find them, plus many tips on how to avoid them. PAGE 2
  • 3. Speaker Bio Karen López is a principal consultant at InfoAdvisors. She specializes in the practical application of data management principles. Karen is also the ListMistress and moderator of the InfoAdvisors Discussion Groups at www.infoadvisors.com and other online communities. PAGE 3
  • 4. Agenda The Problem Blunders, FAILs, D’Oh’s, Faults and WTHs? 10 Tips PAGE 4
  • 8. Confirmation Number Assuming Numbers are Numbers PAGE 8
  • 10. Avoid it 1. Use a numeric-ish datatype when there’s math involved. 2. Do your research 3. If it’s externally managed data, its format or composition could change at any time. Anticipate that. 4. Anticipate leading zeros. 5. Anticipate significant formatting, characters & other tricks. PAGE 10
  • 11. Choosing the Wrong Primary Key PAGE 11
  • 12. 32 + 8 + 32 + 32 + (0 to 22) = A really big key that only a Data Architect could love PAGE 12
  • 13. Avoid it • Establish Primary Key standards and styles • Smaller PKs lead to smaller indexes and smaller databases. Take advantage of that • Choose PKs that do not change when data changes • Understand how Identity columns work PAGE 13
  • 14. Applying Surrogate Keys Incorrectly PAGE 14
  • 15. Applying a Surrogate Key PAGE 15
  • 18. Avoid it • Establish Primary Key standards and styles • Smaller PKs lead to smaller indexes and smaller databases. Take advantage of that. • Don’t forget the Business Key • Choose PKs that do not change when data changes • Understand how Identity columns work PAGE 18
  • 19. Turning off RI for Development PAGE 19
  • 21. RI = FK PAGE 21
  • 22. Avoid it • Educate team on the risks & nearly universal outcome of turning off RI • Generate RI in every script • Develop test data • Develop scripts for loading data • Ensure team has easy access to Data Models PAGE 22
  • 24. Defaults • Defaults <> Best Practices • Product Defaults aren’t YOUR defaults • Defaults might even be…faulty. PAGE 24
  • 26. Avoid it • Create a DA Test database…and use it. • Test Generation Options to Test DB with Data • Generate Test Scripts, Varying Options • Review options with DBAs and Developers. • Choose Defaults, don’t default them • Save your Default Set PAGE 26
  • 27. Other Blunders • Using the wrong tool • Hand generating scripts • Too much Subtyping • Failing to report issues • Logical Flexible hierarchies • Denormalizing too soon • Wrong Datatypes • Failing to use constraints • Dual updates • Making performance more • Forgetting to manage the data important than integrity • Duplicate Indexes • Completely separate logical • Ignoring Clustering model • Using overly generic design • Failing to take into account patterns whole environment. • Column order • Applying surrogate keys where they have no business • Silly Long Names • Not testing your design • Failing to compare • Ignoring reference data design PAGE 27
  • 28. 10 Tips for Avoiding Physical Modeling Blunders 1. Understand cost, benefit and risk 2. Get formal training on target technologies 3. Work near your DBAs and Developers 4. Ask DBAs about trade-offs, not just solutions 5. Build portfolio of performance vs. integrity trade-offs PAGE 28
  • 29. 10 Tips for Avoiding Physical Modeling Blunders 6. Profile Source Data 7. Build Test Databases, with Data 8. Test Scripts 9. Compare, Compare, Compare…then Compare some More. 10.Get More Training. PAGE 29
  • 30. Summary 1.Don’t assume that numbers are….numbers 2.Choose the right primary key 3.Apply surrogate keys correctly 4.Keep RI turned on 5.Architect, don’t default PAGE 30
  • 31. Questions? Feedback, questions, comments are always Karen@InfoAdvisors.com appreciated. http://www.speakerrate.com/karenlopez PAGE 31