SlideShare une entreprise Scribd logo
1  sur  21
Data Modelling
It’s a lot more than drawing diagrams
George McGeachie
Metadata Matters Limited
My lightning talk at PG day in 2014
(I liked that venue – Horwood House)
My entry in the 2nd Quadrant
blog last week – used example
of automating Data Vault
design and creation
https://blog.2ndquadrant.com/data-modelling-lot-just-diagram/
2
This is my favourite theme
The right tool can
give you a lot more
than just this messy
Diagram
– would you want to
work with this
diagram?
3
This is my favourite theme
A data model is a lot more than just a Diagram
4
This is my favourite theme
 Check against your design standards
 The tedious stuff, like making sure all your
tables have the standard audit columns
 Do you need JSON?
 How much will this DB grow?
 Managing (and comparing) schema, table &
column versions
 Building Data Vaults – see 2nd Quadrant blog
5
Automate tasks – before you
build the database
Available automation
A Contextual menu is one way of
accessing automation – check the
model, export JSON to a file, apply
Naming Standards, adding audit
columns
Add your own model
checks, along with
automatic fixing for
those problems if
possible
(e.g. adding
surrogate key)
7
Check your design meets your
design standards
8
Make sure all your tables have
the standard audit columns
Don’t blink or you’ll miss it
9
The PDM – without audit columns
10
Less than 2 seconds later …
11
Do you need JSON?
{
"Name" : "Departments",
"Code" : "Departments",
"Fully Qualified Name" : "Group0.Departments",
"Fully Qualified Code" : "Group0.Departments",
"Owner" : "Group0",
"Object Type" : "Table",
"id" : "8731F3EE-8E53-46C6-A873-81C522F51717",
"description" : "contains the names and heads of th
"note" : "<NONE>",
"Columns" :
[
{
"Name" : "DepartmentID",
"Code" : "DepartmentID",
"Fully Qualified Name" : "Departments
"Fully Qualified Code" : "Group0.Depa
"Object Type" : "Column",
"id" : "D23F6064-87A8-4D1D-92D0-70F35
"description" : "short one",
"note" : "<NONE>",
"Data Type" : "INT4",
"Length" : "4",
"Precision" : "0",
"Primary?" : "TRUE",
"FK?" : "FALSE",
"Mandatory?" : "TRUE",
 Extract current statistics, define growth rates
12
How much will this database
grow?
Estimate of the size of the Database "PhysicalDataModel_1"...
Number Estimated size Object
------------------------- ----------------------- ----------------------------------------
------------
1,556 312 KB Table "Contacts"
17,370 3,475 KB Table "Customers"
917 KB Index "IX_customer_name"
130 7 KB Table "Departments"
1,945 390 KB Table "Employees"
182 13 KB Table "FinancialCodes"
2,179 39 KB Table "FinancialData"
259 130 KB Table "MarketingInformation"
259 KB Long data types
274 KB Index "MarketingTextIndex"
1,379 9,651 KB Table "Products"
9,650 KB Long data types
34 KB Index "IX_product_name"
60 KB Index "IX_product_description"
41 KB Index "IX_product_size"
41 KB Index "IX_product_color"
28,453 619 KB Table "SalesOrderItems"
382,637,520 11,595,077 KB Table "SalesOrders"
3,268 467 KB Table "SpatialContacts"
467 25 KB Table "SpatialShapes"
------------------------- ----------------------- ----------------------------------------
------------
11,621,481 KB Total estimated space
The data will be distributed on the following tablespaces:
Estimated size Tablespace
----------------------- ----------------------------------------------
1,367 KB system
13
Estimate Database size
 Write your own estimation script
14
If you don’t like the way it’s
done
 Branching
 Comparing
versions
15
Versioning
 Check models into the
repository, but don’t
update the mainline until
they’ve been approved
16
Check in model for peer review
17
Integrate the 2nd Branch back
into the 1st Branch
Models updated
with selected
changes
Still able to access version 1
18
Simon and Hannu
say …
Page 53
• Understand Database
Dependencies
◦ e.g. Table  View  Procedure
I only have the first edition of this
excellent book
 ETL Jobs
 Forms and Reports
 Applications
 XML Message Schemas
 Regulatory Requirements
 Business Processes
 Use Cases
 JIRA tickets
etc.
19
Databases have connections
20
Choose your tools carefully
What Tools are there?
The big 3
ERwin, ER/Studio, PowerDesigner
Others
Dezign
Sparx EA
ModelRight
Silverrun
IBM Infosphere Data Architect
Toad Data Modeller
might not all support PG
George McGeachie
Co-author of “Data Modeling Made Simple with
PowerDesigner”, data modeller and strategist,
SAP PowerDesigner trainer, and data modelling
tool junkie.
@metadatajunkie
Blog – metadatajunkie.wordpress.com
https://www.linkedin.com/in/georgemcgeachie/
George.McGeachie@MetadataMatters.com
Mobile: +44 (0) 794 293 0648

Contenu connexe

Tendances

SlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual AnalyticsSlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual AnalyticsJohn De Goes
 
Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010ERwin Modeling
 
MS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmxMS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmxDataminingTools Inc
 
01 Persistence And Orm
01 Persistence And Orm01 Persistence And Orm
01 Persistence And OrmRanjan Kumar
 
Datalayer Best Practices with Observepoint
Datalayer Best Practices with ObservepointDatalayer Best Practices with Observepoint
Datalayer Best Practices with ObservepointMike Plant
 
Give Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the BusinessGive Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the BusinessFormant
 
Making Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BIMaking Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BIAvtex
 
Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28Martin Bém
 
Cust experience a practical guide 09152010
Cust experience a practical guide 09152010Cust experience a practical guide 09152010
Cust experience a practical guide 09152010ERwin Modeling
 
Tableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesTableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesBurn & Born
 
Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3DTU Library
 

Tendances (16)

Star schema PPT
Star schema PPTStar schema PPT
Star schema PPT
 
SlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual AnalyticsSlamData - How MongoDB Is Powering a Revolution in Visual Analytics
SlamData - How MongoDB Is Powering a Revolution in Visual Analytics
 
Xml
XmlXml
Xml
 
Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010Effective capture of metadata using ca e rwin data modeler 09232010
Effective capture of metadata using ca e rwin data modeler 09232010
 
MS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmxMS SQL SERVER: Data mining concepts and dmx
MS SQL SERVER: Data mining concepts and dmx
 
01 Persistence And Orm
01 Persistence And Orm01 Persistence And Orm
01 Persistence And Orm
 
Datalayer Best Practices with Observepoint
Datalayer Best Practices with ObservepointDatalayer Best Practices with Observepoint
Datalayer Best Practices with Observepoint
 
Lançamento ERwin 08/02
Lançamento ERwin 08/02Lançamento ERwin 08/02
Lançamento ERwin 08/02
 
Give Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the BusinessGive Me My Damn Report: Making NoSQL Data Accessible to the Business
Give Me My Damn Report: Making NoSQL Data Accessible to the Business
 
Making Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BIMaking Smarter Business Decisions with Power BI
Making Smarter Business Decisions with Power BI
 
Rendering The Fat
Rendering The FatRendering The Fat
Rendering The Fat
 
Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28Prague data management meetup 2017-03-28
Prague data management meetup 2017-03-28
 
Hibernate II
Hibernate IIHibernate II
Hibernate II
 
Cust experience a practical guide 09152010
Cust experience a practical guide 09152010Cust experience a practical guide 09152010
Cust experience a practical guide 09152010
 
Tableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, DisadvantagesTableau PPT Intro, Features, Advantages, Disadvantages
Tableau PPT Intro, Features, Advantages, Disadvantages
 
Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3Guidelines data cite_denmark_ver3
Guidelines data cite_denmark_ver3
 

Similaire à Data Modelling is More Than Just Diagrams

Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big DataJames Serra
 
MongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptxMongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptxKalpitPandit1
 
Emerging database landscape july 2011
Emerging database landscape july 2011Emerging database landscape july 2011
Emerging database landscape july 2011navaidkhan
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingPrithwis Mukerjee
 
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkBest Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkDatabricks
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessyalisassoon
 
A guide to preparing your data for tableau
A guide to preparing your data for tableauA guide to preparing your data for tableau
A guide to preparing your data for tableauPhillip Reinhart
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNabclearnn
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneMongoDB
 
Data Science Demystified
Data Science DemystifiedData Science Demystified
Data Science DemystifiedEmily Robinson
 
L’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneL’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneMongoDB
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauWebinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauMongoDB
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data PlatformAndrei Savu
 
Snowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessSnowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessGiuseppe Gaviani
 

Similaire à Data Modelling is More Than Just Diagrams (20)

Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 
MongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptxMongoDb Schema Pattern - Kalpit Pandit.pptx
MongoDb Schema Pattern - Kalpit Pandit.pptx
 
Date Analysis .pdf
Date Analysis .pdfDate Analysis .pdf
Date Analysis .pdf
 
Emerging database landscape july 2011
Emerging database landscape july 2011Emerging database landscape july 2011
Emerging database landscape july 2011
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Best Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache SparkBest Practices for Building and Deploying Data Pipelines in Apache Spark
Best Practices for Building and Deploying Data Pipelines in Apache Spark
 
Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your business
 
A guide to preparing your data for tableau
A guide to preparing your data for tableauA guide to preparing your data for tableau
A guide to preparing your data for tableau
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARN
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazione
 
Data Science Demystified
Data Science DemystifiedData Science Demystified
Data Science Demystified
 
No SQL and MongoDB - Hyderabad Scalability Meetup
No SQL and MongoDB - Hyderabad Scalability MeetupNo SQL and MongoDB - Hyderabad Scalability Meetup
No SQL and MongoDB - Hyderabad Scalability Meetup
 
Data engineering design patterns
Data engineering design patternsData engineering design patterns
Data engineering design patterns
 
PowerBI Training
PowerBI Training PowerBI Training
PowerBI Training
 
L’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneL’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova Generazione
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauWebinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data Platform
 
Snowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessSnowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your business
 

Dernier

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Dernier (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Data Modelling is More Than Just Diagrams

  • 1. Data Modelling It’s a lot more than drawing diagrams George McGeachie Metadata Matters Limited
  • 2. My lightning talk at PG day in 2014 (I liked that venue – Horwood House) My entry in the 2nd Quadrant blog last week – used example of automating Data Vault design and creation https://blog.2ndquadrant.com/data-modelling-lot-just-diagram/ 2 This is my favourite theme
  • 3. The right tool can give you a lot more than just this messy Diagram – would you want to work with this diagram? 3 This is my favourite theme
  • 4. A data model is a lot more than just a Diagram 4 This is my favourite theme
  • 5.  Check against your design standards  The tedious stuff, like making sure all your tables have the standard audit columns  Do you need JSON?  How much will this DB grow?  Managing (and comparing) schema, table & column versions  Building Data Vaults – see 2nd Quadrant blog 5 Automate tasks – before you build the database
  • 6. Available automation A Contextual menu is one way of accessing automation – check the model, export JSON to a file, apply Naming Standards, adding audit columns
  • 7. Add your own model checks, along with automatic fixing for those problems if possible (e.g. adding surrogate key) 7 Check your design meets your design standards
  • 8. 8 Make sure all your tables have the standard audit columns Don’t blink or you’ll miss it
  • 9. 9 The PDM – without audit columns
  • 10. 10 Less than 2 seconds later …
  • 11. 11 Do you need JSON? { "Name" : "Departments", "Code" : "Departments", "Fully Qualified Name" : "Group0.Departments", "Fully Qualified Code" : "Group0.Departments", "Owner" : "Group0", "Object Type" : "Table", "id" : "8731F3EE-8E53-46C6-A873-81C522F51717", "description" : "contains the names and heads of th "note" : "<NONE>", "Columns" : [ { "Name" : "DepartmentID", "Code" : "DepartmentID", "Fully Qualified Name" : "Departments "Fully Qualified Code" : "Group0.Depa "Object Type" : "Column", "id" : "D23F6064-87A8-4D1D-92D0-70F35 "description" : "short one", "note" : "<NONE>", "Data Type" : "INT4", "Length" : "4", "Precision" : "0", "Primary?" : "TRUE", "FK?" : "FALSE", "Mandatory?" : "TRUE",
  • 12.  Extract current statistics, define growth rates 12 How much will this database grow?
  • 13. Estimate of the size of the Database "PhysicalDataModel_1"... Number Estimated size Object ------------------------- ----------------------- ---------------------------------------- ------------ 1,556 312 KB Table "Contacts" 17,370 3,475 KB Table "Customers" 917 KB Index "IX_customer_name" 130 7 KB Table "Departments" 1,945 390 KB Table "Employees" 182 13 KB Table "FinancialCodes" 2,179 39 KB Table "FinancialData" 259 130 KB Table "MarketingInformation" 259 KB Long data types 274 KB Index "MarketingTextIndex" 1,379 9,651 KB Table "Products" 9,650 KB Long data types 34 KB Index "IX_product_name" 60 KB Index "IX_product_description" 41 KB Index "IX_product_size" 41 KB Index "IX_product_color" 28,453 619 KB Table "SalesOrderItems" 382,637,520 11,595,077 KB Table "SalesOrders" 3,268 467 KB Table "SpatialContacts" 467 25 KB Table "SpatialShapes" ------------------------- ----------------------- ---------------------------------------- ------------ 11,621,481 KB Total estimated space The data will be distributed on the following tablespaces: Estimated size Tablespace ----------------------- ---------------------------------------------- 1,367 KB system 13 Estimate Database size
  • 14.  Write your own estimation script 14 If you don’t like the way it’s done
  • 16.  Check models into the repository, but don’t update the mainline until they’ve been approved 16 Check in model for peer review
  • 17. 17 Integrate the 2nd Branch back into the 1st Branch Models updated with selected changes Still able to access version 1
  • 18. 18 Simon and Hannu say … Page 53 • Understand Database Dependencies ◦ e.g. Table  View  Procedure I only have the first edition of this excellent book
  • 19.  ETL Jobs  Forms and Reports  Applications  XML Message Schemas  Regulatory Requirements  Business Processes  Use Cases  JIRA tickets etc. 19 Databases have connections
  • 20. 20 Choose your tools carefully What Tools are there? The big 3 ERwin, ER/Studio, PowerDesigner Others Dezign Sparx EA ModelRight Silverrun IBM Infosphere Data Architect Toad Data Modeller might not all support PG
  • 21. George McGeachie Co-author of “Data Modeling Made Simple with PowerDesigner”, data modeller and strategist, SAP PowerDesigner trainer, and data modelling tool junkie. @metadatajunkie Blog – metadatajunkie.wordpress.com https://www.linkedin.com/in/georgemcgeachie/ George.McGeachie@MetadataMatters.com Mobile: +44 (0) 794 293 0648