SlideShare a Scribd company logo
1 of 44
Making sense of data visually: 
A modern look at data 
visualization 
VLADIMIR MILEV 
NEW VENTURE SOFTWARE
Author Bio 
Vladimir Milev 
MCPD Enterprise 
Speaker (Devreach, NTK Slovenia and others) 
DV Evangelist 
Founder at New Venture Software 
@vmilev 
www.linkedin.com/in/vladimirmilev/
http://www.newventuresoftware.com/
Agenda 
1. Big data and information overload 
2. What problems DataViz solves 
3. DataViz fundamental theory 
4. Basic visualizations 
5. Advanced visualizations
Information Overload 
Twitter: 500 million tweets per day 
Facebook: 55 million status updates per day 
Facebook: 900 million interactions per day (comments, likes etc.) 
Reddit:
Proliferation of smart devices 
 We are already living in a world dominated by 
smart devices 
 What is the meaning of this? 
 More connected, data is more accessible 
 Less space for tables and text 
 Must use visual communication
Making Sense of Data 
Increasing amount of data available 
Increasing number of data consumer devices 
Obtaining data no longer a problem 
We have an Information Overload issue 
Quick data analysis is the new problem 
But how quick?
A Picture is worth a 1000 words 
With about 1,000,000 
ganglion cells, the human 
retina would transmit data 
at roughly the rate of an 
Ethernet connection, or 10 
million bits per second.” 
-Vijay Balasubramanian, 
PhD, Professor of Physics at 
U Penn
OK – That’s a lot of 
bandwidth 
BUT ARE WE USING IT EFFICIENTLY?
Efficiency 
Best readers usually read up to about 300 words per minute. 
Average word length is 5.1 letters 
300 * 5.1 = 1530 characters per minute 
Or 1530 / 60 = 25.5 characters per second 
1 character is usually stored as 8 bits 
26 * 8 = 208 bits per second 
Reading bandwidth is ~0.025 KiB/s 
Or 0.00208% Efficiency
So reading clearly isn’t 
the way to go… 
BUT WHAT IS THE SOLUTION?
Using statistics 
For the most part of the 20th century 
Using arithmetic mean, average, standard deviation 
Variance, correlations, regressions 
Turns out this is not good enough
Anscombe’s Quartet 
I II III IV 
x y x y x y x y 
10 8.04 10 9.14 10 7.46 8 6.58 
8 6.95 8 8.14 8 6.77 8 5.76 
13 7.58 13 8.74 13 12.74 8 7.71 
9 8.81 9 8.77 9 7.11 8 8.84 
11 8.33 11 9.26 11 7.81 8 8.47 
14 9.96 14 8.1 14 8.84 8 7.04 
6 7.24 6 6.13 6 6.08 8 5.25 
4 4.26 4 3.1 4 5.39 19 12.5 
12 10.84 12 9.13 12 8.15 8 5.56 
7 4.82 7 7.26 7 6.42 8 7.91 
5 5.68 5 4.74 5 5.73 8 6.89 
• Statistical properties are identical: 
• Mean of X (9.0) and Y (7.5) values are constant 
• Nearly same variances, correlations and regressions 
• As far as statistics is concerned these sets are almost the same
Anscombe’s Quartet
So DataViz is very powerful 
But why does it work so well?
Gestalt Psychology 
Seeing with the brain 
The mind understands external stimuli as whole rather than the 
sum of their parts 
We tend to order our experience in a manner that is regular, 
orderly, symmetric, and simple 
Key principles of gestalt: reification, multistability, invariance 
Gestalt laws of grouping: proximity, similarity, closure, symmetry
Gestalt Principles - Reification 
Our minds tend to 
construct/generate 
information
Gestalt Principles -Multistability 
The tendency of our 
mind to jump back and 
forth between 
ambiguous alternative 
interpretations 
Spinning Girl Rubin Vase
Gestalt Principles - Invariance 
The tendency to perceive simple geometric 
objects independent of rotation, translation, 
and scale 
Also elastic deformations, different lighting, 
and different component features
Gestalt Laws of Grouping - Similarity 
We group objects based on visual similarity
Gestalt Laws of Grouping - Proximity 
We group items based on spatial proximity
Gestalt Laws of Grouping - Closure 
We perceive objects such as shapes, letters, pictures, etc., as 
being whole when they are not complete
Application in Data Visualization 
 Introducing the visual variables 
 Fundamental properties of objects which can encode information into a 
picture 
 Fundamental visual variables: 
◦ Position 
◦ Size 
◦ Color 
◦ Shape 
◦ Orientation 
Basis for all Data Visualization!
Basic/Common Visualizations 
Bar graphs 
Line graphs 
Area charts 
Pie charts
Bar Graphs 
• Using color correctly to encode 
gender 
• Using position (ordering) to 
create an orderly scale 
• Using size to encode the values 
• Using orientation to differentiate 
gender again
Bar Graphs continued 
• Labels are used 
• Color is neutral and does not encode 
information 
• Again, we have top-down ordering 
(position) 
• And again size encodes the relative 
numeric value
Bars and Normal Distribution 
Minimum passing grade 
• Distribution of test scores for 
Polish “Matura” exam 
• Normal Distribution is 
expected 
• Red line shows normal 
distribution 
• 30 is the minimum expected 
grade 
• Detecting behavioral changes 
• What happened?
Line Graphs 
Confirming what we already know – 
paper media is declining rapidly. 
• Shape encodes the value 
• Color is not significant 
• Design goal is to show a 
trend/change
Area Graphs 
Effect of school year on 
Team Fortress 2 players 
School starts 
• Similar to line graph 
• Design goal for area 
charts is emphasize 
on the 
value/quantity, not 
so much on the trend 
• You can see both 
• Color has no 
meaning
Area Graphs continued 
• This time color carries a 
meaning (legend) 
• The graph is also good for 
displaying ratio between series 
of data over time
Pie Charts
Pie Charts 
Golden Rules for Pie Charts 
• Ratio of one piece to the whole 
• Order the values 
• Less than 6 pieces 
• Avoid legends 
• Sum up to 100%
Abusing Pie Charts 
Don’t break the rules!
Maps 
Plot millions of journal entries from 18th and 19th century ship logs, and 
you reveal a picture of ocean trade you've never seen before 
• Visualization of 
routes 
• Color saturation 
indicates heavily 
used routes
Maps are good with animations too 
• Concentration of NO2 from 
2005 to 2011 
• Using both color and position 
to encode concentration 
• Using continuous color scale 
• Adding another dimension - 
time
Choropleth Maps 
Displaying the most popular 
name for a newborn in each 
state 
• Using discrete 
palette to encode 
information
Heat Maps 
• Excellent for plotting 
recurring values 
• Color 
saturation/brightness 
encodes the values 
• Position also encodes 
information 
• Easy to spot 
concentrations and 
find patterns
Heat Maps medicine/genetics
Tree Maps 
• Excellent for representing 
hierarchical data 
• Color carries a meaning 
• Size carries a meaning as well 
• Position is irrelevant 
• Suitable for annotations
Parallel Coordinates Plot 
• Interactive visualization 
• Good at displaying 
relationships between 
different dimensions of 
data 
• Position encodes 
dimension 
• Color encodes scale
Parallel Coordinates Plot – in action 
Selecting a subset 
of a dimension to 
display the 
relationships with 
the other 
dimensions
Chord Diagram 
• Similar to Parallel Coordinates 
plot 
• Color and Position used to 
encode data 
• Design is different 
• Filtering of dimensions is not a 
design goal 
• Focuses on selecting a whole 
dimension
Some resources 
http://www.reddit.com/r/dataisbeautiful/ 
http://blog.visual.ly/ 
http://flowingdata.com/ 
http://eagereyes.org/ 
http://www.perceptualedge.com/blog/
Thank You!

More Related Content

Viewers also liked

Information visualization: representation
Information visualization: representationInformation visualization: representation
Information visualization: representationKatrien Verbert
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataPaco Nathan
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsDavid Pittman
 
Titan: The Rise of Big Graph Data
Titan: The Rise of Big Graph DataTitan: The Rise of Big Graph Data
Titan: The Rise of Big Graph DataMarko Rodriguez
 
Data Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsData Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsMahmood Ahmad
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraMatthias Broecheler
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)Prof. Dr. Diego Kuonen
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 

Viewers also liked (10)

Information visualization: representation
Information visualization: representationInformation visualization: representation
Information visualization: representation
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big Data
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
 
Titan: The Rise of Big Graph Data
Titan: The Rise of Big Graph DataTitan: The Rise of Big Graph Data
Titan: The Rise of Big Graph Data
 
Data Analysis: Descriptive Statistics
Data Analysis: Descriptive StatisticsData Analysis: Descriptive Statistics
Data Analysis: Descriptive Statistics
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with Cassandra
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
 
Introduction to R for Data Mining
Introduction to R for Data MiningIntroduction to R for Data Mining
Introduction to R for Data Mining
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 

Similar to Making sense of data visually: A modern look at datavisualization

AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1David Gotz
 
chi03-tutorial.ppt
chi03-tutorial.pptchi03-tutorial.ppt
chi03-tutorial.pptKumarVijay54
 
Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Manzur Ashraf
 
How Humans See Data - Google - November 2017
How Humans See Data  - Google - November 2017How Humans See Data  - Google - November 2017
How Humans See Data - Google - November 2017John Rauser
 
How Humans See Data - Amazon Cut
How Humans See Data - Amazon CutHow Humans See Data - Amazon Cut
How Humans See Data - Amazon CutJohn Rauser
 
Data Visualization dataviz superpower
Data Visualization dataviz superpowerData Visualization dataviz superpower
Data Visualization dataviz superpowerJen Stirrup
 
Design and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchDesign and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchAngela Zoss
 
How Humans See Data
How Humans See DataHow Humans See Data
How Humans See DataJohn Rauser
 
Best Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationBest Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationQualtrics
 
Working With Infographics
Working With InfographicsWorking With Infographics
Working With InfographicsUNCResearchHub
 
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptxMIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptxEdelmarBenosa3
 
Design in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil AcharyaDesign in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil AcharyaMobileNepal
 
SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"SMART Infrastructure Facility
 
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docxgilbertkpeters11344
 
Rules of Design Composition
Rules of Design CompositionRules of Design Composition
Rules of Design CompositionManamiIshimura
 
Best practices for visualising data and building dashboards
Best practices for visualising data and building dashboardsBest practices for visualising data and building dashboards
Best practices for visualising data and building dashboardsDaniel Tanner
 

Similar to Making sense of data visually: A modern look at datavisualization (20)

AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
 
chi03-tutorial.ppt
chi03-tutorial.pptchi03-tutorial.ppt
chi03-tutorial.ppt
 
Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Exploratory Data Analysis week 4
Exploratory Data Analysis week 4
 
How Humans See Data - Google - November 2017
How Humans See Data  - Google - November 2017How Humans See Data  - Google - November 2017
How Humans See Data - Google - November 2017
 
How Humans See Data - Amazon Cut
How Humans See Data - Amazon CutHow Humans See Data - Amazon Cut
How Humans See Data - Amazon Cut
 
Data Visualization dataviz superpower
Data Visualization dataviz superpowerData Visualization dataviz superpower
Data Visualization dataviz superpower
 
Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
 
Design and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization ResearchDesign and Support Recommendations from Data Visualization Research
Design and Support Recommendations from Data Visualization Research
 
How Humans See Data
How Humans See DataHow Humans See Data
How Humans See Data
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Best Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationBest Practices for Killer Data Visualization
Best Practices for Killer Data Visualization
 
Working With Infographics
Working With InfographicsWorking With Infographics
Working With Infographics
 
Data Visulalization
Data VisulalizationData Visulalization
Data Visulalization
 
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptxMIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
MIL-report-TEXT-AND-VISUAL-DIMENSION.pptx
 
Design in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil AcharyaDesign in UI: Visuals and Aesthetics - Swapnil Acharya
Design in UI: Visuals and Aesthetics - Swapnil Acharya
 
SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"SMART Seminar Series: "Data is the new water in the digital age"
SMART Seminar Series: "Data is the new water in the digital age"
 
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
4 CREATING GRAPHS A PICTURE REALLY IS WORTH A THOUSAND WORDS4 M.docx
 
Rules of Design Composition
Rules of Design CompositionRules of Design Composition
Rules of Design Composition
 
Best practices for visualising data and building dashboards
Best practices for visualising data and building dashboardsBest practices for visualising data and building dashboards
Best practices for visualising data and building dashboards
 
Numeracy-for-all-learners.pptx
Numeracy-for-all-learners.pptxNumeracy-for-all-learners.pptx
Numeracy-for-all-learners.pptx
 

Recently uploaded

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT  - Elevating Productivity in Today's Agile EnvironmentHarnessing ChatGPT  - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT - Elevating Productivity in Today's Agile EnvironmentVictorSzoltysek
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...masabamasaba
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 

Recently uploaded (20)

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT  - Elevating Productivity in Today's Agile EnvironmentHarnessing ChatGPT  - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 

Making sense of data visually: A modern look at datavisualization

  • 1. Making sense of data visually: A modern look at data visualization VLADIMIR MILEV NEW VENTURE SOFTWARE
  • 2. Author Bio Vladimir Milev MCPD Enterprise Speaker (Devreach, NTK Slovenia and others) DV Evangelist Founder at New Venture Software @vmilev www.linkedin.com/in/vladimirmilev/
  • 4. Agenda 1. Big data and information overload 2. What problems DataViz solves 3. DataViz fundamental theory 4. Basic visualizations 5. Advanced visualizations
  • 5. Information Overload Twitter: 500 million tweets per day Facebook: 55 million status updates per day Facebook: 900 million interactions per day (comments, likes etc.) Reddit:
  • 6. Proliferation of smart devices  We are already living in a world dominated by smart devices  What is the meaning of this?  More connected, data is more accessible  Less space for tables and text  Must use visual communication
  • 7. Making Sense of Data Increasing amount of data available Increasing number of data consumer devices Obtaining data no longer a problem We have an Information Overload issue Quick data analysis is the new problem But how quick?
  • 8. A Picture is worth a 1000 words With about 1,000,000 ganglion cells, the human retina would transmit data at roughly the rate of an Ethernet connection, or 10 million bits per second.” -Vijay Balasubramanian, PhD, Professor of Physics at U Penn
  • 9. OK – That’s a lot of bandwidth BUT ARE WE USING IT EFFICIENTLY?
  • 10. Efficiency Best readers usually read up to about 300 words per minute. Average word length is 5.1 letters 300 * 5.1 = 1530 characters per minute Or 1530 / 60 = 25.5 characters per second 1 character is usually stored as 8 bits 26 * 8 = 208 bits per second Reading bandwidth is ~0.025 KiB/s Or 0.00208% Efficiency
  • 11. So reading clearly isn’t the way to go… BUT WHAT IS THE SOLUTION?
  • 12. Using statistics For the most part of the 20th century Using arithmetic mean, average, standard deviation Variance, correlations, regressions Turns out this is not good enough
  • 13. Anscombe’s Quartet I II III IV x y x y x y x y 10 8.04 10 9.14 10 7.46 8 6.58 8 6.95 8 8.14 8 6.77 8 5.76 13 7.58 13 8.74 13 12.74 8 7.71 9 8.81 9 8.77 9 7.11 8 8.84 11 8.33 11 9.26 11 7.81 8 8.47 14 9.96 14 8.1 14 8.84 8 7.04 6 7.24 6 6.13 6 6.08 8 5.25 4 4.26 4 3.1 4 5.39 19 12.5 12 10.84 12 9.13 12 8.15 8 5.56 7 4.82 7 7.26 7 6.42 8 7.91 5 5.68 5 4.74 5 5.73 8 6.89 • Statistical properties are identical: • Mean of X (9.0) and Y (7.5) values are constant • Nearly same variances, correlations and regressions • As far as statistics is concerned these sets are almost the same
  • 15. So DataViz is very powerful But why does it work so well?
  • 16. Gestalt Psychology Seeing with the brain The mind understands external stimuli as whole rather than the sum of their parts We tend to order our experience in a manner that is regular, orderly, symmetric, and simple Key principles of gestalt: reification, multistability, invariance Gestalt laws of grouping: proximity, similarity, closure, symmetry
  • 17. Gestalt Principles - Reification Our minds tend to construct/generate information
  • 18. Gestalt Principles -Multistability The tendency of our mind to jump back and forth between ambiguous alternative interpretations Spinning Girl Rubin Vase
  • 19. Gestalt Principles - Invariance The tendency to perceive simple geometric objects independent of rotation, translation, and scale Also elastic deformations, different lighting, and different component features
  • 20. Gestalt Laws of Grouping - Similarity We group objects based on visual similarity
  • 21. Gestalt Laws of Grouping - Proximity We group items based on spatial proximity
  • 22. Gestalt Laws of Grouping - Closure We perceive objects such as shapes, letters, pictures, etc., as being whole when they are not complete
  • 23. Application in Data Visualization  Introducing the visual variables  Fundamental properties of objects which can encode information into a picture  Fundamental visual variables: ◦ Position ◦ Size ◦ Color ◦ Shape ◦ Orientation Basis for all Data Visualization!
  • 24. Basic/Common Visualizations Bar graphs Line graphs Area charts Pie charts
  • 25. Bar Graphs • Using color correctly to encode gender • Using position (ordering) to create an orderly scale • Using size to encode the values • Using orientation to differentiate gender again
  • 26. Bar Graphs continued • Labels are used • Color is neutral and does not encode information • Again, we have top-down ordering (position) • And again size encodes the relative numeric value
  • 27. Bars and Normal Distribution Minimum passing grade • Distribution of test scores for Polish “Matura” exam • Normal Distribution is expected • Red line shows normal distribution • 30 is the minimum expected grade • Detecting behavioral changes • What happened?
  • 28. Line Graphs Confirming what we already know – paper media is declining rapidly. • Shape encodes the value • Color is not significant • Design goal is to show a trend/change
  • 29. Area Graphs Effect of school year on Team Fortress 2 players School starts • Similar to line graph • Design goal for area charts is emphasize on the value/quantity, not so much on the trend • You can see both • Color has no meaning
  • 30. Area Graphs continued • This time color carries a meaning (legend) • The graph is also good for displaying ratio between series of data over time
  • 32. Pie Charts Golden Rules for Pie Charts • Ratio of one piece to the whole • Order the values • Less than 6 pieces • Avoid legends • Sum up to 100%
  • 33. Abusing Pie Charts Don’t break the rules!
  • 34. Maps Plot millions of journal entries from 18th and 19th century ship logs, and you reveal a picture of ocean trade you've never seen before • Visualization of routes • Color saturation indicates heavily used routes
  • 35. Maps are good with animations too • Concentration of NO2 from 2005 to 2011 • Using both color and position to encode concentration • Using continuous color scale • Adding another dimension - time
  • 36. Choropleth Maps Displaying the most popular name for a newborn in each state • Using discrete palette to encode information
  • 37. Heat Maps • Excellent for plotting recurring values • Color saturation/brightness encodes the values • Position also encodes information • Easy to spot concentrations and find patterns
  • 39. Tree Maps • Excellent for representing hierarchical data • Color carries a meaning • Size carries a meaning as well • Position is irrelevant • Suitable for annotations
  • 40. Parallel Coordinates Plot • Interactive visualization • Good at displaying relationships between different dimensions of data • Position encodes dimension • Color encodes scale
  • 41. Parallel Coordinates Plot – in action Selecting a subset of a dimension to display the relationships with the other dimensions
  • 42. Chord Diagram • Similar to Parallel Coordinates plot • Color and Position used to encode data • Design is different • Filtering of dimensions is not a design goal • Focuses on selecting a whole dimension
  • 43. Some resources http://www.reddit.com/r/dataisbeautiful/ http://blog.visual.ly/ http://flowingdata.com/ http://eagereyes.org/ http://www.perceptualedge.com/blog/