Visual Resources Association Annual Conference
March 27-30, 2018, Philadelphia
Session: DIY: Crafting Your Space, Mission, and Situation
Presenter: Alison Blaine, Digital Technologies Development Librarian, North Carolina State University
4. high school humanities instructor
languages & global area studies
|
[Love of teaching & learning new things]
|
information science master’s degree
academic librarian/technologist/statistical
computing/coding
21. graphical symbols
points, lines, bars, bubbles, boxes, circles, etc
coordinate systems
cartesian, polar, etc
aesthetic mappings
color
fill
texture
size
length
shape
22. goal setting
Do you want to…
make graphs for a presentation or publication?
just explore your own data?
make an interactive web visualization?
make data art?
teach a workshop?
28. Helpful metaphors?
A dataset is a mysterious forest.
A dataset is a room with many doors.
A dataset is a mountain with many faces.
29. Walk into the forest
without set expectations.
But have a plan.
30. Plan to spend a lot of time getting your
data clean and tidy for visualization.
31. get your data clean
no errors, misspellings or inconsistencies
no non-data rows or text besides single column headers
Raleigh
Raleigh, NC
Raleigh N.C
36. “There are three interrelated rules which make a
dataset tidy:
1. Each variable must have its own column.
2. Each observation must have its own row.
3. Each value must have its own cell.”
-Hadley Wickham, R for Data Science
37. This is NOT tidy data.
Source: Hadley Wickham (2014) “Tidy Data.” Journal of Statistical Software. Vol. 59, Issue 10.
39. “The first step is always to figure out what the variables and
observations are. Sometimes this is easy; other times you’ll
need to consult with the people who originally generated the
data. The second step is to resolve one of two common
problems:
1. One variable might be spread across multiple columns.
2. One observation might be scattered across multiple rows.”
-Hadley Wickham, R for Data Science
40. What are the variables in this data?
Variables are the things with values that change.
Source: Hadley Wickham (2014) “Tidy Data.” Journal of Statistical Software. Vol. 59, Issue 10.
44. Does each variable have its own column?
Does each observation have its own row?
Does each value have its own cell?
45. Does each variable have its own column? NO
Does each observation have its own row? YES
Does each value have its own cell? YES
46. Tidy data version
Year is a variable
Individual years are values of that variable
Same with Land Percentage
I used Tableau
Public’s pivot
feature to
reshape the
data.
53. find relationships between variables, patterns,
outliers, correlations, interesting things, errors,
etc.
find the story you ultimately want to tell.
54. For this stage, use whatever tool you
want to use for rapid prototyping
RawGraphs - rawgraphs.io
Tableau - public.tableau.com
Google Sheets charts?
55. when it comes to presenting your
data, regardless of subject
77. Resources
flowingdata.com – info on every aspect of the data visualization process
Tidy data, by Hadley Wickham - https://vita.had.co.nz/papers/tidy-data.pdf
R for Data Science, by Hadley Wickham - http://r4ds.had.co.nz/tidy-data.html
Tableau How-to Videos - https://public.tableau.com/en-us/s/resources
“Which Chart or Graph is Right for You?” – Tableau whitepaper
https://www.tableau.com/learn/whitepapers/which-chart-or-graph-is-right-for-you
Lots of ways data can be represented graphically, and this is by no means an exhaustive display.
One thing people are faced with when they try to do data viz is -- what type of visual symbols do I use/ what type of chart will best convey my data?
Source: d3 gallery / mike bostock
Graphs are not entities unto themselves… there is a grammar to graphics. And a famous paper called, “The Grammar of Graphics” by Leland Wilkinson. These are just a few elements of that grammar.
Jacques Bertin’s (French cartographer who wrote a book on data visualization, Semologie Graphique) reorderable matrix, and example created for IEEE Vis Conference in 2014
Source: http://www.aviz.fr/diyMatrix/
Table with cell values encoded with visual symbols that represent data values. Size relates to quantity
NYTimes Interactive graphic: March 2013, http://www.nytimes.com/interactive/2013/03/29/sports/baseball/Strikeouts-Are-Still-Soaring.html
Source: March 2018, https://www.nytimes.com/interactive/2018/03/19/upshot/race-class-white-and-black-men.html