SlideShare une entreprise Scribd logo
1  sur  44
Visualising Activity Data Tony Hirst Dept of Communication and Systems, The Open University Scattered puzzle pieces next to solved fragment by HoriaVarlan
Today’s link shortener is bit.ly Read:		 [ jlKwGq ] as:		 http://bit.ly/jlKwGq
Visual Analysisvs.Presentation Graphics
This is NOT a presentation about: ,[object Object]
 data preparation
 data cleansing,[object Object]
ScraperWiki [ aGhJtK ]
Search and replace… …add regular expressions and you have search and replace “on steroids”
Google Refine [ aq1jUE ] Example: walkthrough (@jenit) [ awGQPT ] Example: merging two tables by column [ pWK3C0 ]
DataWrangler [ gmE3yz ]
Data has shape and structure
Hierarchical Data
Many Eyes [ qY5786 ] Treemaps
plot srcfile using ($1):(column(focusCar) -$2) with lines title "VET", srcfileusing ($1):(column(focusCar) -$3) with lines title "WEB", srcfileusing ($1):(column(focusCar) -$4) with lines title "HAM", srcfileusing ($1):(column(focusCar) -$5) with lines title "BUT", srcfileusing ($1):(column(focusCar) -$6) with lines title "ALO", srcfileusing ($1):(column(focusCar) -$7) with lines title "MAS", srcfileusing ($1):(column(focusCar) -$8) with lines title "SCH", srcfileusing ($1):(column(focusCar) -$9) with lines title "ROS", …
Or heatmaps in R: [ qXmPgs ]
Text processing with Unix tools[ m5tz63 ] [ lOVySX ] Count number of lines in a file: wc-l L2sample.csv View first few lines in a file: head L2sample.csv or head -n 4 L2sample.csv  View last few lines in a file: tail L2sample.csv or tail -n 15 L2sample.csv Sample contiguous rows from start or end of file: head -n 1 L2sample.csv > headers.csv 	tail -n 20 L2sample.csv > subSample.csv 	cat headers.csvsubSample.csv > subSampleWithHeaders.csv Sample contiguous rows from middle of file: head -n 15 L2sample.csv | tail -n 6 > middleSample.csv Split large file into smaller files: split -l 15 L2sample.csv subSamples Search for lines containing a term: grepmendeley L2sample.csv grepEBSCO L2sample.csv > rowsContainingEBSCO.csv
More text processing tricks Extract columns: cut -f 3 L2sample.csv 	cut -f 1,2,14,17 L2sample.csv > columnSample.csv Sort data in a column: 	cut -f 40 L2sample.csv | sort Identify distinct entries in a column: 	cut -f 40 L2sample.csv | sort | uniq Count how many times each distinct term appears in a column: 	cut -f 40 L2sample.csv | sort | uniq –c Sort can also sort by column (-k), reverse order (-r): cut -f 40 L2_2011-04.csv | sort | uniq -c | sort -k 1 -r > uniqueSID.csv
[ dAdIo3 ]
Time series data
aka “seasonal subseries” [ j3HODr ]
matplotlib Trends [ qSIcrV ] #time series data in d #first difference fd=np.diff(d) Autocorrelation
Graphs and Networks
Graphviz digraph test { CSV [shape=box] KML [shape=box] JSON [shape=box] XML [shape=box] RDF [shape=box] HTML [shape=box] GoogleSpreadsheet[shape=Msquare] RDFTripleStore [shape=Msquare] "[SPARQL]" [shape=diamond] "[YQL]" [shape=diamond] "[GoogleVizDataAPI]" [shape=diamond] "<GoogleGadgets>" [shape=doubleoctagon] "<GoogleVizDataCharts>" [shape=doubleoctagon] "<GoogleMaps>" [shape=doubleoctagon] "<GoogleEarth>" [shape=doubleoctagon] "<JQueryCharts_etc>" [shape=doubleoctagon] "[SPARQL]"->RDF; "[SPARQL]"->XML; "[SPARQL]"->CSV; "[SPARQL]"->JSON; JSON-> "<JQueryCharts_etc>"; CSV->"{GoogleRefine}" CSV->ScraperWiki JSON->ScraperWiki "[YQL]"->ScraperWiki ScraperWiki->CSV HTML->ScraperWiki HTML->"[YQL]" "[SPARQL]"->"[YQL]" "{GoogleRefine}"->CSV [style=dashed] CSV->"<Gephi>" [style=dashed] "<Gephi>"->CSV [style=dashed] RDF->"[YQL]” }
Gephi
[ nKoB4b]
[ nKoB4b]
Statistical Graphs
R
Graphics Libraries
Protovis
Processing

Contenu connexe

Tendances

Tendances (11)

The Very ^ 2 Basics of R
The Very ^ 2 Basics of RThe Very ^ 2 Basics of R
The Very ^ 2 Basics of R
 
Inside database
Inside databaseInside database
Inside database
 
Do something in 5 with gas 3-simple invoicing app
Do something in 5 with gas 3-simple invoicing appDo something in 5 with gas 3-simple invoicing app
Do something in 5 with gas 3-simple invoicing app
 
Gratest
GratestGratest
Gratest
 
Introduction to data.table in R
Introduction to data.table in RIntroduction to data.table in R
Introduction to data.table in R
 
Data visualization in python/Django
Data visualization in python/DjangoData visualization in python/Django
Data visualization in python/Django
 
Sql can be cool again
Sql can be cool againSql can be cool again
Sql can be cool again
 
Visdjango presentation django_boston_oct_2014
Visdjango presentation django_boston_oct_2014Visdjango presentation django_boston_oct_2014
Visdjango presentation django_boston_oct_2014
 
Hacking the Internet of Things for Fun & Profit
Hacking the Internet of Things for Fun & ProfitHacking the Internet of Things for Fun & Profit
Hacking the Internet of Things for Fun & Profit
 
Data warehouse or conventional database: Which is right for you?
Data warehouse or conventional database: Which is right for you?Data warehouse or conventional database: Which is right for you?
Data warehouse or conventional database: Which is right for you?
 
Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...
Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...
Do something useful in Apps Script 5. Get your analytics pageviews to a sprea...
 

Similaire à Jiscad viz

PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12
Andrew Dunstan
 
SQL Server - Introduction to TSQL
SQL Server - Introduction to TSQLSQL Server - Introduction to TSQL
SQL Server - Introduction to TSQL
Peter Gfader
 

Similaire à Jiscad viz (20)

Hands on Mahout!
Hands on Mahout!Hands on Mahout!
Hands on Mahout!
 
Meetup cassandra for_java_cql
Meetup cassandra for_java_cqlMeetup cassandra for_java_cql
Meetup cassandra for_java_cql
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfolio
 
The best ETL questions in a nut shell
The best ETL questions in a nut shellThe best ETL questions in a nut shell
The best ETL questions in a nut shell
 
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
 
PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12PostgreSQL 8.4 TriLUG 2009-11-12
PostgreSQL 8.4 TriLUG 2009-11-12
 
ICDM2019 table tutorial
ICDM2019 table tutorialICDM2019 table tutorial
ICDM2019 table tutorial
 
Chapter15
Chapter15Chapter15
Chapter15
 
ElasticSearch.pptx
ElasticSearch.pptxElasticSearch.pptx
ElasticSearch.pptx
 
MongoDB Aggregation MongoSF May 2011
MongoDB Aggregation MongoSF May 2011MongoDB Aggregation MongoSF May 2011
MongoDB Aggregation MongoSF May 2011
 
Practical catalyst
Practical catalystPractical catalyst
Practical catalyst
 
SQL Server - Introduction to TSQL
SQL Server - Introduction to TSQLSQL Server - Introduction to TSQL
SQL Server - Introduction to TSQL
 
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table packageJanuary 2016 Meetup: Speeding up (big) data manipulation with data.table package
January 2016 Meetup: Speeding up (big) data manipulation with data.table package
 
MIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome MeasuresMIS5101 WK10 Outcome Measures
MIS5101 WK10 Outcome Measures
 
Lobos Introduction
Lobos IntroductionLobos Introduction
Lobos Introduction
 
Simplifying SQL with CTE's and windowing functions
Simplifying SQL with CTE's and windowing functionsSimplifying SQL with CTE's and windowing functions
Simplifying SQL with CTE's and windowing functions
 
Google cloud Dataflow & Apache Flink
Google cloud Dataflow & Apache FlinkGoogle cloud Dataflow & Apache Flink
Google cloud Dataflow & Apache Flink
 
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptxfINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
 
How We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website Development
How We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website DevelopmentHow We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website Development
How We Use the Abivia Content Pattern (ACP) Plugin in Joomla Website Development
 
Bo4301369372
Bo4301369372Bo4301369372
Bo4301369372
 

Plus de Tony Hirst

Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Tony Hirst
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
Tony Hirst
 

Plus de Tony Hirst (20)

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiesta
 
Dev8d jupyter
Dev8d jupyterDev8d jupyter
Dev8d jupyter
 
Ili 16 robot
Ili 16 robotIli 16 robot
Ili 16 robot
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptx
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptx
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacks
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyter
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 Workshop
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wire
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interest
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKX
 
Week4
Week4Week4
Week4
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefine
 
Conversations with data
Conversations with dataConversations with data
Conversations with data
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingo
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
 

Dernier

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Dernier (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

Jiscad viz

  • 1. Visualising Activity Data Tony Hirst Dept of Communication and Systems, The Open University Scattered puzzle pieces next to solved fragment by HoriaVarlan
  • 2. Today’s link shortener is bit.ly Read: [ jlKwGq ] as: http://bit.ly/jlKwGq
  • 4.
  • 5.
  • 7.
  • 9. Search and replace… …add regular expressions and you have search and replace “on steroids”
  • 10. Google Refine [ aq1jUE ] Example: walkthrough (@jenit) [ awGQPT ] Example: merging two tables by column [ pWK3C0 ]
  • 12. Data has shape and structure
  • 14. Many Eyes [ qY5786 ] Treemaps
  • 15.
  • 16.
  • 17.
  • 18.
  • 19. plot srcfile using ($1):(column(focusCar) -$2) with lines title "VET", srcfileusing ($1):(column(focusCar) -$3) with lines title "WEB", srcfileusing ($1):(column(focusCar) -$4) with lines title "HAM", srcfileusing ($1):(column(focusCar) -$5) with lines title "BUT", srcfileusing ($1):(column(focusCar) -$6) with lines title "ALO", srcfileusing ($1):(column(focusCar) -$7) with lines title "MAS", srcfileusing ($1):(column(focusCar) -$8) with lines title "SCH", srcfileusing ($1):(column(focusCar) -$9) with lines title "ROS", …
  • 20. Or heatmaps in R: [ qXmPgs ]
  • 21. Text processing with Unix tools[ m5tz63 ] [ lOVySX ] Count number of lines in a file: wc-l L2sample.csv View first few lines in a file: head L2sample.csv or head -n 4 L2sample.csv View last few lines in a file: tail L2sample.csv or tail -n 15 L2sample.csv Sample contiguous rows from start or end of file: head -n 1 L2sample.csv > headers.csv tail -n 20 L2sample.csv > subSample.csv cat headers.csvsubSample.csv > subSampleWithHeaders.csv Sample contiguous rows from middle of file: head -n 15 L2sample.csv | tail -n 6 > middleSample.csv Split large file into smaller files: split -l 15 L2sample.csv subSamples Search for lines containing a term: grepmendeley L2sample.csv grepEBSCO L2sample.csv > rowsContainingEBSCO.csv
  • 22. More text processing tricks Extract columns: cut -f 3 L2sample.csv cut -f 1,2,14,17 L2sample.csv > columnSample.csv Sort data in a column: cut -f 40 L2sample.csv | sort Identify distinct entries in a column: cut -f 40 L2sample.csv | sort | uniq Count how many times each distinct term appears in a column: cut -f 40 L2sample.csv | sort | uniq –c Sort can also sort by column (-k), reverse order (-r): cut -f 40 L2_2011-04.csv | sort | uniq -c | sort -k 1 -r > uniqueSID.csv
  • 24.
  • 27.
  • 28. matplotlib Trends [ qSIcrV ] #time series data in d #first difference fd=np.diff(d) Autocorrelation
  • 30. Graphviz digraph test { CSV [shape=box] KML [shape=box] JSON [shape=box] XML [shape=box] RDF [shape=box] HTML [shape=box] GoogleSpreadsheet[shape=Msquare] RDFTripleStore [shape=Msquare] "[SPARQL]" [shape=diamond] "[YQL]" [shape=diamond] "[GoogleVizDataAPI]" [shape=diamond] "<GoogleGadgets>" [shape=doubleoctagon] "<GoogleVizDataCharts>" [shape=doubleoctagon] "<GoogleMaps>" [shape=doubleoctagon] "<GoogleEarth>" [shape=doubleoctagon] "<JQueryCharts_etc>" [shape=doubleoctagon] "[SPARQL]"->RDF; "[SPARQL]"->XML; "[SPARQL]"->CSV; "[SPARQL]"->JSON; JSON-> "<JQueryCharts_etc>"; CSV->"{GoogleRefine}" CSV->ScraperWiki JSON->ScraperWiki "[YQL]"->ScraperWiki ScraperWiki->CSV HTML->ScraperWiki HTML->"[YQL]" "[SPARQL]"->"[YQL]" "{GoogleRefine}"->CSV [style=dashed] CSV->"<Gephi>" [style=dashed] "<Gephi>"->CSV [style=dashed] RDF->"[YQL]” }
  • 31. Gephi
  • 34.
  • 35.
  • 36.
  • 37.
  • 39. R
  • 40.
  • 44.
  • 45. I hope that’s beenouseful.info….?

Notes de l'éditeur

  1. Change the basis… eg in OU, might consider different presentations (“years”) of the same course (“month”).