SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Kwan-Liu Ma
Department of Computer Science
University of California at Davis
Big Data Visualization
CA Technologies 1/22/2014
Big	
  Data:	
  Issues	
  
	
  
•  Volume:	
  size/scale	
  
•  Velocity:	
  rate	
  
•  Variety:	
  type/form	
  
•  Veracity:	
  accuracy	
  and	
  completeness	
  
Visualiza0on	
  
	
  
•  To	
  explore	
  and	
  discover	
  
•  To	
  validate	
  
•  To	
  communicate	
  
•  An	
  overview,	
  a	
  path,	
  an	
  interface	
  
Extreme-­‐Scale	
  
Scien0fic	
  Simula0ons	
  
Scien>fic	
  Simula>ons	
  
Large	
  Scien>fic	
  Data	
  Visualiza>on	
  
•  In	
  situ	
  visualiza>on	
  	
  
•  Parallel	
  visualiza>on	
  that	
  is	
  highly	
  scalable	
  
•  In	
  situ	
  data	
  reduc>on	
  and	
  triage	
  
•  In	
  situ	
  data	
  processing	
  for	
  interac>ve	
  data	
  
explora>on	
  and	
  analysis	
  
As we move to Exascale, it’s no longer feasible
to store most of the data for post processing!
We must do:
Supernova	
  Simula>on	
  
Simulation: John Blondin, NCSU
Fusion	
  Simula>ons	
  
Simulation: Dr. S. Ethier, the Princeton Plasma Physics Lab.
Big	
  Network	
  	
  
Analysis	
  &	
  Visualiza0on	
  
FM3
GRIP
Treemap
Hilbert
Sunburst
Circle
222 nodes 2583 edges
Network	
  Simplifica>on/Characteriza>on	
  
Hamas	
al Qaeda	
TVCG 12(6) 2006
Network	
  Simplifica>on/Characteriza>on	
  
Friendster social network Astrophysics co-author network
Links exhibit negative sensitivity (red) One competitive network (red) and
between cluster centers one collaborative network (blue)
Using centrality sensitivity
Competitive
Collaborative
TVCG 18(1) 2012
The	
  Graph	
  Layout	
  Problem	
  
•  The	
  cost	
  of	
  displaying	
  a	
  
graph	
  
•  The	
  hairball	
  problem	
  of	
  
large	
  graph	
  layouts	
  
–  Large,	
  dense	
  graphs	
  become	
  
a	
  mess	
  
–  Inefficient	
  use	
  of	
  space	
  
–  Details	
  cluLered	
  
•  Solu>ons	
  
–  Filtering	
  
–  Clustering	
  
–  Abstrac>on	
  
–  Focus+context	
   California data 6,107 nodes 15,160 edges
High dimensional embedding method
A	
  Fast	
  Graph	
  Layout	
  Method	
  
l  Hierarchically	
  cluster	
  the	
  nodes	
  (if	
  no	
  clustering	
  given)	
  
l  Traverse	
  the	
  hierarchy	
  to	
  order	
  the	
  nodes	
  
l  Place	
  the	
  nodes	
  in	
  that	
  order	
  along	
  a	
  space	
  filling	
  curve	
  
Order 1 Order 2 Order 3 Order 4 Order 5 Order 11
Hilbert	
  curves	
  
TVCG 14(6) 2008
Fast	
  Graph	
  Layout	
  
A Graph with 6,107 nodes 15,160 edges
HibertSpace filling curve: Gosper
Treemap
High dimensional embedding: 0.19s
One time clustering:
0.5 seconds
Layout + rendering:
0.0005 seconds
LinLog (force directed): 10,737s
Fast	
  Graph	
  Layout	
  
Internet Connectivity 41,928 nodes 218,080 edges
Space filling curve: Hibert
Space filling curve: GosperFM3 40.8s
GRIP 6.87s
One time clustering:
18.87 seconds
Layout + rendering:
0.0036 seconds
Treemap
Dynamic	
  Networks	
  
Growing	
  Internet	
  
Incremental clustering-based approach – Radial treemap layout
Video
Time-­‐Varying	
  Networks	
  
•  Almost	
  all	
  networks	
  found	
  in	
  real-­‐world	
  
applica>ons	
  are	
  >me-­‐varying	
  
•  Both	
  nodes	
  and	
  edges	
  can	
  change	
  
•  Visualiza>on	
  methods:	
  
– Anima>ons	
  
– Small	
  mul>ples	
  visualiza>on	
  
– Difference	
  visualiza>on	
  
– Storyline	
  visualiza>on	
  
Storyline	
  Visualiza>on	
  
XKCD.com
Storyline	
  Visualiza>on	
  
•  Consis>ng	
  of	
  a	
  series	
  of	
  lines,	
  going	
  from	
  leU	
  to	
  right	
  
along	
  the	
  >me-­‐axis,	
  that	
  converge	
  and	
  diverge	
  in	
  the	
  
course	
  of	
  their	
  paths.	
  
•  Each	
  line	
  represents	
  a	
  unique	
  en>ty	
  (character)	
  in	
  the	
  
data.	
  
•  The	
  star>ng	
  &	
  ending	
  points	
  of	
  each	
  line	
  represent	
  the	
  
lifespan	
  of	
  the	
  corresponding	
  en>ty.	
  
•  Lines	
  are	
  bundled	
  together	
  during	
  the	
  >me	
  period	
  of	
  
their	
  interac>on.	
  
•  Exis>ng	
  algorithms:	
  
1.  	
  Rules	
  and	
  heuris>cs	
  based	
  [Ogawa	
  &	
  Ma	
  2008]	
  
2.  	
  Gene>c	
  algorithm	
  [Tanahashi	
  &	
  Ma	
  2012]	
  
3.  	
  Convex	
  quadra>c	
  op>miza>on	
  [Liu	
  et	
  al.	
  2013]	
  
4.  	
  Greedy	
  algorithms	
  	
  
Star	
  Wars	
  
Matrix	
  
Incep0on	
  
Star	
  Wars	
  
Video
Enron	
  Scandal	
  Email	
  Data	
  
1230 days, 1264 employees, 495,408 messages, and 3478 email clusters
Video
Current	
  Projects	
  
•  Dynamic	
  network	
  visualiza>on	
  [Biological	
  science,	
  Internet,	
  social	
  networks]	
  
•  Visual	
  recommenda>ons	
  and	
  predic>ve	
  analysis	
  [Transporta>on]	
  
•  Visual	
  analy>cs	
  for	
  cyber	
  and	
  airborne	
  intelligence	
  	
  
•  Remote	
  and	
  collabora>ve	
  visualiza>on	
  	
  
•  Volume	
  data	
  visualiza>on	
  [Flow	
  simula>on,	
  biomedical	
  imaging,	
  NDT]	
  
•  Health	
  record	
  visualiza>on	
  
•  Visual	
  analysis	
  of	
  driving	
  behaviors	
  and	
  energy	
  use	
  [Transporta>on]	
  
•  Visualiza>on	
  for	
  scien>fic	
  storytelling	
  	
  
•  Massively	
  parallel	
  visualiza>on	
  
•  In	
  situ	
  visualiza>on	
  and	
  data	
  reduc>on	
  	
  
•  Visualizing	
  large	
  scale	
  compu>ng	
  [Scien>fic	
  compu>ng,	
  cloud	
  compu>ng]	
  
•  Video	
  visualiza>on	
  [Security]	
  
•  Uncertainty	
  visualiza>on	
  	
  
•  Visualiza>on	
  interface	
  design	
  	
  	
  
CENTER FOR VISUALIZATION
Kwan-Liu Ma
ma@cs.ucdavis.edu
hLp://www.cs.ucdavis.edu/~ma	
  

Contenu connexe

Tendances

"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler..."Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...Dataconomy Media
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationData Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationIan Foster
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
"Machine Learning and Internet of Things, the future of medical prevention", ...
"Machine Learning and Internet of Things, the future of medical prevention", ..."Machine Learning and Internet of Things, the future of medical prevention", ...
"Machine Learning and Internet of Things, the future of medical prevention", ...Dataconomy Media
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterIan Foster
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchRobert Grossman
 
Use Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersUse Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersDatabricks
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceRobert Grossman
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...Ian Foster
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilitiesIan Foster
 
Applying Machine Learning using H2O
Applying Machine Learning using H2OApplying Machine Learning using H2O
Applying Machine Learning using H2OIan Gomez
 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Ian Gomez
 
Genomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesGenomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesLynn Langit
 
Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)Robert Grossman
 
Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data Robert Grossman
 
Democratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDemocratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDatabricks
 
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...Big Data Spain
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!Ian Foster
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science ServicesIan Foster
 
Doing Research in the Cloud - NIH Workshop Dennis Gannon
Doing Research in the Cloud - NIH Workshop Dennis GannonDoing Research in the Cloud - NIH Workshop Dennis Gannon
Doing Research in the Cloud - NIH Workshop Dennis GannonMicrosoft Azure for Research
 

Tendances (20)

"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler..."Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationData Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud Automation
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
"Machine Learning and Internet of Things, the future of medical prevention", ...
"Machine Learning and Internet of Things, the future of medical prevention", ..."Machine Learning and Internet of Things, the future of medical prevention", ...
"Machine Learning and Internet of Things, the future of medical prevention", ...
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and Jupyter
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science Research
 
Use Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersUse Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data Clusters
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of Science
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilities
 
Applying Machine Learning using H2O
Applying Machine Learning using H2OApplying Machine Learning using H2O
Applying Machine Learning using H2O
 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
 
Genomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesGenomic Scale Big Data Pipelines
Genomic Scale Big Data Pipelines
 
Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)Health & Status Monitoring (2010-v8)
Health & Status Monitoring (2010-v8)
 
Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data
 
Democratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDemocratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn Creator
 
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
Are we reaching a Data Science Singularity? How Cognitive Computing is emergi...
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science Services
 
Doing Research in the Cloud - NIH Workshop Dennis Gannon
Doing Research in the Cloud - NIH Workshop Dennis GannonDoing Research in the Cloud - NIH Workshop Dennis Gannon
Doing Research in the Cloud - NIH Workshop Dennis Gannon
 

Similaire à Big Data Visualization

Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.KGMGROUP
 
Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsTal Lavian Ph.D.
 
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Tal Lavian Ph.D.
 
Science cloud foster june 2013
Science cloud foster june 2013Science cloud foster june 2013
Science cloud foster june 2013Kirill Osipov
 
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryScience as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryIan Foster
 
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...Otávio Carvalho
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Ian Foster
 
Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...
Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...
Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...Lionel Briand
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsExtraHop Networks
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapSrinath Perera
 
Big Process for Big Data @ NASA
Big Process for Big Data @ NASABig Process for Big Data @ NASA
Big Process for Big Data @ NASAIan Foster
 
Grid computing by vaishali sahare [katkar]
Grid computing by vaishali sahare [katkar]Grid computing by vaishali sahare [katkar]
Grid computing by vaishali sahare [katkar]vaishalisahare123
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Ian Foster
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphsStanka Dalekova
 
DataStax and Esri: Geotemporal IoT Search and Analytics
DataStax and Esri: Geotemporal IoT Search and AnalyticsDataStax and Esri: Geotemporal IoT Search and Analytics
DataStax and Esri: Geotemporal IoT Search and AnalyticsDataStax Academy
 

Similaire à Big Data Visualization (20)

Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.Don't Be Scared. Data Don't Bite. Introduction to Big Data.
Don't Be Scared. Data Don't Bite. Introduction to Big Data.
 
Grid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applicationsGrid optical network service architecture for data intensive applications
Grid optical network service architecture for data intensive applications
 
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
 
Bigdata analytics
Bigdata analyticsBigdata analytics
Bigdata analytics
 
CINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network ScienceCINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network Science
 
Science cloud foster june 2013
Science cloud foster june 2013Science cloud foster june 2013
Science cloud foster june 2013
 
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryScience as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate Discovery
 
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
Distributed Near Real-Time Processing of Sensor Network Data Flows for Smart ...
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013
 
Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...
Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...
Dynamic Adaptation of Software-defined Networks for IoT Systems: A Search-bas...
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
 
01-pengantar.pdf
01-pengantar.pdf01-pengantar.pdf
01-pengantar.pdf
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Grid computing
Grid computingGrid computing
Grid computing
 
Bertenthal
BertenthalBertenthal
Bertenthal
 
Big Process for Big Data @ NASA
Big Process for Big Data @ NASABig Process for Big Data @ NASA
Big Process for Big Data @ NASA
 
Grid computing by vaishali sahare [katkar]
Grid computing by vaishali sahare [katkar]Grid computing by vaishali sahare [katkar]
Grid computing by vaishali sahare [katkar]
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
 
DataStax and Esri: Geotemporal IoT Search and Analytics
DataStax and Esri: Geotemporal IoT Search and AnalyticsDataStax and Esri: Geotemporal IoT Search and Analytics
DataStax and Esri: Geotemporal IoT Search and Analytics
 

Dernier

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 

Dernier (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Big Data Visualization

  • 1. Kwan-Liu Ma Department of Computer Science University of California at Davis Big Data Visualization CA Technologies 1/22/2014
  • 2. Big  Data:  Issues     •  Volume:  size/scale   •  Velocity:  rate   •  Variety:  type/form   •  Veracity:  accuracy  and  completeness  
  • 3. Visualiza0on     •  To  explore  and  discover   •  To  validate   •  To  communicate   •  An  overview,  a  path,  an  interface  
  • 6. Large  Scien>fic  Data  Visualiza>on   •  In  situ  visualiza>on     •  Parallel  visualiza>on  that  is  highly  scalable   •  In  situ  data  reduc>on  and  triage   •  In  situ  data  processing  for  interac>ve  data   explora>on  and  analysis   As we move to Exascale, it’s no longer feasible to store most of the data for post processing! We must do:
  • 8. Fusion  Simula>ons   Simulation: Dr. S. Ethier, the Princeton Plasma Physics Lab.
  • 9. Big  Network     Analysis  &  Visualiza0on  
  • 12. Network  Simplifica>on/Characteriza>on   Friendster social network Astrophysics co-author network Links exhibit negative sensitivity (red) One competitive network (red) and between cluster centers one collaborative network (blue) Using centrality sensitivity Competitive Collaborative TVCG 18(1) 2012
  • 13. The  Graph  Layout  Problem   •  The  cost  of  displaying  a   graph   •  The  hairball  problem  of   large  graph  layouts   –  Large,  dense  graphs  become   a  mess   –  Inefficient  use  of  space   –  Details  cluLered   •  Solu>ons   –  Filtering   –  Clustering   –  Abstrac>on   –  Focus+context   California data 6,107 nodes 15,160 edges High dimensional embedding method
  • 14. A  Fast  Graph  Layout  Method   l  Hierarchically  cluster  the  nodes  (if  no  clustering  given)   l  Traverse  the  hierarchy  to  order  the  nodes   l  Place  the  nodes  in  that  order  along  a  space  filling  curve   Order 1 Order 2 Order 3 Order 4 Order 5 Order 11 Hilbert  curves   TVCG 14(6) 2008
  • 15. Fast  Graph  Layout   A Graph with 6,107 nodes 15,160 edges HibertSpace filling curve: Gosper Treemap High dimensional embedding: 0.19s One time clustering: 0.5 seconds Layout + rendering: 0.0005 seconds LinLog (force directed): 10,737s
  • 16. Fast  Graph  Layout   Internet Connectivity 41,928 nodes 218,080 edges Space filling curve: Hibert Space filling curve: GosperFM3 40.8s GRIP 6.87s One time clustering: 18.87 seconds Layout + rendering: 0.0036 seconds Treemap
  • 18. Growing  Internet   Incremental clustering-based approach – Radial treemap layout Video
  • 19. Time-­‐Varying  Networks   •  Almost  all  networks  found  in  real-­‐world   applica>ons  are  >me-­‐varying   •  Both  nodes  and  edges  can  change   •  Visualiza>on  methods:   – Anima>ons   – Small  mul>ples  visualiza>on   – Difference  visualiza>on   – Storyline  visualiza>on  
  • 21. Storyline  Visualiza>on   •  Consis>ng  of  a  series  of  lines,  going  from  leU  to  right   along  the  >me-­‐axis,  that  converge  and  diverge  in  the   course  of  their  paths.   •  Each  line  represents  a  unique  en>ty  (character)  in  the   data.   •  The  star>ng  &  ending  points  of  each  line  represent  the   lifespan  of  the  corresponding  en>ty.   •  Lines  are  bundled  together  during  the  >me  period  of   their  interac>on.   •  Exis>ng  algorithms:   1.   Rules  and  heuris>cs  based  [Ogawa  &  Ma  2008]   2.   Gene>c  algorithm  [Tanahashi  &  Ma  2012]   3.   Convex  quadra>c  op>miza>on  [Liu  et  al.  2013]   4.   Greedy  algorithms    
  • 26. Enron  Scandal  Email  Data   1230 days, 1264 employees, 495,408 messages, and 3478 email clusters Video
  • 27. Current  Projects   •  Dynamic  network  visualiza>on  [Biological  science,  Internet,  social  networks]   •  Visual  recommenda>ons  and  predic>ve  analysis  [Transporta>on]   •  Visual  analy>cs  for  cyber  and  airborne  intelligence     •  Remote  and  collabora>ve  visualiza>on     •  Volume  data  visualiza>on  [Flow  simula>on,  biomedical  imaging,  NDT]   •  Health  record  visualiza>on   •  Visual  analysis  of  driving  behaviors  and  energy  use  [Transporta>on]   •  Visualiza>on  for  scien>fic  storytelling     •  Massively  parallel  visualiza>on   •  In  situ  visualiza>on  and  data  reduc>on     •  Visualizing  large  scale  compu>ng  [Scien>fic  compu>ng,  cloud  compu>ng]   •  Video  visualiza>on  [Security]   •  Uncertainty  visualiza>on     •  Visualiza>on  interface  design      
  • 28. CENTER FOR VISUALIZATION Kwan-Liu Ma ma@cs.ucdavis.edu hLp://www.cs.ucdavis.edu/~ma