SlideShare a Scribd company logo
1 of 160
Download to read offline
Let	
  your	
  data	
  free	
  
   Empowering	
  others	
  to	
  use	
  the	
  data	
  you	
  store	
  
Big	
  Data	
  and	
  Data	
  Visualiza5on	
  as	
  
seen	
  at	
  Google,	
  Facebook,	
  Ne=lix,	
  
            Yahoo	
  and	
  TwiCer.	
  
                         	
  
       The	
  future	
  of	
  Educa5onal	
  
          Technologies	
  is	
  here	
  
My	
  big	
  data	
  journey	
  

             Happy	
  New	
  Year!	
  
Why	
  Big	
  Data?	
  
•  Spi<ng	
  out	
  a	
  log	
  that	
  my	
  monkey	
  ate	
  a	
  
   banana	
  is	
  interes@ng	
  

•  Charts	
  and	
  plots	
  of	
  it,	
  cool	
  

•  But	
  tell	
  me	
  why?	
  And	
  predict	
  the	
  next?	
  
   Supercool	
  
Data	
  stored	
  on	
  your	
  iPhone	
  
We	
  live	
  in	
  the	
  age	
  of	
  data	
  
Your	
  next	
  job	
  will	
  depend	
  on	
  data	
  
Your	
  life	
  will	
  depend	
  on	
  data	
  
Big	
  Data	
  
	
  Data	
  Mining	
  
     Analy5cs	
  
Visualiza5on	
  
Anyone	
  using	
  Big	
  Data?	
  
Google	
  Big	
  Data	
  
Big	
  Data	
  Machine	
  Learning	
  
•  Machine	
  learning	
  is	
  about	
  finding	
  paJerns	
  in	
  
   the	
  data	
  
•  Machine	
  learning	
  is	
  about	
  finding	
  meaningful	
  
   informa@on	
  
TwiCer	
  Big	
  Data	
  
"Data	
  visualiza@on	
  is	
  the	
  last	
  mile	
  
between	
  computers	
  and	
  our	
  brains."	
  
                 —@Edd	
  
                    	
  
Ne=lix	
  Big	
  Data	
  
Facebook	
  Big	
  Data	
  
My	
  toolbox	
  
•    Data	
  manipula@on:	
  iPython,	
  Vim	
  
•    Visualiza@ons:	
  Gephi,	
  Tableau	
  
•    Sta@s@cal	
  Toolkit:	
  R	
  Studio	
  
•    Version	
  Control:	
  git	
  
I	
  dream	
  of	
  this	
  
What	
  does	
  this	
  mean?	
  
•    Student	
  #1,192,187	
      •    Student	
  #11,192,173	
  
•    Student	
  #2,160,143	
      •    Student	
  #12,164,151	
  
•    Student	
  #3,183,180	
      •    Student	
  #13,184,165	
  
•    Student	
  #4,136,100	
      •    Student	
  #14,189,184	
  
•    Student	
  #5,162,180	
      •    Student	
  #15,183,170	
  
•    Student	
  #6,165,159	
      •    Student	
  #16,181,176	
  
•    Student	
  #7,181,162	
      •    Student	
  #17,188,163	
  
•    Student	
  #8,188,	
         •    Student	
  #18,191,185	
  
•    Student	
  #9,150,146	
      •    Student	
  #19,190,175	
  
•    Student	
  #10,163,159	
     •    Student	
  #20,184,171	
  
Learning	
  about	
  Visualizing	
  Data	
  
Uses	
  of	
  Big	
  Data	
  in	
  EDU	
  
•  When	
  online	
  learning	
  systems	
  use	
  data	
  to	
  
   change	
  in	
  response	
  to	
  student	
  performance,	
  
   they	
  become	
  adap$ve	
  learning	
  environments	
  
But	
  we	
  work	
  for	
  a	
  school!	
  
Anyone	
  BI?	
  
Educa@onal	
  data	
  mining	
  (EDM)	
  	
  
•  EDM	
  are	
  methods	
  in	
  sta@s@cs,	
  machine	
  
   learning,	
  and	
  data	
  mining	
  to	
  analyze	
  data	
  
•  Data	
  that	
  is	
  collected	
  during	
  teaching	
  and	
  
   learning	
  
Learning	
  analy@cs	
  
•  Learning	
  analy@cs	
  applies	
  sociology,	
  
   psychology,	
  sta@s@cs,	
  computer	
  science	
  
   concepts	
  to	
  data	
  
•  Learning	
  analy@cs	
  creates	
  applica@ons	
  that	
  
   directly	
  influence	
  educa@onal	
  prac@ce	
  
What	
  is	
  big	
  data?	
  
Hadoop
•  Open-source framework for running applications on large
   clusters built of commodity hardware

•  Distributed storage and OS

•  Way bigger than traditional databases

•  Petabytes vs gigabytes




                                                        82
Yahoo	
  
Learning	
  about	
  Data	
  Mining	
  
What	
  is	
  data	
  mining?	
  
It's	
  all	
  about	
  discovery:	
  
•  Grouping	
  similar	
  data	
  
•  Iden@fying	
  interes@ng/unique	
  data	
  
•  Detec@ng	
  rela@onships	
  
•  Discovering	
  previously	
  unknown	
  paJerns	
  
Examples	
  of	
  Machine	
  Learning	
  
•    SPAM	
  detec@on	
  
•    Handwri@ng	
  
•    Google	
  Streetview	
  
•    Speech	
  recogni@on	
  
•    Neilix	
  recommenda@on	
  
•    Robo@c	
  naviga@on	
  
Reasons	
  for	
  Analy@cs	
  
•    Predict	
  the	
  future	
  
•    Understand	
  Risk	
  and	
  Complexity	
  
•    Embrace	
  complexity	
  
•    Iden@fy	
  the	
  unusual	
  
•    Think	
  beJer	
  
Visualiza5ons	
  
Dashboards	
  
Big	
  Data	
  and	
  You	
  
Example	
  Big	
  Data	
  Use	
  Cases	
  
              Data	
                        High-­‐frequency	
                   Lower-­‐frequency	
  
             Source    	
                     opera@ons   	
                        opera@ons  	
  

                                    Write/index	
  all	
  trades,	
        Show	
  consolidated	
  risk	
  
  Capital	
  markets	
  
                                    store	
  @ck	
  data	
                 across	
  traders	
  


  Call	
  ini@a@on	
  request	
     Real-­‐@me	
  authoriza@on	
           Fraud	
  detec@on/analysis	
  


  Inbound	
  HTTP	
                 Visitor	
  logging,	
  analysis,	
  
                                                                           Traffic	
  paJern	
  analy@cs	
  
  requests	
                        aler@ng	
  

                                    Rank	
  scores:	
  
  Online	
  game	
                  • Defined	
  intervals	
                Leaderboard	
  lookups	
  
                                    • Player	
  “bests”	
  

  Real-­‐@me	
  ad	
  trading	
     Match	
  form	
  factor,	
            Report	
  ad	
  performance	
  
  systems	
                         placement	
  criteria,	
  bid/ask	
   from	
  exhaust	
  stream	
  

  Mobile	
  device	
                Loca@on	
  updates,	
  QoS,	
  
                                                                           Analy@cs	
  on	
  transac@ons	
  
  loca@on	
  sensor	
               transac@ons	
  
The	
  best	
  examples	
  
Example	
  machine	
  learning	
  
Examples	
  in	
  the	
  news	
  
What	
  is	
  a	
  data	
  scien@st?	
  
•  Person	
  who	
  understands	
  data-­‐driven	
  world	
  
•  Person	
  who	
  can	
  make	
  sense	
  of	
  big	
  data	
  
•  Person	
  who	
  has	
  tools,	
  skills	
  and	
  mindset	
  to	
  
     see	
  data	
  as	
  the	
  new	
  "oil"	
  fueling	
  a	
  company	
  
•  Person	
  who	
  programs	
  
•  Person	
  who	
  analyses	
  data	
  
•  Person	
  who	
  visualizes	
  data	
  
	
  
Big	
  data,	
  Analy5cs	
  and	
  Visualiza5on	
  
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc
Big databigideasit4bc

More Related Content

What's hot

What's hot (20)

Introduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryIntroduction to Big Data: Smart Factory
Introduction to Big Data: Smart Factory
 
AI on Big Data
AI on Big DataAI on Big Data
AI on Big Data
 
Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps!
 
Data science
Data scienceData science
Data science
 
Big Data and Predictive Analysis
Big Data and Predictive AnalysisBig Data and Predictive Analysis
Big Data and Predictive Analysis
 
Scalable Predictive Analysis and The Trend with Big Data & AI
Scalable Predictive Analysis and The Trend with Big Data & AIScalable Predictive Analysis and The Trend with Big Data & AI
Scalable Predictive Analysis and The Trend with Big Data & AI
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big Data, Baby Steps
Big Data, Baby StepsBig Data, Baby Steps
Big Data, Baby Steps
 
Are you ready for BIG DATA?
Are you ready for BIG DATA?Are you ready for BIG DATA?
Are you ready for BIG DATA?
 
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
 
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
Introduction to Big data with Hadoop & Spark | Big Data Hadoop Spark Tutorial...
 
Predictive Analysis of Financial Fraud Detection using Azure and Spark ML
Predictive Analysis of Financial Fraud Detection using Azure and Spark MLPredictive Analysis of Financial Fraud Detection using Azure and Spark ML
Predictive Analysis of Financial Fraud Detection using Azure and Spark ML
 
Different Career Paths in Data Science
Different Career Paths in Data ScienceDifferent Career Paths in Data Science
Different Career Paths in Data Science
 
Data Skills for Digital Era
Data Skills for Digital EraData Skills for Digital Era
Data Skills for Digital Era
 
When Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensWhen Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic Happens
 
Data science
Data scienceData science
Data science
 
Analysis of ‘Unstructured’ Data
Analysis of ‘Unstructured’ DataAnalysis of ‘Unstructured’ Data
Analysis of ‘Unstructured’ Data
 
Adatao: Interactive, Visual, Predictive Analytics for Big Data @ Silicon Vall...
Adatao: Interactive, Visual, Predictive Analytics for Big Data @ Silicon Vall...Adatao: Interactive, Visual, Predictive Analytics for Big Data @ Silicon Vall...
Adatao: Interactive, Visual, Predictive Analytics for Big Data @ Silicon Vall...
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 

Similar to Big databigideasit4bc

Think Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial ServicesThink Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial Services
Amazon Web Services
 

Similar to Big databigideasit4bc (20)

What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data Science
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With Hadoop
 
DataScienceIntroduction.pptx
DataScienceIntroduction.pptxDataScienceIntroduction.pptx
DataScienceIntroduction.pptx
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Think Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial ServicesThink Big Analytics AWS for Financial Services
Think Big Analytics AWS for Financial Services
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data Analytics
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
 
Intro big data analytics
Intro big data analyticsIntro big data analytics
Intro big data analytics
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and Innovation
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing Meetup
 
Data mining applications
Data mining applicationsData mining applications
Data mining applications
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Workshop_Presentation.pptx
Workshop_Presentation.pptxWorkshop_Presentation.pptx
Workshop_Presentation.pptx
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 

More from Vincent Ohprecio (8)

ipython notebook poc memory forensics
ipython notebook poc memory forensicsipython notebook poc memory forensics
ipython notebook poc memory forensics
 
Learning iPython Notebook Volatility Memory Forensics
Learning iPython Notebook Volatility Memory ForensicsLearning iPython Notebook Volatility Memory Forensics
Learning iPython Notebook Volatility Memory Forensics
 
iPython Notebook Volatility Memory Forensics SilentBanker
iPython Notebook Volatility Memory Forensics SilentBankeriPython Notebook Volatility Memory Forensics SilentBanker
iPython Notebook Volatility Memory Forensics SilentBanker
 
iPython Notebook Volatility For Memory Forensics
iPython Notebook Volatility For Memory ForensicsiPython Notebook Volatility For Memory Forensics
iPython Notebook Volatility For Memory Forensics
 
iPhone Forensics Without iPhone using iTunes Backup
iPhone Forensics Without iPhone using iTunes BackupiPhone Forensics Without iPhone using iTunes Backup
iPhone Forensics Without iPhone using iTunes Backup
 
Forensic Challenge 10 - FC5 Attack Dataset Visualization
Forensic Challenge 10 - FC5 Attack Dataset VisualizationForensic Challenge 10 - FC5 Attack Dataset Visualization
Forensic Challenge 10 - FC5 Attack Dataset Visualization
 
Intro2 malwareanalysisshort
Intro2 malwareanalysisshortIntro2 malwareanalysisshort
Intro2 malwareanalysisshort
 
Hacking school computers for fun profit and better grades short
Hacking school computers for fun profit and better grades shortHacking school computers for fun profit and better grades short
Hacking school computers for fun profit and better grades short
 

Recently uploaded

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Big databigideasit4bc

  • 1. Let  your  data  free   Empowering  others  to  use  the  data  you  store  
  • 2. Big  Data  and  Data  Visualiza5on  as   seen  at  Google,  Facebook,  Ne=lix,   Yahoo  and  TwiCer.     The  future  of  Educa5onal   Technologies  is  here  
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. My  big  data  journey   Happy  New  Year!  
  • 9. Why  Big  Data?   •  Spi<ng  out  a  log  that  my  monkey  ate  a   banana  is  interes@ng   •  Charts  and  plots  of  it,  cool   •  But  tell  me  why?  And  predict  the  next?   Supercool  
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Data  stored  on  your  iPhone  
  • 21.
  • 22. We  live  in  the  age  of  data  
  • 23. Your  next  job  will  depend  on  data  
  • 24. Your  life  will  depend  on  data  
  • 25.
  • 26. Big  Data    Data  Mining   Analy5cs   Visualiza5on  
  • 27. Anyone  using  Big  Data?  
  • 28.
  • 30.
  • 31. Big  Data  Machine  Learning   •  Machine  learning  is  about  finding  paJerns  in   the  data   •  Machine  learning  is  about  finding  meaningful   informa@on  
  • 32.
  • 34.
  • 35. "Data  visualiza@on  is  the  last  mile   between  computers  and  our  brains."   —@Edd    
  • 36.
  • 37.
  • 39.
  • 40.
  • 41.
  • 42.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51. My  toolbox   •  Data  manipula@on:  iPython,  Vim   •  Visualiza@ons:  Gephi,  Tableau   •  Sta@s@cal  Toolkit:  R  Studio   •  Version  Control:  git  
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58. I  dream  of  this  
  • 59.
  • 60.
  • 61.
  • 62.
  • 63. What  does  this  mean?   •  Student  #1,192,187   •  Student  #11,192,173   •  Student  #2,160,143   •  Student  #12,164,151   •  Student  #3,183,180   •  Student  #13,184,165   •  Student  #4,136,100   •  Student  #14,189,184   •  Student  #5,162,180   •  Student  #15,183,170   •  Student  #6,165,159   •  Student  #16,181,176   •  Student  #7,181,162   •  Student  #17,188,163   •  Student  #8,188,   •  Student  #18,191,185   •  Student  #9,150,146   •  Student  #19,190,175   •  Student  #10,163,159   •  Student  #20,184,171  
  • 65.
  • 66.
  • 67. Uses  of  Big  Data  in  EDU   •  When  online  learning  systems  use  data  to   change  in  response  to  student  performance,   they  become  adap$ve  learning  environments  
  • 68. But  we  work  for  a  school!  
  • 69.
  • 70.
  • 71.
  • 73. Educa@onal  data  mining  (EDM)     •  EDM  are  methods  in  sta@s@cs,  machine   learning,  and  data  mining  to  analyze  data   •  Data  that  is  collected  during  teaching  and   learning  
  • 74.
  • 75.
  • 76. Learning  analy@cs   •  Learning  analy@cs  applies  sociology,   psychology,  sta@s@cs,  computer  science   concepts  to  data   •  Learning  analy@cs  creates  applica@ons  that   directly  influence  educa@onal  prac@ce  
  • 77.
  • 78.
  • 79. What  is  big  data?  
  • 80.
  • 81.
  • 82. Hadoop •  Open-source framework for running applications on large clusters built of commodity hardware •  Distributed storage and OS •  Way bigger than traditional databases •  Petabytes vs gigabytes 82
  • 83.
  • 84.
  • 85.
  • 86.
  • 87.
  • 89.
  • 90.
  • 91.
  • 92.
  • 93.
  • 94. Learning  about  Data  Mining  
  • 95.
  • 96. What  is  data  mining?   It's  all  about  discovery:   •  Grouping  similar  data   •  Iden@fying  interes@ng/unique  data   •  Detec@ng  rela@onships   •  Discovering  previously  unknown  paJerns  
  • 97. Examples  of  Machine  Learning   •  SPAM  detec@on   •  Handwri@ng   •  Google  Streetview   •  Speech  recogni@on   •  Neilix  recommenda@on   •  Robo@c  naviga@on  
  • 98. Reasons  for  Analy@cs   •  Predict  the  future   •  Understand  Risk  and  Complexity   •  Embrace  complexity   •  Iden@fy  the  unusual   •  Think  beJer  
  • 99.
  • 101.
  • 102.
  • 103.
  • 104.
  • 105.
  • 106.
  • 107.
  • 108.
  • 109.
  • 110.
  • 111.
  • 112.
  • 113.
  • 114.
  • 115.
  • 116.
  • 117.
  • 118.
  • 119.
  • 121.
  • 122. Big  Data  and  You  
  • 123.
  • 124.
  • 125.
  • 126.
  • 127.
  • 128.
  • 129.
  • 130. Example  Big  Data  Use  Cases   Data   High-­‐frequency   Lower-­‐frequency   Source   opera@ons   opera@ons   Write/index  all  trades,   Show  consolidated  risk   Capital  markets   store  @ck  data   across  traders   Call  ini@a@on  request   Real-­‐@me  authoriza@on   Fraud  detec@on/analysis   Inbound  HTTP   Visitor  logging,  analysis,   Traffic  paJern  analy@cs   requests   aler@ng   Rank  scores:   Online  game   • Defined  intervals   Leaderboard  lookups   • Player  “bests”   Real-­‐@me  ad  trading   Match  form  factor,   Report  ad  performance   systems   placement  criteria,  bid/ask   from  exhaust  stream   Mobile  device   Loca@on  updates,  QoS,   Analy@cs  on  transac@ons   loca@on  sensor   transac@ons  
  • 131.
  • 133.
  • 134.
  • 135.
  • 136.
  • 137.
  • 138.
  • 140.
  • 141.
  • 142. Examples  in  the  news  
  • 143.
  • 144.
  • 145.
  • 146. What  is  a  data  scien@st?   •  Person  who  understands  data-­‐driven  world   •  Person  who  can  make  sense  of  big  data   •  Person  who  has  tools,  skills  and  mindset  to   see  data  as  the  new  "oil"  fueling  a  company   •  Person  who  programs   •  Person  who  analyses  data   •  Person  who  visualizes  data    
  • 147.
  • 148.
  • 149.
  • 150. Big  data,  Analy5cs  and  Visualiza5on