Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data

•Télécharger en tant que PPTX, PDF•

2 j'aime•597 vues

There is a flood of information online from tweets,feeds, status updates, photos, government, private, and other sources. Just how big is “big data”? This presentation will share examples of big and open data in the cloud:where it comes from, how it’s stored, and what you can do with it. Learn to incorporate real world data online for your students to analyze using Excel; create data visualizations and infographics, and understand the impact of Data as a Service as a model for cloud computing.

Drinking from the Fire Hose:
Tools for Interpreting and Teaching
with Big Data

Mark Frydenberg
Bentley University

What's your Bacon Index?

2 Ann
Joe
3
Bob

2 X
4

1 Kim
Kevin

Big Data

'Big data' refers to a
collection of tools,
techniques and
technologies which make
it easy to work with data
at any scale.

powerof60.com

3 V's

• Volume - amount of data is larger than
those conventional relational database
infrastructures can handle
• Velocity - the rate at which data is
generated, processed and analyzed in
(real) time
• Variety – data formats are unstructured
and inconsistent

Walmart

• Walmart collects more than 2.5
petabytes of data every hour from its
customer transactions.
• A petabyte is one quadrillion bytes, or the
equivalent of about 20 million filing
cabinets’ worth of text.

http://hbr.org/2012/10/big-data-the-management-revolution/ar

Velocity: Drinking from the Firehose

• Scrutinize 5 million trade events created
each day to identify potential fraud
• Analyze 500 million daily call detail
records in real-time to predict customer
churn faster

McKinsey&Company Report (2011)

• Data is part of every
industry and business
function.
• Data creates value.
• Big data becomes a basis
of competition and growth.
• Some sectors will achieve
greater gains.
• Shortage of people with
analytical skills.
• Need policies related to
privacy, security,
ownership.

Twitter

3000 tweets per second
data is disorganized
How does twitter use its data?

Big Data Technologies

• HADOOP: scalable
storage, parallel
computation
• NoSQL: distributed
querying

What this Means

• Change your web page and Google finds it
in minutes.
• Ten years ago, you would have to submit a
request to Yahoo! to reindex your site.
• All you need is a lot of servers.
• Google has a million of them.
• No problem.

Collaborative Filtering

The Black Black
Camera Tripod
Stallion Beauty

Me You

Mark Frydenberg

mfrydenberg@bentley.edu
cis.bentley.edu/mfrydenberg
CourseMate
Enhanced
Edition
Invite me to your school!

Contenu connexe

Similaire à Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data

Data mining with big data implementationSandip Tipayle Patil

DataEd Online: Demystifying Big DataDATAVERSITY

Data-Ed: Demystifying Big DataData Blueprint

Bigdata " new level"Vamshikrishna Goud

Understanding big dataPraneet Samaiya

Business Intelligence & Predictive Analytic by Prof. Lili SaghafiProfessor Lili Saghafi

Bigdatasourabhdattawad

Big Data Analytics - A GlimpseLaguna State Polytechnic University

Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air FranceJedha Bootcamp

Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Simplilearn

Intro big data analyticsHagar Alaa el-din

Big dataEnfa Rose George

Big Data By Vijay Bhaskar SemwalIIIT Allahabad

Big dataJoseph Sebastian

BigData.pptxvidhi171881

Big Data - GeramiMohammad Reza Gerami

A Review of Big data for Social Policy Decision Making Ridi Fe

Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj

Understanding The Big Data Opportunity FinalAndrew Gregoris

Big data? No. Big Decisions are What You WantStuart Miniman

Similaire à Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data (20)

Data mining with big data implementation

DataEd Online: Demystifying Big Data

Data-Ed: Demystifying Big Data

Bigdata " new level"

Understanding big data

Business Intelligence & Predictive Analytic by Prof. Lili Saghafi

Bigdata

Big Data Analytics - A Glimpse

Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France

Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...

Intro big data analytics

Big data

Big Data By Vijay Bhaskar Semwal

Big data

BigData.pptx

Big Data - Gerami

A Review of Big data for Social Policy Decision Making

Big data PPT prepared by Hritika Raj (Shivalik college of engg.)

Understanding The Big Data Opportunity Final

Big data? No. Big Decisions are What You Want

Plus de Cengage Learning

Discovering History Through Digital Newspaper CollectionCengage Learning

Are Your Students Ready for Lab?Cengage Learning

5 Course Design Tips to Increase Engagement and OutcomesCengage Learning

The Journey to Digital: Incorporating Technology to Strengthen Critical MindsCengage Learning

Google Drive Plus TexQuest Equals a Match Made in Research HeavenCengage Learning

Improving Time Management: Tips that Will Help College Students Start the Yea...Cengage Learning

Mind Tap Open Trial Cengage LearningCengage Learning

Getting Started with Enhanced WebAssign 8/11/15 Presented by: Mike Lafreniere...Cengage Learning

Taming the Digital Tiger: Implementing a Successful Digital or 1:1 InitiativeCengage Learning

Decimal and Fraction Jeopardy - A Game for Developmental MathCengage Learning

Game it up! Introducing Game Based Learning for Developmental MathCengage Learning

Overcoming Textbook FatigueCengage Learning

Adult Student Success: How Does Awareness Correlate to Program Completion?Cengage Learning

You're responsible for teaching, and your students are resonsible for learnin...Cengage Learning

What is the Impact of the New Standard on the Intermediate Accounting Course?Cengage Learning

The ABCs Approach to Goal Setting and ImplementationCengage Learning

Competency-based Education: Out with the new, in with the old? Cengage Learning

Student-to-Student Learning, Powered by FlashNotes Cengage Learning

Creating Career Success: A Flexible Plan for the World of WorkCengage Learning

Preparing Students for Career Success Cengage Learning

Plus de Cengage Learning (20)

Discovering History Through Digital Newspaper Collection

Are Your Students Ready for Lab?

5 Course Design Tips to Increase Engagement and Outcomes

The Journey to Digital: Incorporating Technology to Strengthen Critical Minds

Google Drive Plus TexQuest Equals a Match Made in Research Heaven

Improving Time Management: Tips that Will Help College Students Start the Yea...

Mind Tap Open Trial Cengage Learning

Getting Started with Enhanced WebAssign 8/11/15 Presented by: Mike Lafreniere...

Taming the Digital Tiger: Implementing a Successful Digital or 1:1 Initiative

Decimal and Fraction Jeopardy - A Game for Developmental Math

Game it up! Introducing Game Based Learning for Developmental Math

Overcoming Textbook Fatigue

Adult Student Success: How Does Awareness Correlate to Program Completion?

You're responsible for teaching, and your students are resonsible for learnin...

What is the Impact of the New Standard on the Intermediate Accounting Course?

The ABCs Approach to Goal Setting and Implementation

Competency-based Education: Out with the new, in with the old?

Student-to-Student Learning, Powered by FlashNotes

Creating Career Success: A Flexible Plan for the World of Work

Preparing Students for Career Success

Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data

1. Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data Mark Frydenberg Bentley University

3. CourseMate Enhanced Edition

4. 77 Movies and TV Shows!

5. What's your Bacon Index? 2 Ann Joe 3 Bob 2 X 4 1 Kim Kevin

8. APIs

10. Friend of a Friend

11. Social Graph

12.

13. Big Data 'Big data' refers to a collection of tools, techniques and technologies which make it easy to work with data at any scale. powerof60.com

14. The Road

15.

16. 3 V's • Volume - amount of data is larger than those conventional relational database infrastructures can handle • Velocity - the rate at which data is generated, processed and analyzed in (real) time • Variety – data formats are unstructured and inconsistent

17. Volume: How Big is Big Data?

18. Yottabyte?

19. Walmart • Walmart collects more than 2.5 petabytes of data every hour from its customer transactions. • A petabyte is one quadrillion bytes, or the equivalent of about 20 million filing cabinets’ worth of text. http://hbr.org/2012/10/big-data-the-management-revolution/ar

20. Velocity: Drinking from the Firehose • Scrutinize 5 million trade events created each day to identify potential fraud • Analyze 500 million daily call detail records in real-time to predict customer churn faster

21. A Variety of Big Data Sources

22. McKinsey&Company Report (2011) • Data is part of every industry and business function. • Data creates value. • Big data becomes a basis of competition and growth. • Some sectors will achieve greater gains. • Shortage of people with analytical skills. • Need policies related to privacy, security, ownership.

23. Twitter

24. Twitter 3000 tweets per second data is disorganized How does twitter use its data?

25. Twitter Visualization

26.

27. Big Data Technologies • HADOOP: scalable storage, parallel computation • NoSQL: distributed querying

28. What this Means • Change your web page and Google finds it in minutes. • Ten years ago, you would have to submit a request to Yahoo! to reindex your site. • All you need is a lot of servers. • Google has a million of them. • No problem.

29. http://aws.amazon.com/big-data/

30.

31.

32. Collaborative Filtering

33. Collaborative Filtering The Black Black Camera Tripod Stallion Beauty Me You

34.

35. Variety: Semantic Web

36. RelFinder

37. Unstructured Data

38.

39. Health Care

40.

41.

42. Analyzing Big Data

43. explore.data.gov

44. Searching Big Data

45. Fusion Table Visualizations

46. Fusion Table Visualizations

47. Fusion Table Visualizations

48. Mark Frydenberg mfrydenberg@bentley.edu cis.bentley.edu/mfrydenberg CourseMate Enhanced Edition Invite me to your school!

Notes de l'éditeur

6 Degrees of Kevin Bacon, Name is Dumb Luck6 Degrees of Separation – within networks of people or things, there is a theoretical maximum of 6 points between any two nodesThat’s the Bacon IndexBob is 1, Ann is 2, Joe is 3. Index can only get so big because of interconnections.If Kim is connected to Bob, Kim is 2, not 4.
Twitter can’t be structured. Twitter is a bunch of words that humans are the best at parsingAnd so again we’re back to the 3 V’s, Volume, Velocity, and Variety. Not only is twitter’s data disorganized, it handles over 3000 new tweets per secondTwitter is using this data to recommend things to you, and it does it all lightning fast through an engine called Storm
If Amazon can see that lots of people buy forks and knives together, or that people buy curtains and curtain rods together how do they not recommend everyone who has bought a wrench set or a copy of black beauty buy them together if someone else has?This is where things get complicated
Twitter isn’t the only place where unstructured, realtime data is being processed. Facial recognition is a massive big data problemYour iPhone does facial recognition. Facebook does facial recognition. Aperture learns about faces from hundreds of data points and can help you find who is in what photos. Amazing.How do we do this so quickly?
Should it be opt-in only? http://www.code.org/sites/all/themes/codedotorg/logo.png
- Hereis a blood pressure monitor fromiHealththat stores yourblood pressure data in the cloud.
Here’s an appthat monitors yourheart rate fromyourphone’s camera, amazingstuffSo all thiswellness data isnowbeingcollectedubiquitously. How canitbeusedsecurely and effectively to make all of us healthier? This is the big data problem in health care

Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data

Recommandé

Recommandé

Contenu connexe

Similaire à Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data

Similaire à Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data (20)

Plus de Cengage Learning

Plus de Cengage Learning (20)

Course Tech 2013, Mark Frydenberg, Drinking from the Fire Hose: Tools for Interpreting and Teaching with Big Data

Notes de l'éditeur