SlideShare une entreprise Scribd logo
1  sur  35
Télécharger pour lire hors ligne
1!
Privacy, Ethics, and Future Uses of the Social Web"
Prepared for Owen Graduate School of Management (Vanderbilt University)!
April 3, 2014!
Matthew A. Russell (Chief Technology Officer @ Digital Reasoning)!
Twitter: @ptwobrussell & @dreasoning!
Overview!
•  Intro (5 mins)
•  Mining the Social Web (5 mins)
•  "Know thy data..." (10 mins)
•  "...and know thyself" (15 mins)
•  Wrap Up/Final Q&A (15 mins)
2!
INTRO!
3!
Hello, My Name Is ... Matthew!
•  Background in Computer Science
•  Data mining, AI, machine learning, etc.
•  CTO @ Digital Reasoning Systems
•  Moving toward cognitive computing
•  Author
•  5 published books on technology (just for fun)
•  CrossFit, triathlon, Bikram hot yoga
•  Stress management
4!
The only easy day was yesterday.
-- Motto of the U.S. Navy SEALs
5!
It pays to be a winner.
-- Motto of the U.S. Navy SEALs
6!
Mining the Social Web!
7!
Data Exhaust => Digital Fingerprints!
•  World population: ~7B people
•  Facebook: 1.15B users
•  Twitter: 500M users
•  Google+ 343M users
•  LinkedIn: 238M users
•  ~200M+ blogs (conservative estimate)
8!
•  An open source software (OSS) project
•  http://bit.ly/MiningTheSocialWeb2E
•  A (rewritten) book
•  http://bit.ly/135dHfs
•  Accessible to (virtually) everyone
•  Virtual machine with turn-key coding templates for
data science experiments
•  Think of the book as "premium" support for the OSS
project
Transforming Curiosity Into Insight!
9!
Table of Contents (1/2)!
•  Chapter 1 - Mining Twitter: Exploring Trending Topics,
Discovering What People Are Talking About, and More
•  Chapter 2 - Mining Facebook: Analyzing Fan Pages,
Examining Friendships, and More
•  Chapter 3 - Mining LinkedIn: Faceting Job Titles,
Clustering Colleagues, and More
•  Chapter 4 - Mining Google+: Computing Document
Similarity, Extracting Collocations, and More
•  Chapter 5 - Mining Web Pages: Using Natural Language
Processing to Understand Human Language,
Summarize Blog Posts, and More
•  Chapter 6 - Mining Mailboxes: Analyzing Who's Talking
to Whom About What, How Often, and More
10!
Table of Contents (2/2)!
•  Chapter 7 - Mining GitHub: Inspecting Software
Collaboration Habits, Building Interest Graphs, and More
•  Chapter 8 - Mining the Semantically Marked-Up Web:
Extracting Microformats, Inferencing over RDF, and More
•  Chapter 9 - Twitter Cookbook
•  Appendix A - Information About This Machine's Virtual
Machine Experience
•  Appendix B - OAuth Primer
•  Appendix C - Python and IPython Notebook Tips & Tricks
11!
Anatomy of Each Chapter!
•  Brief Intro
•  Objectives
•  API Primer
•  Analysis Technique(s)
•  Data Visualization
•  Recap
•  Suggested Exercises
•  Recommended Resources
12!
Why You Should Use IPython
Notebook!
•  Because it's great for hacking
•  And hacking is usually the first step
•  Because it's great for collaboration
•  Sharing/publishing results is trivial
•  Because the UX is as easy as working in a
notepad
•  Think of it as "executable paper"
•  In short, it's a terrific learning platform for
novices and experts alike
13!
14!
15!
"Know thy data..."!
16!
If we have data, let’s look at data. If we have
opinions, let’s go with mine.
--Jim Barksdale
17!
In God we trust. All others must bring data.
--W. Edwards Deming
18!
Communication => Data!
•  Communication
•  Senders
•  humans & machines
•  Messages
•  natural language, images, videos, etc.
•  Recipients
•  humans & machines
19!
Data Alchemy!
•  Data: Documents & document fragments (text
messages, etc.)
•  Information: "Assertions", summaries, tags, etc.
•  Knowledge: Aggregated, query-able information
•  Wisdom: “Compressed” knowledge
•  Gold: Money
20!
Data Mining = Curiosity + Stats!
•  Curiosity
•  Interests, desires, and intuitions
•  Statistics
•  Counting
•  Comparing
•  Filtering
•  Ranking
•  Hypothesis testing; knowledge discovery
21!
Machine Learning!
•  A program that learns (improves)
from experience (data)
according to some objective
•  Supervised learning
•  Unsupervised learning
•  Reinforcement learning
•  How to do it
•  Program mathematical
models and hope for the
best...
•  How to do it well
•  Program state-of-the-art
mathematical models with
sufficient representative data
22!
Knowledge is a process of piling up facts;
wisdom lies in their simplification.
--Martin Fischer
23!
Any sufficiently advanced technology is
indistinguishable from magic.
--Arthur C. Clarke
24!
"...and know thyself"!
25!
Is Privacy Already an Illusion?!
•  Digital happenings circa 2014
•  The Cloud
•  Social Media
•  Deep Learning
•  The Internet of Things
•  Internet.org
26!
Civilization is the progress toward
a society of privacy.
--Ayn Rand
27!
If you have something that you
don’t want anyone to know,
maybe you shouldn’t be doing it
in the first place.
-- Eric Schmidt, (former) CEO of Google
28!
Influences on Ethics!
•  Capitalism, economics, & marketing
•  A for-profit corporation's fiduciary duty: To
maximize the common stock's value
•  How to do it? By transacting commerce
•  How do it well? By advertising more effectively
than competitors
•  How to do it really well? With highly relevant
personalized ads (recommenders)
•  Terms of Service (ToS) - The legal extent of
ethical obligations?
29!
If you're not paying for the product, you
are the product.
-- Savvy consumers everywhere one day (?)
30!
For the wisdom of this world is
foolishness...
-- Saint Paul
31!
The Future of the Web...!
•  The Blue Pill: All of your precious data housed remotely
and controlled by a few of the world's most powerful
international corporations
•  The Red Pill: A distributed cloud controlled by no one
with decentralized data and anonymity online as the
status quo
•  The Purple Pill: Meet somewhere in the middle (?)
•  Significant legislative reforms concerning consumer
data (?)
•  Consumer education with more transparency (?)
•  Resurgence of local/offline storage and anonymity
online (?)
32!
The real danger is the gradual erosion of
individual liberties through automation,
integration, and interconnection of many
small, separate record-keeping systems,
each of which alone may seem innocuous,
even benevolent, and wholly justifiable.
-- Anonymous (U. S. Privacy Study Commission, 1977)
33!
.	
  
There are two primary choices in life:
to accept conditions as they exist,
or accept the responsibility for
changing them.
-- Dennis Waitley
34!
WRAP-UP / Q&A!
35!

Contenu connexe

En vedette

Mining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started GuideMining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started GuideMatthew Russell
 
Mining Social Web APIs with IPython Notebook (PyCon 2014)
Mining Social Web APIs with IPython Notebook (PyCon 2014)Mining Social Web APIs with IPython Notebook (PyCon 2014)
Mining Social Web APIs with IPython Notebook (PyCon 2014)Matthew Russell
 
What convnets look at when they look at nudity
What convnets look at when they look at nudityWhat convnets look at when they look at nudity
What convnets look at when they look at nudityRyan Compton
 
Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)
Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)
Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)Matthew Russell
 
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...Maryam Farooq
 
Using cognitive computing to better analyze human communication
Using cognitive computing to better analyze human communicationUsing cognitive computing to better analyze human communication
Using cognitive computing to better analyze human communicationDigital Reasoning
 
Mining the Social Web for Fun & Profit Within Your Organization
Mining the Social Web for Fun & Profit Within Your OrganizationMining the Social Web for Fun & Profit Within Your Organization
Mining the Social Web for Fun & Profit Within Your OrganizationDigital Reasoning
 
Tim Estes - Generating dynamic social networks from large scale unstructured ...
Tim Estes - Generating dynamic social networks from large scale unstructured ...Tim Estes - Generating dynamic social networks from large scale unstructured ...
Tim Estes - Generating dynamic social networks from large scale unstructured ...Digital Reasoning
 
Tim Estes - Information Systems in an Entity Centric World
Tim Estes - Information Systems in an Entity Centric WorldTim Estes - Information Systems in an Entity Centric World
Tim Estes - Information Systems in an Entity Centric WorldDigital Reasoning
 
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...Digital Reasoning
 
Unleashing twitter data for fun and insight
Unleashing twitter data for fun and insightUnleashing twitter data for fun and insight
Unleashing twitter data for fun and insightDigital Reasoning
 
Mining the Geo Needles in the Social Haystack
Mining the Geo Needles in the Social HaystackMining the Geo Needles in the Social Haystack
Mining the Geo Needles in the Social HaystackMatthew Russell
 
NYAI #5 - Fun With Neural Nets by Jason Yosinski
NYAI #5 - Fun With Neural Nets by Jason YosinskiNYAI #5 - Fun With Neural Nets by Jason Yosinski
NYAI #5 - Fun With Neural Nets by Jason YosinskiRizwan Habib
 
Building Tooling And Culture Together
Building Tooling And Culture TogetherBuilding Tooling And Culture Together
Building Tooling And Culture TogetherNishan Subedi
 
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...Rizwan Habib
 
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...Rizwan Habib
 
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)Matthew Russell
 
NYAI #9: Concepts and Questions As Programs by Brenden Lake
NYAI #9: Concepts and Questions As Programs by Brenden LakeNYAI #9: Concepts and Questions As Programs by Brenden Lake
NYAI #9: Concepts and Questions As Programs by Brenden LakeRizwan Habib
 
Virtual Madness @ Etsy
Virtual Madness @ EtsyVirtual Madness @ Etsy
Virtual Madness @ EtsyNishan Subedi
 

En vedette (20)

Mining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started GuideMining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started Guide
 
Mining Social Web APIs with IPython Notebook (PyCon 2014)
Mining Social Web APIs with IPython Notebook (PyCon 2014)Mining Social Web APIs with IPython Notebook (PyCon 2014)
Mining Social Web APIs with IPython Notebook (PyCon 2014)
 
What convnets look at when they look at nudity
What convnets look at when they look at nudityWhat convnets look at when they look at nudity
What convnets look at when they look at nudity
 
Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)
Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)
Mining Social Web APIs with IPython Notebook (Data Day Texas 2015)
 
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
NYAI #10: Building an AI Autonomous Agent Using Supervised Learning with Denn...
 
Using cognitive computing to better analyze human communication
Using cognitive computing to better analyze human communicationUsing cognitive computing to better analyze human communication
Using cognitive computing to better analyze human communication
 
Mining the Social Web for Fun & Profit Within Your Organization
Mining the Social Web for Fun & Profit Within Your OrganizationMining the Social Web for Fun & Profit Within Your Organization
Mining the Social Web for Fun & Profit Within Your Organization
 
Tim Estes - Generating dynamic social networks from large scale unstructured ...
Tim Estes - Generating dynamic social networks from large scale unstructured ...Tim Estes - Generating dynamic social networks from large scale unstructured ...
Tim Estes - Generating dynamic social networks from large scale unstructured ...
 
Tim Estes - Information Systems in an Entity Centric World
Tim Estes - Information Systems in an Entity Centric WorldTim Estes - Information Systems in an Entity Centric World
Tim Estes - Information Systems in an Entity Centric World
 
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
Got Chaos? Extracting Business Intelligence from Email with Natural Language ...
 
Unleashing twitter data for fun and insight
Unleashing twitter data for fun and insightUnleashing twitter data for fun and insight
Unleashing twitter data for fun and insight
 
How to Build a Tech Team
How to Build a Tech TeamHow to Build a Tech Team
How to Build a Tech Team
 
Mining the Geo Needles in the Social Haystack
Mining the Geo Needles in the Social HaystackMining the Geo Needles in the Social Haystack
Mining the Geo Needles in the Social Haystack
 
NYAI #5 - Fun With Neural Nets by Jason Yosinski
NYAI #5 - Fun With Neural Nets by Jason YosinskiNYAI #5 - Fun With Neural Nets by Jason Yosinski
NYAI #5 - Fun With Neural Nets by Jason Yosinski
 
Building Tooling And Culture Together
Building Tooling And Culture TogetherBuilding Tooling And Culture Together
Building Tooling And Culture Together
 
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
 
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
 
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
 
NYAI #9: Concepts and Questions As Programs by Brenden Lake
NYAI #9: Concepts and Questions As Programs by Brenden LakeNYAI #9: Concepts and Questions As Programs by Brenden Lake
NYAI #9: Concepts and Questions As Programs by Brenden Lake
 
Virtual Madness @ Etsy
Virtual Madness @ EtsyVirtual Madness @ Etsy
Virtual Madness @ Etsy
 

Similaire à Privacy, Ethics, and Future Uses of the Social Web

Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)Thinkful
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)Thinkful
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Thinkful
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptxAkhirulAminulloh2
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxHASHEMHASH
 
Advanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsAdvanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsSloan Carne
 
Introduction to Digital Life (March 2017)
Introduction to Digital Life (March 2017)Introduction to Digital Life (March 2017)
Introduction to Digital Life (March 2017)KR_Barker
 
Thinkful DC - Intro to Data Science
Thinkful DC - Intro to Data Science Thinkful DC - Intro to Data Science
Thinkful DC - Intro to Data Science TJ Stalcup
 
Intro to Data Science
Intro to Data ScienceIntro to Data Science
Intro to Data ScienceTJ Stalcup
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information RetrievalCarsten Eickhoff
 
Introduction to Digital Life (October 2016)
Introduction to Digital Life (October 2016)Introduction to Digital Life (October 2016)
Introduction to Digital Life (October 2016)KR_Barker
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantLynne Thomas
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataHamilton Public Library
 
What you did last summer?
What you did last summer?What you did last summer?
What you did last summer?DoThinger
 
Online Privacy, the next Battleground
Online Privacy, the next BattlegroundOnline Privacy, the next Battleground
Online Privacy, the next BattlegroundSensePost
 
Thinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCThinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCTJ Stalcup
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiProfessor Lili Saghafi
 

Similaire à Privacy, Ethics, and Future Uses of the Social Web (20)

Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Getting started in data science (4:3)
Getting started in data science (4:3)Getting started in data science (4:3)
Getting started in data science (4:3)
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
datamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptxdatamining_Lecture_1(introduction).pptx
datamining_Lecture_1(introduction).pptx
 
Advanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU InvestigatorsAdvanced Research Investigations for SIU Investigators
Advanced Research Investigations for SIU Investigators
 
Introduction to Digital Life (March 2017)
Introduction to Digital Life (March 2017)Introduction to Digital Life (March 2017)
Introduction to Digital Life (March 2017)
 
Thinkful DC - Intro to Data Science
Thinkful DC - Intro to Data Science Thinkful DC - Intro to Data Science
Thinkful DC - Intro to Data Science
 
Intro to Data Science
Intro to Data ScienceIntro to Data Science
Intro to Data Science
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Lecture4 Social Web
Lecture4 Social Web Lecture4 Social Web
Lecture4 Social Web
 
Introduction to Digital Life (October 2016)
Introduction to Digital Life (October 2016)Introduction to Digital Life (October 2016)
Introduction to Digital Life (October 2016)
 
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership GrantPOWRR Tools: Lessons learned from an IMLS National Leadership Grant
POWRR Tools: Lessons learned from an IMLS National Leadership Grant
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with Data
 
What you did last summer?
What you did last summer?What you did last summer?
What you did last summer?
 
Online Privacy, the next Battleground
Online Privacy, the next BattlegroundOnline Privacy, the next Battleground
Online Privacy, the next Battleground
 
Thinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DCThinkful - Intro to Data Science - Washington DC
Thinkful - Intro to Data Science - Washington DC
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
 

Plus de Matthew Russell

Mining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started GuideMining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started GuideMatthew Russell
 
Why Twitter Is All the Rage: A Data Miner's Perspective
Why Twitter Is All the Rage: A Data Miner's PerspectiveWhy Twitter Is All the Rage: A Data Miner's Perspective
Why Twitter Is All the Rage: A Data Miner's PerspectiveMatthew Russell
 
Mining Social Web APIs with IPython Notebook - Data Day Texas 2014
Mining Social Web APIs with IPython Notebook - Data Day Texas 2014Mining Social Web APIs with IPython Notebook - Data Day Texas 2014
Mining Social Web APIs with IPython Notebook - Data Day Texas 2014Matthew Russell
 
Mining Social Web APIs with IPython Notebook (Strata 2013)
Mining Social Web APIs with IPython Notebook (Strata 2013)Mining Social Web APIs with IPython Notebook (Strata 2013)
Mining Social Web APIs with IPython Notebook (Strata 2013)Matthew Russell
 
Mining Social Web Data Like a Pro: Four Steps to Success
Mining Social Web Data Like a Pro: Four Steps to SuccessMining Social Web Data Like a Pro: Four Steps to Success
Mining Social Web Data Like a Pro: Four Steps to SuccessMatthew Russell
 
Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightMatthew Russell
 

Plus de Matthew Russell (6)

Mining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started GuideMining the Social Web for Fun and Profit: A Getting Started Guide
Mining the Social Web for Fun and Profit: A Getting Started Guide
 
Why Twitter Is All the Rage: A Data Miner's Perspective
Why Twitter Is All the Rage: A Data Miner's PerspectiveWhy Twitter Is All the Rage: A Data Miner's Perspective
Why Twitter Is All the Rage: A Data Miner's Perspective
 
Mining Social Web APIs with IPython Notebook - Data Day Texas 2014
Mining Social Web APIs with IPython Notebook - Data Day Texas 2014Mining Social Web APIs with IPython Notebook - Data Day Texas 2014
Mining Social Web APIs with IPython Notebook - Data Day Texas 2014
 
Mining Social Web APIs with IPython Notebook (Strata 2013)
Mining Social Web APIs with IPython Notebook (Strata 2013)Mining Social Web APIs with IPython Notebook (Strata 2013)
Mining Social Web APIs with IPython Notebook (Strata 2013)
 
Mining Social Web Data Like a Pro: Four Steps to Success
Mining Social Web Data Like a Pro: Four Steps to SuccessMining Social Web Data Like a Pro: Four Steps to Success
Mining Social Web Data Like a Pro: Four Steps to Success
 
Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and Insight
 

Dernier

Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxeditsforyah
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一z xss
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Sonam Pathan
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Sonam Pathan
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
NSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationNSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationMarko4394
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhimiss dipika
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 

Dernier (17)

Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
Q4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptxQ4-1-Illustrating-Hypothesis-Testing.pptx
Q4-1-Illustrating-Hypothesis-Testing.pptx
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170Call Girls Near The Suryaa Hotel New Delhi 9873777170
Call Girls Near The Suryaa Hotel New Delhi 9873777170
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
NSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentationNSX-T and Service Interfaces presentation
NSX-T and Service Interfaces presentation
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhi
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 

Privacy, Ethics, and Future Uses of the Social Web

  • 1. 1! Privacy, Ethics, and Future Uses of the Social Web" Prepared for Owen Graduate School of Management (Vanderbilt University)! April 3, 2014! Matthew A. Russell (Chief Technology Officer @ Digital Reasoning)! Twitter: @ptwobrussell & @dreasoning!
  • 2. Overview! •  Intro (5 mins) •  Mining the Social Web (5 mins) •  "Know thy data..." (10 mins) •  "...and know thyself" (15 mins) •  Wrap Up/Final Q&A (15 mins) 2!
  • 4. Hello, My Name Is ... Matthew! •  Background in Computer Science •  Data mining, AI, machine learning, etc. •  CTO @ Digital Reasoning Systems •  Moving toward cognitive computing •  Author •  5 published books on technology (just for fun) •  CrossFit, triathlon, Bikram hot yoga •  Stress management 4!
  • 5. The only easy day was yesterday. -- Motto of the U.S. Navy SEALs 5!
  • 6. It pays to be a winner. -- Motto of the U.S. Navy SEALs 6!
  • 8. Data Exhaust => Digital Fingerprints! •  World population: ~7B people •  Facebook: 1.15B users •  Twitter: 500M users •  Google+ 343M users •  LinkedIn: 238M users •  ~200M+ blogs (conservative estimate) 8!
  • 9. •  An open source software (OSS) project •  http://bit.ly/MiningTheSocialWeb2E •  A (rewritten) book •  http://bit.ly/135dHfs •  Accessible to (virtually) everyone •  Virtual machine with turn-key coding templates for data science experiments •  Think of the book as "premium" support for the OSS project Transforming Curiosity Into Insight! 9!
  • 10. Table of Contents (1/2)! •  Chapter 1 - Mining Twitter: Exploring Trending Topics, Discovering What People Are Talking About, and More •  Chapter 2 - Mining Facebook: Analyzing Fan Pages, Examining Friendships, and More •  Chapter 3 - Mining LinkedIn: Faceting Job Titles, Clustering Colleagues, and More •  Chapter 4 - Mining Google+: Computing Document Similarity, Extracting Collocations, and More •  Chapter 5 - Mining Web Pages: Using Natural Language Processing to Understand Human Language, Summarize Blog Posts, and More •  Chapter 6 - Mining Mailboxes: Analyzing Who's Talking to Whom About What, How Often, and More 10!
  • 11. Table of Contents (2/2)! •  Chapter 7 - Mining GitHub: Inspecting Software Collaboration Habits, Building Interest Graphs, and More •  Chapter 8 - Mining the Semantically Marked-Up Web: Extracting Microformats, Inferencing over RDF, and More •  Chapter 9 - Twitter Cookbook •  Appendix A - Information About This Machine's Virtual Machine Experience •  Appendix B - OAuth Primer •  Appendix C - Python and IPython Notebook Tips & Tricks 11!
  • 12. Anatomy of Each Chapter! •  Brief Intro •  Objectives •  API Primer •  Analysis Technique(s) •  Data Visualization •  Recap •  Suggested Exercises •  Recommended Resources 12!
  • 13. Why You Should Use IPython Notebook! •  Because it's great for hacking •  And hacking is usually the first step •  Because it's great for collaboration •  Sharing/publishing results is trivial •  Because the UX is as easy as working in a notepad •  Think of it as "executable paper" •  In short, it's a terrific learning platform for novices and experts alike 13!
  • 14. 14!
  • 15. 15!
  • 17. If we have data, let’s look at data. If we have opinions, let’s go with mine. --Jim Barksdale 17!
  • 18. In God we trust. All others must bring data. --W. Edwards Deming 18!
  • 19. Communication => Data! •  Communication •  Senders •  humans & machines •  Messages •  natural language, images, videos, etc. •  Recipients •  humans & machines 19!
  • 20. Data Alchemy! •  Data: Documents & document fragments (text messages, etc.) •  Information: "Assertions", summaries, tags, etc. •  Knowledge: Aggregated, query-able information •  Wisdom: “Compressed” knowledge •  Gold: Money 20!
  • 21. Data Mining = Curiosity + Stats! •  Curiosity •  Interests, desires, and intuitions •  Statistics •  Counting •  Comparing •  Filtering •  Ranking •  Hypothesis testing; knowledge discovery 21!
  • 22. Machine Learning! •  A program that learns (improves) from experience (data) according to some objective •  Supervised learning •  Unsupervised learning •  Reinforcement learning •  How to do it •  Program mathematical models and hope for the best... •  How to do it well •  Program state-of-the-art mathematical models with sufficient representative data 22!
  • 23. Knowledge is a process of piling up facts; wisdom lies in their simplification. --Martin Fischer 23!
  • 24. Any sufficiently advanced technology is indistinguishable from magic. --Arthur C. Clarke 24!
  • 26. Is Privacy Already an Illusion?! •  Digital happenings circa 2014 •  The Cloud •  Social Media •  Deep Learning •  The Internet of Things •  Internet.org 26!
  • 27. Civilization is the progress toward a society of privacy. --Ayn Rand 27!
  • 28. If you have something that you don’t want anyone to know, maybe you shouldn’t be doing it in the first place. -- Eric Schmidt, (former) CEO of Google 28!
  • 29. Influences on Ethics! •  Capitalism, economics, & marketing •  A for-profit corporation's fiduciary duty: To maximize the common stock's value •  How to do it? By transacting commerce •  How do it well? By advertising more effectively than competitors •  How to do it really well? With highly relevant personalized ads (recommenders) •  Terms of Service (ToS) - The legal extent of ethical obligations? 29!
  • 30. If you're not paying for the product, you are the product. -- Savvy consumers everywhere one day (?) 30!
  • 31. For the wisdom of this world is foolishness... -- Saint Paul 31!
  • 32. The Future of the Web...! •  The Blue Pill: All of your precious data housed remotely and controlled by a few of the world's most powerful international corporations •  The Red Pill: A distributed cloud controlled by no one with decentralized data and anonymity online as the status quo •  The Purple Pill: Meet somewhere in the middle (?) •  Significant legislative reforms concerning consumer data (?) •  Consumer education with more transparency (?) •  Resurgence of local/offline storage and anonymity online (?) 32!
  • 33. The real danger is the gradual erosion of individual liberties through automation, integration, and interconnection of many small, separate record-keeping systems, each of which alone may seem innocuous, even benevolent, and wholly justifiable. -- Anonymous (U. S. Privacy Study Commission, 1977) 33!
  • 34. .   There are two primary choices in life: to accept conditions as they exist, or accept the responsibility for changing them. -- Dennis Waitley 34!