This talk is an expanded version of my previous upload from August of this year. I delivered this presentation to the Chinese University of Hong Kong (CUHK)'s Department of Statistics. In it I provided an overview of big data, the techs and skills used in the space, and some engaging stories in the space. I also focused heavily on Hong Kong with details on Open Data projects ongoing in the city right now.
8. components: creating value from data
servers
traditional
DBMS
visualization
storage
columnar DBs
network
s
hardware
software
NoSQL
platforms
Hadoop
appliances
people
traditional
IT
platform
architects
comp.
scientists
stats.
people
15. privacy
• what is your expectation of your data’s
lifespan?
• what is the relationship between privacy and
intellectual property protection?
• do you know your digital exhaust?
• should you be compensated for helping
Google earn another billion dollars?
18. big data: where are we
today?
adapted from Gartner hype cycle
visibility/expectations
this will be caused by
a lack of statistics knowledge
time
trigger
inflated
disillusionment
productivity
expectations
enlightenment
28. want to get involved?
• decision tree:
– individual?
• learn: join G+ group, ask Scott for reading
recommendations
• work: Scott knows some recruiters and hiring
businesses
– research ideas?
• meet with CS students about HK data research platform
– social or civic engagement?
• come to regular ODHK meetings
29. changing future
• borderless big data will increasingly become
invasive. how will regional laws keep up?
• “free” services will shift money from many
small contributors to a few large businesses.
• data must be properly valued which requires a
market.
• those with computer science and statistics
skills will be very well paid for may years.
30. want more?
Google+: Hong Kong Big Data
http://www.infoincog.com/
scott@infoincog.com
all content by Scott Brady Drummonds – scott@infoincog.com
Notes de l'éditeur
$6.3B in 2012, $48.3B by 2018, CAGR 40.5%
2009: http://www.nature.com/nature/journal/v457/n7232/full/nature07634.htmlCDC, 60 years old, dept. of health and human services
Gartner, 2001, three v’s. veracity added by others later.
“too big from which to derive value”
76% of analysts use MS Excel: http://www.billingviews.com/microsoft-excel-king-analytics-hill/
US leads the globe by about six months.Asia trails the US by about 18 monthsHong Kong trails Asia by about six months
http://www.gov.hk/en/theme/psi/datasets/questions:what is the relationship between traffic and air pollution? (data joining)how does property value lead/trail changes in population (historical analysis)what are the trends for weather-related closures (trending)
http://www.gov.hk/en/theme/psi/datasets/questions:what is the relationship between traffic and air pollution? (data joining)how does property value lead/trail changes in population (historical analysis)what are the trends for weather-related closures (trending)
博文约礼
http://www.gov.hk/en/theme/psi/datasets/questions:what is the relationship between traffic and air pollution? (data joining)how does property value lead/trail changes in population (historical analysis)what are the trends for weather-related closures (trending)