2. 2
Confidential and
Proprietary VDPFinder
Inc., 2012
Big Data TrendConnect Vertical
IT
Big Data Demographic Summary Professional Data
10% Scientist/
Targeted Analyst
Community 2%
Number of Members 188,312
Number Tweets 632,674
Media/
Number Keywords Entered 130 Press
Number Keywords Extracted 1,436 12%
Unclassified
Number “IT Professionals” 19,112 36% Industry
“IT Professional” Incidence Rate Medium; 10% Analyst
1%
Start Date 1-June-2012
End Date 17-September-2012
Potential IT
“Reach” – Total Number Followers 107,325,975
10%
Average “Follower/Following” Ratio 16.9 Missing
Average “IT Pro” “Follower/Following” Ratio 7.0 Info
15% Marketing
7%
Recruiters/
Swear- Student Jobs
Spam 2% 3%
2%
3. 3
Confidential and
Proprietary VDPFinder
Inc., 2012
Big Data IT Professionals by Title
Developer/Programmer
Engineer
Technologist/IT Pro.
Architect
Systems Administrator
CIO/VP
DBA
IT Director/Manager
Business Analyst
0 2000 4000 6000 8000 10000
4. 4
Confidential and
Proprietary VDPFinder
Inc., 2012
Big Data Tweets
Top 30 Topics by Number of Tweets
Big Data Community Tweets by Topic
big data
#bigdata 136343
244320
• Tweets around verticals typically
analytics
data analytics
94543
62472
produce a “long tail” effect
hadoop
mongodb
61415
34928
▫ The top 10 topics account for almost two-
#io12 30692 thirds of all topics
data science 28558
machine learning 26868 ▫ Over time, new interest areas will emerge
big data analytics 24188 and the order of topics will change to help
cloud data 20813
social data 17439 identify new industry trends or changes in
#hadoop 15414 the market
hadoop data 13840
mobile data
data insights
12393
10847 • Not surprisingly, references to big
predictive analytics 10641
data scientist 10239
data and (especially) analytics
nosql 9824
real-time 7700
dominate the social media
artificial intelligence 6831
cloudera 6723
conversation.
business analytics
social media data
6619
6506 • Opensource and Nosql products are
#hana 6322
data discovery 6269 overwhelmingly mentioned in
data analysis 6182
#nosql 6020 conversation compared to
data mining 5745
hortonworks 5727 commercial products
real-time analytics 5589
5. 5
Confidential and
Big Data Conversations
Proprietary VDPFinder
Inc., 2012
Top 20 Topics by Type of Member
100%
#nosql
• Conversation of
90%
cloudera interest differ
real-time
significantly by
data scientist
80% type of member
mobile data
70%
predictive analytics
• IT Professionals
data insights
nosql tend to have very
60% hadoop data different
machine learning
conversations
50% social data
#hadoop relative to the rest
40% data science of the industry, at
cloud data
#io12
the moment focused
30%
big data analytics on open-source
mongodb
20% solutions.
hadoop
data analytics
10% analytics
#bigdata
0% big data
IT Professional Data Media/Press Industry Analyst
Scientist/Analyst
6. 6
Confidential and
Big Data Conversations
Proprietary VDPFinder
Inc., 2012
Top 20 Topics by Type of Member
Normalized for “Big Data”
100%
#nosql
• Conversation of
90% cloudera interest differ
real-time significantly by
80% data scientist
type of member
mobile data
70% predictive analytics • IT Professionals
data insights tend to have very
60% nosql
different
hadoop data
conversations
50% machine learning
social data relative to the rest
40% #hadoop of the industry, at
data science
the moment focused
30% cloud data
#io12
on open-source
20% big data analytics solutions.
mongodb
10% hadoop
data analytics
0% analytics
IT Professional Data Media/Press Industry Analyst
Scientist/Analyst
7. 7
Confidential and
Proprietary VDPFinder
Big Data Conversations: ITPros and Data Scientists
Inc., 2012
Top 20 Topics by Month
100% hadoop analytics • Conversations among
mapreduce IT pros and Data
90% social data Scientists are among
apache
the most relevant to the
data scientist
80% overall Big Data
predictive analytics
hortonworks industry
70% 10gen • Industry conference
cloudera
60% #nosql
Google IO12 had a big
data science
impact on end-users
50% hadoop data conversations in June
cloud data
• Conversations around
nosql
40% machine learning
Hadoop are beginning
big data analytics
to temper in
30% #hadoop July/August as
#io12 analytics discussions
20% data analytics
become more topical
analytics
mongodb
10%
hadoop
#bigdata
0% big data
JUNE JULY AUGUST SEPTEMBER
8. 8
Confidential and
Proprietary VDPFinder
Big Data Conversations: ITPros and Data Scientists
Inc., 2012
Top 20 Topics by Month
Normalized for “Big Data”
100% hadoop analytics • Conversations among
mapreduce IT pros and Data
90% social data Scientists are among
apache the most relevant to the
80% data scientist overall Big Data
predictive analytics
industry
70% hortonworks
10gen
• Industry conference
60% cloudera
Google IO12 had a big
#nosql impact on end-users
50% data science conversations in June
hadoop data • Conversations around
cloud data
40% Hadoop are beginning
nosql
to temper in
machine learning
30%
big data analytics
July/August as
#hadoop
analytics discussions
20% #io12
become more topical
data analytics
10% analytics
mongodb
0% hadoop
JUNE JULY AUGUST SEPTEMBER
9. 9
Confidential and
Proprietary VDPFinder
Inc., 2012
Big Data: Psychographic Profile of IT
professionals & Data Scientists:
-- Software Engineers, Architects, Developers
10. 10
Confidential and
Proprietary VDPFinder
Inc., 2012
Big Data: Psychographic Profile of IT
professionals & Data Scientists:
-- Software Engineers, Architects, Developers
• Titles • Descriptors
▫ SW Developer ▫ Geek/Nerd
▫ Engineer ▫ Hacker
▫ Architect ▫ Father/Dad
▫ Sysadmin ▫ Husband
▫ Consultant ▫ Entrepreneur
• Major Qualifications ▫ Founder
▫ Cloud • Likes
▫ Python ▫ Sports
▫ Hadoop ▫ Music
▫ Linux ▫ Photography
▫ Software ▫ Gamer
▫ Web
▫ Ruby
▫ php
▫ Sql
▫ security
▫ Java
▫ Android/apple/iOS
▫ Network