Get a preview of the contents in DZone's free 2014 Guide to Big Data.
DZone’s 2014 Guide to Big Data offers a thorough and insightful look into the growing world of big data through the lens of developers, data scientists, and other IT professionals. The guide explores a number of facets of big data including a survey of big data practitioners, a thoughtfully curated collection of introductory and expert articles, and a comprehensive directory of tools and solutions for implementing big data strategies.
2. ABOUT DZONE’S GUIDE TO BIG DATA
EXCLUSIVE PREVIEW OF DZONE’S 2014 GUIDE TO BIG DATADZONE.COM
DZone’s 2014 Guide to Big Data, released this week, offers a thorough and insightful look into the
growing world of big data through the lens of developers, data scientists, and other IT
professionals. The guide explores a number of facets of big data including a survey of big data
practitioners, a thoughtfully curated collection of introductory and expert articles, and a comprehensive
directory of tools and solutions for implementing big data strategies.
Contents of the Guide Include:!
!
BIG DATA SURVEY FINDING (850+ DEVELOPERS)
!
ARTICLE: THE NO FLUFF INTRODUCTION TO BIG DATA
ARTICLE: THE EVOLUTION OF MAPREDUCE AND HADOOP
!
ARTICLE: THE DEVELOPER’S GUIDE TO DATA SCIENCE
!
ARTICLE: THE DIY BIG DATA CLUSTER
!
PRACTITIONER CHECKLIST: FINDING THE DATABASE FOR YOUR USE CASE
!
BIG DATA SOLUTION & VENDOR DIRECTORY
!
BIG DATA GLOSSARY
3. EXCLUSIVE PREVIEW OF DZONE’S 2014 GUIDE TO BIG DATADZONE.COM
AN EXCERPT FROM OUR KEY RESEARCH FINDINGS
More than 850 IT professionals responded to DZone’s 2014 Big Data Survey:
!
• Developers (43%) and development team leads (26%) were the most common roles.
• 60% of respondents come from large organizations (100 or more employees) and 40% come from small
organizations (under 100 employees).
• The majority of respondents are headquartered in the US (35%) or Europe (38%).
• Over half of the respondents (63%) have over 10 years of experience as IT professionals.
• A large majority of respondents’ organizations use Java (86%). Python is the next highest (37%)
SURVEY FINDINGS INCLUDE:
• Files and Logs are the Most Common Data Sources
• Unstructured Data is More Common in Hadoop and
NoSQL Orgs
• Cloud Storage Customers Take Full Advantage
• Almost All Orgs Expect Their Storage Needs to
Grow Exponentially
• Hadoop Usage is High Despite the Learning Curve
• Teams Running Hadoop and Column Stores Tend to
Have Bigger Analytics Clusters
TO SEE THE COMPLETE SURVEY RESULTS, DOWNLOAD THE GUIDE HERE
4. FINDING THE DATABASE FOR YOUR USE CASE
This chart will help you find the best types of databases to try testing with your software.
DB Types
Relational DB
Examples: MySQL, PostgreSQL,
SQL Server
Key-Value Store
Examples: Redis, Riak, DynamoDB
Document Store
Examples: MongoDB, Couchbase,
RavenDB
sTrong Use Cases
When ACID transactions are required
Looking up data by different keys with secondary indexes
(also a feature of several NoSQL DBs)
When strong consistency for results and queries is required
Conventional online transaction processing
Risk-averse projects seeking very mature technologies and
widely available skills
Products for enterprise customers that are more familiar with
relational DBs
Handling lots of small, continuous, and potentially volatile
reads and writes; also look for any DB with fast in-memory
access or SSD storage
Storing session information, user preferences, and
e-commerce carts
Simplifying the upgrade path of your software with the
without having to build a schema migration framework
Handling a wide variety of access patterns and data types
Handling reads with low latency
Handling frequently changing, user generated data
Simplifying the upgrade path of your software with the
without having to build a schema migration framework
Deployment on a mobile device (Mobile Couchbase)
When high availability is crucial, and eventual consistency
Weak Use Cases
Systems that need to tolerate partition
failures
Schema-free management
Handling any complex / rich entities that
require you to do multiple joins to get the
entire entity back.
Correlating data between different sets
of keys
Saving multiple transactions
Performing well during key searches based
on values
Operating on multiple keys (it’s only
possible through the client side)
Returning only partial values is required
Updates in place are necessary
Atomic cross-document operations
(RavenDB is exempt from this weakness)
Querying large aggregate data structures
that frequently change
Returning only partial values is required
Joins are desired
Foreign key usage is desired
Partial updates of documents (especially
child/sub-documents)
Early prototyping or situations where
TO SEE THE FULL CHART, DOWNLOAD THE GUIDE HERE
EXCLUSIVE PREVIEW OF DZONE’S 2014 GUIDE TO BIG DATA
5. excerpt from
DIVING
DEEPER INTO
BIG DATA:!
THE RESOURCES
YOU NEED TO
LEARN MORE
EXCLUSIVE PREVIEW OF DZONE’S 2014 GUIDE TO BIG DATADZONE.COM
TOP 10 #BIGDATA TWITTER FEEDS
BIG DATA REFCARDZ
BIG DATA LEARNING ZONES
@KirkDBorne @kdnuggets
@data_nerd @BigDataGal @medriscoll
@spyced
@InformaticaCorp@marcusborba
@jameskobielus
@IBMbigdata
Big Data Machine
Learning: Patterns for
Predictive Analytics
bit.ly/WUFymk
Apache HBase: The
NoSQL Database for
Hadoop and Big Data
bit.ly/1nRgDYh
Getting Started with
Apache Hadoop bit.ly/
1wnFbQI
MongoDB: Flexible
NoSQL for Humongous
Data bit.ly/YEgXEi
Big Data Zone
dzone.com/mz/big-data
We're on top of all the best tips and
news for Hadoop, R, and data
visualization technologies. We also
give you advice from data science
experts on how to understand and
present that data.
NoSQL Zone
dzone.com/mz/nosql
DZone's portal for following the news
and trends of the non-relational
database ecosystem, including
solutions such as MongoDB,
Cassandra, Redis, and many others.
SQL Zone
dzone.com/mz/sql
DZone's portal for following the
news and trends of the relational
database ecosystem, which includes
solutions such as MySQL,
PostgreSQL, SQL Server, and many
others.
DOWNLOAD THE
GUIDE HERE
6. DZONE.COM
THANK YOU FOR READING
an excerpt from
!
DZONE’S 2014 GUIDE TO
BIG DATA
The full version of DZone’s Guide to Big Data includes expert and
thought leader articles, a comparison tool of various solutions, and
additional industry insights.
!
Visit dzone.com/research/guide-to-big-data to download the
complete version of the 2014 Guide to Big Data for free!