2. Who am I?
• Software guy
• Technology leader with experience in software
development as CTOs and development managers
of mid-sized teams.
• Doing big data hands-on since 2009
• Running http://meetup.com/bigdatabe since 2011
(1700 members!)
2
@wimvanleuven
wim@bigboards.io
5. “Big data is data that exceeds the processing
capacity of conventional database systems.
The data is too big, moves too fast, or doesn’t fit
the strictures of your database architectures.”
5
–Edd Dumbill, O’Reilly
What is big data?
http://radar.oreilly.com/2012/01/what-is-big-data.html
10. What is Big Data not?
• not a delivery model (on-premise vs hosted vs
cloud vs IaaS/PaaS/SaaS vs serverless)
• not a deployment model (private, public, hybrid)
• not a revenue model (license vs subscription vs
Pay-as-you-Go)
• not software architecture
10
11. “We don’t do Hadoop because we have Big
Data; we do Big Data because we have
Hadoop.”
11
–Unknown developer, Facebook
What is Big Data? — revisited
12. New tools and technologies to capture and
process data on a cluster of commodity
hardware so that the system acts as one,
is resilient to failures and scales linearly.
12
What is Big Data? — revisited
13. Big Data is no panacea
13
• First decide what problem you want to solve; pick
a real business problem to add immediate value
• Start small, the technology is made for linear
scalability (a 3-node cluster is a cluster!)
• Then become lean: learn through experimentation
14. Big Data challenges
• Beware of hype, Big Data - washing and fad
• Tech infancy
• IT | Biz
• Data is hard
• Lack of skills!
14
15. Benefits
• Scalability of course
• Collect more and more data
• Robustness inherent to the setup
• More predictable performance
15