2. 2
Special thanks to Amazon Web Services
for supporting AWS's credit to run
EMR Hadoop cluster
3. 3
Schedule
13 March
– 16.00 - 18.00 Workshop / Demo on Big Data Analytics
using Amazon EMR
– 18.00: Start registration for those who interested in running
the cluster for 30 Hours & Account access to Amazon EMR
will be given
14 March
– 06.00 Amazon EMR Cluster will be opened
– Participant will be discussed via online / Social Media
15 March (@ EGA Office)
– 12.00 Amazon EMR will be closed
– 13.00 Presentation by each competitor on the result
– 15.30 Winner Announcement
5. 5
Hadoop Cluster for the challenge
10 AWS’s m3.xlarge EC2 server each with
4vCPU, 15 GByte Memory, 80 GB SSD Memory
A sample data set with more than 10 million
records will be given
6. 6
Challenge rules
A competitor can use a sample data to analyse
with Hive, Pig or Map/Reduce
In addition, a competitor can use own large set of
data.
A winner will be judged from those who have a
best innovation / result from the analytics.
Those who are just would like to try using the
cluster are also welcome
9. 9
Awards
The best winner will receive an Apple TV.
Two winners will be selected for two free training
courses on
– Big Data using Hadoop Workshop; 30-31 March 2015
– Business Intelligence Design and Process; 18-20, 25-26
May 2015
Starbucks Card 200 Baht
15. Thanachart Numnonda, thanachart@imcinstitute.com Feb 2015Big Data Hadoop on Amazon EMR – Hands On Workshop
Creating a cluster in EMR (cont.)
Leave the Hardware Configuration as default
Choose an exisitng EC2 key pair
17. Thanachart Numnonda, thanachart@imcinstitute.com Feb 2015Big Data Hadoop on Amazon EMR – Hands On Workshop
EMR Cluster Details
Note on the Master public DNS:
To see the details on how to connect to the Master Node using SSH click at SSH
19. 19
Set Up an SSH Tunnel to the Master Node
– See instruction at
– http://docs.aws.amazon.com/ElasticMapReduce/latest/
DeveloperGuide/emr-ssh-tunnel.html
23. 23
Launch the Hue Web Interface
Set Up an SSH Tunnel to the Master Node
– See instruction at
– http://docs.aws.amazon.com/ElasticMapReduce/latest/Devel
operGuide/emr-ssh-tunnel.html
Configure Proxy Settings to View Websites
– See instruction at
– http://docs.aws.amazon.com/ElasticMapReduce/latest/Devel
operGuide/emr-connect-master-node-proxy.html
24. 24
Launch the Hue Web Interface (Cont.)
http://master-public-dns-name:8888/
40. 40
Registration
Provide your name, organization, mobile, e-mail
address
On-site registartion at 17.00 pm, 13 March
E-mail: contact@imcinstitute.com
Facebook message to Thanachart Numnonda
Your username & password & key & public DNS will
be send to your e-mail by 6 am, 14 March