SlideShare a Scribd company logo
1 of 18
Download to read offline
Web
Scrapping
with
ABOUT ME
Hengki Sihombing
Building Karejo.com - Organizer JakartaJS
twitter @hengkiardo
github @aredo
hengki@karejo.com
Schedule
- What is Web Scrapping
- Why we do that
- How we do that in Node.js
- Our Target
- Demo Code
What is Web
Scrapping
Web scraping is a computer
software technique of
extracting information from
websites.
https://en.wikipedia.org/wiki/Web_scraping
Why we do that
• To get more accessible data
• Improve our hacking skill
• Have fun and build some nice
project
Some product doing
Web scrapping
• Wego, Skyscanner
• Flipboard, Instapaper, Nuzzel
• Telunjuk.com
• Karejo.com
• etc..
How we do
with
Our Dependencies
Request
Cheerio
Async
Mongoose
Express
Agenda
Step by step
Scapping
Request
Cheerio
Our Target
DemoCode
Agenda
Please!!
Let’s Join Us
Software Engineer
Front-End Developer
UI/UX Designer
let me know by email :
hengki@karejo.com
Thank you
https://jakartajs-join.herokuapp.com
meetup.com/JakartaJS

More Related Content

What's hot

SlickGrid Touch: Making complex JavaScript widgets work on mobile devices
SlickGrid Touch: Making complex JavaScript widgets work on mobile devicesSlickGrid Touch: Making complex JavaScript widgets work on mobile devices
SlickGrid Touch: Making complex JavaScript widgets work on mobile devicesreebalazs
 
Building fast webapps, fast - Velocity 2010
Building fast webapps, fast - Velocity 2010Building fast webapps, fast - Velocity 2010
Building fast webapps, fast - Velocity 2010marcuswestin
 
Automated perf optimization - jQuery Conference
Automated perf optimization - jQuery ConferenceAutomated perf optimization - jQuery Conference
Automated perf optimization - jQuery ConferenceMatthew Lancaster
 
Wulin kungfu final
Wulin kungfu finalWulin kungfu final
Wulin kungfu finalJimmy Huang
 
Web Front End Performance
Web Front End PerformanceWeb Front End Performance
Web Front End PerformanceChris Love
 
What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012
What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012
What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012WordCamp Sydney
 
WeBB MeetUp#1 Web applications caching techniques
WeBB MeetUp#1 Web applications caching techniquesWeBB MeetUp#1 Web applications caching techniques
WeBB MeetUp#1 Web applications caching techniquesMaciej Grajcarek
 
High performance java script why everything youve been taught is wrong
High performance java script why everything youve been taught is wrongHigh performance java script why everything youve been taught is wrong
High performance java script why everything youve been taught is wrongTao Gao
 
Meebo performance ny_web_performance
Meebo performance ny_web_performanceMeebo performance ny_web_performance
Meebo performance ny_web_performancemarcuswestin
 
3 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 2017
3 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 20173 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 2017
3 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 2017Alexandra_CaptainForm
 
5 Quick JavaScript Performance Improvement Tips
5 Quick JavaScript Performance Improvement Tips5 Quick JavaScript Performance Improvement Tips
5 Quick JavaScript Performance Improvement TipsTroy Miles
 
Staying Connected: Securing Your WordPress Website
Staying Connected: Securing Your WordPress WebsiteStaying Connected: Securing Your WordPress Website
Staying Connected: Securing Your WordPress WebsiteRaymund Mitchell
 
Top 10 WordPress Plugins
Top 10 WordPress PluginsTop 10 WordPress Plugins
Top 10 WordPress PluginsManny Sarmiento
 
Transforming Front-End Disaster Code™ Into A Maintainable Masterpiece
Transforming Front-End Disaster Code™ Into A Maintainable MasterpieceTransforming Front-End Disaster Code™ Into A Maintainable Masterpiece
Transforming Front-End Disaster Code™ Into A Maintainable MasterpieceDan Gribbin
 
WordCamp Birmingham 2015 - Theme building workshop
WordCamp Birmingham 2015 - Theme building workshopWordCamp Birmingham 2015 - Theme building workshop
WordCamp Birmingham 2015 - Theme building workshopJonny Allbut
 
New Perspectives on Performance
New Perspectives on PerformanceNew Perspectives on Performance
New Perspectives on Performancemennovanslooten
 
Prerendering with Nokogiri
Prerendering with NokogiriPrerendering with Nokogiri
Prerendering with NokogiriAlyss Noland
 
[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment
[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment
[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deploymentDuc Lai Trung Minh
 

What's hot (20)

SlickGrid Touch: Making complex JavaScript widgets work on mobile devices
SlickGrid Touch: Making complex JavaScript widgets work on mobile devicesSlickGrid Touch: Making complex JavaScript widgets work on mobile devices
SlickGrid Touch: Making complex JavaScript widgets work on mobile devices
 
Building fast webapps, fast - Velocity 2010
Building fast webapps, fast - Velocity 2010Building fast webapps, fast - Velocity 2010
Building fast webapps, fast - Velocity 2010
 
Automated perf optimization - jQuery Conference
Automated perf optimization - jQuery ConferenceAutomated perf optimization - jQuery Conference
Automated perf optimization - jQuery Conference
 
Wulin kungfu final
Wulin kungfu finalWulin kungfu final
Wulin kungfu final
 
Web Front End Performance
Web Front End PerformanceWeb Front End Performance
Web Front End Performance
 
What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012
What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012
What Multisite can do for You - Anthony Cole - WordCamp Sydney 2012
 
WeBB MeetUp#1 Web applications caching techniques
WeBB MeetUp#1 Web applications caching techniquesWeBB MeetUp#1 Web applications caching techniques
WeBB MeetUp#1 Web applications caching techniques
 
High performance java script why everything youve been taught is wrong
High performance java script why everything youve been taught is wrongHigh performance java script why everything youve been taught is wrong
High performance java script why everything youve been taught is wrong
 
Meebo performance ny_web_performance
Meebo performance ny_web_performanceMeebo performance ny_web_performance
Meebo performance ny_web_performance
 
3 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 2017
3 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 20173 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 2017
3 Gifts My Users Gave Me - Alexandra Draghici - WordCamp Europe 2017
 
5 Quick JavaScript Performance Improvement Tips
5 Quick JavaScript Performance Improvement Tips5 Quick JavaScript Performance Improvement Tips
5 Quick JavaScript Performance Improvement Tips
 
Staying Connected: Securing Your WordPress Website
Staying Connected: Securing Your WordPress WebsiteStaying Connected: Securing Your WordPress Website
Staying Connected: Securing Your WordPress Website
 
A team 43 C
A team 43 CA team 43 C
A team 43 C
 
15 months of AMP
15 months of AMP15 months of AMP
15 months of AMP
 
Top 10 WordPress Plugins
Top 10 WordPress PluginsTop 10 WordPress Plugins
Top 10 WordPress Plugins
 
Transforming Front-End Disaster Code™ Into A Maintainable Masterpiece
Transforming Front-End Disaster Code™ Into A Maintainable MasterpieceTransforming Front-End Disaster Code™ Into A Maintainable Masterpiece
Transforming Front-End Disaster Code™ Into A Maintainable Masterpiece
 
WordCamp Birmingham 2015 - Theme building workshop
WordCamp Birmingham 2015 - Theme building workshopWordCamp Birmingham 2015 - Theme building workshop
WordCamp Birmingham 2015 - Theme building workshop
 
New Perspectives on Performance
New Perspectives on PerformanceNew Perspectives on Performance
New Perspectives on Performance
 
Prerendering with Nokogiri
Prerendering with NokogiriPrerendering with Nokogiri
Prerendering with Nokogiri
 
[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment
[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment
[MSPVN - Azure Workshop] Day 1 - Azure Web App with WordPress deployment
 

Viewers also liked

Jakarta js meetup kudo
Jakarta js meetup kudoJakarta js meetup kudo
Jakarta js meetup kudoAgung Julisman
 
WINNING STORY STARTS WITH US
WINNING STORY STARTS WITH USWINNING STORY STARTS WITH US
WINNING STORY STARTS WITH USVani Vandana
 
Music box startup next nov 2014
Music box startup next nov 2014Music box startup next nov 2014
Music box startup next nov 2014Michael Hoy
 
fall 2015 annual appeal newsletter
fall 2015 annual appeal newsletterfall 2015 annual appeal newsletter
fall 2015 annual appeal newsletterStephanie Koclanis
 
CÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠN
CÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠNCÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠN
CÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠNTan Nguyen
 
High Performance NodeJS
High Performance NodeJSHigh Performance NodeJS
High Performance NodeJSDicoding
 
Susan Peterson Résumé
Susan Peterson RésuméSusan Peterson Résumé
Susan Peterson RésuméSusan Peterson
 

Viewers also liked (13)

OLX-Tech-Talk
OLX-Tech-TalkOLX-Tech-Talk
OLX-Tech-Talk
 
Node.JS Workshop
Node.JS WorkshopNode.JS Workshop
Node.JS Workshop
 
Jakarta js meetup kudo
Jakarta js meetup kudoJakarta js meetup kudo
Jakarta js meetup kudo
 
How to start a hope squad
How to start a hope squadHow to start a hope squad
How to start a hope squad
 
2628 10
2628 102628 10
2628 10
 
Resume
ResumeResume
Resume
 
WINNING STORY STARTS WITH US
WINNING STORY STARTS WITH USWINNING STORY STARTS WITH US
WINNING STORY STARTS WITH US
 
Music box startup next nov 2014
Music box startup next nov 2014Music box startup next nov 2014
Music box startup next nov 2014
 
fall 2015 annual appeal newsletter
fall 2015 annual appeal newsletterfall 2015 annual appeal newsletter
fall 2015 annual appeal newsletter
 
CÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠN
CÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠNCÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠN
CÔNG TY TNHH KỸ THUẬT CÔNG NGHỆ NAM SƠN
 
High Performance NodeJS
High Performance NodeJSHigh Performance NodeJS
High Performance NodeJS
 
Golonka
GolonkaGolonka
Golonka
 
Susan Peterson Résumé
Susan Peterson RésuméSusan Peterson Résumé
Susan Peterson Résumé
 

Similar to 12-2015-Meetup

How Open Source / Open Technology Could Help On Your Project
How Open Source / Open Technology Could Help On Your ProjectHow Open Source / Open Technology Could Help On Your Project
How Open Source / Open Technology Could Help On Your ProjectWan Leung Wong
 
Overboard.js - where are we going with with jsconfasia / devfestasia
Overboard.js - where are we going with with jsconfasia / devfestasiaOverboard.js - where are we going with with jsconfasia / devfestasia
Overboard.js - where are we going with with jsconfasia / devfestasiaChristian Heilmann
 
McrFRED talk 25/09/2014
McrFRED talk 25/09/2014McrFRED talk 25/09/2014
McrFRED talk 25/09/2014Jake Smith
 
Stapling and patching the web of now - ForwardJS3, San Francisco
Stapling and patching the web of now - ForwardJS3, San FranciscoStapling and patching the web of now - ForwardJS3, San Francisco
Stapling and patching the web of now - ForwardJS3, San FranciscoChristian Heilmann
 
Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript Onely
 
Build and Deploy a Python Web App to Amazon in 30 Mins
Build and Deploy a Python Web App to Amazon in 30 MinsBuild and Deploy a Python Web App to Amazon in 30 Mins
Build and Deploy a Python Web App to Amazon in 30 MinsJeff Hull
 
Responsive Web Design and Sitecore
Responsive Web Design and SitecoreResponsive Web Design and Sitecore
Responsive Web Design and SitecoreJohn Eckman
 
The more information Website Design_New.pdf
The more information Website Design_New.pdfThe more information Website Design_New.pdf
The more information Website Design_New.pdfssuser088e5b
 
Distributed: Reinventing the Workplace
Distributed: Reinventing the WorkplaceDistributed: Reinventing the Workplace
Distributed: Reinventing the WorkplaceYoav Farhi
 
Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...
Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...
Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...Jorge Ferreiro
 
Why Your Site is Slow: Performance Answers for Your Clients
Why Your Site is Slow: Performance Answers for Your ClientsWhy Your Site is Slow: Performance Answers for Your Clients
Why Your Site is Slow: Performance Answers for Your ClientsPantheon
 
Web-D club Orientation
Web-D club OrientationWeb-D club Orientation
Web-D club OrientationAbhishek Dutt
 
Single Page Application Development with backbone.js and Simple.Web
Single Page Application Development with backbone.js and Simple.WebSingle Page Application Development with backbone.js and Simple.Web
Single Page Application Development with backbone.js and Simple.WebChris Canal
 
L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...
L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...
L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...SEO Camp Association
 

Similar to 12-2015-Meetup (20)

How Open Source / Open Technology Could Help On Your Project
How Open Source / Open Technology Could Help On Your ProjectHow Open Source / Open Technology Could Help On Your Project
How Open Source / Open Technology Could Help On Your Project
 
Web Design Trends: 2018 Edition
Web Design Trends: 2018 EditionWeb Design Trends: 2018 Edition
Web Design Trends: 2018 Edition
 
Overboard.js - where are we going with with jsconfasia / devfestasia
Overboard.js - where are we going with with jsconfasia / devfestasiaOverboard.js - where are we going with with jsconfasia / devfestasia
Overboard.js - where are we going with with jsconfasia / devfestasia
 
Ice dec05-04-wan leung
Ice dec05-04-wan leungIce dec05-04-wan leung
Ice dec05-04-wan leung
 
McrFRED talk 25/09/2014
McrFRED talk 25/09/2014McrFRED talk 25/09/2014
McrFRED talk 25/09/2014
 
Stapling and patching the web of now - ForwardJS3, San Francisco
Stapling and patching the web of now - ForwardJS3, San FranciscoStapling and patching the web of now - ForwardJS3, San Francisco
Stapling and patching the web of now - ForwardJS3, San Francisco
 
Angular SEO
Angular SEO Angular SEO
Angular SEO
 
Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript
 
Build and Deploy a Python Web App to Amazon in 30 Mins
Build and Deploy a Python Web App to Amazon in 30 MinsBuild and Deploy a Python Web App to Amazon in 30 Mins
Build and Deploy a Python Web App to Amazon in 30 Mins
 
Responsive Web Design and Sitecore
Responsive Web Design and SitecoreResponsive Web Design and Sitecore
Responsive Web Design and Sitecore
 
The more information Website Design_New.pdf
The more information Website Design_New.pdfThe more information Website Design_New.pdf
The more information Website Design_New.pdf
 
Distributed: Reinventing the Workplace
Distributed: Reinventing the WorkplaceDistributed: Reinventing the Workplace
Distributed: Reinventing the Workplace
 
Asp.net orientation
Asp.net orientationAsp.net orientation
Asp.net orientation
 
Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...
Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...
Codemotion Progressive Web Applications Pwa Webinar - Jorge Ferreiro - @jgfer...
 
Ember
EmberEmber
Ember
 
Why Your Site is Slow: Performance Answers for Your Clients
Why Your Site is Slow: Performance Answers for Your ClientsWhy Your Site is Slow: Performance Answers for Your Clients
Why Your Site is Slow: Performance Answers for Your Clients
 
Web-D club Orientation
Web-D club OrientationWeb-D club Orientation
Web-D club Orientation
 
Single Page Application Development with backbone.js and Simple.Web
Single Page Application Development with backbone.js and Simple.WebSingle Page Application Development with backbone.js and Simple.Web
Single Page Application Development with backbone.js and Simple.Web
 
Wiu talk
Wiu talkWiu talk
Wiu talk
 
L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...
L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...
L’importance du crawl du JavaScript : pourquoi, comment et pour quels bénéfic...
 

12-2015-Meetup