Ce diaporama a bien été signalé.
Nous utilisons votre profil LinkedIn et vos données d’activité pour vous proposer des publicités personnalisées et pertinentes. Vous pouvez changer vos préférences de publicités à tout moment.
RubyConf 2009 2009-11-19Inside buzztter設樂 洋爾 (Yoji Shidara)yoji.shidara@enishi-tech.com
My name isYoji SHIDARA.
Enishi Tech Inc.Small Company.Only 2 staffs.
Small Office.(CEO’s flat)
I’m from Sapporo!    @darashi (me)Sapporo 1.4hrs     9.5hrsTokyo                San Fransisco!
Sapporo is abeautiful provincialcity of Japan.
Delicious foods arewaiting for you!
Do you knowbuzztter.com?
RubyConf 2009 2009-11-19 LTOutside buzztter設樂 洋爾 (Yoji Shidara)yoji.shidara@enishi-tech.com
http://buzztter.com
http://buzztter.com buzz-phrases (auto update)Full-text Search Results     (auto update)
Servers are in my room!             next to my bed
We crawls:1,932,823 tweets/day (Japanese)  607,749 tweets/day (English) (We’re now focusing on Japanese users  due to API ...
Server 1:  Phenom 9950  Memory 8GB  SSD 80GB + HDD 80GBServer 2:  PhenomII x4 955  Memory 8GB  HDD 500GB
@buzztter
Bot for English wasbanned because it tweeted      spam as hot           :-<
Buzztter:
buzz-phrase    detection        +real time full text      search
... like official“trending topics”
buzztter releasedbefore the official  one launched!
buzztter supports   English and    Japanese!
1-2 minutes forbuzz analysis par a    language.
HOT notification bot is available.
RubyConf 2009 2009-11-19 LTInside buzztter設樂 洋爾 (Yoji Shidara)yoji.shidara@enishi-tech.com
System Diagram:Rough Sketch             Web       Notification           Frontend     module crawler    Fulltext    Analyze...
System Diagram: Crawler                            Typhoeus Twitter          crawler   EventMachine       RabbitMQ        ...
notifier                pagecachecrawler         bot                  varnish             Twitter4R             Web Fronten...
analyzer     tokenizer           phrase extractor    detects change of phrase occurrence     Reference                    ...
Try http://buzztter.com and giveme your feedback please!If you Interested in thesekeywords:groonga, AMQP, RabbitMQ, Typhoe...
Prochain SlideShare
Chargement dans…5
×

1

Partager

Télécharger pour lire hors ligne

Inside Buzztter

Télécharger pour lire hors ligne

Inside Buzztter

  1. 1. RubyConf 2009 2009-11-19Inside buzztter設樂 洋爾 (Yoji Shidara)yoji.shidara@enishi-tech.com
  2. 2. My name isYoji SHIDARA.
  3. 3. Enishi Tech Inc.Small Company.Only 2 staffs.
  4. 4. Small Office.(CEO’s flat)
  5. 5. I’m from Sapporo! @darashi (me)Sapporo 1.4hrs 9.5hrsTokyo San Fransisco!
  6. 6. Sapporo is abeautiful provincialcity of Japan.
  7. 7. Delicious foods arewaiting for you!
  8. 8. Do you knowbuzztter.com?
  9. 9. RubyConf 2009 2009-11-19 LTOutside buzztter設樂 洋爾 (Yoji Shidara)yoji.shidara@enishi-tech.com
  10. 10. http://buzztter.com
  11. 11. http://buzztter.com buzz-phrases (auto update)Full-text Search Results (auto update)
  12. 12. Servers are in my room! next to my bed
  13. 13. We crawls:1,932,823 tweets/day (Japanese) 607,749 tweets/day (English) (We’re now focusing on Japanese users due to API limits) We have: 1,336,444 PVs / Month
  14. 14. Server 1: Phenom 9950 Memory 8GB SSD 80GB + HDD 80GBServer 2: PhenomII x4 955 Memory 8GB HDD 500GB
  15. 15. @buzztter
  16. 16. Bot for English wasbanned because it tweeted spam as hot :-<
  17. 17. Buzztter:
  18. 18. buzz-phrase detection +real time full text search
  19. 19. ... like official“trending topics”
  20. 20. buzztter releasedbefore the official one launched!
  21. 21. buzztter supports English and Japanese!
  22. 22. 1-2 minutes forbuzz analysis par a language.
  23. 23. HOT notification bot is available.
  24. 24. RubyConf 2009 2009-11-19 LTInside buzztter設樂 洋爾 (Yoji Shidara)yoji.shidara@enishi-tech.com
  25. 25. System Diagram:Rough Sketch Web Notification Frontend module crawler Fulltext Analyzer module Search module BuzzDAS: Buzz Data Analysis System
  26. 26. System Diagram: Crawler Typhoeus Twitter crawler EventMachine RabbitMQ MQ crawl crawl scheduling controller libtextcatUsers DB langugage language, guesserPostgreSQL post frequency
  27. 27. notifier pagecachecrawler bot varnish Twitter4R Web Frontend net/irc Rails MQ Web Service API Sinatra memcached groonga fulltext importer search analyzer engineBuzzDAS: Buzz Data Analysis System
  28. 28. analyzer tokenizer phrase extractor detects change of phrase occurrence Reference Recent Index Indexkeeps post in 24 hours keeps recent 1 hour
  29. 29. Try http://buzztter.com and giveme your feedback please!If you Interested in thesekeywords:groonga, AMQP, RabbitMQ, Typhoeus,EventMachine, PostgreSQL, libtextcat,Sinatra, Rails, Twitter4R, net/irc,memcached, PrefixSpan ,MeCab, TF-IDF, ... or our BuzzDAS Engine, please contact me! My name is Yoji Shidara.
  • himanshuc

    Nov. 22, 2009

Vues

Nombre de vues

4 734

Sur Slideshare

0

À partir des intégrations

0

Nombre d'intégrations

2 010

Actions

Téléchargements

12

Partages

0

Commentaires

0

Mentions J'aime

1

×