SlideShare une entreprise Scribd logo
1  sur  76
Télécharger pour lire hors ligne
Fluentd:
Data streams in Ruby world
@tagomoris
RedDotRubyConf 2014
Day1, 26 June 2014
14年6月26日木曜日
TAGOMORI Satoshi a.k.a. @tagomoris
14年6月26日木曜日
14年6月26日木曜日
14年6月26日木曜日
14年6月26日木曜日
Fluentd
Fluentd is an open source data collector
to simplify log management.
Fluentd is designed to process high-volume data streams reliably. Use cases
include real-time search and monitoring, Big Data analytics, reliable archiving and
more.
http://www.fluentd.org/
14年6月26日木曜日
Before Fluentd:
Access logs Metrics
Archives
apache
nginx
graphs
Amazon S3
Filesystem
tail -f
scp
python
Error handling? Buffering?
14年6月26日木曜日
Before Fluentd:
Access logs Metrics
Analytics
Archives
apache
nginx
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
tail -f
scp
python
ruby
ruby
cmd
Error handling? Buffering?
Routing? API Keys?
14年6月26日木曜日
Before Fluentd:
Access logs
App logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
tail -f
scp
python
ruby
ruby
cmd
file ruby
logger
Error handling? Buffering?
Routing? API Keys? Formats?
14年6月26日木曜日
Before Fluentd:
Access logs
App logs
System logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
syslogd
snmp data
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
tail -f
scp
python
ruby
ruby
cmd
file ruby
logger
Error handling? Buffering?
Routing? API Keys? Formats?
14年6月26日木曜日
Before Fluentd: CHAOS
Access logs
App logs
System logs
Various logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
syslogd
snmp data
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
tail -f
scp
python
ruby
ruby
cmd
file ruby
logger
file
logger
ruby
cmd
ruby
Error handling? Buffering?
Routing? API Keys? Formats?
14年6月26日木曜日
After Fluentd: Controllable
Access logs
App logs
System logs
Various logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
syslogd
snmp data
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
14年6月26日木曜日
Access logs
App logs
System logs
Various logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
syslogd
snmp data
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
Fluentd does:
Format, Buffer, Retry, Route
After Fluentd: Controllable
14年6月26日木曜日
Fluentd
Open source data collector
Written in Ruby, runs on CRuby on UNIX-like OS
With error handling and routing in core
Plugin systems
Input, Output and Buffer (w/ many built-in plugins)
Distributed on rubygems.org
Fluentd and its plugins: gem install fluentd
rpm/deb are also available (td-agent)
14年6月26日木曜日
Why Fluentd?
14年6月26日木曜日
Why Fluentd?
Fluentd’s logo is very cute!
14年6月26日木曜日
He is also very cute...
14年6月26日木曜日
Why Fluentd?
Simple data structure
tag, time and record(hash)
Apache-like configuration syntax
Simple / powerful routing
Many public plugins
Just few steps for custom plugins
Scalability
14年6月26日木曜日
Fluentd Event
app.device.ios
2014-06-24 16:28:50
{
“username”: “tagomoris”,
“fullname”: “TAGOMORI Satoshi”,
“age”: 34,
“device”: “iPhone 5”,
...
}
Event
14年6月26日木曜日
Fluentd Event
app.device.ios
1403512916 (2014-06-23 16:41:56 +0800)
{
“username”: “tagomoris”,
“fullname”: “TAGOMORI Satoshi”,
“age”: 34,
“device”: “iPhone 5”,
...
}
tag
time
record
14年6月26日木曜日
Fluentd Event
app.device.ios
1403512916 (2014-06-23 16:41:56 +0800)
{
“username”: “tagomoris”,
“fullname”: “TAGOMORI Satoshi”,
“age”: 34,
“device”: “iPhone 5”,
...
}
tag for routing
record
structured data
time by unix time
14年6月26日木曜日
# read from a file and parse
<source>
type tail
path /var/log/httpd.log
format apache2
tag web.access
</source>
# logs from client libraries
<source>
type forward
port 24224
</source>
# store logs to MongoDB and S3
<match app.**>
type copy
<store>
type mongo
host mongo.example.com
capped
capped_size 200m
</store>
<store>
type s3
path archive/
</store>
</match>
Fluentd Configuration
14年6月26日木曜日
# read from a file and parse
<source>
type tail
path /var/log/httpd.log
format apache2
tag web.access
</source>
# logs from client libraries
<source>
type forward
port 24224
</source>
# store logs to MongoDB and S3
<match app.**>
type copy
<store>
type mongo
host mongo.example.com
capped
capped_size 200m
</store>
<store>
type s3
path archive/
</store>
</match>
Fluentd Configuration
for input for output
14年6月26日木曜日
# read from a file and parse
source {
type ”tail”
path “/var/log/httpd.log”
format “apache2”
tag ”web.access”
}
# logs from client libraries
source {
type ”forward”
port 24224
}
# store logs to MongoDB and S3
match(“app.**”) {
type ”copy”
store {
type ”mongo”
host “mongo.example.com”
capped
capped_size “200m”
}
store {
type ”s3”
path “archive/”
}
}
Fluentd Configuration DSL
14年6月26日木曜日
Tag based routing
input
input
output
output
input
output
output
core
tag
time
record
web.log
sys.*
app.**
**
14年6月26日木曜日
Tag based routing
input
input
output
output
input
output
output
core
tag
time
record
web.log
sys.*
app.**
**
converted.web.log
14年6月26日木曜日
300+ Public Plugins
access, add, aes-forward, airbrake-python, amazon_sns, amplifier-filter, amqp, amqp2, andon,
anomalydetect, anonymizer, arango, arduino, axlsx, backlog, bigquery, boundio, buffer-
lightening, buffered-filter, buffered-hipchat, buffered-stdout, bufferize, calc, cassandra,
cassandra-cql, cloudstack, cloudwatch, cloudwatch_ya, combiner, conditional_filter, config-
expander, config_pit, config_reloader, convert-value-to-sha, copy_ex, couch, couch-sharded,
couchbase, dashing, data-rejecter, datacalculator, datacounter, dbi, dd, debug, delay-
inspector, delayed, derive, df, droonga, dstat, dummydata-producer, dynamodb, ec2-metadata,
elapsed-time, elasticsearch, elasticsearch-cluster, elasticsearch-ruby, elb-log, embedded-
elasticsearch, eval-filter, event-tail, extract_query_params, file-alternative, file-sprintf,
filter, filter_keys, flatten, flatten-hash, flowcounter, flowcounter-simple, flume,
fnordmetric, forest, fork, format, forward-aws, ftp, gamobile, ganglia, gc, geoip, glusterfs,
graphite, grassland, gree_community, grep, grepcounter, groonga, groupcounter, growl,
growthforecast, gstore, hash-forward, hato, hbase, hekk_redshift, heroku-postgres, heroku-
syslog, hipchat, histogram, hoop, hostname, hrforecast, http-enhanced, http-ex, http-list,
http-status, https-json, idobata, ikachan, imagefile, imkayac, in-udp-event, incremental,
influxdb, influxdb_metrics, inline-classifier, irc, jabber, json-api, json-nest2flat,
jsonbucket, jstat, jubatus, jvmwatcher, kafka, kanicounter, keep-forward, kestrel, kibana-
server, kinesis-alt, latency, leftronic, librato-metrics, loggly, lossycount, mackerel, mail,
map, measure_time, mecab, metricsense, mixi_community, mixpanel, mobile-carrier, mongo,
mongo-typed, mongokpi, mqtt, msgpack-rpc, mssql, multiprocess, munin, mysql, mysql-binlog,
mysql-bulk, mysql-load, mysql-prepared-statement, mysql-query, mysql-replicator,
mysqlslowquery, mysqlslowquerylog, nats, network-probe, nginx-status, nicorepo, norikra,
notifier, numeric-counter, numeric-monitor, onlineuser, openldap-monitor, opentsdb, order,
out-http, out-http-buffered, out-solr, parser, pgdist, pghstore, pgjson, ping-message,
postgres, qqwry, rambler, rawexec, rds-log, rds-slowlog, reassemble, record
http://www.fluentd.org/plugins
14年6月26日木曜日
Fluentd patterns
14年6月26日木曜日
1.
read logs from file
and write these on storages
file in_tail
read, parse
out_file
format, write
file
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse insert
MongoDBout_mongo
https://github.com/fluent/fluent-plugin-mongo
in_tail
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse
out_mysql
insert
MySQL
https://github.com/tagomoris/fluent-plugin-mysql
in_tail
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse
out_elasticsearch
send
Elasticsearch
https://github.com/uken/fluent-plugin-elasticsearch
in_tail
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse
out_webhdfs
format, write
Hadoop HDFS
https://github.com/fluent/fluent-plugin-webhdfs
in_tail
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse
out_s3
format, write
Amazon S3
https://github.com/fluent/fluent-plugin-s3
in_tail
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse
out_redshift
insert
Amazon
Redshift
https://github.com/hapyrus/fluent-plugin-redshift
in_tail
14年6月26日木曜日
1.
read logs from file
and write these on storages
file
read, parse
out_bigquery
insert
Google
BigQuery
https://github.com/tagomoris/fluent-plugin-bigquery
in_tail
14年6月26日木曜日
2.
receive and forward data
from/to other node
forward
forward
forward
input
events
input
events
output
events
fluent-logger-ruby
fluent-logger-java
...
send events over TCP
14年6月26日木曜日
2.
receive and forward data
from/to other node
forward
forward
forward
load balance, active-standby forward
forward
forward
14年6月26日木曜日
datacenter
2’.
receive and forward data
from/to other node, over internet & SSL
secure-forward
secure-forward
datacenter
secure-forward
send events over SSL
with authentication
https://github.com/tagomoris/fluent-plugin-secure-forward
14年6月26日木曜日
3.
connect with other middleware
in_syslog
syslog
Flume
Scribe
Kafka
in_flume
in_scribe
in_kafka
out_flume
in_scribe
in_kafka
Flume
Scribe
Kafka
https://github.com/fluent/fluent-plugin-flume
https://github.com/fluent/fluent-plugin-scribe
https://github.com/htgc/fluent-plugin-kafka/
14年6月26日木曜日
4.
copy events
forward copy
forward
webhdfs Hadoop HDFS
14年6月26日木曜日
5.
count events by string values
forward any outputs
count records
by regexp patterns
events
{
“pattern1_count”: 60,
“pattern1_rate” : 1.0,
“pattern2_count”: 20,
“pattern2_rate” : 0.33,
...
}
datacounter
https://github.com/tagomoris/fluent-plugin-datacounter
14年6月26日木曜日
5.
count events by numeric values
forward numeric-counter any outputs
count records
by numerical range
https://github.com/tagomoris/fluent-plugin-numeric-counter
events
{
“pattern1_count”: 60,
“pattern1_rate” : 1.0,
“pattern2_count”: 20,
“pattern2_rate” : 0.33,
...
}
14年6月26日木曜日
5.
aggregate numeric values
forward numeric-monitor any outputs
calculate real-time metrics
of numeric values
{
“max”: 128, “min”: 16,
“avg”: 64.0, “sum”: 1024,
“num”: 20,
“percentile_50”: 48,
“percentile_90”: 112,
...
}
https://github.com/tagomoris/fluent-plugin-numeric-monitor
events
14年6月26日木曜日
6.
various inputs: Linux performance (dstat)
in_dstatdstat
collect server performance data
https://github.com/shun0102/fluent-plugin-dstat
14年6月26日木曜日
6.
various inputs: SQL execution
in_sql
input from SELECT
RDBMS
https://github.com/fluent/fluent-plugin-sql
14年6月26日木曜日
6.
various inputs: external command
in_execany commands
input from STDOUT of any commands
14年6月26日木曜日
7.
various outpus: notification on IRC
out_ikachan
notice on IRC channel
IRC
https://github.com/tagomoris/fluent-plugin-ikachan
14年6月26日木曜日
7.
various outpus: notification on IRC
out_ikachan
notice on IRC channel
IRC
https://github.com/tagomoris/fluent-plugin-ikachan
14:56 ikachan: HTTP status_4xx crit [2014-06-23 14:56:29 +0900] serviceX: 100.00 (threshold
75.0) http://graph.tool.local/view_graph/accesslog/httpstatus/serviceX_4xx_percentage
14:57 kazeburo: ↑ 40x 100%...
14年6月26日木曜日
7.
various outpus: notification on HipChat
out_hipchat
notice on HipChat
HipChat
https://github.com/hotchpotch/fluent-plugin-hipchat
14年6月26日木曜日
7.
various outpus: graph tools
out_growthforecast
POST data into graph tools
GrowthForecast
or
Focuslight
https://github.com/tagomoris/fluent-plugin-growthforecast
14年6月26日木曜日
7.
various outpus
out_growthforecast
POST data into graph tools
GrowthForecast
or
Focuslight
https://github.com/tagomoris/fluent-plugin-growthforecast
14年6月26日木曜日
7.
various outpus: external command
out_exec any commands
output into STDIN of any commands
14年6月26日木曜日
8. filters:
stream processing: external command
any inputs any outputs
format & write
into STDIN
exec_filter
any commands
read & parse
from STDOUT
read from STDIN
do WHATEVER you want
write into STDOUT
ex: tail -f | grep ... | sed ... | cat
events
14年6月26日木曜日
8. filters:
stream processing w/ external server RPC
any inputs any outputs
send
out_norikra
fetch
stream processing w/ SQL
in_norikra
http://norikra.github.io/
SELECT stage, score, COUNT(*) AS c
FROM results.win:time_batch(1 min)
WHERE stage > 1 AND user.valid
GROUP BY stage, score
events
14年6月26日木曜日
... And,
Fluentd does
error handling and retries
for all of these plugins!
14年6月26日木曜日
Before Fluentd: CHAOS
Access logs
App logs
System logs
Various logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
syslogd
snmp data
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
tail -f
scp
python
ruby
ruby
cmd
file ruby
logger
file
logger
ruby
cmd
ruby
14年6月26日木曜日
After Fluentd: Controllable
Access logs
App logs
System logs
Various logs
Metrics
Analytics
Archives
apache
nginx
frontend
backend
syslogd
snmp data
graphs
Hadoop
MySQL
MongoDB
Redshift
Amazon S3
Filesystem
14年6月26日木曜日
Fluentd: Now and then
14年6月26日木曜日
Fluentd versions
Latest: v0.10.50
released on Jun 17, 2014
v0.10.x: Stable versions
many minor feature updates, bug fixes
new features for v1
14年6月26日木曜日
Fluentd v1
Planned as the first major release
someday in 2014 (?)
100% Compatible with v0.10.x
New (and additional) features on v1.x loadmap
https://github.com/fluent/fluentd/issues/251
new configuration syntax, plugin backends
daemon process management
multi core CPU supports
14年6月26日木曜日
Fluentd on JRuby
Under development!
trying to fix Cool.io to support JRuby
14年6月26日木曜日
Fluentd on Windows
Under development!
“windows” branch on github fluent/fluend
14年6月26日木曜日
Use case in LINE
14年6月26日木曜日
Analytics data flow overview
servers Fluentd
Cluster
archive
visualization
notifications
Hadoop
Fluentd
Norikra
application
metrics
14年6月26日木曜日
servers Fluentd
Cluster
archive
visualization
notifications
Hadoop
Fluentd
Norikra
application
metrics
delivery/stream-map
aggregate/stream-reduce
14年6月26日木曜日
archive
visualization
notifications
Hadoop
Norikra
application
metrics
fluent-agent-lite
non-parsed raw logs
non-parsed
access logs
deliver: receive/archive/load-balance
worker:
parse/store/forward
watcher: monitor/notify
cep:
general-purpose
stream processing
14年6月26日木曜日
Fluentd cluster statistics
Fluentd nodes
access/application logs from 600+ nodes
receiver: 5 server (60 process)
parser/converter: 10 server (90 process)
stream processing: 3 server
14年6月26日木曜日
Fluentd cluster statistics
Daily:
5.5+ Billion events, 1.5TB+ data
Peak time:
150,000+ events /sec, 300+ Mbps
14年6月26日木曜日
Fluentd is the best partner
for stream-processing newbies
and rubyists!
Check out sites and code!
http://fluentd.org/
https://github.com/fluent/fluentd
14年6月26日木曜日
FAQ
14年6月26日木曜日
Fault-tolerance?
Node level fault-tolerance
File buffer: processing data can be serialized on disk
Cluster level fault-tolerance
Copy + Forward(load balance, active-standby)
Event level assurance: ACK?
NO (for performance reason)
14年6月26日木曜日
Performance?
NOT SO BAD:
real throughput depends on plugin/configuration
simple event transferring: 10-20k events/sec
14年6月26日木曜日
vs Scribe? vs Flume?
14年6月26日木曜日
vs Storm?
14年6月26日木曜日
Eco-system? Clones?
ik
fluent-agent-lite
fluenpy
14年6月26日木曜日

Contenu connexe

Tendances

Norikra: Stream Processing with SQL
Norikra: Stream Processing with SQLNorikra: Stream Processing with SQL
Norikra: Stream Processing with SQLSATOSHI TAGOMORI
 
TenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience SharingTenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience SharingChen-en Lu
 
Real Time Data Streaming using Kafka & Storm
Real Time Data Streaming using Kafka & StormReal Time Data Streaming using Kafka & Storm
Real Time Data Streaming using Kafka & StormRan Silberman
 
SF Big Analytics 20191112: How to performance-tune Spark applications in larg...
SF Big Analytics 20191112: How to performance-tune Spark applications in larg...SF Big Analytics 20191112: How to performance-tune Spark applications in larg...
SF Big Analytics 20191112: How to performance-tune Spark applications in larg...Chester Chen
 
Spark stream - Kafka
Spark stream - Kafka Spark stream - Kafka
Spark stream - Kafka Dori Waldman
 
HadoopCon 2016 - 用 Jupyter Notebook Hold 住一個上線 Spark Machine Learning 專案實戰
HadoopCon 2016  - 用 Jupyter Notebook Hold 住一個上線 Spark  Machine Learning 專案實戰HadoopCon 2016  - 用 Jupyter Notebook Hold 住一個上線 Spark  Machine Learning 專案實戰
HadoopCon 2016 - 用 Jupyter Notebook Hold 住一個上線 Spark Machine Learning 專案實戰Wayne Chen
 
RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...
RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...
RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...InfluxData
 
SparkR: Enabling Interactive Data Science at Scale on Hadoop
SparkR: Enabling Interactive Data Science at Scale on HadoopSparkR: Enabling Interactive Data Science at Scale on Hadoop
SparkR: Enabling Interactive Data Science at Scale on HadoopDataWorks Summit
 
Building production spark streaming applications
Building production spark streaming applicationsBuilding production spark streaming applications
Building production spark streaming applicationsJoey Echeverria
 
Storm crawler apachecon_na_2015
Storm crawler apachecon_na_2015Storm crawler apachecon_na_2015
Storm crawler apachecon_na_2015ontopic
 
Monitoring Spark Applications
Monitoring Spark ApplicationsMonitoring Spark Applications
Monitoring Spark ApplicationsTzach Zohar
 
Spark Summit EU talk by Steve Loughran
Spark Summit EU talk by Steve LoughranSpark Summit EU talk by Steve Loughran
Spark Summit EU talk by Steve LoughranSpark Summit
 
PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...
PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...
PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...PGConf APAC
 
Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...
Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...
Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...InfluxData
 
ApacheCon 2021 - Apache NiFi Deep Dive 300
ApacheCon 2021 - Apache NiFi Deep Dive 300ApacheCon 2021 - Apache NiFi Deep Dive 300
ApacheCon 2021 - Apache NiFi Deep Dive 300Timothy Spann
 
Memory Management in Apache Spark
Memory Management in Apache SparkMemory Management in Apache Spark
Memory Management in Apache SparkDatabricks
 
Apache Spark Introduction - CloudxLab
Apache Spark Introduction - CloudxLabApache Spark Introduction - CloudxLab
Apache Spark Introduction - CloudxLabAbhinav Singh
 
Managing Apache Spark Workload and Automatic Optimizing
Managing Apache Spark Workload and Automatic OptimizingManaging Apache Spark Workload and Automatic Optimizing
Managing Apache Spark Workload and Automatic OptimizingDatabricks
 
Expand data analysis tool at scale with Zeppelin
Expand data analysis tool at scale with ZeppelinExpand data analysis tool at scale with Zeppelin
Expand data analysis tool at scale with ZeppelinDataWorks Summit
 

Tendances (20)

Norikra: Stream Processing with SQL
Norikra: Stream Processing with SQLNorikra: Stream Processing with SQL
Norikra: Stream Processing with SQL
 
TenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience SharingTenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience Sharing
 
Real Time Data Streaming using Kafka & Storm
Real Time Data Streaming using Kafka & StormReal Time Data Streaming using Kafka & Storm
Real Time Data Streaming using Kafka & Storm
 
SF Big Analytics 20191112: How to performance-tune Spark applications in larg...
SF Big Analytics 20191112: How to performance-tune Spark applications in larg...SF Big Analytics 20191112: How to performance-tune Spark applications in larg...
SF Big Analytics 20191112: How to performance-tune Spark applications in larg...
 
Spark stream - Kafka
Spark stream - Kafka Spark stream - Kafka
Spark stream - Kafka
 
HadoopCon 2016 - 用 Jupyter Notebook Hold 住一個上線 Spark Machine Learning 專案實戰
HadoopCon 2016  - 用 Jupyter Notebook Hold 住一個上線 Spark  Machine Learning 專案實戰HadoopCon 2016  - 用 Jupyter Notebook Hold 住一個上線 Spark  Machine Learning 專案實戰
HadoopCon 2016 - 用 Jupyter Notebook Hold 住一個上線 Spark Machine Learning 專案實戰
 
PySaprk
PySaprkPySaprk
PySaprk
 
RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...
RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...
RESTful API – How to Consume, Extract, Store and Visualize Data with InfluxDB...
 
SparkR: Enabling Interactive Data Science at Scale on Hadoop
SparkR: Enabling Interactive Data Science at Scale on HadoopSparkR: Enabling Interactive Data Science at Scale on Hadoop
SparkR: Enabling Interactive Data Science at Scale on Hadoop
 
Building production spark streaming applications
Building production spark streaming applicationsBuilding production spark streaming applications
Building production spark streaming applications
 
Storm crawler apachecon_na_2015
Storm crawler apachecon_na_2015Storm crawler apachecon_na_2015
Storm crawler apachecon_na_2015
 
Monitoring Spark Applications
Monitoring Spark ApplicationsMonitoring Spark Applications
Monitoring Spark Applications
 
Spark Summit EU talk by Steve Loughran
Spark Summit EU talk by Steve LoughranSpark Summit EU talk by Steve Loughran
Spark Summit EU talk by Steve Loughran
 
PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...
PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...
PGConf APAC 2018 - Managing replication clusters with repmgr, Barman and PgBo...
 
Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...
Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...
Jorge de la Cruz [Veeam Software] | RESTful API – How to Consume, Extract, St...
 
ApacheCon 2021 - Apache NiFi Deep Dive 300
ApacheCon 2021 - Apache NiFi Deep Dive 300ApacheCon 2021 - Apache NiFi Deep Dive 300
ApacheCon 2021 - Apache NiFi Deep Dive 300
 
Memory Management in Apache Spark
Memory Management in Apache SparkMemory Management in Apache Spark
Memory Management in Apache Spark
 
Apache Spark Introduction - CloudxLab
Apache Spark Introduction - CloudxLabApache Spark Introduction - CloudxLab
Apache Spark Introduction - CloudxLab
 
Managing Apache Spark Workload and Automatic Optimizing
Managing Apache Spark Workload and Automatic OptimizingManaging Apache Spark Workload and Automatic Optimizing
Managing Apache Spark Workload and Automatic Optimizing
 
Expand data analysis tool at scale with Zeppelin
Expand data analysis tool at scale with ZeppelinExpand data analysis tool at scale with Zeppelin
Expand data analysis tool at scale with Zeppelin
 

En vedette

Batch processing and Stream processing by SQL
Batch processing and Stream processing by SQLBatch processing and Stream processing by SQL
Batch processing and Stream processing by SQLSATOSHI TAGOMORI
 
Landscape of Norikra features
Landscape of Norikra featuresLandscape of Norikra features
Landscape of Norikra featuresSATOSHI TAGOMORI
 
Dynamic Inventory: no more host lists!
Dynamic Inventory: no more host lists!Dynamic Inventory: no more host lists!
Dynamic Inventory: no more host lists!SATOSHI TAGOMORI
 
8 language deployments on GCE and GAE #gcpja
8 language deployments on GCE and GAE #gcpja8 language deployments on GCE and GAE #gcpja
8 language deployments on GCE and GAE #gcpjaSATOSHI TAGOMORI
 
Log Analysis System And its designs in LINE Corp. 2014 early
Log Analysis System And its designs in LINE Corp. 2014 earlyLog Analysis System And its designs in LINE Corp. 2014 early
Log Analysis System And its designs in LINE Corp. 2014 earlySATOSHI TAGOMORI
 
Stream processing and Norikra
Stream processing and NorikraStream processing and Norikra
Stream processing and NorikraSATOSHI TAGOMORI
 
NorikraのJVMチューンで苦労している話
NorikraのJVMチューンで苦労している話NorikraのJVMチューンで苦労している話
NorikraのJVMチューンで苦労している話SATOSHI TAGOMORI
 
Shib: WebUI tool provides crossover of Hive and MPP
Shib: WebUI tool provides crossover of Hive and MPPShib: WebUI tool provides crossover of Hive and MPP
Shib: WebUI tool provides crossover of Hive and MPPSATOSHI TAGOMORI
 

En vedette (8)

Batch processing and Stream processing by SQL
Batch processing and Stream processing by SQLBatch processing and Stream processing by SQL
Batch processing and Stream processing by SQL
 
Landscape of Norikra features
Landscape of Norikra featuresLandscape of Norikra features
Landscape of Norikra features
 
Dynamic Inventory: no more host lists!
Dynamic Inventory: no more host lists!Dynamic Inventory: no more host lists!
Dynamic Inventory: no more host lists!
 
8 language deployments on GCE and GAE #gcpja
8 language deployments on GCE and GAE #gcpja8 language deployments on GCE and GAE #gcpja
8 language deployments on GCE and GAE #gcpja
 
Log Analysis System And its designs in LINE Corp. 2014 early
Log Analysis System And its designs in LINE Corp. 2014 earlyLog Analysis System And its designs in LINE Corp. 2014 early
Log Analysis System And its designs in LINE Corp. 2014 early
 
Stream processing and Norikra
Stream processing and NorikraStream processing and Norikra
Stream processing and Norikra
 
NorikraのJVMチューンで苦労している話
NorikraのJVMチューンで苦労している話NorikraのJVMチューンで苦労している話
NorikraのJVMチューンで苦労している話
 
Shib: WebUI tool provides crossover of Hive and MPP
Shib: WebUI tool provides crossover of Hive and MPPShib: WebUI tool provides crossover of Hive and MPP
Shib: WebUI tool provides crossover of Hive and MPP
 

Similaire à Fluentd: Data streams in Ruby world #rdrc2014

Linked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaLinked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaThomas Kurz
 
OSS EU: Deep Dive into Building Streaming Applications with Apache Pulsar
OSS EU:  Deep Dive into Building Streaming Applications with Apache PulsarOSS EU:  Deep Dive into Building Streaming Applications with Apache Pulsar
OSS EU: Deep Dive into Building Streaming Applications with Apache PulsarTimothy Spann
 
Pulsar summit asia 2021 apache pulsar with mqtt for edge computing
Pulsar summit asia 2021   apache pulsar with mqtt for edge computingPulsar summit asia 2021   apache pulsar with mqtt for edge computing
Pulsar summit asia 2021 apache pulsar with mqtt for edge computingTimothy Spann
 
Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0Databricks
 
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...Data Con LA
 
JConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkTimothy Spann
 
Integrating ChatGPT with Apache Airflow
Integrating ChatGPT with Apache AirflowIntegrating ChatGPT with Apache Airflow
Integrating ChatGPT with Apache AirflowTatiana Al-Chueyr
 
Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3Dongjoon Hyun
 
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...Guido Schmutz
 
Use drupal 8 as a framework the romance recalibration
Use drupal 8 as a framework   the romance recalibrationUse drupal 8 as a framework   the romance recalibration
Use drupal 8 as a framework the romance recalibrationKevin Wenger
 
Dok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on KubernetesDok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on KubernetesDoKC
 
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021StreamNative
 
Austin Data Meetup 092014 - Spark
Austin Data Meetup 092014 - SparkAustin Data Meetup 092014 - Spark
Austin Data Meetup 092014 - SparkSteve Blackmon
 
Camel and JBoss
Camel and JBossCamel and JBoss
Camel and JBossJBug Italy
 
Fluentd Project Intro at Kubecon 2019 EU
Fluentd Project Intro at Kubecon 2019 EUFluentd Project Intro at Kubecon 2019 EU
Fluentd Project Intro at Kubecon 2019 EUN Masahiro
 
Kafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
Kafka Connect & Kafka Streams/KSQL - the ecosystem around KafkaKafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
Kafka Connect & Kafka Streams/KSQL - the ecosystem around KafkaGuido Schmutz
 
REST in Peace. Long live gRPC!
REST in Peace. Long live gRPC!REST in Peace. Long live gRPC!
REST in Peace. Long live gRPC!QAware GmbH
 
CoC23_Utilizing Real-Time Transit Data for Travel Optimization
CoC23_Utilizing Real-Time Transit Data for Travel OptimizationCoC23_Utilizing Real-Time Transit Data for Travel Optimization
CoC23_Utilizing Real-Time Transit Data for Travel OptimizationTimothy Spann
 

Similaire à Fluentd: Data streams in Ruby world #rdrc2014 (20)

Linked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaLinked Media Management with Apache Marmotta
Linked Media Management with Apache Marmotta
 
Fluentd meetup #2
Fluentd meetup #2Fluentd meetup #2
Fluentd meetup #2
 
OSS EU: Deep Dive into Building Streaming Applications with Apache Pulsar
OSS EU:  Deep Dive into Building Streaming Applications with Apache PulsarOSS EU:  Deep Dive into Building Streaming Applications with Apache Pulsar
OSS EU: Deep Dive into Building Streaming Applications with Apache Pulsar
 
The basics of fluentd
The basics of fluentdThe basics of fluentd
The basics of fluentd
 
Pulsar summit asia 2021 apache pulsar with mqtt for edge computing
Pulsar summit asia 2021   apache pulsar with mqtt for edge computingPulsar summit asia 2021   apache pulsar with mqtt for edge computing
Pulsar summit asia 2021 apache pulsar with mqtt for edge computing
 
Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0
 
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
 
JConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and Flink
 
Integrating ChatGPT with Apache Airflow
Integrating ChatGPT with Apache AirflowIntegrating ChatGPT with Apache Airflow
Integrating ChatGPT with Apache Airflow
 
Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3Native support of Prometheus monitoring in Apache Spark 3
Native support of Prometheus monitoring in Apache Spark 3
 
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
 
Use drupal 8 as a framework the romance recalibration
Use drupal 8 as a framework   the romance recalibrationUse drupal 8 as a framework   the romance recalibration
Use drupal 8 as a framework the romance recalibration
 
Dok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on KubernetesDok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on Kubernetes
 
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
 
Austin Data Meetup 092014 - Spark
Austin Data Meetup 092014 - SparkAustin Data Meetup 092014 - Spark
Austin Data Meetup 092014 - Spark
 
Camel and JBoss
Camel and JBossCamel and JBoss
Camel and JBoss
 
Fluentd Project Intro at Kubecon 2019 EU
Fluentd Project Intro at Kubecon 2019 EUFluentd Project Intro at Kubecon 2019 EU
Fluentd Project Intro at Kubecon 2019 EU
 
Kafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
Kafka Connect & Kafka Streams/KSQL - the ecosystem around KafkaKafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
Kafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
 
REST in Peace. Long live gRPC!
REST in Peace. Long live gRPC!REST in Peace. Long live gRPC!
REST in Peace. Long live gRPC!
 
CoC23_Utilizing Real-Time Transit Data for Travel Optimization
CoC23_Utilizing Real-Time Transit Data for Travel OptimizationCoC23_Utilizing Real-Time Transit Data for Travel Optimization
CoC23_Utilizing Real-Time Transit Data for Travel Optimization
 

Plus de SATOSHI TAGOMORI

Ractor's speed is not light-speed
Ractor's speed is not light-speedRactor's speed is not light-speed
Ractor's speed is not light-speedSATOSHI TAGOMORI
 
Good Things and Hard Things of SaaS Development/Operations
Good Things and Hard Things of SaaS Development/OperationsGood Things and Hard Things of SaaS Development/Operations
Good Things and Hard Things of SaaS Development/OperationsSATOSHI TAGOMORI
 
Invitation to the dark side of Ruby
Invitation to the dark side of RubyInvitation to the dark side of Ruby
Invitation to the dark side of RubySATOSHI TAGOMORI
 
Hijacking Ruby Syntax in Ruby (RubyConf 2018)
Hijacking Ruby Syntax in Ruby (RubyConf 2018)Hijacking Ruby Syntax in Ruby (RubyConf 2018)
Hijacking Ruby Syntax in Ruby (RubyConf 2018)SATOSHI TAGOMORI
 
Make Your Ruby Script Confusing
Make Your Ruby Script ConfusingMake Your Ruby Script Confusing
Make Your Ruby Script ConfusingSATOSHI TAGOMORI
 
Hijacking Ruby Syntax in Ruby
Hijacking Ruby Syntax in RubyHijacking Ruby Syntax in Ruby
Hijacking Ruby Syntax in RubySATOSHI TAGOMORI
 
Lock, Concurrency and Throughput of Exclusive Operations
Lock, Concurrency and Throughput of Exclusive OperationsLock, Concurrency and Throughput of Exclusive Operations
Lock, Concurrency and Throughput of Exclusive OperationsSATOSHI TAGOMORI
 
Data Processing and Ruby in the World
Data Processing and Ruby in the WorldData Processing and Ruby in the World
Data Processing and Ruby in the WorldSATOSHI TAGOMORI
 
Planet-scale Data Ingestion Pipeline: Bigdam
Planet-scale Data Ingestion Pipeline: BigdamPlanet-scale Data Ingestion Pipeline: Bigdam
Planet-scale Data Ingestion Pipeline: BigdamSATOSHI TAGOMORI
 
Technologies, Data Analytics Service and Enterprise Business
Technologies, Data Analytics Service and Enterprise BusinessTechnologies, Data Analytics Service and Enterprise Business
Technologies, Data Analytics Service and Enterprise BusinessSATOSHI TAGOMORI
 
Ruby and Distributed Storage Systems
Ruby and Distributed Storage SystemsRuby and Distributed Storage Systems
Ruby and Distributed Storage SystemsSATOSHI TAGOMORI
 
Perfect Norikra 2nd Season
Perfect Norikra 2nd SeasonPerfect Norikra 2nd Season
Perfect Norikra 2nd SeasonSATOSHI TAGOMORI
 
To Have Own Data Analytics Platform, Or NOT To
To Have Own Data Analytics Platform, Or NOT ToTo Have Own Data Analytics Platform, Or NOT To
To Have Own Data Analytics Platform, Or NOT ToSATOSHI TAGOMORI
 
The Patterns of Distributed Logging and Containers
The Patterns of Distributed Logging and ContainersThe Patterns of Distributed Logging and Containers
The Patterns of Distributed Logging and ContainersSATOSHI TAGOMORI
 
How To Write Middleware In Ruby
How To Write Middleware In RubyHow To Write Middleware In Ruby
How To Write Middleware In RubySATOSHI TAGOMORI
 
Modern Black Mages Fighting in the Real World
Modern Black Mages Fighting in the Real WorldModern Black Mages Fighting in the Real World
Modern Black Mages Fighting in the Real WorldSATOSHI TAGOMORI
 
Open Source Software, Distributed Systems, Database as a Cloud Service
Open Source Software, Distributed Systems, Database as a Cloud ServiceOpen Source Software, Distributed Systems, Database as a Cloud Service
Open Source Software, Distributed Systems, Database as a Cloud ServiceSATOSHI TAGOMORI
 
Fluentd Overview, Now and Then
Fluentd Overview, Now and ThenFluentd Overview, Now and Then
Fluentd Overview, Now and ThenSATOSHI TAGOMORI
 

Plus de SATOSHI TAGOMORI (20)

Ractor's speed is not light-speed
Ractor's speed is not light-speedRactor's speed is not light-speed
Ractor's speed is not light-speed
 
Good Things and Hard Things of SaaS Development/Operations
Good Things and Hard Things of SaaS Development/OperationsGood Things and Hard Things of SaaS Development/Operations
Good Things and Hard Things of SaaS Development/Operations
 
Maccro Strikes Back
Maccro Strikes BackMaccro Strikes Back
Maccro Strikes Back
 
Invitation to the dark side of Ruby
Invitation to the dark side of RubyInvitation to the dark side of Ruby
Invitation to the dark side of Ruby
 
Hijacking Ruby Syntax in Ruby (RubyConf 2018)
Hijacking Ruby Syntax in Ruby (RubyConf 2018)Hijacking Ruby Syntax in Ruby (RubyConf 2018)
Hijacking Ruby Syntax in Ruby (RubyConf 2018)
 
Make Your Ruby Script Confusing
Make Your Ruby Script ConfusingMake Your Ruby Script Confusing
Make Your Ruby Script Confusing
 
Hijacking Ruby Syntax in Ruby
Hijacking Ruby Syntax in RubyHijacking Ruby Syntax in Ruby
Hijacking Ruby Syntax in Ruby
 
Lock, Concurrency and Throughput of Exclusive Operations
Lock, Concurrency and Throughput of Exclusive OperationsLock, Concurrency and Throughput of Exclusive Operations
Lock, Concurrency and Throughput of Exclusive Operations
 
Data Processing and Ruby in the World
Data Processing and Ruby in the WorldData Processing and Ruby in the World
Data Processing and Ruby in the World
 
Planet-scale Data Ingestion Pipeline: Bigdam
Planet-scale Data Ingestion Pipeline: BigdamPlanet-scale Data Ingestion Pipeline: Bigdam
Planet-scale Data Ingestion Pipeline: Bigdam
 
Technologies, Data Analytics Service and Enterprise Business
Technologies, Data Analytics Service and Enterprise BusinessTechnologies, Data Analytics Service and Enterprise Business
Technologies, Data Analytics Service and Enterprise Business
 
Ruby and Distributed Storage Systems
Ruby and Distributed Storage SystemsRuby and Distributed Storage Systems
Ruby and Distributed Storage Systems
 
Perfect Norikra 2nd Season
Perfect Norikra 2nd SeasonPerfect Norikra 2nd Season
Perfect Norikra 2nd Season
 
Fluentd 101
Fluentd 101Fluentd 101
Fluentd 101
 
To Have Own Data Analytics Platform, Or NOT To
To Have Own Data Analytics Platform, Or NOT ToTo Have Own Data Analytics Platform, Or NOT To
To Have Own Data Analytics Platform, Or NOT To
 
The Patterns of Distributed Logging and Containers
The Patterns of Distributed Logging and ContainersThe Patterns of Distributed Logging and Containers
The Patterns of Distributed Logging and Containers
 
How To Write Middleware In Ruby
How To Write Middleware In RubyHow To Write Middleware In Ruby
How To Write Middleware In Ruby
 
Modern Black Mages Fighting in the Real World
Modern Black Mages Fighting in the Real WorldModern Black Mages Fighting in the Real World
Modern Black Mages Fighting in the Real World
 
Open Source Software, Distributed Systems, Database as a Cloud Service
Open Source Software, Distributed Systems, Database as a Cloud ServiceOpen Source Software, Distributed Systems, Database as a Cloud Service
Open Source Software, Distributed Systems, Database as a Cloud Service
 
Fluentd Overview, Now and Then
Fluentd Overview, Now and ThenFluentd Overview, Now and Then
Fluentd Overview, Now and Then
 

Dernier

Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptrcbcrtm
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 

Dernier (20)

Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.ppt
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 

Fluentd: Data streams in Ruby world #rdrc2014

  • 1. Fluentd: Data streams in Ruby world @tagomoris RedDotRubyConf 2014 Day1, 26 June 2014 14年6月26日木曜日
  • 2. TAGOMORI Satoshi a.k.a. @tagomoris 14年6月26日木曜日
  • 6. Fluentd Fluentd is an open source data collector to simplify log management. Fluentd is designed to process high-volume data streams reliably. Use cases include real-time search and monitoring, Big Data analytics, reliable archiving and more. http://www.fluentd.org/ 14年6月26日木曜日
  • 7. Before Fluentd: Access logs Metrics Archives apache nginx graphs Amazon S3 Filesystem tail -f scp python Error handling? Buffering? 14年6月26日木曜日
  • 8. Before Fluentd: Access logs Metrics Analytics Archives apache nginx graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem tail -f scp python ruby ruby cmd Error handling? Buffering? Routing? API Keys? 14年6月26日木曜日
  • 9. Before Fluentd: Access logs App logs Metrics Analytics Archives apache nginx frontend backend graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem tail -f scp python ruby ruby cmd file ruby logger Error handling? Buffering? Routing? API Keys? Formats? 14年6月26日木曜日
  • 10. Before Fluentd: Access logs App logs System logs Metrics Analytics Archives apache nginx frontend backend syslogd snmp data graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem tail -f scp python ruby ruby cmd file ruby logger Error handling? Buffering? Routing? API Keys? Formats? 14年6月26日木曜日
  • 11. Before Fluentd: CHAOS Access logs App logs System logs Various logs Metrics Analytics Archives apache nginx frontend backend syslogd snmp data graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem tail -f scp python ruby ruby cmd file ruby logger file logger ruby cmd ruby Error handling? Buffering? Routing? API Keys? Formats? 14年6月26日木曜日
  • 12. After Fluentd: Controllable Access logs App logs System logs Various logs Metrics Analytics Archives apache nginx frontend backend syslogd snmp data graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem 14年6月26日木曜日
  • 13. Access logs App logs System logs Various logs Metrics Analytics Archives apache nginx frontend backend syslogd snmp data graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem Fluentd does: Format, Buffer, Retry, Route After Fluentd: Controllable 14年6月26日木曜日
  • 14. Fluentd Open source data collector Written in Ruby, runs on CRuby on UNIX-like OS With error handling and routing in core Plugin systems Input, Output and Buffer (w/ many built-in plugins) Distributed on rubygems.org Fluentd and its plugins: gem install fluentd rpm/deb are also available (td-agent) 14年6月26日木曜日
  • 16. Why Fluentd? Fluentd’s logo is very cute! 14年6月26日木曜日
  • 17. He is also very cute... 14年6月26日木曜日
  • 18. Why Fluentd? Simple data structure tag, time and record(hash) Apache-like configuration syntax Simple / powerful routing Many public plugins Just few steps for custom plugins Scalability 14年6月26日木曜日
  • 19. Fluentd Event app.device.ios 2014-06-24 16:28:50 { “username”: “tagomoris”, “fullname”: “TAGOMORI Satoshi”, “age”: 34, “device”: “iPhone 5”, ... } Event 14年6月26日木曜日
  • 20. Fluentd Event app.device.ios 1403512916 (2014-06-23 16:41:56 +0800) { “username”: “tagomoris”, “fullname”: “TAGOMORI Satoshi”, “age”: 34, “device”: “iPhone 5”, ... } tag time record 14年6月26日木曜日
  • 21. Fluentd Event app.device.ios 1403512916 (2014-06-23 16:41:56 +0800) { “username”: “tagomoris”, “fullname”: “TAGOMORI Satoshi”, “age”: 34, “device”: “iPhone 5”, ... } tag for routing record structured data time by unix time 14年6月26日木曜日
  • 22. # read from a file and parse <source> type tail path /var/log/httpd.log format apache2 tag web.access </source> # logs from client libraries <source> type forward port 24224 </source> # store logs to MongoDB and S3 <match app.**> type copy <store> type mongo host mongo.example.com capped capped_size 200m </store> <store> type s3 path archive/ </store> </match> Fluentd Configuration 14年6月26日木曜日
  • 23. # read from a file and parse <source> type tail path /var/log/httpd.log format apache2 tag web.access </source> # logs from client libraries <source> type forward port 24224 </source> # store logs to MongoDB and S3 <match app.**> type copy <store> type mongo host mongo.example.com capped capped_size 200m </store> <store> type s3 path archive/ </store> </match> Fluentd Configuration for input for output 14年6月26日木曜日
  • 24. # read from a file and parse source { type ”tail” path “/var/log/httpd.log” format “apache2” tag ”web.access” } # logs from client libraries source { type ”forward” port 24224 } # store logs to MongoDB and S3 match(“app.**”) { type ”copy” store { type ”mongo” host “mongo.example.com” capped capped_size “200m” } store { type ”s3” path “archive/” } } Fluentd Configuration DSL 14年6月26日木曜日
  • 27. 300+ Public Plugins access, add, aes-forward, airbrake-python, amazon_sns, amplifier-filter, amqp, amqp2, andon, anomalydetect, anonymizer, arango, arduino, axlsx, backlog, bigquery, boundio, buffer- lightening, buffered-filter, buffered-hipchat, buffered-stdout, bufferize, calc, cassandra, cassandra-cql, cloudstack, cloudwatch, cloudwatch_ya, combiner, conditional_filter, config- expander, config_pit, config_reloader, convert-value-to-sha, copy_ex, couch, couch-sharded, couchbase, dashing, data-rejecter, datacalculator, datacounter, dbi, dd, debug, delay- inspector, delayed, derive, df, droonga, dstat, dummydata-producer, dynamodb, ec2-metadata, elapsed-time, elasticsearch, elasticsearch-cluster, elasticsearch-ruby, elb-log, embedded- elasticsearch, eval-filter, event-tail, extract_query_params, file-alternative, file-sprintf, filter, filter_keys, flatten, flatten-hash, flowcounter, flowcounter-simple, flume, fnordmetric, forest, fork, format, forward-aws, ftp, gamobile, ganglia, gc, geoip, glusterfs, graphite, grassland, gree_community, grep, grepcounter, groonga, groupcounter, growl, growthforecast, gstore, hash-forward, hato, hbase, hekk_redshift, heroku-postgres, heroku- syslog, hipchat, histogram, hoop, hostname, hrforecast, http-enhanced, http-ex, http-list, http-status, https-json, idobata, ikachan, imagefile, imkayac, in-udp-event, incremental, influxdb, influxdb_metrics, inline-classifier, irc, jabber, json-api, json-nest2flat, jsonbucket, jstat, jubatus, jvmwatcher, kafka, kanicounter, keep-forward, kestrel, kibana- server, kinesis-alt, latency, leftronic, librato-metrics, loggly, lossycount, mackerel, mail, map, measure_time, mecab, metricsense, mixi_community, mixpanel, mobile-carrier, mongo, mongo-typed, mongokpi, mqtt, msgpack-rpc, mssql, multiprocess, munin, mysql, mysql-binlog, mysql-bulk, mysql-load, mysql-prepared-statement, mysql-query, mysql-replicator, mysqlslowquery, mysqlslowquerylog, nats, network-probe, nginx-status, nicorepo, norikra, notifier, numeric-counter, numeric-monitor, onlineuser, openldap-monitor, opentsdb, order, out-http, out-http-buffered, out-solr, parser, pgdist, pghstore, pgjson, ping-message, postgres, qqwry, rambler, rawexec, rds-log, rds-slowlog, reassemble, record http://www.fluentd.org/plugins 14年6月26日木曜日
  • 29. 1. read logs from file and write these on storages file in_tail read, parse out_file format, write file 14年6月26日木曜日
  • 30. 1. read logs from file and write these on storages file read, parse insert MongoDBout_mongo https://github.com/fluent/fluent-plugin-mongo in_tail 14年6月26日木曜日
  • 31. 1. read logs from file and write these on storages file read, parse out_mysql insert MySQL https://github.com/tagomoris/fluent-plugin-mysql in_tail 14年6月26日木曜日
  • 32. 1. read logs from file and write these on storages file read, parse out_elasticsearch send Elasticsearch https://github.com/uken/fluent-plugin-elasticsearch in_tail 14年6月26日木曜日
  • 33. 1. read logs from file and write these on storages file read, parse out_webhdfs format, write Hadoop HDFS https://github.com/fluent/fluent-plugin-webhdfs in_tail 14年6月26日木曜日
  • 34. 1. read logs from file and write these on storages file read, parse out_s3 format, write Amazon S3 https://github.com/fluent/fluent-plugin-s3 in_tail 14年6月26日木曜日
  • 35. 1. read logs from file and write these on storages file read, parse out_redshift insert Amazon Redshift https://github.com/hapyrus/fluent-plugin-redshift in_tail 14年6月26日木曜日
  • 36. 1. read logs from file and write these on storages file read, parse out_bigquery insert Google BigQuery https://github.com/tagomoris/fluent-plugin-bigquery in_tail 14年6月26日木曜日
  • 37. 2. receive and forward data from/to other node forward forward forward input events input events output events fluent-logger-ruby fluent-logger-java ... send events over TCP 14年6月26日木曜日
  • 38. 2. receive and forward data from/to other node forward forward forward load balance, active-standby forward forward forward 14年6月26日木曜日
  • 39. datacenter 2’. receive and forward data from/to other node, over internet & SSL secure-forward secure-forward datacenter secure-forward send events over SSL with authentication https://github.com/tagomoris/fluent-plugin-secure-forward 14年6月26日木曜日
  • 40. 3. connect with other middleware in_syslog syslog Flume Scribe Kafka in_flume in_scribe in_kafka out_flume in_scribe in_kafka Flume Scribe Kafka https://github.com/fluent/fluent-plugin-flume https://github.com/fluent/fluent-plugin-scribe https://github.com/htgc/fluent-plugin-kafka/ 14年6月26日木曜日
  • 41. 4. copy events forward copy forward webhdfs Hadoop HDFS 14年6月26日木曜日
  • 42. 5. count events by string values forward any outputs count records by regexp patterns events { “pattern1_count”: 60, “pattern1_rate” : 1.0, “pattern2_count”: 20, “pattern2_rate” : 0.33, ... } datacounter https://github.com/tagomoris/fluent-plugin-datacounter 14年6月26日木曜日
  • 43. 5. count events by numeric values forward numeric-counter any outputs count records by numerical range https://github.com/tagomoris/fluent-plugin-numeric-counter events { “pattern1_count”: 60, “pattern1_rate” : 1.0, “pattern2_count”: 20, “pattern2_rate” : 0.33, ... } 14年6月26日木曜日
  • 44. 5. aggregate numeric values forward numeric-monitor any outputs calculate real-time metrics of numeric values { “max”: 128, “min”: 16, “avg”: 64.0, “sum”: 1024, “num”: 20, “percentile_50”: 48, “percentile_90”: 112, ... } https://github.com/tagomoris/fluent-plugin-numeric-monitor events 14年6月26日木曜日
  • 45. 6. various inputs: Linux performance (dstat) in_dstatdstat collect server performance data https://github.com/shun0102/fluent-plugin-dstat 14年6月26日木曜日
  • 46. 6. various inputs: SQL execution in_sql input from SELECT RDBMS https://github.com/fluent/fluent-plugin-sql 14年6月26日木曜日
  • 47. 6. various inputs: external command in_execany commands input from STDOUT of any commands 14年6月26日木曜日
  • 48. 7. various outpus: notification on IRC out_ikachan notice on IRC channel IRC https://github.com/tagomoris/fluent-plugin-ikachan 14年6月26日木曜日
  • 49. 7. various outpus: notification on IRC out_ikachan notice on IRC channel IRC https://github.com/tagomoris/fluent-plugin-ikachan 14:56 ikachan: HTTP status_4xx crit [2014-06-23 14:56:29 +0900] serviceX: 100.00 (threshold 75.0) http://graph.tool.local/view_graph/accesslog/httpstatus/serviceX_4xx_percentage 14:57 kazeburo: ↑ 40x 100%... 14年6月26日木曜日
  • 50. 7. various outpus: notification on HipChat out_hipchat notice on HipChat HipChat https://github.com/hotchpotch/fluent-plugin-hipchat 14年6月26日木曜日
  • 51. 7. various outpus: graph tools out_growthforecast POST data into graph tools GrowthForecast or Focuslight https://github.com/tagomoris/fluent-plugin-growthforecast 14年6月26日木曜日
  • 52. 7. various outpus out_growthforecast POST data into graph tools GrowthForecast or Focuslight https://github.com/tagomoris/fluent-plugin-growthforecast 14年6月26日木曜日
  • 53. 7. various outpus: external command out_exec any commands output into STDIN of any commands 14年6月26日木曜日
  • 54. 8. filters: stream processing: external command any inputs any outputs format & write into STDIN exec_filter any commands read & parse from STDOUT read from STDIN do WHATEVER you want write into STDOUT ex: tail -f | grep ... | sed ... | cat events 14年6月26日木曜日
  • 55. 8. filters: stream processing w/ external server RPC any inputs any outputs send out_norikra fetch stream processing w/ SQL in_norikra http://norikra.github.io/ SELECT stage, score, COUNT(*) AS c FROM results.win:time_batch(1 min) WHERE stage > 1 AND user.valid GROUP BY stage, score events 14年6月26日木曜日
  • 56. ... And, Fluentd does error handling and retries for all of these plugins! 14年6月26日木曜日
  • 57. Before Fluentd: CHAOS Access logs App logs System logs Various logs Metrics Analytics Archives apache nginx frontend backend syslogd snmp data graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem tail -f scp python ruby ruby cmd file ruby logger file logger ruby cmd ruby 14年6月26日木曜日
  • 58. After Fluentd: Controllable Access logs App logs System logs Various logs Metrics Analytics Archives apache nginx frontend backend syslogd snmp data graphs Hadoop MySQL MongoDB Redshift Amazon S3 Filesystem 14年6月26日木曜日
  • 59. Fluentd: Now and then 14年6月26日木曜日
  • 60. Fluentd versions Latest: v0.10.50 released on Jun 17, 2014 v0.10.x: Stable versions many minor feature updates, bug fixes new features for v1 14年6月26日木曜日
  • 61. Fluentd v1 Planned as the first major release someday in 2014 (?) 100% Compatible with v0.10.x New (and additional) features on v1.x loadmap https://github.com/fluent/fluentd/issues/251 new configuration syntax, plugin backends daemon process management multi core CPU supports 14年6月26日木曜日
  • 62. Fluentd on JRuby Under development! trying to fix Cool.io to support JRuby 14年6月26日木曜日
  • 63. Fluentd on Windows Under development! “windows” branch on github fluent/fluend 14年6月26日木曜日
  • 64. Use case in LINE 14年6月26日木曜日
  • 65. Analytics data flow overview servers Fluentd Cluster archive visualization notifications Hadoop Fluentd Norikra application metrics 14年6月26日木曜日
  • 67. archive visualization notifications Hadoop Norikra application metrics fluent-agent-lite non-parsed raw logs non-parsed access logs deliver: receive/archive/load-balance worker: parse/store/forward watcher: monitor/notify cep: general-purpose stream processing 14年6月26日木曜日
  • 68. Fluentd cluster statistics Fluentd nodes access/application logs from 600+ nodes receiver: 5 server (60 process) parser/converter: 10 server (90 process) stream processing: 3 server 14年6月26日木曜日
  • 69. Fluentd cluster statistics Daily: 5.5+ Billion events, 1.5TB+ data Peak time: 150,000+ events /sec, 300+ Mbps 14年6月26日木曜日
  • 70. Fluentd is the best partner for stream-processing newbies and rubyists! Check out sites and code! http://fluentd.org/ https://github.com/fluent/fluentd 14年6月26日木曜日
  • 72. Fault-tolerance? Node level fault-tolerance File buffer: processing data can be serialized on disk Cluster level fault-tolerance Copy + Forward(load balance, active-standby) Event level assurance: ACK? NO (for performance reason) 14年6月26日木曜日
  • 73. Performance? NOT SO BAD: real throughput depends on plugin/configuration simple event transferring: 10-20k events/sec 14年6月26日木曜日
  • 74. vs Scribe? vs Flume? 14年6月26日木曜日