Personal Information
Entreprise/Lieu de travail
San Francisco, CA United States
Profession
Senior Data Engineer at Workday
Secteur d’activité
Technology / Software / Internet
Site Web
github.com/erenavsarogullari
À propos
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Mots-clés
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
Tout plus
Présentations
(6)Personal Information
Entreprise/Lieu de travail
San Francisco, CA United States
Profession
Senior Data Engineer at Workday
Secteur d’activité
Technology / Software / Internet
Site Web
github.com/erenavsarogullari
À propos
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Mots-clés
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
Tout plus