kaidata Lee

0 Abonné

Présentations
Documents
Infographies

Plus récents Les plus populaires

data science @NYT ; inaugural Data Science Initiative Lecture

chris wiggins • il y a 8 ans

How to Become a Data Scientist

ryanorban • il y a 9 ans

Paris ML meetup

Yves Raimond • il y a 8 ans

Building Robust ETL Pipelines with Apache Spark

Databricks • il y a 6 ans

Why apache Flink is the 4G of Big Data Analytics Frameworks

Slim Baltagi • il y a 8 ans

Extreme Apache Spark: how in 3 months we created a pipeline that can process 2.5 billion rows a day

Josef A. Habdank • il y a 8 ans

Large Scale Deep Learning with TensorFlow

Jen Aman • il y a 7 ans

Time Series Analysis with Spark by Sandy Ryza

Spark Summit • il y a 8 ans

The Parquet Format and Performance Optimization Opportunities

Databricks • il y a 4 ans

Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Architect Things Right

Databricks • il y a 4 ans

Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang

Databricks • il y a 5 ans

Apache Arrow: In Theory, In Practice

Dremio Corporation • il y a 6 ans

Dynamic Partition Pruning in Apache Spark

Databricks • il y a 4 ans

Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East talk by DB Tsai

Spark Summit • il y a 7 ans

Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache

Dremio Corporation • il y a 6 ans

Machine learning pipeline with spark ml

datamantra • il y a 7 ans

Jumpstart on Apache Spark 2.2 on Databricks

Databricks • il y a 6 ans

Apache Spark Core – Practical Optimization

Databricks • il y a 4 ans

Why is My Stream Processing Job Slow? with Xavier Leaute

Databricks • il y a 5 ans

Building Reliable Data Lakes at Scale with Delta Lake

Databricks • il y a 4 ans