Personal Information
Entreprise/Lieu de travail
Hangzhou, Zhejiang, China China
Profession
海康研究院 大数据架构工程师 大数据安防 平安城市 智慧城市
Secteur d’activité
Technology / Software / Internet
Site Web
kaidata.github.io
À propos
大数据处理
- Présentations
- Documents
- Infographies
data science @NYT ; inaugural Data Science Initiative Lecture
chris wiggins
•
il y a 8 ans
How to Become a Data Scientist
ryanorban
•
il y a 9 ans
Paris ML meetup
Yves Raimond
•
il y a 8 ans
Building Robust ETL Pipelines with Apache Spark
Databricks
•
il y a 6 ans
Why apache Flink is the 4G of Big Data Analytics Frameworks
Slim Baltagi
•
il y a 8 ans
Extreme Apache Spark: how in 3 months we created a pipeline that can process 2.5 billion rows a day
Josef A. Habdank
•
il y a 8 ans
Large Scale Deep Learning with TensorFlow
Jen Aman
•
il y a 7 ans
Time Series Analysis with Spark by Sandy Ryza
Spark Summit
•
il y a 8 ans
The Parquet Format and Performance Optimization Opportunities
Databricks
•
il y a 4 ans
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Architect Things Right
Databricks
•
il y a 4 ans
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Databricks
•
il y a 5 ans
Apache Arrow: In Theory, In Practice
Dremio Corporation
•
il y a 6 ans
Dynamic Partition Pruning in Apache Spark
Databricks
•
il y a 4 ans
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East talk by DB Tsai
Spark Summit
•
il y a 7 ans
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
il y a 6 ans
Machine learning pipeline with spark ml
datamantra
•
il y a 7 ans
Jumpstart on Apache Spark 2.2 on Databricks
Databricks
•
il y a 6 ans
Apache Spark Core – Practical Optimization
Databricks
•
il y a 4 ans
Why is My Stream Processing Job Slow? with Xavier Leaute
Databricks
•
il y a 5 ans
Building Reliable Data Lakes at Scale with Delta Lake
Databricks
•
il y a 4 ans