alluxio data orchestration storage open source presto cloud big data spark file system hybrid cloud summit data management distributed computing memory tachyon project analytics aws alluxio day machine learning performance hadoop hdfs separation of compute and storage hive apache spark s3 distributed systems ai cloud storage distributed storage alluxio engineering aws s3 caching multi cloud data engineering cloud computing emr data analytics kubernetes sql object store data platform meetup data lake data architecture tachyon object stores compute tech talk release data deep learning fuse data locality rocksdb architecture facebook intel cloud bursting google dataproc unified namespace posix orchestration tensorflow use case apache hudi apache ozone local cache raft office hour scale hybrid cloud bursting overview compute storage separation uber metadata community tencent ceph memory centric database product school zookeeper ml apache iceberg microsoft data lakes fluid alibaba datasapiens under file system zero copy bursting on-prem analytics zoo amazon emr structured data management rakuten object storage query engine data stores conference computer baidu data warehouse grpc data stack demo amazon web services data ecosystem jd kyligence olap memory-centric product release analytics and ai cloud migration cloud architecture twitter virtual file system apache ranger hybrid big data netapp bilibili data tagging open data platform presto caching metadata management shadow cache cache tiktok cache layer prometheus metrics grafana optane persistent memory raptorx disaggregated storage rapids accelerator gpu analytics data lake analytics dask aspect analytics webinar terraform eks t3go walkme unisound atlas starburst robinhood data catalog paypal gimel sql workloads jd.com distributed applications ing tech dataproc google cloud hybrid data lake helixa comcast china unicom aunalytics hub hybrid shannondb storagequery s3 api analytic workloads public cloud deep learning applications high performance high-performance scalable metadata services structured data services catalog service spark workloads remote data software testing unified data zero copy hybrid bursting mapr cloud workloads nfs dc/os object store analytics on-premise compute e-commerce datasets pipeline api usability concurrency iceberg netflix alibaba cloud gene computing structured data search queries ryte zero-copy burst distributed data caching distributed query walmartlabs global namespace multi-tiering 2.0 preview unified bigdata tutorial storage system security parquet amazon amplab pingo tachyon nexus elastic mapreduce developers developer datawarehouse etl financial services decoupling compute and storage data unification virtualization distributed system in-memory storage qiniu sogou business intelligence ctrip momo talking data nvidia mesosphere qunar strata
Tout plus