Session Details: Current Bengaluru 2025

Session Type

Breakout Session

Name

Inside Uber's Large-Scale Real-Time Analytics Platform

Date

Wednesday, March 19, 2025

Time

4:00 PM - 4:45 PM

Location Name

Scarlet 1

Description

Abstract: At Uber, the EVA platform that drives substantial advancements in our real-time analytics capabilities, empowering various business use cases across marketing, engineering, data science, and operations and internal use cases around metrics, logs & query analytics. The platform features Apache Kafka for realtime data transport, Apache Flink for stream processing, Spark for batch processing, HDFS for deep storage needs, and Apache Pinot as the core analytics engine. Additionally, it features internal service Neutrion for Presto-like queries on Pinot and metadata service for dataset management. Core Theme As part of the talk, we cover the matured architecture for realtime analytics ecosystem powering Uber’s usecases that serve up to 10s of thousands of queries/sec, several million writes/sec and host up to tens of Petabytes of Pinot datasets. We also cover two critical business and observability usecase. Technical Depth 1. Realtime processing and ingestion using AthenaX(SQL based transformation on Flink), Flink and Kafka to provide analytics on realtime data. 2. Realtime Analytics powered by Apache Pinot to serve analytics at high QPS with sub-second latency 3. Disaster resiliency and disaster recovery strategies for Apache Pinot datasets. Relevance The talk covers Uber’s two usecases that solve realtime analytics challenges for business and observability. 1. Use case 1: Business usecase(rides/eats related) 2. Use case 2: Observability usecase (metrics/logs related) Audience Takeaways The audience will gain practical insights into designing real-time analytics systems centered around Apache Pinot and effectively leveraging complementary real-time technologies to build robust and high-performing solutions.

Speakers

Rohit Yadav, Uber
Satish Duggana, Uber

Level

Intermediate

Audience

Architect, Developer

Industry

Technology

Tags

Analytics, Apache Flink, Apache Kafka, Storage