Session Type
Lightning Talk
Name
Next-Gen RAG Architectures for Streaming Vector Data
Date
Tuesday, May 20, 2025
Time
12:30 PM - 12:45 PM
Location Name
Breakout Room 1
Description

Real-time retrieval-augmented generation (RAG) is poised to revolutionize how businesses leverage streaming vector data, but many current RAG architectures fall short of meeting the demands of real-time use cases. These architectures, originally designed for batch-based workflows, struggle with latency issues that prevent applications like real-time personalization, financial analysis, and fleet optimization from achieving their full potential. In this session, we’ll introduce an emerging real-time RAG reference architecture - originally designed by Uber - designed specifically to handle the complexities of streaming vector data. We’ll explore how this architecture overcomes the limitations of traditional RAG systems by enabling real-time analysis on freshly created vector embeddings. Attendees will leave this session with actionable insights into building and deploying real-time RAG systems, unlocking new possibilities for applications that demand both speed and accuracy in vector-driven analysis.

Chad Meley
Level
Advanced
Target Audience
Architect, Data Engineer/Scientist, Developer, Executive (Technical)
Industry
Advertising/Media, Banking/Finance, Gaming, Government, Healthcare, Manufacturing, Retail/E-Commerce, Telecommunications, Transportation
Tags
ML/AI Application