Session Type
Lightning Talk
Name
Next-Gen RAG Architectures for Streaming Vector Data
Date
Wednesday, May 21, 2025
Time
11:00 AM - 11:15 AM
Location Name
Breakout Room 4
Description
Real-time retrieval-augmented generation (RAG) is poised to revolutionize how businesses leverage streaming vector data, but many current RAG architectures fall short of meeting the demands of real-time use cases. These architectures, originally designed for batch-based workflows, struggle with latency issues that prevent applications like real-time personalization, financial analysis, and fleet optimization from achieving their full potential.
In this session, we’ll introduce an emerging real-time RAG reference architecture - originally designed by Uber - designed specifically to handle the complexities of streaming vector data. We’ll explore how this architecture overcomes the limitations of traditional RAG systems by enabling real-time analysis on freshly created vector embeddings.
Attendees will leave this session with actionable insights into building and deploying real-time RAG systems, unlocking new possibilities for applications that demand both speed and accuracy in vector-driven analysis.
Speakers

Level
Advanced
Target Audience
Architect, Executive (Technical), Data Engineer/Scientist, Developer
Industry
Banking/Finance, Advertising/Media, Gaming, Government, Healthcare, Manufacturing, Retail/E-Commerce, Telecommunications, Transportation
Tags
ML/AI Application