Context Management for Agentic RAG | Johan Jern, Co-founder & CTO at Realm
Some queries are hard to solve with "basic" RAG. When questions require multi-step reasoning, full-document understanding (not just chunks), or aggregating many results that match specific criteria, simple retrieve-and-generate pipelines break down, we need agentic RAG. But this added capability comes at a cost: as agents plan, search, read, and iterate, they quickly use up a lot of context, which both degrades answer quality and increases costs and latency.
In this session, we’ll focus on how to tackle the context-related challenges with agentic systems focused on retrieval.
Connect With Us
Website: http://aiven.io
LinkedIn: https://www.linkedin.com/company/aiven/
GitHub: https://github.com/aiven
X: https://twitter.com/aiven_io