Agentic RAG

Agentic RAG embeds autonomous AI agents into the retrieval-augmented generation pipeline, enabling dynamic decisions about when, what, and how to retrieve based on task requirements.

Traditional RAG follows a fixed flow: query → retrieve → generate. Agentic RAG adds a reasoning layer that can:

  • Decide when to retrieve: not every query needs external knowledge
  • Reformulate queries: rewrite based on initial results or task decomposition
  • Iterate multi-step: retrieve, reason, retrieve again as needed
  • Coordinate tools: use retrieval alongside other capabilities (calculation, API calls, etc.)

Two paradigms have emerged:

  • Predefined reasoning: structured pipelines with fixed decision points (rule-based when to retrieve, what to do with results)
  • Agentic reasoning: the model autonomously orchestrates tool interaction during inference

This shift parallels the evolution from static websites to interactive applications, from passive content delivery to dynamic, context-aware behavior.

The term reflects broader AI agent architectures: reflection, planning, tool use, and multi-agent collaboration applied specifically to the retrieval-generation problem.

Related: 05-atom—rag-definition, 05-molecule—rag-evolution-timeline