Typical RAG pipelines usually include:

  • generating embeddings
  • similarity search on embeddings
  • piping outputs from the vector DB, e. g. found similar documents into an LLM Thus, the LLM generated is augumented by the retrieval system.