Skip to content
Carbonfay
RU

Knowledge & search

Corporate Knowledge RAG Agent

A RAG agent answers staff and customer questions from your documents and policies — grounded in the source, with a citation. Not in the base — it honestly says so.

RAG Systems

“Staff ask the same thing a hundred times, let’s put an AI on our knowledge base” is a reasonable request. But “give the model the documents and ask” breeds confident inventions: the model will fabricate an answer that isn’t in the policy, and do it convincingly. A RAG agent is built differently: first retrieval over your base, an answer strictly from what was found with a link to the source, and an honest “not found” where there are no sources.

What it does

It takes a question from staff or a customer in any channel, retrieves relevant fragments from the vector index of your documents, policies and ticket history, and composes an answer strictly from what it found. Each answer carries a link to the document and section so the person sees the original. If nothing relevant is in the base, it answers “not found in the sources” and hands off to a human when needed. This isn’t a chat “inspired by” corporate knowledge but an auditable answer traced to a specific document.

Where the line is

The main RAG risk isn’t “no answer found” but a confidently issued invention indistinguishable from fact. So refusal is part of the agent’s contract: a relevance threshold, an explicit “not in the base”, a source link under every answer. The knowledge retrieval inside the agent is a full RAG system, and its quality depends directly on how the vector knowledge base is built: without a clean index, even a perfect model answers from garbage.

How the chain works

  1. 01
    Knowledge-base retrieval · embedder

    For a question it finds relevant fragments in the vector index of documents, policies and history — by meaning, not word overlap.

  2. 02
    Source-grounded answer · RAG + mid model

    Composes an answer strictly from the retrieved fragments and attaches a link to the document and section. Grounded in the source, not the model's memory.

  3. 03
    Coverage check · rule + model

    If there are no relevant fragments or they're weak — it answers 'not found in the base' rather than inventing. A boundary instead of a hallucination.

Integrations

OpenAI YandexGPT Google Sheets Telegram

+ any external API

Cost calculator

200
3
Tokens, ₽/mo
Development, ₽
Support, ₽/mo

Estimate at a blended per-token rate (input+output). Exact cost depends on context length, number of calls and the share of manual review — we scope it to your process.

related cases

faq

Straight answers

How is a RAG agent different from ChatGPT over our documents?
ChatGPT answers from the model's memory and sounds confident even when it's making things up. A RAG agent first retrieves fragments from your base, answers strictly from what it found and attaches a link to the source. If the answer isn't in the documents, it says so rather than filling in a plausible invention. For policies and customer answers that distinction is fundamental.
What does the agent do if the answer isn't in the base?
It honestly answers that it didn't find it in the sources and, if needed, hands the question to a human. This is a deliberate boundary, not a gap: 'I don't know' beats a confident wrong answer about a policy or contract. The relevance threshold is tuned to the cost of error in the specific process.
Can you check where the agent got an answer?
Yes, that's the point. Each answer comes with source fragments and a link to the document and section — staff or customers see what the answer is based on and can open the original. This removes the 'the agent said so' dispute and makes answers auditable.
How does the agent stay current as documents change?
It answers from a live index, not 'baked-in' knowledge. When a policy is updated, its document is re-indexed and the agent immediately answers from the new version. Building and updating that index is a separate vector knowledge base task that the agent relies on.

Next step

Let's design an AI-native automation layer for your operations.

DBCV