Advanced RAG Settings for AI Agents- Chipp Docs

When you add knowledge sources (documents, websites, etc.) to your Chipp app, the AI uses Retrieval-Augmented Generation (RAG) to find and use relevant information. The Advanced RAG Settings let you fine-tune this process.

How Your AI Uses Your Documents

Before diving into settings, here's how your AI actually finds and uses information from your uploaded documents.

What is RAG?

RAG (Retrieval-Augmented Generation) is how your AI "reads" your documents to answer questions. Instead of memorizing everything, the AI:

Searches your documents for relevant information when a user asks a question
Retrieves the most relevant sections
Generates a response using that information

Think of it like a research assistant who quickly scans your files for relevant passages before answering, rather than trying to memorize every document.

What is a "Chunk"?

When you upload a document, Chipp automatically breaks it into smaller pieces called chunks. A chunk is typically a paragraph or a few paragraphs of text.

Why chunks?

AI models have limits on how much text they can process at once
Smaller pieces let the AI find the exact relevant section, not the whole document
It's faster and more accurate than searching full documents

Example: A 50-page PDF might be split into 200 chunks. When a user asks a question, the AI searches all 200 chunks and retrieves only the 5 most relevant ones.

How Search Works

When a user asks a question, Chipp converts both the question and your document chunks into mathematical representations (called "embeddings"). It then finds chunks whose meaning is closest to the question's meaning.

This is semantic search - it understands meaning, not just keywords. So "How do I cancel my subscription?" will match a chunk about "ending your membership" even if those exact words aren't used.

Standard Search vs. Hybrid Search

Standard Search (Default)

Looks at each chunk independently and finds the ones most similar to the user's question.

Works great for: FAQ-style content, focused topics, smaller knowledge bases
Best when: Each chunk contains self-contained answers

Hybrid Search

Combines two search strategies:

Chunk search: Which individual chunks match the question best?
Document search: Which overall documents are most relevant?

Then it boosts chunks that come from highly relevant documents, even if those chunks aren't the top individual matches.

Works great for: Multiple documents on similar topics, long documents, broad questions
Best when: Context from the same document matters, or you have 10+ files

Example: A user asks about "return policies." Standard search might find one great chunk from your FAQ and another from a random product page. Hybrid search recognizes that your "Returns & Refunds" document is highly relevant overall, so it prioritizes chunks from that document.

Accessing RAG Settings

Go to your app in the Chipp dashboard
Navigate to Build tab
Scroll to Knowledge Sources
Click Advanced RAG Settings to expand

Settings Reference

Relevance Threshold

Default: 0.15 | Range: 0.00 - 1.00

Controls how closely a document chunk must match the user's query to be included in the AI's context.

Value	Behavior
Lower (0.00 - 0.15)	More results included, may include loosely related content
Medium (0.15 - 0.30)	Balanced approach (recommended starting point)
Higher (0.30 - 1.00)	Stricter matching, only highly relevant content

When to adjust:

Lower the threshold if users report the AI doesn't find relevant information that exists in your documents
Raise the threshold if responses include too much irrelevant information or the AI seems confused by conflicting sources

Max Chunks

Default: 5 | Range: 1 - 20

The maximum number of document chunks retrieved for each user query. Each chunk is a section of your uploaded content.

Value	Behavior
Fewer chunks (1-3)	Focused responses, lower token usage, faster
Moderate (4-7)	Balanced context (recommended)
More chunks (8-20)	Comprehensive coverage, higher token usage

When to adjust:

Use fewer chunks for simple Q&A where answers are typically in one place
Use more chunks when questions require synthesizing information from multiple sources, or for complex topics with scattered information

Note: More chunks means more context for the AI, but also increases token usage and may slow responses.

Hybrid Search

Default: Off

When enabled, combines two search strategies:

Chunk-level search: Finds individual chunks most similar to the query
Document-level search: Considers which documents overall are most relevant

This helps when the best answer comes from a document that's highly relevant overall, even if no single chunk perfectly matches the query.

When to enable:

You have multiple documents on similar topics
Users ask broad questions that span document sections
You notice the AI missing context from relevant documents

When to keep off:

Simple, focused knowledge bases
When chunk-level matching works well

Document Weight

Default: 0.30 | Range: 0.00 - 1.00 | Only visible when Hybrid Search is enabled

Controls the balance between chunk-level and document-level relevance when Hybrid Search is on.

Value	Behavior
Lower (0.00 - 0.25)	Prioritize individual chunk matches (chunk focus)
Balanced (0.25 - 0.50)	Equal weight to both signals
Higher (0.50 - 1.00)	Prioritize chunks from relevant documents (document focus)

When to adjust:

Lower values when your documents cover very different topics and chunk relevance is most important
Higher values when you have long documents where context from the same document is valuable

Recommended Configurations

Customer Support Bot

Relevance Threshold: 0.20
Max Chunks: 5
Hybrid Search: Off

Good for FAQ-style knowledge bases where answers are self-contained.

Research Assistant

Relevance Threshold: 0.10
Max Chunks: 10
Hybrid Search: On
Document Weight: 0.40

Retrieves more context for synthesizing comprehensive answers.

Technical Documentation

Relevance Threshold: 0.25
Max Chunks: 7
Hybrid Search: On
Document Weight: 0.30

Balances precision with coverage for technical queries.

Legal/Compliance Bot

Relevance Threshold: 0.15
Max Chunks: 8
Hybrid Search: On
Document Weight: 0.50

Ensures relevant policy documents are well-represented.

Troubleshooting

AI doesn't find information that exists in my documents

Lower the Relevance Threshold to 0.10
Increase Max Chunks to 8-10
Enable Hybrid Search if you have multiple related documents

Responses include irrelevant information

Raise the Relevance Threshold to 0.25-0.35
Decrease Max Chunks to 3-4
Review your document organization

Token usage is too high

Reduce Max Chunks to 3-4
Raise Relevance Threshold to reduce low-quality matches
Consider splitting large documents into focused topics

AI gives conflicting information

Raise Relevance Threshold to be more selective
If using Hybrid Search, increase Document Weight to favor cohesive document sources
Review documents for outdated or contradictory content

Resetting to Defaults

Click Reset to defaults in the Advanced RAG Settings panel to restore all settings to their default values. This is useful if you've experimented and want to start fresh.

Using with MCP

You can manage knowledge sources and RAG settings programmatically using the Chipp MCP Server.

Available Tools

Tool	Description
`list_knowledge_sources`	List all sources with processing status
`add_url_source`	Add URL with crawl depth options
`add_document_source`	Upload documents (base64 encoded)
`delete_knowledge_source`	Remove a knowledge source
`refresh_knowledge_source`	Re-crawl or re-process a source
`update_settings`	Modify RAG settings (topK, temperature, etc.)

Example Prompts

"List all knowledge sources for my FAQ bot"
"Add https://docs.example.com with crawl depth 3"
"Refresh the outdated product manual"
"Update my app to retrieve 10 chunks instead of 5"

Workflow: Knowledge Base Setup

You: Set up knowledge sources for my support bot

AI: [Calls add_url_source for docs site]
[Calls add_url_source for FAQ page]
Added 2 knowledge sources:
- docs.example.com (crawling, depth 2)
- example.com/faq (crawling, depth 1)

You: Increase the max chunks to 8 for better coverage

AI: [Calls update_settings with topK=8]
Updated RAG settings. Your app will now retrieve up to 8
chunks per query.

💡

MCP server access requires a Pro plan or higher. See the MCP Setup Guide to get started.

Advanced RAG Settings

How Your AI Uses Your Documents

What is RAG?

What is a "Chunk"?

How Search Works

Standard Search vs. Hybrid Search

Accessing RAG Settings

Settings Reference

Relevance Threshold

Max Chunks

Hybrid Search

Document Weight

Recommended Configurations

Customer Support Bot

Research Assistant

Technical Documentation

Legal/Compliance Bot

Troubleshooting

AI doesn't find information that exists in my documents

Responses include irrelevant information

Token usage is too high

AI gives conflicting information

Resetting to Defaults

Using with MCP

Available Tools

Example Prompts

Workflow: Knowledge Base Setup

Continue Reading

How Knowledge Sources Work

Building Custom MCP Servers

Understanding AI Evaluations

Want to learn more?