Advanced Settings

βš™οΈ Advanced Settings Guide

What Are Advanced Settings?

Once you've set up your API keys, AI BotKit offers several advanced settings that control how your chatbot processes content and interacts with users. Think of these as the "fine-tuning knobs" that help you optimize your chatbot's performance and behavior.

πŸ’‘ Good News: The default settings work well for most websites! You only need to adjust these if you have specific requirements or want to optimize performance.


🧠Embedding Models

Embedding models are AI tools that turn text (or other data) into numbers β€” specifically, vectors β€” so that computers can understand and compare their meanings.

πŸ“¦ Imagine This:

You give the model a sentence like:

β€œHow’s the weather today?”

The model turns it into a list of numbers like:

[0.45, -0.13, 0.88, ...]

This list of numbers is called an embedding. Similar sentences will have similar embeddings.


πŸ” Why Is This Useful?

Embedding models help with tasks like:

  • Search: Find content that's similar in meaning (even if it's not the same words).

  • Recommendations: Show related articles, documents, or products.

  • Clustering: Group similar ideas together.

  • Chatbots & AI Assistants: Understand and match user questions to the right answers.


πŸ› οΈ Real-World Example:

If someone types:

β€œTell me a joke”

And your knowledge base has a document that says:

β€œHere’s a funny story…”

The embedding model can tell those two are close in meaning β€” even if they use different words β€” and show the right content.


πŸ“„ Chunk Size

What It Does:

When AI BotKit processes your documents, it breaks them into smaller pieces called "chunks." Think of it like cutting a book into chapters - each chunk contains related information that the AI can understand and reference.

How It Works:

  • Larger chunks = More context, but fewer total chunks

  • Smaller chunks = More precise targeting, but less context per chunk

Default Setting:

1000 characters (roughly 150-200 words)

Range:

  • Minimum: 100 characters

  • Maximum: 2000 characters

When To Adjust:

Increase Chunk Size (1200-1500) When:

βœ… Your content has long explanations (tutorials, detailed guides) βœ… You want more context in answers βœ… Your documents have complex, interconnected topics βœ… You're getting fragmented or incomplete answers

Decrease Chunk Size (600-800) When:

βœ… Your content is mostly short FAQs βœ… You want more precise, focused answers βœ… Your documents have many separate topics βœ… You're getting overly long or unfocused answers

Real-World Examples:

E-commerce Store (Recommend: 800-1000)

  • Product descriptions are usually concise

  • Users ask specific questions about features

Educational Site (Recommend: 1200-1500)

  • Content is detailed and explanatory

  • Concepts build on each other


πŸ”— Chunk Overlap

What It Does:

This setting ensures that important information doesn't get "lost" between chunks. It's like having overlapping sections in a book so you don't miss the connection between chapters.

How It Works:

  • Takes the last X characters from one chunk

  • Includes them at the beginning of the next chunk

  • Ensures context continuity between chunks

Default Setting:

200 characters (roughly 30-40 words)

Range:

  • Minimum: 0 characters (no overlap)

  • Maximum: 200 characters (20% of default chunk size)

When To Adjust:

Increase Overlap (300-400) When:

βœ… Your content has lots of cross-references βœ… Topics are highly interconnected βœ… You're getting answers that miss important connections βœ… Your content has step-by-step processes

Decrease Overlap (50-100) When:

βœ… Your content is mostly independent sections βœ… You want to reduce processing time βœ… Your content has clear topic boundaries βœ… Storage/cost optimization is important

Impact on Performance:

  • More overlap = Better context but higher processing costs

  • Less overlap = Faster processing but potential context loss


⏱️ Rate Limit Window

What It Does:

Controls the time period for counting API requests. Think of it as a "rolling window" that tracks how many requests users make.

Simple Analogy: Like having a timer that resets every X seconds to count fresh requests.

How It Works:

  • Tracks requests made in the last X seconds

  • When window expires, counter resets

  • Prevents abuse and controls costs

Default Setting:

60 seconds (1 minute)

Range:

  • Minimum: 1 second

  • Maximum: 3600 seconds (1 hour)

When To Adjust:

Increase Window (300-900 seconds) When:

βœ… You want longer-term usage tracking βœ… Your site has steady, consistent traffic βœ… You prefer hourly rather than minute-based limits βœ… You want to smooth out traffic spikes

Decrease Window (15-30 seconds) When:

βœ… You want strict, immediate rate limiting βœ… Your site gets sudden traffic bursts βœ… You want to prevent rapid-fire requests βœ… Cost control is critical


🚦 Max Tokens per Conversation

What It Does:

Sets the maximum number of chatbot tokens utilization allowed per chat. This is your primary tool for controlling usage and costs.

Simple Analogy: Like setting a limit - it determines how many requests users can make!

Default Setting:

100k Tokens per chat

When To Adjust:

  • If users frequently reach the token limit and get cut off mid-conversation.

  • If you’re trying to reduce API usage costs, especially under high traffic conditions.

Usage Scenarios:

Business Website (e.g., SaaS, Corporate)

  • Recommended Limit: 100k–200k tokens

  • Why: These users may ask in-depth questions about services, pricing, integrations, or documentation. Conversations are longer and more informative.

  • Tip: Ensure longer context retention if the chatbot provides guided walkthroughs or complex troubleshooting.

Small E-Commerce Website

  • Recommended Limit: 50k–100k tokens

  • Why: Customers typically ask about products, availability, shipping, and return policies β€” short and focused interactions.

  • Tip: Monitor usage for product-specific FAQs to optimize response quality and token limits.


πŸ“© Max Messages in 24 hours

What It Does:

Sets the maximum number of messages a single user can send to the chatbot within a 24-hour period. This helps control excessive usage and ensures fair access across users.

Simple Analogy: Like placing a limit on the number of customer support calls you can make per day β€” it prevents overuse and ensures others also get served.

Default Setting:

60 messages per user per day

When To Adjust:

  • Increase the limit if users often need multi-turn conversations or the chatbot handles complex workflows (e.g., step-by-step guidance).

  • Decrease the limit if you're seeing abuse, spam, or resource strain from a few users.

  • Balance the limit based on your backend resource capacity and pricing plan (especially if using a paid API).

Usage Scenarios:

Small E-Commerce Website

  • Recommended Limit: 10 messages per day

  • Why: Most customers ask 3–10 quick questions per session β€” product queries, order status, or returns. Rarely will one user exceed this unless abusing the system.

Business Website (e.g., SaaS, Corporate)

  • Recommended Limit: 20-30 messages per day

  • Why: Users may engage in longer sessions involving feature walkthroughs, documentation help, or troubleshooting. More back-and-forth is expected.

🎯 Choosing The Right Settings For Your Site

πŸ›’ E-commerce Product Catalog

Chunk Size: 800-1000 characters
Chunk Overlap: 150-200 characters
Max Tokens per Conversation: 50k-100k
Max Messages: 10

Why: Product info is concise, but you need to handle customer service spikes.

πŸ’Ό Professional Services Site

Chunk Size: 1000-1200 characters
Chunk Overlap: 200-250 characters
Max Tokens per Conversation: 100k
Max Messages: 40-80

Why: Professional content needs context, but traffic is usually moderate.


🚨 Troubleshooting Common Issues

Issues

Solution

Answers seem incomplete or cut off

Increase chunk size to 1200–1500 characters

Chatbot loses context between related topics

Increase chunk overlap to 250–300 characters

Users complain about rate limiting

Increase max requests or extend the rate limit window

AI costs are too high

Decrease max requests and optimize chunk size

Responses are too slow

Decrease chunk size and overlap

Getting irrelevant answers

Decrease chunk size for more precise content targeting


πŸ“ž When To Seek Help

Contact support or consider professional setup if:

  • Your costs are unexpectedly high despite optimization

  • Users consistently report poor answer quality

  • You're experiencing frequent rate limiting issues

  • Your content has unique requirements not covered here

Remember: These settings can always be adjusted! Start with defaults, monitor performance, and fine-tune based on your specific needs and user feedback.


Last updated