Advanced Settings
βοΈ Advanced Settings Guide
What Are Advanced Settings?
Once you've set up your API keys, AI BotKit offers several advanced settings that control how your chatbot processes content and interacts with users. Think of these as the "fine-tuning knobs" that help you optimize your chatbot's performance and behavior.
π‘ Good News: The default settings work well for most websites! You only need to adjust these if you have specific requirements or want to optimize performance.
π§ Embedding Models
Embedding models are AI tools that turn text (or other data) into numbers β specifically, vectors β so that computers can understand and compare their meanings.
π¦ Imagine This:
You give the model a sentence like:
βHowβs the weather today?β
The model turns it into a list of numbers like:
[0.45, -0.13, 0.88, ...]
This list of numbers is called an embedding. Similar sentences will have similar embeddings.
π Why Is This Useful?
Embedding models help with tasks like:
Search: Find content that's similar in meaning (even if it's not the same words).
Recommendations: Show related articles, documents, or products.
Clustering: Group similar ideas together.
Chatbots & AI Assistants: Understand and match user questions to the right answers.
π οΈ Real-World Example:
If someone types:
βTell me a jokeβ
And your knowledge base has a document that says:
βHereβs a funny storyβ¦β
The embedding model can tell those two are close in meaning β even if they use different words β and show the right content.
π Chunk Size
What It Does:
When AI BotKit processes your documents, it breaks them into smaller pieces called "chunks." Think of it like cutting a book into chapters - each chunk contains related information that the AI can understand and reference.
How It Works:
Larger chunks = More context, but fewer total chunks
Smaller chunks = More precise targeting, but less context per chunk
Default Setting:
1000 characters (roughly 150-200 words)
Range:
Minimum: 100 characters
Maximum: 2000 characters
When To Adjust:
Increase Chunk Size (1200-1500) When:
β Your content has long explanations (tutorials, detailed guides) β You want more context in answers β Your documents have complex, interconnected topics β You're getting fragmented or incomplete answers
Decrease Chunk Size (600-800) When:
β Your content is mostly short FAQs β You want more precise, focused answers β Your documents have many separate topics β You're getting overly long or unfocused answers
Real-World Examples:
E-commerce Store (Recommend: 800-1000)
Product descriptions are usually concise
Users ask specific questions about features
Educational Site (Recommend: 1200-1500)
Content is detailed and explanatory
Concepts build on each other
π Chunk Overlap
What It Does:
This setting ensures that important information doesn't get "lost" between chunks. It's like having overlapping sections in a book so you don't miss the connection between chapters.
How It Works:
Takes the last X characters from one chunk
Includes them at the beginning of the next chunk
Ensures context continuity between chunks
Default Setting:
200 characters (roughly 30-40 words)
Range:
Minimum: 0 characters (no overlap)
Maximum: 200 characters (20% of default chunk size)
When To Adjust:
Increase Overlap (300-400) When:
β Your content has lots of cross-references β Topics are highly interconnected β You're getting answers that miss important connections β Your content has step-by-step processes
Decrease Overlap (50-100) When:
β Your content is mostly independent sections β You want to reduce processing time β Your content has clear topic boundaries β Storage/cost optimization is important
Impact on Performance:
More overlap = Better context but higher processing costs
Less overlap = Faster processing but potential context loss
β±οΈ Rate Limit Window
What It Does:
Controls the time period for counting API requests. Think of it as a "rolling window" that tracks how many requests users make.
Simple Analogy: Like having a timer that resets every X seconds to count fresh requests.
How It Works:
Tracks requests made in the last X seconds
When window expires, counter resets
Prevents abuse and controls costs
Default Setting:
60 seconds (1 minute)
Range:
Minimum: 1 second
Maximum: 3600 seconds (1 hour)
When To Adjust:
Increase Window (300-900 seconds) When:
β You want longer-term usage tracking β Your site has steady, consistent traffic β You prefer hourly rather than minute-based limits β You want to smooth out traffic spikes
Decrease Window (15-30 seconds) When:
β You want strict, immediate rate limiting β Your site gets sudden traffic bursts β You want to prevent rapid-fire requests β Cost control is critical
π¦ Max Tokens per Conversation
What It Does:
Sets the maximum number of chatbot tokens utilization allowed per chat. This is your primary tool for controlling usage and costs.
Simple Analogy: Like setting a limit - it determines how many requests users can make!
Default Setting:
100k Tokens per chat
When To Adjust:
If users frequently reach the token limit and get cut off mid-conversation.
If youβre trying to reduce API usage costs, especially under high traffic conditions.
Usage Scenarios:
Business Website (e.g., SaaS, Corporate)
Recommended Limit: 100kβ200k tokens
Why: These users may ask in-depth questions about services, pricing, integrations, or documentation. Conversations are longer and more informative.
Tip: Ensure longer context retention if the chatbot provides guided walkthroughs or complex troubleshooting.
Small E-Commerce Website
Recommended Limit: 50kβ100k tokens
Why: Customers typically ask about products, availability, shipping, and return policies β short and focused interactions.
Tip: Monitor usage for product-specific FAQs to optimize response quality and token limits.
π© Max Messages in 24 hours
What It Does:
Sets the maximum number of messages a single user can send to the chatbot within a 24-hour period. This helps control excessive usage and ensures fair access across users.
Simple Analogy: Like placing a limit on the number of customer support calls you can make per day β it prevents overuse and ensures others also get served.
Default Setting:
60 messages per user per day
When To Adjust:
Increase the limit if users often need multi-turn conversations or the chatbot handles complex workflows (e.g., step-by-step guidance).
Decrease the limit if you're seeing abuse, spam, or resource strain from a few users.
Balance the limit based on your backend resource capacity and pricing plan (especially if using a paid API).
Usage Scenarios:
Small E-Commerce Website
Recommended Limit: 10 messages per day
Why: Most customers ask 3β10 quick questions per session β product queries, order status, or returns. Rarely will one user exceed this unless abusing the system.
Business Website (e.g., SaaS, Corporate)
Recommended Limit: 20-30 messages per day
Why: Users may engage in longer sessions involving feature walkthroughs, documentation help, or troubleshooting. More back-and-forth is expected.
π― Choosing The Right Settings For Your Site
π E-commerce Product Catalog
Chunk Size: 800-1000 characters
Chunk Overlap: 150-200 characters
Max Tokens per Conversation: 50k-100k
Max Messages: 10
Why: Product info is concise, but you need to handle customer service spikes.
πΌ Professional Services Site
Chunk Size: 1000-1200 characters
Chunk Overlap: 200-250 characters
Max Tokens per Conversation: 100k
Max Messages: 40-80
Why: Professional content needs context, but traffic is usually moderate.
π¨ Troubleshooting Common Issues
Issues
Solution
Answers seem incomplete or cut off
Increase chunk size to 1200β1500 characters
Chatbot loses context between related topics
Increase chunk overlap to 250β300 characters
Users complain about rate limiting
Increase max requests or extend the rate limit window
AI costs are too high
Decrease max requests and optimize chunk size
Responses are too slow
Decrease chunk size and overlap
Getting irrelevant answers
Decrease chunk size for more precise content targeting
π When To Seek Help
Contact support or consider professional setup if:
Your costs are unexpectedly high despite optimization
Users consistently report poor answer quality
You're experiencing frequent rate limiting issues
Your content has unique requirements not covered here
Remember: These settings can always be adjusted! Start with defaults, monitor performance, and fine-tune based on your specific needs and user feedback.
Last updated