Cache Management Overview

Cache Management is one of the most powerful features of the HFIM Admin Panel. This page explains what caching is, why it matters, and how to use it effectively.

What is Cache Management?

Cache Management lets you create and maintain a library of pre-written question-and-answer pairs. When users ask questions similar to your cached entries, the chatbot returns the cached answer immediately without searching through documents.

Think of It Like...

Imagine the chatbot is a librarian:

Without cache: Every time someone asks "Where is the circulation desk?", the librarian searches through maps, directories, and signs before answering.
With cache: The librarian memorizes "The circulation desk is on the first floor near the entrance" and immediately answers without searching.

Cache entries are like the librarian's memorized answers - faster, consistent, and always ready.

Why Use Cache Management?

Cache management provides three major benefits:

1. ⚡ Faster Responses

Without cache: 2-8 seconds (AI search + generation) With cache: 50-500 milliseconds (instant retrieval)

Users get answers 10-15x faster when questions match cache entries.

2. ✅ Consistent Answers

Cache ensures the chatbot gives the same accurate answer every time:

Official program requirements stay consistent
Important deadlines are always correct
Contact information never varies

Example: Without cache, "How many credits is HFIM?" might generate slightly different responses (120 credits vs 120 total credits vs 120 semester hours). With cache, every user gets the exact same verified answer.

3. 💰 Lower Costs

Each cached response avoids:

OpenAI API calls for embedding and generation
Pinecone vector searches
Database lookups

Impact: A cache hit rate of 70% can reduce AI costs by 60-80%.

Real-World Example

If 100 users ask "What is HFIM?" in a week:

Without cache: 100 AI generation calls = ~$0.50-$1.00
With cache: 1 AI call (first time) + 99 cache hits = ~$0.005

Savings: 99%+ cost reduction for frequently asked questions!

How Cache Works

The Cache Matching Process

When a user asks a question, the chatbot follows this process:

1. User asks: "What is the HFIM program?"
                     ↓
2. Chatbot checks cache for similar questions
                     ↓
3a. MATCH FOUND → Return cached response (fast!)
                     ↓
    Done in 50-500ms

3b. NO MATCH → Search documents + Generate response
                     ↓
    Takes 2-8 seconds

What Makes a Good Match?

The chatbot considers:

Similarity score - How closely the question matches
Confidence level - How confident you are in the answer
Status - Whether the cache entry is active
Variations - Alternative ways to ask the same question

Matching threshold: Questions must be at least 70% similar to return the cached response.

Cache Entry Components

Each cache entry has several fields:

Core Fields

Field	Purpose	Example
Question	Main question this entry answers	"What is the HFIM program?"
Response	The answer to return	"HFIM stands for Hospitality and Food Industry Management..."
Confidence	Your confidence in accuracy (0-1)	0.95 (very confident)
Status	Active (used) or Inactive (disabled)	Active
Sources	JSON array of source documents	`[{"filename": "HFIM_Handbook.pdf", "page": 5}]`

Optional Fields

Field	Purpose	Example
Question Variations	Alternative ways to ask	"Tell me about HFIM\nExplain the Hospitality program"
TTL	How long answer stays valid (days)	90 days
Admin Notes	Internal notes for other admins	"Updated for Fall 2026 requirements"

Sources Field

The "sources" field uses JSON format to list documents used to create the response. This helps with transparency and allows you to update entries when source documents change.

Format: [{"filename": "Document.pdf", "page": 12, "section": "Overview"}]

Cache Statistics

The admin panel tracks performance metrics for each cache entry:

Times Served: How many times this entry was returned to users
Success Rate: Percentage of times it successfully matched questions
Last Updated: When you last modified this entry
Last Served: When it was most recently used

These metrics help you identify:

✅ High-value entries (frequently used)
⚠️ Unused entries (never or rarely matched)
🔄 Entries needing updates (old "Last Updated" dates)

Learn more: Performance Metrics

Getting Started with Cache

Recommended Workflow

For new admin users, follow this process:

Week 1: Observe
- Review existing cache entries
- Check Dashboard metrics (hit rate, popular questions)
- Read Best Practices
Week 2: Maintain
- Edit outdated entries
- Fix entries with low confidence
- Add variations to underperforming entries
Week 3+: Expand
- Create new entries for frequently asked questions
- Use the variation generator
- Analyze conversation feedback

What to Cache First

Highest Priority (cache these immediately):

✅ Program overview and mission
✅ Admission requirements
✅ Core faculty contact information
✅ Course prerequisites
✅ Internship requirements

Medium Priority (cache within 1-2 months):

⚠️ Popular questions (check Dashboard)
⚠️ Questions with negative feedback
⚠️ Degree requirements and pathways

Lower Priority (cache as time allows):

⬜ Rarely asked questions
⬜ Questions with consistent positive feedback (AI already answers well)

Start Small

Don't try to cache everything at once! Focus on the 10-20 most common questions first, then expand gradually based on usage patterns.

Common Cache Management Tasks

Quick links to specific guides:

Task	Guide	Frequency
Find a cache entry	Searching Cache	Daily
Edit an entry	Editing Entries	Weekly
Create variations	Generating Variations	Bi-weekly
Bulk updates	Bulk Operations	Monthly
Check performance	Performance Metrics	Weekly
Follow best practices	Best Practices	Always!
Fix problems	Troubleshooting	As needed

Cache vs. RAG: When to Use Each

Understanding when to use cache vs. letting the AI search (RAG) helps you make better decisions.

Use Cache For:

✅ Frequently asked questions - Asked 10+ times ✅ Official information - Program requirements, policies, contact info ✅ Consistent answers - Same answer every time ✅ Time-sensitive info - Deadlines, current semester dates (with appropriate TTL) ✅ Approved messaging - Specific phrasing matters

Let AI Search (RAG) For:

❌ Rarely asked questions - Asked < 5 times ❌ Context-dependent questions - "What about MY situation?" ❌ Exploratory questions - "Tell me about career paths" ❌ Complex queries - Require synthesizing multiple sources ❌ Recent updates - Brand new information not yet cached

Don't Over-Cache

Caching too many entries can:

Make management difficult (hundreds of entries to maintain)
Increase false matches (similar questions get wrong cached answers)
Reduce flexibility (users want slightly different answers)

Rule of thumb: If a question is asked < 5 times per month, consider letting RAG handle it.

Understanding Cache Lifecycle

Cache entries go through several stages:

1. Creation

Manual creation: You write question + response
Conversion: From good conversations
Import: Bulk upload (coming soon)

2. Active Use

Serves responses to users
Tracks "Times Served" metric
Generates performance data

3. Maintenance

Review based on TTL
Update for accuracy
Add variations if underperforming

4. Retirement

Set to "Inactive" when outdated
Delete if permanently irrelevant
Archive notes explain why

Learn more: Best Practices - Status Management

Next Steps

Ready to start working with cache? Here's what to do next:

Learn to search cache entries - Find existing entries
Practice editing - Update an entry safely
Generate variations - Improve matching
Read best practices - Avoid common mistakes

Practice Makes Perfect

Start by editing a low-impact entry (something with few "Times Served") to get comfortable with the interface before modifying high-traffic entries.

Questions?

FAQ - Common cache management questions
Glossary - Term definitions
Support - Contact information

Remember: Cache management is a powerful tool for improving the chatbot. Start small, follow best practices, and expand gradually based on usage data!

What is Cache Management?​

Think of It Like...​

Why Use Cache Management?​

1. ⚡ Faster Responses​

2. ✅ Consistent Answers​

3. 💰 Lower Costs​

How Cache Works​

The Cache Matching Process​

What Makes a Good Match?​

Cache Entry Components​

Core Fields​

Optional Fields​

Cache Statistics​

Getting Started with Cache​

Recommended Workflow​

What to Cache First​

Common Cache Management Tasks​

Cache vs. RAG: When to Use Each​

Use Cache For:​

Let AI Search (RAG) For:​

Understanding Cache Lifecycle​

1. Creation​

2. Active Use​

3. Maintenance​

4. Retirement​

Next Steps​

Questions?​