Skip to main content

Performance Metrics

Learn how to interpret cache performance metrics and use them to improve your chatbot's effectiveness.

Why Metrics Matter​

Performance metrics help you:

  • πŸ“Š Measure cache effectiveness - Is your cache working well?
  • 🎯 Identify improvements - Which entries need attention?
  • πŸš€ Track progress - Are your changes making a difference?
  • πŸ’‘ Make data-driven decisions - What should you cache next?

Rule of thumb: Check metrics weekly to stay on top of cache performance.


Dashboard Metrics​

The Dashboard provides high-level cache performance statistics.

1. Total Cache Entries​

What it means: Total number of cache entries (active + inactive)

Displayed: Count with icon (e.g., "9 Cache Entries")

Interpretation:

CountStatusRecommendation
0-10Small cacheStart caching frequently asked questions
11-50Growing cacheGood foundation, continue expanding
51-100Mature cacheFocus on quality and maintenance
100+Large cachePrioritize high-impact entries, prune unused ones

Action items:

  • Too few: Review Popular Questions and create more entries
  • Too many: Audit for unused or redundant entries

2. Cache Hit Rate​

What it means: Percentage of user questions answered using cached responses

Formula: (Cache hits) Γ· (Total questions) Γ— 100%

Displayed: Percentage with icon (e.g., "85.7% Hit Rate")

Interpretation:

Hit RatePerformanceInterpretation
80%+βœ… ExcellentCache is comprehensive and well-maintained
60-79%βœ… GoodSolid coverage, room for improvement
40-59%⚠️ FairMany questions bypass cache
Below 40%❌ PoorCache needs significant expansion

What affects hit rate:

  • βœ… Number of cache entries
  • βœ… Quality of question variations
  • βœ… Match between cache topics and user questions
  • βœ… Confidence scores (low confidence = fewer matches)

Action items:

  • Low hit rate: Add more cache entries for popular topics
  • High hit rate: Maintain quality, focus on niche topics
Industry Benchmark

A well-maintained chatbot cache typically achieves 65-75% hit rate. Rates above 80% are exceptional!


3. Success Rate​

What it means: Percentage of cache entries that successfully matched at least one user question

Formula: (Entries with Times Served > 0) Γ· (Total active entries) Γ— 100%

Displayed: Percentage with icon (e.g., "0% Success Rate")

Interpretation:

Success RatePerformanceInterpretation
80%+βœ… ExcellentMost cache entries are useful
60-79%βœ… GoodMajority of entries are matching
40-59%⚠️ FairMany entries aren't matching
Below 40%❌ PoorToo many unused entries

Why success rate might be 0%:

  • ⏰ New cache: Entries just created, haven't had time to match
  • 🎯 Poor variations: Questions don't match how users ask
  • πŸ“‰ Low traffic: Few users asking questions
  • ❌ Inactive entries: All entries are disabled

Action items:

  • 0% initially: Normal for new cache, wait 1-2 weeks
  • 0% after 2+ weeks: Generate more variations, check if entries are active
  • Low success rate: Review unused entries (Times Served = 0) and improve variations

4. Total Variations​

What it means: Total count of all question variations across all cache entries

Displayed: Count with icon (e.g., "0 Variations")

Interpretation:

Variations per EntryCoverageRecommendation
0-2❌ PoorGenerate variations immediately
3-5⚠️ MinimalAdd more variations
6-10βœ… GoodSolid coverage
11-15βœ… ExcellentComprehensive coverage
15+⚠️ ExcessiveMay cause false matches

Calculating average: Total Variations Γ· Total Cache Entries

Example: 90 variations Γ· 10 entries = 9 variations per entry (Good!)

Action items:

  • 0 variations: Use bulk variation generator immediately
  • Low average: Generate more variations for underperforming entries

Entry-Level Metrics​

Each cache entry has individual performance metrics visible in the cache list.

1. Times Served​

What it means: How many times this specific entry was returned to users

Location: Cache Management table, column "Times Served"

Interpretation:

Times ServedStatusAction
100+πŸ”₯ High-valueMonitor closely, keep updated
50-99βœ… ValuableMaintain accuracy
10-49⚠️ ModerateConsider adding variations
1-9πŸ“‰ Low usageReview variations and relevance
0❌ UnusedImprove variations or deactivate

High-value entries (50+ times served):

  • Deserve extra attention
  • Small improvements have big impact
  • Review monthly for accuracy

Unused entries (0 times served):

  • Not matching user questions
  • Poor variations or irrelevant topic
  • Consider improving or removing

Action items:

  • Sort by Times Served to find highest-value entries
  • Focus maintenance on top 20% of entries
  • Improve or remove entries with 0 after 1+ month

2. Last Served​

What it means: Timestamp of when this entry was most recently used

Location: Cache Management table, column "Last Served"

Interpretation:

Last ServedStatusAction
Today/YesterdayπŸ”₯ ActiveCurrently being used
This weekβœ… RecentRegularly used
This month⚠️ OccasionalCheck if still relevant
1+ months ago❌ StaleReview for relevance
Never❌ UnusedSame as "Times Served = 0"

Why "Never"?

  • Entry is new (just created)
  • Variations don't match user questions
  • Topic isn't relevant to users
  • Status is "Inactive"

Action items:

  • Old "Last Served": Verify information is still accurate
  • Never served: Improve variations or remove if irrelevant

3. Last Updated​

What it means: Timestamp of when this entry was last edited

Location: Cache Management table, column "Last Updated"

Interpretation:

Last UpdatedStatusAction
Todayβœ… FreshJust edited
This week/monthβœ… CurrentRecently maintained
1-3 months ago⚠️ CheckReview for accuracy
3+ months ago❌ StaleNeeds update
6+ months ago🚨 CriticalUpdate immediately

Combined with TTL: If "Last Updated" exceeds TTL, entry needs review.

Example:

  • TTL: 30 days
  • Last Updated: 60 days ago
  • Action: Review and update entry now!

Action items:

  • Sort by "Last Updated" (oldest first) to find stale entries
  • Set reminders based on TTL values
  • Review high-traffic entries more frequently

Weekly Review Workflow​

Spend 15-30 minutes weekly reviewing metrics:

Step 1: Check Dashboard (5 minutes)​

  1. Note Cache Hit Rate - Is it improving or declining?
  2. Compare to last week - What changed?
  3. Check Success Rate - Are new entries matching?

Record: Keep a simple log (spreadsheet or document) tracking these numbers weekly.

Example Log:

Date       | Hit Rate | Success Rate | Total Entries
-----------+----------+--------------+--------------
1/1/2026 | 65% | 75% | 8
1/8/2026 | 72% | 80% | 9
1/15/2026 | 78% | 85% | 12

Step 2: Identify Top Performers (5 minutes)​

  1. Go to Cache Management
  2. Sort by Times Served (High β†’ Low)
  3. Review top 5-10 entries

Questions to ask:

  • Are these entries still accurate?
  • Do they need updated information?
  • Can I create similar entries for related topics?

Step 3: Find Underperformers (10 minutes)​

  1. Sort by Times Served (Low β†’ High)
  2. Focus on entries with 0 or very low "Times Served"

Questions to ask:

  • Are variations matching how users ask?
  • Is the topic relevant to users?
  • Should I improve or deactivate?

Action: Generate variations or deactivate if irrelevant.


Step 4: Review Negative Feedback (10 minutes)​

  1. Go to Conversations
  2. Filter by Negative Feedback
  3. Identify if any negative feedback relates to cached responses

Questions to ask:

  • Did the cache return a wrong answer?
  • Is the cached response outdated?
  • Do variations cause false matches?

Action: Update or fix problematic cache entries.


Monthly Deep Dive​

Spend 1-2 hours monthly for comprehensive analysis:

1. Calculate Key Ratios​

Variation Density: Total Variations Γ· Total Entries

  • Target: 7-12 variations per entry

Utilization Rate: Entries with Times Served > 10 Γ· Total Active Entries

  • Target: 60%+

Update Frequency: Entries updated this month Γ· Total Entries

  • Target: 20-30% (regular maintenance)

2. Topic Analysis​

Group entries by topic and compare performance:

Example:

TopicEntriesAvg Times ServedHit Rate
Admissions545High
Courses812Medium
Faculty43Low

Insights:

  • Admissions cache is working well (keep maintaining)
  • Courses need more variations (medium hit rate)
  • Faculty cache needs improvement or removal (low hit rate)

Track metrics across semesters:

Example observations:

  • Fall: Higher hit rate (new students ask common questions)
  • Spring: Lower hit rate (experienced students ask specific questions)
  • Summer: Medium hit rate (prospective students research program)

Action: Adjust cache strategy seasonally.


Using Metrics to Prioritize Work​

Priority Matrix​

Use this matrix to decide which entries need attention:

High Times ServedLow Times Served
Recent Updateβœ… Maintain⚠️ Monitor
Old Update🚨 Update NOW❌ Deactivate/Improve

Quadrant Actions:

  1. High Times Served + Recent Update (βœ… Maintain)

    • Keep monitoring
    • Make small refinements
    • Ensure accuracy
  2. High Times Served + Old Update (🚨 Update NOW)

    • HIGHEST PRIORITY
    • Many users see this entry
    • Outdated info affects most users
    • Update immediately!
  3. Low Times Served + Recent Update (⚠️ Monitor)

    • Give it time to match users
    • Check again in 2-4 weeks
    • Add variations if still low
  4. Low Times Served + Old Update (❌ Deactivate/Improve)

    • LOWEST PRIORITY
    • Not matching users
    • Hasn't been updated
    • Consider deactivating

Setting Performance Goals​

Short-Term Goals (1-3 Months)​

Example Goals:

  • βœ… Increase Cache Hit Rate from 60% to 70%
  • βœ… Reduce entries with "Times Served = 0" by 50%
  • βœ… Add 3-5 variations to all active entries
  • βœ… Update all entries with "Last Updated > 60 days"

How to achieve:

  1. Generate variations for underperforming entries
  2. Review and update stale entries
  3. Create new entries for popular uncached topics

Long-Term Goals (6-12 Months)​

Example Goals:

  • 🎯 Maintain Cache Hit Rate > 75%
  • 🎯 Success Rate > 85%
  • 🎯 Average 8+ variations per entry
  • 🎯 Update all entries at least quarterly

How to achieve:

  1. Establish regular maintenance schedule
  2. Monitor metrics consistently
  3. Respond quickly to negative feedback
  4. Expand cache based on usage patterns

Exporting Metrics (If Available)​

Some admin panels may allow exporting metrics:

Useful exports:

  • πŸ“Š Cache performance over time (CSV)
  • πŸ“ˆ Individual entry statistics (Excel)
  • πŸ“‰ Hit rate trends (graphs)

Uses:

  • Track progress over time
  • Share with stakeholders
  • Identify long-term trends
  • Justify resource allocation

How to export (if available):

  • Look for "Export" or "Download" buttons
  • Choose format (CSV, Excel, PDF)
  • Save and analyze in spreadsheet software

Common Metric Misinterpretations​

Mistake 1: "Success Rate is 0%, cache isn't working!"​

Reality: If you just created entries, success rate will be 0% initially. Give it 1-2 weeks for users to ask matching questions.

Fix: Wait patiently and generate variations.


Mistake 2: "100% Hit Rate is best!"​

Reality: Extremely high hit rates (95%+) may indicate:

  • Cache is too aggressive (matching unrelated questions)
  • Not enough unique questions being asked
  • Need to allow more RAG searches for nuanced questions

Fix: Monitor user feedback for false matches.


Mistake 3: "Times Served = 0 means entry is bad"​

Reality: New entries need time. Also, niche topics naturally have low "Times Served."

Fix: Wait 2-4 weeks before judging new entries. Niche entries are okay if accurate.


Mistake 4: "I should cache everything to get 100% hit rate"​

Reality: Over-caching can:

  • Make maintenance overwhelming
  • Cause false matches
  • Reduce answer flexibility

Fix: Cache strategicallyβ€”focus on frequently asked questions with consistent answers.


Next Steps​

Now that you understand performance metrics:

  1. Follow best practices for effective cache management
  2. Troubleshoot issues when metrics don't improve
  3. Review metrics weekly to track progress

Remember: Metrics are tools, not goals. Use them to improve user experience, not to chase perfect numbers!