CACHING_MIGRATION.md

Caching Migration: From In-Memory to Redis

Overview

This document describes the migration from problematic in-memory caching to a Redis-based distributed caching solution for the opensensor-api running in Kubernetes.

Problem Statement

The original implementation used simple in-memory caching with global dictionaries:

# Problematic in-memory cache
_cache = {}
_cache_timestamps = {}

Issues with In-Memory Caching in Kubernetes:

Pod Isolation: Each of the 4 replicas has its own memory space, so cached data isn't shared
Cache Inconsistency: Different pods may have different cached values for the same data
Memory Waste: Each pod duplicates the same cached data
Pod Restarts: Cache is lost when pods restart (common in K8s)
Scaling Issues: Adding more replicas multiplies memory usage and cache inconsistency

Solution: Redis-Based Distributed Caching

Architecture Changes

New Cache Module: opensensor/cache.py
- Redis connection management with connection pooling
- Graceful fallback when Redis is unavailable
- Comprehensive error handling and logging
Updated Collection APIs: opensensor/collection_apis.py
- Replaced simple_cache decorator with redis_cache
- Updated get_device_info_cached function to use Redis
Cache Management Endpoints: Added to opensensor/app.py
- /cache/stats - Get cache statistics
- /cache/clear - Clear all cache entries
- /cache/invalidate - Invalidate specific cache patterns

Key Features

Redis Cache Decorator

@redis_cache(ttl_seconds=300)
def get_device_info_cached(device_id: str):
    """Cached device information lookup using Redis"""
    api_keys, _ = get_api_keys_by_device_id(device_id)
    return reduce_api_keys_to_device_ids(api_keys, device_id)

Graceful Fallback

If Redis is unavailable, functions execute without caching
No service disruption when Redis is down
Automatic reconnection attempts

Connection Management

Uses existing REDIS_URL environment variable
Connection pooling for optimal performance
Health checks and timeout handling

Configuration

Environment Variables

REDIS_URL: Redis connection string (already available in deployment)

Dependencies

Added to Pipfile:

redis = "*"

Cache Management

Monitoring

# Get cache statistics
GET /cache/stats

Response includes:

Redis connection status
Number of opensensor cache keys
Redis version and memory usage
Cache hit/miss ratios

Maintenance

# Clear all cache
POST /cache/clear

# Invalidate specific patterns
POST /cache/invalidate
{
  "pattern": "get_device_info_cached:*"
}

Benefits

Shared Cache: All pods share the same cache, ensuring consistency
Persistence: Cache survives pod restarts
Scalability: Adding more API pods doesn't duplicate cache data
Performance: Redis is optimized for caching workloads
Monitoring: Built-in metrics and monitoring capabilities
Reliability: Graceful degradation when Redis is unavailable

Deployment Notes

Kubernetes Deployment

No changes required to existing deployment YAML
Uses existing REDIS_URL environment variable
Backward compatible - works with or without Redis

Rolling Update Strategy

Deploy new image with Redis caching
Old in-memory cache will be gradually replaced
No downtime or service interruption

Monitoring

Check /cache/stats endpoint for Redis connectivity
Monitor Redis metrics through existing infrastructure
Log analysis for cache hit/miss ratios

Testing

Local Development

# Install dependencies
pipenv install

# Set Redis URL (if testing locally)
export REDIS_URL="redis://localhost:6379"

# Run the application
uvicorn opensensor.app:app --reload

Cache Verification

# Check cache stats
curl -X GET "http://localhost:8000/cache/stats" \
  -H "Authorization: Bearer <token>"

# Test cache invalidation
curl -X POST "http://localhost:8000/cache/invalidate" \
  -H "Authorization: Bearer <token>" \
  -H "Content-Type: application/json" \
  -d '{"pattern": "*"}'

Migration Checklist

Rollback Plan

If issues arise, the system gracefully falls back to no caching when Redis is unavailable. For complete rollback:

Revert to previous image version
In-memory caching will resume automatically
No data loss or service interruption

Performance Expectations

Cache Hit Ratio: Expected 80-90% for device info lookups
Response Time: 10-50ms improvement for cached requests
Memory Usage: Reduced per-pod memory usage
Consistency: 100% cache consistency across all pods

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caching Migration: From In-Memory to Redis

Overview

Problem Statement

Issues with In-Memory Caching in Kubernetes:

Solution: Redis-Based Distributed Caching

Architecture Changes

Key Features

Redis Cache Decorator

Graceful Fallback

Connection Management

Configuration

Environment Variables

Dependencies

Cache Management

Monitoring

Maintenance

Benefits

Deployment Notes

Kubernetes Deployment

Rolling Update Strategy

Monitoring

Testing

Local Development

Cache Verification

Migration Checklist

Rollback Plan

Performance Expectations

FilesExpand file tree

CACHING_MIGRATION.md

Latest commit

History

CACHING_MIGRATION.md

File metadata and controls

Caching Migration: From In-Memory to Redis

Overview

Problem Statement

Issues with In-Memory Caching in Kubernetes:

Solution: Redis-Based Distributed Caching

Architecture Changes

Key Features

Redis Cache Decorator

Graceful Fallback

Connection Management

Configuration

Environment Variables

Dependencies

Cache Management

Monitoring

Maintenance

Benefits

Deployment Notes

Kubernetes Deployment

Rolling Update Strategy

Monitoring

Testing

Local Development

Cache Verification

Migration Checklist

Rollback Plan

Performance Expectations