Tag: caching

2 articles filed under this tag. Newest first below ; start with the highlighted pick if you are new here.

Featured

Cost Optimization in LLM Applications

Token budgeting, semantic and exact caching, model routing tiers, and fallback strategies to control spend without turning the product into a smaller model glued to a spreadsheet of hacks.

Mar 3, 2026 · 6 min read

Caching Strategies for Low-Latency APIs (Redis + In-Memory)
How layered caching reduces database load by serving hot data from memory before hitting persistent storage — and how to keep those layers correct, consistent, and stampede-proof.

Jul 10, 2025 · 9 min read