ai-cache

Here are 2 public repositories matching this topic...

peva3 / SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

docker self-hosted model-serving gpu-monitoring fastapi llm openai-proxy semantic-cache local-llm ollama llm-proxy ollama-api ai-gateway llm-router self-hosted-ai ai-cache

Updated Mar 4, 2026
Python

dizzydes / 0pi-mcp-server

Star

Dropbox and pastebin for AI agents. Open and free. Useful for caching contexts and bridging multi-agent workflows

Updated Mar 5, 2026
JavaScript

Improve this page

Add a description, image, and links to the ai-cache topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-cache topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly