llama-server

Star

Here are 11 public repositories matching this topic...

lordmathis / llamactl

Star

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

self-hosted mlx openai-api llm llamacpp llama-cpp vllm llm-inference localllm localllama llama-server llm-router mlx-lm

Updated Feb 9, 2026
Go

hwpoison / llamacpp-terminal-chat

Star

A lightweight chat terminal-interface for llama.cpp server written in C++ with many features and windows/linux support.

chat roleplay llama teminal-application llamacpp mistral-7b llama-server

Updated May 31, 2025
C++

lynxai-team / goinfer

Star

Local LLM proxy, DevOps friendly

inference inference-server inference-api openai-api llm openaiapi llamacpp llama-cpp local-llm localllm local-ai llm-proxy llama-api llama-server llm-router language-model-api local-lm local-llm-integration

Updated Feb 8, 2026
Go

pkeffect / llama-swap-sync

Star

A robust, production-ready Python toolkit to automate the synchronization between a directory of .gguf model files and a llama-swap config.yaml

python llama-cpp gguf llama-server llama-swap

Updated Nov 15, 2025
Python

A production-grade Python SDK for llama-server that streamlines authentication, token rotation, observability, and PII masking—helping AI architects ship secure, traceable LLM systems with enterprise-ready guardrails.

python privacy sdk authentication openai llama observability governance masking pii llm langchain langfuse llama-server langgraph ai-architecture token-rotation

Updated Feb 9, 2026
Python

CasualEngineerZombie / smolvlm-realtime-face

Star

A simple web application for real-time AI vision analysis using SmolVLM-500M-Instruct with live camera feed processing and text-to-speech.

face-recognition webcam llm-inference llama-server smolvlm

Updated Jun 30, 2025
JavaScript

nemmusu / run-llama-server

Star

This is a Bash script to automatically launch llama-server, detects available .gguf models, and selects GPU layers based on your free VRAM.

bash cli utility ai launcher nvidia llama nvidia-smi nvidia-gpu llm llamacpp gguf llama-server gguf-models

Updated May 25, 2025
Shell

nlkli / lachat

Star

minimal CLI client for llama-server

chat cli cli-app llama gpt llm chatgpt llamacpp llama-server

Updated Jan 24, 2026
Rust

hyang97 / nanocomplete

Star

Create a code completion model & tool for IDEs that can run locally on consumer hardware and rival the performance of commercial products like Cursor.

transformers torch vscode-extension code-generation mlflow huggingface humaneval vllm evals llama-server

Updated Feb 9, 2026
Python

MidalaNet / hikma

Star

Hikma is a minimal GTK4 chat client in Vala for OpenAI‑compatible APIs. It renders messages as plain text, stores settings securely via libsecret, and builds with Meson/Ninja plus a simple Debian packaging flow.

linux debian vala artificial-intelligence gtk4 openai-api llama-server

Updated Dec 10, 2025
Vala

david-rockwood / fimpad

Star

FIMpad is a FIM-focused local LLM interface in the form of a tabbed GUI text editor.

text-editor tkinter granite fim llm llamacpp llama-cpp fill-in-the-middle qwen llama-server ai-interface ai-text-editor

Updated Jan 20, 2026
Python

Improve this page

Add a description, image, and links to the llama-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llama-server topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-server

Here are 11 public repositories matching this topic...

lordmathis / llamactl

hwpoison / llamacpp-terminal-chat

lynxai-team / goinfer

pkeffect / llama-swap-sync

Root1V / llm-arch-sdk

CasualEngineerZombie / smolvlm-realtime-face

nemmusu / run-llama-server

nlkli / lachat

hyang97 / nanocomplete

MidalaNet / hikma

david-rockwood / fimpad

Improve this page

Add this topic to your repo