self-hosted-ai

Here are 123 public repositories matching this topic...

Mintplex-Labs / anything-llm

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

no-code ai-agents multimodal rag vector-database llm localai local-ai agentic-ai self-hosted-ai agent-harness open-claw hermes-agent

Updated Jun 17, 2026
JavaScript

ItzCrazyKns / Vane

Star

Vane is an AI-powered answering engine.

search-engine machine-learning artificial-intelligence ai-agents rag vane answering-engine searxng llm ai-search-engine open-source-ai-search-engine perplexica searxng-copilot self-hosted-ai

Updated Apr 11, 2026
TypeScript

SamurAIGPT / Vibe-Workflow

Star

Free, open-source alternative to Weavy AI, Krea Nodes, Freepik Spaces & FloraFauna AI — node-based AI workflow builder for generative image & video pipelines

Updated Jun 12, 2026
JavaScript

izwi-ai / izwi

Star

Voice AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.

text-to-speech tts speech-to-text asr speaker-diarization voice-cloning local-first openai-compatible-api self-hosted-ai audio-inference

Updated Jun 17, 2026
Rust

thushan / olla

Sponsor

Star

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Updated Jun 17, 2026
Go

peva3 / SmarterRouter

Star

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

docker self-hosted model-serving gpu-monitoring fastapi llm openai-proxy semantic-cache local-llm ollama llm-proxy ollama-api ai-gateway llm-router self-hosted-ai ai-cache

Updated May 10, 2026
Python

AVADSA25 / codec

Star

Open-Source Intelligent Command Layer

open-source self-hosted mac-os mlx voice-assistant python-automation opensource-projects voice-assistant-ai llm-agent local-ai qwen voice-assistant-free self-hosted-ai local-ai-development llm-agent-framework local-ai-agents local-ai-llm

Updated Jun 15, 2026
Python

tanaos / artifex

Star

Small Language Model Inference, Fine-Tuning and Observability.

sentiment-analysis text-classification named-entity-recognition emotion-detection intent-classification reranker text-anonymization pre-trained-models ai-observability llm-inference local-ai llm-finetuning task-specific-model small-language-models self-hosted-ai guardrail-models

Updated Apr 11, 2026
Python

Recallium is a local, self-hosted universal AI memory system providing a persistent knowledge layer for developer tools (Copilot, Cursor, Claude Desktop). It eliminates "AI amnesia" by automatically capturing, clustering, and surfacing decisions and patterns across all projects. It uses the MCP for universal compatibility and ensures privacy

Updated May 15, 2026
Batchfile

sanath-kumar-s / CleverWick

Star

Electron + Next.js desktop AI assistant that runs GGUF models locally using llama.cpp. Designed for offline use, portability, and zero-install deployment.

electron react nodejs desktop-app ai nextjs on-device-ai ai-assistant llm llama-cpp local-llm local-ai gguf offline-ai self-hosted-ai offline-ai-solutions

Updated Mar 30, 2026
JavaScript

alez007 / modelship

Star

Self-hosted, multi-model AI inference server. Run LLMs, TTS, STT, embeddings, and image generation with an OpenAI-compatible API.

ai inference self-hosted embeddings tts openai image-generation ray stt ai-platform llm diffusers vllm self-hosted-ai

Updated Jun 17, 2026
Python

KaiFelixBennett / hermes-claude-code-local

Star

Run Hermes Agent + Claude Code locally on llama.cpp — zero API costs. A 4h / 7M-token session that would have cost $94 on Claude Opus 4.7

telegram-bot wsl2 autonomous-agent ai-agent llama-cpp local-llm llm-inference gguf speculative-decoding litellm private-ai self-hosted-ai claude-code agentic-coding multi-token-prediction hermes-agent qwen3-6-27b hip-rocm

Updated Jun 8, 2026
Shell

OlgaKalinina101 / victor_ai_backend

Star

emotional AI Companions for personal relationships

Updated Mar 13, 2026
Python

sprklai / zenii

Star

Your machine's AI brain. One 20MB binary gives every tool, script, and cron job shared AI memory + 136 API endpoints. Desktop app, CLI, Telegram — all connected. Rust-powered.

Updated Jun 16, 2026
Rust

youngharold / tightwad

Star

Mixed-vendor GPU inference cluster manager with speculative decoding

Updated Apr 28, 2026
Python

aisecuritygateway / aisecuritygateway

Star

Self-hosted AI security proxy. Redact PII, block prompt injection, route to any LLM provider. OpenAI-compatible.

Updated Jun 9, 2026
Python

ClawixAI / clawix

Star

Open-source, self-hosted multi-agent AI orchestration platform. Run agents in isolated Docker containers with swarm coordination, RBAC, token governance, and multi-channel support.

ai assistant multi-agent swarm ai-platform self-hosted-ai ai-swarm agent-orchestration docker-ai

Updated Jun 17, 2026
TypeScript

Jewelzufo / granitepi-4-nano

Star

Run IBM Granite 4.0 locally on Raspberry Pi 5 with Ollama.This is a privacy-first AI. Your data never leaves your device because it runs 100% locally. There are no cloud uploads and no third-party tracking.

linux open-source raspberry-pi ai ibm arm64 embedded-linux ai-project edge-ai huggingface on-device-ai llm local-ai ollama small-language-models offline-ai private-ai self-hosted-ai mamba-2

Updated Mar 27, 2026
Shell

FAI-Solutions / open-webui-extensions

Star

A growing collection of practical filter, pipeline & tool extensions for Open WebUI — solving real usability gaps in local & cloud LLM workflows.

pipelines usage-monitor llm ollama open-webui self-hosted-ai ai-extensions ollama-cloud self-hosted-ai-stack open-webui-extensions filter-extension open-webui-filters open-webui-marketplace ollama-usage-monitor ollama-cloud-usage-monitor

Updated May 30, 2026

geeks-accelerator / ollama-herd

Star

Local AI load balancer for Ollama fleets — auto-discovery, smart routing, OpenAI-compatible API, zero config. Perfect for Mac Minis & Studios.

python load-balancer embeddings zero-config fleet-management multimodal fastapi edge-ai apple-silicon llm local-llm ollama ai-gateway ai-infrastructure self-hosted-ai openai-compatible ai-router inference-router ollama-cluster

Updated Jun 10, 2026
Python

Improve this page

Add a description, image, and links to the self-hosted-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the self-hosted-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-hosted-ai

Here are 123 public repositories matching this topic...

Mintplex-Labs / anything-llm

ItzCrazyKns / Vane

SamurAIGPT / Vibe-Workflow

izwi-ai / izwi

thushan / olla

peva3 / SmarterRouter

AVADSA25 / codec

tanaos / artifex

recallium-ai / recallium

sanath-kumar-s / CleverWick

alez007 / modelship

KaiFelixBennett / hermes-claude-code-local

OlgaKalinina101 / victor_ai_backend

sprklai / zenii

youngharold / tightwad

aisecuritygateway / aisecuritygateway

ClawixAI / clawix

Jewelzufo / granitepi-4-nano

FAI-Solutions / open-webui-extensions

geeks-accelerator / ollama-herd

Improve this page

Add this topic to your repo