self-hosted-ai

Here are 54 public repositories matching this topic...

ItzCrazyKns / Vane

Vane is an AI-powered answering engine.

search-engine machine-learning artificial-intelligence ai-agents rag vane answering-engine searxng llm ai-search-engine open-source-ai-search-engine perplexica searxng-copilot self-hosted-ai

Updated Apr 9, 2026
TypeScript

izwi-ai / izwi

Star

On-device audio AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.

text-to-speech tts speech-to-text asr speaker-diarization voice-cloning local-first openai-compatible-api self-hosted-ai audio-inference

Updated Apr 9, 2026
Rust

thushan / olla

Sponsor

Star

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Updated Apr 1, 2026
Go

SamurAIGPT / Vibe-Workflow

Star

Free, open-source alternative to Weavy AI, Krea Nodes, Freepik Spaces & FloraFauna AI — node-based AI workflow builder for generative image & video pipelines

Updated Apr 6, 2026
JavaScript

peva3 / SmarterRouter

Star

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

docker self-hosted model-serving gpu-monitoring fastapi llm openai-proxy semantic-cache local-llm ollama llm-proxy ollama-api ai-gateway llm-router self-hosted-ai ai-cache

Updated Apr 6, 2026
Python

tanaos / artifex

Star

Small Language Model Inference, Fine-Tuning and Observability.

sentiment-analysis text-classification named-entity-recognition emotion-detection intent-classification reranker text-anonymization pre-trained-models ai-observability llm-inference local-ai llm-finetuning task-specific-model small-language-models self-hosted-ai guardrail-models

Updated Apr 9, 2026
Python

AVADSA25 / codec

Star

Open-Source Intelligent Command Layer

open-source self-hosted mac-os mlx voice-assistant python-automation opensource-projects voice-assistant-ai llm-agent local-ai qwen voice-assistant-free self-hosted-ai local-ai-development llm-agent-framework local-ai-agents local-ai-llm

Updated Apr 10, 2026
Python

augmentedmike / miniclaw-os

Star

We gave AI agents a brain. Memory, planning, continuity, and self-repair — the missing cognitive architecture layer. Runs on your Mac.

Updated Mar 24, 2026
TypeScript

Recallium is a local, self-hosted universal AI memory system providing a persistent knowledge layer for developer tools (Copilot, Cursor, Claude Desktop). It eliminates "AI amnesia" by automatically capturing, clustering, and surfacing decisions and patterns across all projects. It uses the MCP for universal compatibility and ensures privacy

Updated Apr 9, 2026
Batchfile

OlgaKalinina101 / victor_ai_backend

Star

emotional AI Companions for personal relationships

Updated Mar 13, 2026
Python

sprklai / zenii

Star

Your machine's AI brain. One 20MB binary gives every tool, script, and cron job shared AI memory + 114 API routes. Desktop app, CLI, Telegram — all connected. Rust-powered.

Updated Apr 9, 2026
Rust

Jewelzufo / granitepi-4-nano

Star

Run IBM Granite 4.0 locally on Raspberry Pi 5 with Ollama.This is a privacy-first AI. Your data never leaves your device because it runs 100% locally. There are no cloud uploads and no third-party tracking.

linux open-source raspberry-pi ai ibm arm64 embedded-linux ai-project edge-ai huggingface on-device-ai llm local-ai ollama small-language-models offline-ai private-ai self-hosted-ai mamba-2

Updated Mar 27, 2026
Shell

dwain-barnes / flowise-private-doc-chat-rag-blog

Star

A private, local RAG (Retrieval-Augmented Generation) system using Flowise, Ollama, and open-source LLMs to chat with your documents securely and offline.

open-source data-privacy rag local-llm retrieval-augmented-generation flowise ollama pdf-chatbot flowise-ai offline-ai private-doc-chat self-hosted-ai

Updated Nov 21, 2024

Mr-Dark-debug / sms-ai-agent

Star

AI SMS Auto-Responder for Android. Turn your Android device into an autonomous AI communication hub. A Python-based SMS auto-responder running natively on Termux, powered by LLMs (OpenRouter/Ollama) with a sleek Web & Terminal UI.

python automation chatbot python3 termux ai-agents android-automation oep sms-bot termux-tools fastapi mobile-ai llm generative-ai openrouter groq-ai self-hosted-ai textual-ui

Updated Feb 15, 2026
Python

sanath-kumar-s / CleverWick

Star

Electron + Next.js desktop AI assistant that runs GGUF models locally using llama.cpp. Designed for offline use, portability, and zero-install deployment.

electron react nodejs desktop-app ai nextjs on-device-ai ai-assistant llm llama-cpp local-llm local-ai gguf offline-ai self-hosted-ai offline-ai-solutions

Updated Mar 30, 2026
JavaScript

alez007 / yasha

Star

Self-hosted, multi-model AI inference server. Run LLMs, TTS, STT, embeddings, and image generation with an OpenAI-compatible API.

ai inference self-hosted embeddings tts openai image-generation ray stt ai-platform llm diffusers vllm self-hosted-ai

Updated Apr 9, 2026
Python

smansf / juso

Star

Defense-in-depth platform for running OpenClaw agents on personal hardware

security ubuntu ubuntu-server utm security-tools ai-agents defense-in-depth self-hosted-agent ollama self-hosted-ai openclaw openclaw-security

Updated Apr 2, 2026
Shell

westailabs / nebulus-gantry

Star

Self-hosted AI chat interface with RAG, long-term memory, and admin controls. Works with TabbyAPI, Ollama, vLLM, and any OpenAI-compatible API.

Updated Apr 3, 2026
Python

FAI-Solutions / open-webui-extensions

Star

A growing collection of practical filter, pipeline & tool extensions for Open WebUI — solving real usability gaps in local & cloud LLM workflows.

pipelines usage-monitor llm ollama open-webui self-hosted-ai ai-extensions ollama-cloud self-hosted-ai-stack open-webui-extensions filter-extension open-webui-filters open-webui-marketplace ollama-usage-monitor ollama-cloud-usage-monitor

Updated Apr 5, 2026

fiv3fingers / openclaw-telegram-ai-agent

Star

Production-ready guide for connecting OpenClaw to a Telegram Bot. Build a self-hosted Telegram AI Agent using OpenClaw Gateway, pairing, and streaming responses.

telegram-bot ai-agent ai-automation llm ai-gateway telegram-integration telegram-ai-bot self-hosted-ai openclaw openclaw-telegram

Updated Feb 27, 2026

Improve this page

Add a description, image, and links to the self-hosted-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the self-hosted-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-hosted-ai

Here are 54 public repositories matching this topic...

ItzCrazyKns / Vane

izwi-ai / izwi

thushan / olla

SamurAIGPT / Vibe-Workflow

peva3 / SmarterRouter

tanaos / artifex

AVADSA25 / codec

augmentedmike / miniclaw-os

recallium-ai / recallium

OlgaKalinina101 / victor_ai_backend

sprklai / zenii

Jewelzufo / granitepi-4-nano

dwain-barnes / flowise-private-doc-chat-rag-blog

Mr-Dark-debug / sms-ai-agent

sanath-kumar-s / CleverWick

alez007 / yasha

smansf / juso

westailabs / nebulus-gantry

FAI-Solutions / open-webui-extensions

fiv3fingers / openclaw-telegram-ai-agent

Improve this page

Add this topic to your repo