Production ready toolkit to run AI locally
-
Updated
Apr 23, 2026 - C++
Production ready toolkit to run AI locally
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more.
Low-latency AI engine for mobile devices & wearables
AI You Control: Choose your models. Own your data. Eliminate vendor lock-in.
An AI-powered file management tool that ensures privacy by organizing local texts, images. Using Llama3.2 3B and Llava v1.6 models with the Nexa SDK, it intuitively scans, restructures, and organizes files for quick, seamless access and easy retrieval.
Run Claude Code 100% on-device with local AI on Apple Silicon. MLX-native Anthropic-API server, 65 tok/s Qwen 3.5 122B, Llama 3.3 70B, Gemma 4 31B. Private, offline, airgap-ready. Built for NDA / legal / healthcare workflows.
On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that runs entirely on your machine. Powered by Gemma 4 E2B and Kokoro.
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Declarative way to run AI models in React Native on device, powered by ExecuTorch.
On-device LLM execution in React Native with Vercel AI SDK compatibility
Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT, Embedding, and OCR fine-tuning — natively on MLX. Unsloth-compatible API.
Android on_device Face Recognition 、 Liveness detection and 1:N & M:N Face Search SDK 离线版设备端人脸识别 活体检测 以及1:N M:N 人脸搜索SDK
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
NativeMind: Your fully private, open-source, on-device AI assistant
Mano-P: Open-source GUI-VLA agent for edge devices. #1 on OSWorld (specialized, 58.2%). Runs locally on Apple M4 Mac mini/MacBook — no data leaves your device.Mano-P 是一个开源 GUI-VLA 项目,支持在 Mac mini/MacBook 上或通过算力棒本地运行推理,实现纯视觉驱动的跨平台 GUI 自动化操作。数据完全本地处理,支持复杂多步骤任务规划与执行。
Example apps for Foundation Models Framework in iOS 26 and macOS 26
TinyChatEngine: On-Device LLM Inference Library
[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
Local-first, open-source AI assistant for your data. Unify tasks, notes, docs, photos, and bookmarks. Private, self-hosted, and extensible via APIs.
Add a description, image, and links to the on-device-ai topic page so that developers can more easily learn about it.
To associate your repository with the on-device-ai topic, visit your repo's landing page and select "manage topics."