Turn your PC into a private, autonomous AI lab, without melting your GPU.
IronSilo is a completely local, cross-platform (Windows, macOS, Linux) AI development sandbox. It packages a state-of-the-art coding assistant, a wiki RAG engine, an autonomous WebAssembly agent, and a context-compression proxy into a single, resource-capped environment.
It runs on low-to-mid spec machines by strictly limiting background RAM to ~4GB, dedicating 100% of your GPU to your actual AI model.
This workspace abandons brittle IDE extensions in favor of a Terminal-First, Dual-Agent Swarm. We split responsibilities between two specialized engines to maximize token efficiency and system security:
The Action Layer (Runs Locally for File/System Access):
- The Hands (Aider CLI): Your specialized coding engine. Aider maps your project's Abstract Syntax Tree (AST) to use 4x fewer tokens than standard agents. It runs natively in your terminal to safely execute bash commands, read linter errors, and apply complex line-by-line file diffs.
- The Brain (IronClaw PAI): Your Personal AI and orchestrator. Running natively, it executes web research, schedule management, and background cron jobs strictly inside a zero-trust WebAssembly (WASM) sandbox, ensuring your API keys and host OS are never exposed to malicious LLM outputs.
The Intelligence Layer (Locked in 4GB Docker Container):
- Khoj: Your private Wiki RAG engine. Drop in PDFs, markdown files, and notes, and ask your AI questions about them via its native Web UI.
- Genesys & pgvector: The Long-Term Memory (LTM) database. This utilizes an active causal graph, allowing autonomous agents to remember your preferences and causal reasoning across sessions.
- LLMLingua Proxy: The central hub. It intercepts massive prompts and uses a tiny CPU model to compress the text by up to 40% before sending it to your GPU, saving your VRAM from crashing. It also has the added benefit of Token Optimization.
If you are starting from a fresh computer, you must install these core tools first:
- Git: Aider requires Git to track code changes. Download at git-scm.com (Linux:
sudo apt install gitorsudo pacman -S git). - Python / pip: Required to install Aider natively (
pip install aider-chat).
You need Docker to run the background databases and proxies safely.
- Windows & macOS: Download and install Docker Desktop. Windows users: Ensure WSL2 is enabled during installation. Open the app and make sure it is running in your system tray.
- Linux (Ubuntu/Debian): Run
sudo apt install docker.io docker-compose-v2and start the daemon withsudo systemctl enable --now docker. - Linux (Arch/CachyOS): Run
sudo pacman -S docker docker-composeand start the daemon withsudo systemctl enable --now docker.
You need a program running on your computer to host your AI model (we highly recommend downloading the Qwen 2.5 Coder 7B model). Install one of the following:
- LM Studio: Best for Windows/Mac beginners. Features a great UI.
- Ollama: Best for command-line users. (Run
ollama run qwen2.5-coder). - Lemonade: Best for Arch Linux/AMD GPU users seeking maximum ROCm performance. (Arch users:
yay -S lemonade-bin).
Once your prerequisites are installed, you are ready to go.
Step 1: Start your AI Model
Open your AI Host and start a local server. (By default, our proxy looks for an AI running on port 8000. See the 'Documentation' section below if using Ollama, which uses port 11434).
Step 2: Boot the Workspace
- Windows: Double-click
Start_Workspace.bat - Mac/Linux: Open a terminal in this folder and run
./Start_Workspace.sh(Note: The very first time you do this, Docker will download the required tools. It will be instant next time).
Step 3: Code! Your tools are securely routed and ready to use natively.
- To Code (Aider): Open your terminal and start Aider by pointing it to your local proxy:
export OPENAI_API_BASE="[http://127.0.0.1:8001/api/v1](http://127.0.0.1:8001/api/v1)" export OPENAI_API_KEY="local-sandbox" aider
- To Research (Khoj): Open your web browser and navigate to
http://127.0.0.1:42110to access your private Wiki UI. - To Automate (IronClaw): Navigate to
http://127.0.0.1:8080in your browser to chat with your WASM agent.
When you are done working, get your computer's RAM back:
- Windows: Double-click
Stop_Workspace.bat - Mac/Linux: Run
./Stop_Workspace.sh
IronSilo includes comprehensive unit and integration tests to ensure reliability.
# Install test dependencies
pip install -e ".[dev]"
# Run all tests
pytest tests/
# Run unit tests only
pytest tests/unit/
# Run integration tests only
pytest tests/integration/
# Run with coverage report
pytest --cov=. --cov-report=html
# Run specific test file
pytest tests/unit/test_proxy_proxy.py -v- Total Tests: 664 tests (all passing)
- Code Coverage: 81.6%
- Test Types:
- Unit tests for all core modules
- Integration tests for proxy and security
- Mock-based testing for external dependencies
- TUI Pilot tests for headless interface testing
- MCP server tests with mocked HTTP clients
IronSilo includes a comprehensive security framework:
- AES-256-GCM encryption for data at rest
- PBKDF2 key derivation with 100,000 iterations
- Secure key management with rotation support
- Input validation via Pydantic models
- See SECURITY_AUDIT_REPORT.md for full security assessment
When adding new features:
- Write tests first (TDD approach)
- Ensure tests pass:
pytest tests/ - Check coverage:
pytest --cov=.
- Simple Manual - Getting started guide
- Advanced Architecture - Technical deep dive
- Roadmap - Future improvements
- Architecture - System design overview
- Security Audit Report - Enterprise security assessment
Contributions are welcome! Please read CONTRIBUTING.md for guidelines.
# Clone the repository
git clone https://github.com/iknowkungfubar/IronSilo.git
cd IronSilo
# Create virtual environment
python -m venv .venv
source .venv/bin/activate # or .venv\Scripts\activate on Windows
# Install dependencies
pip install -e ".[dev]"
# Run pre-commit hooks
pre-commit install
pre-commit run --all-files