Install Guide
Deploy NGARi on NVIDIA Jetson AGX Orin — bare metal to sovereign AI.
This guide walks through the full deployment of NGARi on an NVIDIA Jetson AGX Orin 64GB. These are the exact steps used to deploy the reference instance running at ngari.com. Total time: approximately 2 hours depending on download speeds.
1. Prerequisites
- NVIDIA Jetson AGX Orin Developer Kit (64GB recommended)
- NVMe SSD (512GB+) or high-endurance SD card (128GB+)
- USB-C power supply (supplied with the dev kit)
- Ethernet cable or WiFi (dual-band preferred)
- Another computer for flashing (Ubuntu or macOS)
- USB flash drive (16GB+) for flashing
2. Flash Ubuntu 24.04 LTS
NVIDIA provides JetPack and the L4T BSP through the SDK Manager tool. Flash the Jetson with Ubuntu 24.04 LTS (or 22.04 LTS if 24.04 is not yet available for your Jetson revision).
3. Install NVIDIA Drivers
Verify the NVIDIA drivers and CUDA are properly installed:
4. Install Ollama
NGARi uses Ollama for local model inference. Install the official build for Linux (aarch64):
~/.ollama/models. If your home partition is small, symlink it to a larger drive: mkdir -p /mnt/ssd/ollama-models && \ ln -s /mnt/ssd/ollama-models ~/.ollama/models5. Pull Models
Pull the models used by NGARi. These are the models deployed on the reference Orin at ngari.com:
6. Install Docker
NGARi runs as a Docker container for easy deployment and updates.
7. Clone and Build NGARi
docker-compose.yml in the repository.8. Start Agent Services
NGARi includes 6 autonomous agents and a Control Tower API. Start them using the deployment scripts in the repository:
9. Access the System
Once everything is running, access NGARi at these endpoints:
- Chat UI: http://localhost:3001— private AI chat with all models
- Control Tower: http://localhost:8888— agent monitoring, revenue, system health (JWT auth required)
- Sandbox: http://localhost:3001/sandbox— browser sessions and code execution
- API: http://localhost:3001/api/v1/chat/completions— OpenAI-compatible API with Bearer auth
- BTCPay Server: http://localhost:3003— sovereign payment processing
10. Optional: Remote Access
To make your NGARi instance accessible from the internet (e.g., to share with your team), set up a Cloudflare tunnel:
Troubleshooting
Ollama not reachable. Ensure the service is running and the OLLAMA_HOST environment variable is set correctly in your Docker/NGARi configuration.
Disk space low. Ollama models and Docker images consume significant space. Consider moving Ollama models to a secondary drive or NVMe.
Web UI not loading. Check that port 3001 is not blocked by a firewall. Run ss -tlnp | grep 3001 to verify the process is listening.
Questions? [email protected] · [email protected]