Download Inferno
Get started with high-performance local AI inference
Recommended for Your System
💻
Choose Your Platform
Select a platform above to view installation options.
All Platforms
macOS
Linux
Windows
Docker
Pull from GitHub Container Registry
docker pull ghcr.io/ringo380/inferno:latest
Run Container
docker run -p 8080:8080 ghcr.io/ringo380/inferno:latest
System Requirements
Minimum Requirements
- CPU: x86_64 or ARM64 processor
- RAM: 4GB (8GB recommended)
- Storage: 500MB + model sizes
- OS: macOS 11+, Linux (kernel 4.x+), Windows 10+
GPU Acceleration Support
- Apple Silicon: Full Metal GPU supportProductionM1/M2/M3/M4 • 13x faster • Full GPU offloading
- NVIDIA: CUDA 11.x+ accelerationSupportedRequires CUDA drivers and compatible GPU
- AMD: ROCm accelerationSupportedRequires ROCm drivers and compatible GPU
- Intel: OpenCL supportExperimentalArc series • Limited functionality
Note: GPU features may vary by distribution method. Build from source for full control.
Recommended Setup
- CPU: Modern multi-core processor
- RAM: 16GB+ for larger models
- GPU: 8GB+ VRAM
- Storage: SSD for model storage
Verify Your Download
All releases are signed and include checksums for verification. See our GitHub releases page for SHA256 checksums.