Download Inferno

Get started with high-performance local AI inference

Recommended for Your System

💻

Choose Your Platform

Select a platform above to view installation options.

All Platforms

macOS

Homebrew (Recommended)

brew install inferno

Build from Source

Download →

Linux

Binary Download

Download →

Cargo Install

cargo install inferno

Build from Source

Download →

Windows

Binary Download

Download →

Cargo Install

cargo install inferno

Build from Source

Download →

Docker

Pull from GitHub Container Registry

docker pull ghcr.io/ringo380/inferno:latest

Run Container

docker run -p 8080:8080 ghcr.io/ringo380/inferno:latest

System Requirements

Minimum Requirements

  • CPU: x86_64 or ARM64 processor
  • RAM: 4GB (8GB recommended)
  • Storage: 500MB + model sizes
  • OS: macOS 11+, Linux (kernel 4.x+), Windows 10+

GPU Acceleration Support

  • Apple Silicon: Full Metal GPU supportProduction
    M1/M2/M3/M4 • 13x faster • Full GPU offloading
  • NVIDIA: CUDA 11.x+ accelerationSupported
    Requires CUDA drivers and compatible GPU
  • AMD: ROCm accelerationSupported
    Requires ROCm drivers and compatible GPU
  • Intel: OpenCL supportExperimental
    Arc series • Limited functionality

Note: GPU features may vary by distribution method. Build from source for full control.

Recommended Setup

  • CPU: Modern multi-core processor
  • RAM: 16GB+ for larger models
  • GPU: 8GB+ VRAM
  • Storage: SSD for model storage

Need Help?

Check our documentation for installation guides and troubleshooting.

View Documentation

Verify Your Download

All releases are signed and include checksums for verification. See our GitHub releases page for SHA256 checksums.

2025 © Inferno AI - Open Source AI Inference Server