System Requirements
OpenRAG requires significant resources to run a local LLM (llama3.1:8b) and the complete infrastructure.Hardware Configuration
MINIMUM Configuration (CPU-only Mode)
CPU
Minimum: 8 cores (x86_64)The LLM uses 80-100% of all cores during generation
RAM
Minimum: 16 GB
- LLM (llama3.1:8b): ~5.5 GB
- Services (PostgreSQL, Redis, Qdrant, MinIO): ~2 GB
- Streamlit frontends: ~500 MB
- OS + buffers: ~8 GB
Storage
Minimum: 50 GB SSD
- Docker images: ~8 GB
- Ollama model (llama3.1:8b): 4.9 GB
- Embeddings: ~400 MB
- Data + documents: 10+ GB
Network
Required: Stable internet connectionTo download LLM model (4.9 GB) and Docker images
RECOMMENDED Configuration (With GPU)
CPU
Recommended: 12+ cores
RAM
Recommended: 32 GBMore RAM allows loading larger models and handling more simultaneous users
GPU
Recommended: NVIDIA GPU with 12+ GB VRAM
- RTX 3060 (12GB): Good for llama3.1:8b
- RTX 4090 (24GB): Excellent for larger models
- A100 (40/80GB): Production
Storage
Recommended: 100+ GB NVMe SSDFor better I/O performance on PostgreSQL and Qdrant
RAM Usage Breakdown (Production System)
Logiciels requis
Docker & Docker Compose
Required Software
Docker & Docker Compose (REQUIRED)
- Linux (Debian/Ubuntu)
- macOS
- Windows + WSL2
Git (REQUIRED)
Recommended Utilities
These tools make testing and debugging easier but are not required:curl: Test REST API (HTTP requests)jq: Parse and format JSON responses
Network Ports Used
OpenRAG uses 10 services with the following ports:Public ports (accessible from browser)
| Service | Port | URL | Description |
|---|---|---|---|
| Chat Interface | 8501 | http://localhost:8501 | Streamlit user interface |
| Admin Panel | 8502 | http://localhost:8502 | Administration dashboard |
| REST API | 8000 | http://localhost:8000 | API entry point |
| MinIO Console | 9001 | http://localhost:9001 | Storage management (admin/admin123456) |
| Qdrant Dashboard | 6333 | http://localhost:6333/dashboard | Vector DB |
Internal ports (between Docker containers)
| Service | Port | Usage |
|---|---|---|
| PostgreSQL | 5432 | Database |
| Redis | 6379 | Cache and queues |
| MinIO API | 9000 | S3 storage |
| Qdrant gRPC | 6334 | Vector DB gRPC |
| Ollama | 11434 | LLM Server |
| Orchestrator | 8001 | Orchestration service |
| Embedding | 8002 | Embeddings service |
Check if a Port is Available
GPU Support (Optional - 10-50x Performance)
NVIDIA GPU on Linux (Recommended for Production)
Apple Silicon (M1/M2/M3)
Ollama supports Metal acceleration on Apple Silicon. Performance is better than CPU-only but typically not as fast as NVIDIA GPUs. Configuration:- No special setup required
- Docker Desktop for Mac handles acceleration
- Performance: ~2-5 seconds per query
Quick Requirements Verification
Before installing OpenRAG, run these commands to verify your system:Pre-installation Checklist
Server with 16 GB+ RAM
50 GB+ SSD disk space
Docker 26.0+ installed
Docker Compose 2.26+ installed
User in
docker groupPorts 8000, 8501, 8502 available
Stable internet connection (5 GB model download)
Configuration per Use Case
- Simple Usage (no GPU)
- Production (with GPU)
- Lightweight Test Environment
Hardware:
- CPU: 8 cores
- RAM: 16 GB
- SSD: 50 GB
- Vector search: 100-200 ms
- LLM first response: 50-75 s (model loading)
- LLM subsequent: 5-15 s
Next Steps
Once requirements are met, consult the Quick Start Guide to install OpenRAG in 5 minutes.Quick Start
Install and launch OpenRAG with
docker compose up