Distributed ARM64 Kubernetes cluster powering AI inference, vector search, object storage, CI/CD automation, and self-hosted backend services.
A self-hosted distributed platform combining Kubernetes, AI inference, data systems, storage, and automation on ARM64 infrastructure.
LLM inference, embeddings, RAG pipelines, and agent workflows running on-cluster.
PostgreSQL, Redis, Weaviate vector search, Apache AGE graph engine, and search indexing.
MinIO S3-compatible object storage for datasets, model weights, and media assets.
Visual representation of compute, storage, AI, and backend layers across the cluster.
Loading diagram…
Each domain is independently documented with architecture, design decisions, and component details.
RAG · Agents · Inference · Embeddings
LLM inference pipeline, vector retrieval, semantic search, and agent orchestration on-cluster.
APIs · Async · Services · Databases
FastAPI services, Redis caching, PostgreSQL with Apache AGE graph extension, job queues.
Kubernetes · ARM64 · Homelab · CI/CD
k3s cluster across 8 Raspberry Pi 5 nodes with Gitea CI/CD, Prometheus, and Grafana.