Self-Hosted Kubernetes Infrastructure

Private Cloud AI Platform

Distributed ARM64 Kubernetes cluster powering AI inference, vector search, object storage, CI/CD automation, and self-hosted backend services.

Request Lifecycle
User
Ingress
FastAPI
Redis / PostgreSQL
AI Layer
MinIO
Response
Requests enter through Kubernetes ingress, pass through FastAPI services, leverage Redis/PostgreSQL data stores, invoke AI workloads, persist artifacts to MinIO, and return results to the client.
Cluster
8
Kubernetes Nodes
Compute
ARM64
Raspberry Pi 5 Cluster
Intelligence
AI
Inference + RAG + Embeddings
Delivery
CI/CD
Automated Deployments
Technology Stack
Kubernetesk3sRaspberry Pi 5FastAPIRedisPostgreSQLApache AGEWeaviateMinIOGiteaActions RunnersLLMs
System Architecture

Core Systems

A self-hosted distributed platform combining Kubernetes, AI inference, data systems, storage, and automation on ARM64 infrastructure.

AI Systems

LLM inference, embeddings, RAG pipelines, and agent workflows running on-cluster.

Llama 3.2WeaviateRAGEmbeddings

Data Layer

PostgreSQL, Redis, Weaviate vector search, Apache AGE graph engine, and search indexing.

PostgreSQLRedisWeaviateApache AGE

Storage

MinIO S3-compatible object storage for datasets, model weights, and media assets.

MinIOS3-CompatiblePre-signed URLs
ARM64 Cluster
8 Raspberry Pi 5 nodes running k3s
Orchestration
Scheduling, networking, autoscaling
CI/CD
Automated builds via Gitea runners
Observability
Metrics, logs, dashboards, alerting
Architecture Visualization

System Architecture Diagram

Visual representation of compute, storage, AI, and backend layers across the cluster.

Loading diagram…

Install @vue-flow/core to render this diagram.
HTTP request
Data ops
AI inference
Storage