Infrastructure Design

Donavan Jones Published November 17, 2025 infrastructure-engineering

Infrastructure Design

Introduction

Infrastructure design is not just about servers, clusters, or networking—it’s about building systems that can evolve without breaking under pressure. In my case, I approach infrastructure as a living system that supports both software development and real-world deployment needs: AI workloads, web applications, CI/CD pipelines, and experimental services.

My current setup reflects this philosophy. I run a homelab rack powered by a K3s Kubernetes cluster on multiple nodes, combined with a separate development machine running an RTX 3090 for local model execution and experimentation. I also use Gitea as the central Git system, paired with self-hosted CI runners that act as the glue between code and deployment. This separation of concerns—compute, orchestration, and development—lets me iterate quickly without sacrificing stability.

Core Principles

1. Separation of Concerns

Each part of the system has a defined role:

Kubernetes (K3s) handles orchestration and service deployment
The 3090 development machine handles AI workloads and local inference
CI runners handle builds, tests, and deployment automation
Gitea acts as the source of truth for all code and pipelines

This prevents the system from becoming tightly coupled or fragile.

2. Edge-First Homelab Architecture

My rack is designed as a small-scale production environment. Instead of treating it as a toy lab, I treat it like a distributed edge cluster. Services deployed to K3s behave as if they are in production, which helps surface real issues early.

3. CI/CD as the Glue Layer

Gitea combined with self-hosted runners is what connects everything. When I push code:

Pipelines build containers
Tests run automatically
Deployments are pushed into the K3s cluster

This removes manual deployment friction and enforces consistency.

4. Hybrid Compute Model

Not everything belongs in Kubernetes. My RTX 3090 machine runs:

AI model training
Inference services inside Docker containers
Experimental workloads that don’t need orchestration overhead

Kubernetes handles stable services; the GPU box handles heavy computation.

Current Homelab Architecture

At a high level, my infrastructure looks like this:

K3s Cluster (Rack Nodes)
- Runs production-like services
- Hosts APIs, backend services, and supporting infrastructure
- Handles service discovery and networking
Development Machine (RTX 3090)
- Docker-based AI services
- Model experimentation and testing
- Local inference endpoints
Gitea Server
- Central repository for all projects
- Triggers CI/CD pipelines
- Stores infrastructure-as-code
CI/CD Runners
- Build and deploy containers
- Push artifacts into the cluster
- Automate infrastructure updates

This setup allows me to simulate real-world distributed systems while still maintaining full control locally.

Design Philosophy

The goal of this infrastructure is not maximum scale—it’s maximum learning velocity with production-grade patterns.

I prioritize:

Observability over complexity
Reproducibility over manual configuration
Automation over direct intervention
Modular services over monoliths

Every new service I add must fit into this model or it gets redesigned.

Scalability Strategy

Instead of scaling vertically, I scale horizontally through:

Adding new nodes to the K3s cluster
Splitting services into microservices when necessary
Offloading heavy compute to dedicated machines
Using CI/CD to ensure deployment consistency

This allows the system to grow organically without requiring a full redesign.

Conclusion

This infrastructure is designed to behave like a small production cloud, but fully controlled in a homelab environment. The combination of a K3s cluster, a GPU-powered development machine, and a CI/CD-driven workflow creates a system where experimentation and production engineering coexist.

The real value is not in the hardware itself, but in the workflow it enables: rapid iteration, safe deployment, and the ability to treat infrastructure as code rather than manual setup. As the system grows, the goal remains the same—keep it modular, automated, and close to real-world production standards while still flexible enough for experimentation.

Keep Reading

Case Study

In Progress

Bible Verse — Case Study

Production SaaS Platform · Full-Stack · Founder & Sole Engineer

A domain-driven SaaS platform with five independently scalable system boundaries: scripture content delivery, RAG-backed AI study, real-time community interaction, async media processing, and infrastructure services — built and operated end-to-end.

Our Results

37K+

Verses Indexed

AI Models

Bounded Domains

Job Queues

How We Built It

RAG pipeline grounding AI responses in actual scripture rather than model memory
Hybrid Llama / OpenAI routing — local inference for cost, API fallback for quality at the edge
Non-blocking media processing — FFmpeg jobs enqueued via BullMQ, API never waits on transcoding
Cross-instance real-time consistency via Redis pub/sub behind WebSocket and WebRTC layers

Lessons Learned

Domain boundaries enforced at the service layer prevent coupling long before scale demands microservices.
RAG retrieval quality matters more than model size — better embeddings outperform a larger model on poor context.
Async queue design should be first-class, not bolted on; BullMQ worker isolation saved the request path repeatedly.

Stack

Nuxt 3TypeScriptNitroPostgreSQLPrismaRedisBullMQWeaviateMinIOFFmpegWebRTCWebSocketsLlama 3.2OpenAI APIKubernetes

View Full Case Study

Written by

Donavan Jones Full-Stack Engineer & Systems Architect

5+ years building production systems · AI, Backend & Infrastructure · Founder of Bible Logic

Full-stack engineer with 5+ years of hands-on experience designing and shipping production systems — from Nuxt 3 frontends and Nitro APIs to self-hosted Kubernetes clusters, RAG pipelines, and real-time AI applications. Everything I write comes from systems I've designed, deployed, and operated in production.

5+ Years Experience AI Systems Specialist Kubernetes & Infrastructure

Nuxt 3TypeScriptPostgreSQLKubernetesRAG / LLMWebRTCAWS IVSRedis

Full Author Bio GitHub LinkedIn Resume Systems

Menu