Pod Layouts

Donavan Jones Published December 12, 2025 infrastructure-engineering

Pod Layouts

Introduction

In my homelab Kubernetes environment, running on a Raspberry Pi-based K3s cluster inside my rack, pod layout is one of the most important design decisions. Since I’m not working with cloud-managed infrastructure, I have to be intentional about how workloads are distributed across constrained ARM nodes, a lightweight control plane, and external compute resources like my development machine with a GPU.

This setup forces me to think differently about architecture: instead of scaling vertically with cloud instances, I design around node specialization, workload isolation, and predictable scheduling behavior. Pod layout becomes the backbone of system reliability, performance, and maintainability in my cluster.

Understanding Pod Layouts in My Cluster

Pod layouts refer to how workloads are organized, scheduled, and distributed across nodes in a Kubernetes cluster. In my setup, this is shaped heavily by the physical constraints of my rack environment and the mixed hardware roles.

1. Control Plane vs Worker Distribution

My cluster is built using K3s on Raspberry Pi nodes, which handle most of the orchestration. The control plane remains lightweight, while worker nodes handle application workloads.

Typical layout:

Control Plane Node:
- Manages scheduling, API server, and cluster state
- Kept free from heavy workloads
Worker Nodes (Raspberry Pi):
- Run application pods
- Handle lightweight services like APIs, dashboards, and automation tools
- Optimized for ARM-compatible containers

This separation ensures stability even when workload demand increases.

2. Workload Categorization Strategy

Instead of randomly deploying pods, I organize them by function:

Infrastructure Pods

These run core services for the cluster itself:

DNS resolution (CoreDNS)
Ingress controllers
Metrics and monitoring tools

Application Pods

These are services tied to my Bible app ecosystem:

API services
Authentication systems
Social features (posts, feeds, reactions)
Devotional content services

Data Layer Pods

These require more care due to persistence:

MySQL or PostgreSQL databases
Redis caching layers
Vector or search services

In my rack, I prioritize data consistency by pinning these pods to more stable nodes and using persistent volumes.

3. Node Affinity and Scheduling Rules

Since my cluster runs on limited hardware, I use scheduling rules to control where pods land.

Examples of scheduling strategy:

Lightweight services → distributed across all Pi nodes
Stateful workloads → pinned to specific reliable nodes
Experimental services → isolated node groups
GPU-related workloads → offloaded to my external dev machine (Docker-based model server outside the cluster)

This hybrid approach keeps the cluster responsive even under load.

4. External Compute Integration (Rack + GPU Machine)

One of the most important parts of my setup is that not everything runs inside Kubernetes.

My external development machine (with an RTX 3090) runs AI workloads in Docker containers. The cluster communicates with it through APIs.

This creates a layout like:

Kubernetes cluster → handles app logic and orchestration
External GPU machine → handles inference, model execution, AI services
Communication layer → REST/gRPC between cluster and GPU host

This separation prevents the Raspberry Pi nodes from being overwhelmed while still allowing AI features in my applications.

5. Deployment Strategy (GitOps Style Thinking)

My deployment workflow is evolving toward a Git-driven model using Gitea and CI runners in the rack.

Typical flow:

Code pushed to repository
CI pipeline builds container images
Images are deployed to K3s cluster
Pods update using rolling updates

This ensures that pod layouts remain consistent and reproducible instead of being manually adjusted.

6. Failure Handling and Resilience

Because my cluster is running on homelab hardware, failure is expected, not theoretical.

To handle this:

Pods are designed to restart automatically
Stateless services are preferred where possible
Replicas are used for critical services
Node failures are tolerated without cluster collapse

The pod layout is intentionally designed to assume hardware will eventually fail.

Conclusion

Pod layout design in my Kubernetes cluster is less about abstract theory and more about adapting to real physical constraints inside my rack. Running K3s on Raspberry Pi nodes forces a disciplined approach to scheduling, resource management, and service separation.

By combining:

ARM-based worker nodes
A lightweight control plane
External GPU compute for AI workloads
Git-based deployment pipelines

I’ve built a hybrid system that behaves like a small-scale cloud, but remains fully under my control.

As the cluster evolves, pod layout will continue to be one of the most important factors in scaling services like my Bible app, AI systems, and future microservices architecture.

Keep Reading

Case Study

In Progress

Bible Verse — Case Study

Production SaaS Platform · Full-Stack · Founder & Sole Engineer

A domain-driven SaaS platform with five independently scalable system boundaries: scripture content delivery, RAG-backed AI study, real-time community interaction, async media processing, and infrastructure services — built and operated end-to-end.

Our Results

37K+

Verses Indexed

AI Models

Bounded Domains

Job Queues

How We Built It

RAG pipeline grounding AI responses in actual scripture rather than model memory
Hybrid Llama / OpenAI routing — local inference for cost, API fallback for quality at the edge
Non-blocking media processing — FFmpeg jobs enqueued via BullMQ, API never waits on transcoding
Cross-instance real-time consistency via Redis pub/sub behind WebSocket and WebRTC layers

Lessons Learned

Domain boundaries enforced at the service layer prevent coupling long before scale demands microservices.
RAG retrieval quality matters more than model size — better embeddings outperform a larger model on poor context.
Async queue design should be first-class, not bolted on; BullMQ worker isolation saved the request path repeatedly.

Stack

Nuxt 3TypeScriptNitroPostgreSQLPrismaRedisBullMQWeaviateMinIOFFmpegWebRTCWebSocketsLlama 3.2OpenAI APIKubernetes

View Full Case Study

Written by

Donavan Jones Full-Stack Engineer & Systems Architect

5+ years building production systems · AI, Backend & Infrastructure · Founder of Bible Logic

Full-stack engineer with 5+ years of hands-on experience designing and shipping production systems — from Nuxt 3 frontends and Nitro APIs to self-hosted Kubernetes clusters, RAG pipelines, and real-time AI applications. Everything I write comes from systems I've designed, deployed, and operated in production.

5+ Years Experience AI Systems Specialist Kubernetes & Infrastructure

Nuxt 3TypeScriptPostgreSQLKubernetesRAG / LLMWebRTCAWS IVSRedis

Full Author Bio GitHub LinkedIn Resume Systems

Menu

Pod Layouts

Pod Layouts

Introduction

Understanding Pod Layouts in My Cluster

1. Control Plane vs Worker Distribution

2. Workload Categorization Strategy

Infrastructure Pods

Application Pods

Data Layer Pods

3. Node Affinity and Scheduling Rules

4. External Compute Integration (Rack + GPU Machine)

5. Deployment Strategy (GitOps Style Thinking)

6. Failure Handling and Resilience

Conclusion

More in infrastructure-engineering

K3s on Raspberry Pis

Hardware List and Costs

Flashing Raspberry Pi OS

Case Study

Our Results

How We Built It

Lessons Learned

Stack