Pod Layouts

Introduction

In my homelab Kubernetes environment, running on a Raspberry Pi-based K3s cluster inside my rack, pod layout is one of the most important design decisions. Since I’m not working with cloud-managed infrastructure, I have to be intentional about how workloads are distributed across constrained ARM nodes, a lightweight control plane, and external compute resources like my development machine with a GPU.

This setup forces me to think differently about architecture: instead of scaling vertically with cloud instances, I design around node specialization, workload isolation, and predictable scheduling behavior. Pod layout becomes the backbone of system reliability, performance, and maintainability in my cluster.

Understanding Pod Layouts in My Cluster

Pod layouts refer to how workloads are organized, scheduled, and distributed across nodes in a Kubernetes cluster. In my setup, this is shaped heavily by the physical constraints of my rack environment and the mixed hardware roles.

1. Control Plane vs Worker Distribution

My cluster is built using K3s on Raspberry Pi nodes, which handle most of the orchestration. The control plane remains lightweight, while worker nodes handle application workloads.

Typical layout:

Control Plane Node:
- Manages scheduling, API server, and cluster state
- Kept free from heavy workloads
Worker Nodes (Raspberry Pi):
- Run application pods
- Handle lightweight services like APIs, dashboards, and automation tools
- Optimized for ARM-compatible containers

This separation ensures stability even when workload demand increases.

2. Workload Categorization Strategy

Instead of randomly deploying pods, I organize them by function:

Infrastructure Pods

These run core services for the cluster itself:

DNS resolution (CoreDNS)
Ingress controllers
Metrics and monitoring tools

Application Pods

These are services tied to my Bible app ecosystem:

API services
Authentication systems
Social features (posts, feeds, reactions)
Devotional content services

Data Layer Pods

These require more care due to persistence:

MySQL or PostgreSQL databases
Redis caching layers
Vector or search services

In my rack, I prioritize data consistency by pinning these pods to more stable nodes and using persistent volumes.

3. Node Affinity and Scheduling Rules

Since my cluster runs on limited hardware, I use scheduling rules to control where pods land.

Examples of scheduling strategy:

Lightweight services → distributed across all Pi nodes
Stateful workloads → pinned to specific reliable nodes
Experimental services → isolated node groups
GPU-related workloads → offloaded to my external dev machine (Docker-based model server outside the cluster)

This hybrid approach keeps the cluster responsive even under load.

4. External Compute Integration (Rack + GPU Machine)

One of the most important parts of my setup is that not everything runs inside Kubernetes.

My external development machine (with an RTX 3090) runs AI workloads in Docker containers. The cluster communicates with it through APIs.

This creates a layout like:

Kubernetes cluster → handles app logic and orchestration
External GPU machine → handles inference, model execution, AI services
Communication layer → REST/gRPC between cluster and GPU host

This separation prevents the Raspberry Pi nodes from being overwhelmed while still allowing AI features in my applications.

5. Deployment Strategy (GitOps Style Thinking)

My deployment workflow is evolving toward a Git-driven model using Gitea and CI runners in the rack.

Typical flow:

Code pushed to repository
CI pipeline builds container images
Images are deployed to K3s cluster
Pods update using rolling updates

This ensures that pod layouts remain consistent and reproducible instead of being manually adjusted.

6. Failure Handling and Resilience

Because my cluster is running on homelab hardware, failure is expected, not theoretical.

To handle this:

Pods are designed to restart automatically
Stateless services are preferred where possible
Replicas are used for critical services
Node failures are tolerated without cluster collapse

The pod layout is intentionally designed to assume hardware will eventually fail.

Conclusion

Pod layout design in my Kubernetes cluster is less about abstract theory and more about adapting to real physical constraints inside my rack. Running K3s on Raspberry Pi nodes forces a disciplined approach to scheduling, resource management, and service separation.

By combining:

ARM-based worker nodes
A lightweight control plane
External GPU compute for AI workloads
Git-based deployment pipelines

I’ve built a hybrid system that behaves like a small-scale cloud, but remains fully under my control.

As the cluster evolves, pod layout will continue to be one of the most important factors in scaling services like my Bible app, AI systems, and future microservices architecture.

Menu

Pod Layouts

Pod Layouts

Understanding Pod Layouts in My Cluster