Kubernetes Networking Tutorial: A Guide for Developers

Kubernetes networking is one of the most critical and complex parts of running containerized workloads in production. This tutorial will walk you through both the theory as well as some hands-on examples and best practices for mastering Kubernetes ne...

Jun 23, 2025 - 23:20

Kubernetes Networking Tutorial: A Guide for Developers

Kubernetes networking is one of the most critical and complex parts of running containerized workloads in production. This tutorial will walk you through both the theory as well as some hands-on examples and best practices for mastering Kubernetes networking.

Prerequisites

Have basic understanding of containers and Docker installed on your system.
Basic understanding of General Networking terms.
Install kubectl tool for runing kubernetes commands.
Kubernetes cluster (Kind, Minikube, and so on).
Installed helm for Kubernetes package managements.

What is Kubernetes Networking?

Kubernetes abstracts much of the complexity involved in networking, but understanding its internal workings helps you optimize and troubleshoot applications.

Each pod gets a unique IP address and can communicate with all other pods without Network Address Translation (NAT). This simple yet powerful model supports complex distributed systems.

NAT (Network Address Translation) refers to the process of rewriting the source or destination IP address (and possibly port) of packets as they pass through a router or gateway.

Because NAT alters packet headers, it breaks the “end-to-end” transparency of the network:

The receiving host sees the NAT device’s address instead of the original sender’s.
Packet captures (for example, via tcpdump) only show the translated addresses, obscuring which internal endpoint truly sent the traffic.

Example: Home Wi-Fi Router NAT

Imagine your home network: you have a laptop, a phone, and a smart TV all connected to the same Wi-Fi. Your Internet provider assigns you one public IP address (say, 203.0.113.5). Internally, your router gives each device a private IP (for example, 192.168.1.10 for your laptop, 192.168.1.11 for your phone, and so on).

Outbound traffic: When your laptop (192.168.1.10) requests a webpage, the router rewrites the packet’s source IP from 192.168.1.10 → 203.0.113.5 (and tracks which internal port maps to which device).
Inbound traffic: When the webpage replies, it arrives at 203.0.113.5, and the router uses its NAT table to forward that packet back to 192.168.1.10.

Because of this translation:

External servers only see the router’s IP (203.0.113.5), not your laptop’s.
Packets are “masqueraded” so multiple devices can share one public address.

In contrast, Kubernetes pods communicate without this extra translation layer – each pod IP is “real” within the cluster, so no router-like step obscures who talked to whom.

Example: E-Commerce Microservices

Consider an online store built as separate microservices, each running in its own pod with a unique IP:

Product Catalog Service: 10.244.1.2
Shopping Cart Service: 10.244.2.3
User Authentication Service: 10.244.1.4
Payment Processing Service: 10.244.3.5

When a shopper adds an item to their cart, the Shopping Cart Pod reaches out directly to the Product Catalog Pod at 10.244.1.2. Because there’s no NAT or external proxy in the data path, this communication is fast and reliable – which is crucial for delivering a snappy, real-time user experience.

Tip: For a complete, hands-on implementation of this scenario (and others), check out the “networking-concepts-practice” section of my: Learn-DevOps-by-building | networking-concepts-practice

Importance in Distributed Systems

Networking in distributed systems facilitates the interaction of multiple services, enabling microservices architectures to function efficiently. Reliable networking supports redundancy, scalability, and fault tolerance.

Kubernetes Networking Model Principles

Kubernetes networking operates on three foundational pillars that create a consistent and high-performance network environment:

1. Unique IP per Pod

Every pod receives its own routable IP address, eliminating port conflicts and simplifying service discovery. This design treats pods like traditional VMs or physical hosts: each can bind to standard ports (for example, 80/443) without remapping.

This helps developers avoid port-management complexity, and tools (like monitoring, tracing) work seamlessly, since pods appear as first-class network endpoints.

2. NAT-Free Pod Communication:

Pods communicate directly without Network Address Translation (NAT). Packets retain their original source/destination IPs, ensuring end-to-end visibility. This simplifies debugging (for example, tcpdump shows real pod IPs) and enables precise network policies. No translation layer also means lower latency and no hidden stateful bottlenecks.

3. Direct Node-Pod Routing:

Nodes route traffic to pods without centralized gateways. Each node handles forwarding decisions locally (via CNI plugins), creating a flat L3 network. This avoids single points of failure and optimizes performance – cross-node traffic flows directly between nodes, not through proxies. Scalability is inherent, and adding nodes expands capacity linearly.

Challenges in Container Networking

Common challenges include managing dynamic IP addresses, securing communications, and scaling networks without performance degradation. While Kubernetes abstracts networking complexities, real-world deployments face hurdles, like:

Dynamic IP Management:

Pods are ephemeral – IPs change constantly during scaling, failures, or updates. Hard-coded IPs break, and DNS caching (with misconfigured TTLs) risks routing to stale endpoints. Solutions like CoreDNS dynamically track pod IPs via the Kubernetes API, while readiness probes ensure only live pods are advertised.

Secure Communication:

Default cluster-wide pod connectivity exposes "east-west" threats. Compromised workloads can scan internal services, and encrypting traffic (for example, mTLS) adds CPU overhead. Network Policies enforce segmentation (for example, isolating PCI-compliant services), and service meshes automate encryption without app changes.

Performance at Scale:

Large clusters strain legacy tooling. iptables rules explode with thousands of services, slowing packet processing. Overlay networks (for example, VXLAN) fragment packets, and centralized load balancers bottleneck traffic. Modern CNIs (Cilium/eBPF, Calico/BGP) bypass kernel bottlenecks, while IPVS replaces iptables for O(1) lookups.

Core Concepts in Kubernetes Networking

What are Pods and Nodes?

Pods are the smallest deployable units. Each pod runs on a node, which could be a virtual or physical machine.

Scenario Example: Web Application Deployment

A typical web application might have:

Three frontend pods running NGINX (distributed across two nodes)
Five backend API pods running Node.js (distributed across three nodes)
Two database pods running PostgreSQL (on dedicated nodes with SSD storage)

# View pods distributed across nodes
kubectl get pods -o wide

NAME                        READY   STATUS    NODE
frontend-6f4d85b5c9-1p4z2   1/1     Running   worker-node-1
frontend-6f4d85b5c9-2m5x3   1/1     Running   worker-node-1
frontend-6f4d85b5c9-3n6c4   1/1     Running   worker-node-2
backend-7c8d96b6b8-4q7d5    1/1     Running   worker-node-2
backend-7c8d96b6b8-5r8e6    1/1     Running   worker-node-3
...

What are Services?

Services expose pods using selectors. They provide a stable network identity even as pod IPs change.

kubectl expose pod nginx-pod --port=80 --target-port=80 --name=nginx-service

Scenario Example: Database Service Migration

A team needs to migrate their database from MySQL to PostgreSQL without disrupting application functionality:

Deploy PostgreSQL pods alongside existing MySQL pods
Create a database service that initially selects only MySQL pods:

apiVersion: v1
kind: Service
metadata:
  name: database-service
spec:
  selector:
    app: mysql
  ports:
  - port: 3306
    targetPort: 3306

Update application to be compatible with both databases
Update the service selector to include both MySQL and PostgreSQL pods:

selector:
  app: database  # New label applied to both MySQL and PostgreSQL pods

Gradually remove MySQL pods while the service routes traffic to available PostgreSQL pods

The service abstraction allows for zero-downtime migration by providing a consistent endpoint throughout the transition.

Communication Paths

A communication path is simply the route that network traffic takes from its source to its destination within (or into/out of) the cluster. In Kubernetes, the three main paths are:

Pod-to-Pod: Direct traffic between two pods (possibly on different nodes).
Pod-to-Service: Traffic from a pod destined for a Kubernetes Service (which then load-balances to one of its backend pods).
External-to-Service: Traffic originating outside the cluster (e.g. from an end-user or external system) directed at a Service (often via a LoadBalancer or Ingress).

Pod-to-Pod Communication

Pods communicate directly with each other using their IP addresses without NAT. For example:

kubectl exec -it pod-a -- ping pod-b

Scenario Example: Sidecar Logging

In a log aggregation setup, each application pod has a sidecar container that processes and forwards logs:

Application container writes logs to a shared volume
Sidecar container reads from the volume and forwards to a central logging service

# Check communication between application and sidecar
kubectl exec -it app-pod -c app -- ls -la /var/log/app
kubectl exec -it app-pod -c log-forwarder -- tail -f /var/log/app/application.log

Because both containers are in the same pod, they can communicate via localhost and shared volumes without any network configuration.

Pod-to-Service Communication

Pods communicate with services using DNS names, enabling load-balanced access to multiple pods:

kubectl exec -it pod-a -- curl http://my-service.default.svc.cluster.local

Scenario Example: API Gateway Pattern

A microservices architecture uses an API gateway pattern:

Frontend pods need to access fifteen or more backend microservices
Instead of tracking individual pod IPs, the frontend connects to service names:

// Frontend code
const authService = 'http://auth-service.default.svc.cluster.local';
const userService = 'http://user-service.default.svc.cluster.local';
const productService = 'http://product-service.default.svc.cluster.local';

async function getUserProducts(userId) {
  const authResponse = await fetch(`${authService}/validate`);
  if (authResponse.ok) {
    const user = await fetch(`${userService}/users/${userId}`);
    const products = await fetch(`${productService}/products?user=${userId}`);
    return { user, products };
  }
}

Each service name resolves to a stable endpoint, even as the underlying pods are scaled, replaced, or rescheduled.

External-to-Service Communication

External communication is facilitated through service types like NodePort or LoadBalancer. An example of NodePort usage:

apiVersion: v1
kind: Service
metadata:
  name: my-nodeport-service
spec:
  type: NodePort
  ports:
  - port: 80
    targetPort: 80
    nodePort: 30080
  selector:
    app: my-app

Now, this service can be accessed externally via:

curl http://:30080

Scenario Example: Public-Facing Web Application

A company runs a public-facing web application that needs external access:

Deploy the application pods with three replicas
Create a LoadBalancer service to expose the application:

apiVersion: v1
kind: Service
metadata:
  name: web-app
  annotations:
    service.beta.kubernetes.io/aws-load-balancer-type: nlb  # Cloud-specific annotation
spec:
  type: LoadBalancer
  ports:
  - port: 80
    targetPort: 8080
  selector:
    app: web-app

When deployed on AWS, this automatically provisions a Network Load Balancer with a public IP
External users access the application through the load balancer, which distributes traffic across all three pods

# Check the external IP assigned to the service
kubectl get service web-app

NAME     TYPE          CLUSTER-IP     EXTERNAL-IP        PORT(S)
web-app  LoadBalancer  10.100.41.213  a1b2c3.amazonaws.com  80:32456/TCP

Cluster Networking Components

Kubernetes networking transforms abstract principles into reality through tightly orchestrated components. Central to this is the Container Network Interface (CNI), a standardized specification that governs how network connectivity is established for containers.

What is a Container Network Interface (CNI) ?

At its essence, CNI acts as Kubernetes' networking plugin framework. It’s responsible for dynamically assigning IP addresses to pods, creating virtual network interfaces (like virtual Ethernet pairs), and configuring routes whenever a pod starts or stops.

Crucially, Kubernetes delegates these low-level networking operations to CNI plugins, allowing you to choose implementations aligned with your environment’s needs: whether that’s Flannel’s simple overlay networks for portability, Calico’s high-performance BGP routing for bare-metal efficiency, or Cilium’s eBPF-powered data plane for advanced security and observability.

Working alongside CNI, kube-proxy operates on every node, translating Service abstractions into concrete routing rules within the node’s kernel (using iptables or IPVS). Meanwhile, CoreDNS provides seamless service discovery by dynamically mapping human-readable names (for example, cart-service.production.svc.cluster.local) to stable Service IPs. Together, these components form a cohesive fabric, ensuring pods can communicate reliably whether they’re on the same node or distributed across global clusters.

High-Level CNI Plugin Differences:

Flannel: Simple overlay (VXLAN, host-gw) for basic multi-host networking.
Calico: Pure-L3 routing using BGP or IP-in-IP, plus rich network policies.
Cilium: eBPF-based dataplane for ultra-fast packet processing and advanced features like API-aware policies.

These High-Level Plugins implement the CNI standard for managing pod IPs and routing.

kubectl get pods -n kube-system

Scenario Example: Multi-Cloud Deployment with Calico

A company operates a hybrid deployment across AWS and Azure:

Choose Calico as the CNI plugin for consistent networking across clouds:

# Install Calico on both clusters
kubectl apply -f https://docs.projectcalico.org/manifests/calico.yaml

# Verify Calico pods are running
kubectl get pods -n kube-system -l k8s-app=calico-node

Calico provides:

Consistent IPAM (IP Address Management) across both clouds
Network policy enforcement in both environments
BGP routing for optimized cross-node traffic

When migrating workloads between clouds, the networking layer behaves consistently despite different underlying infrastructure.

What is kube-proxy?

kube-proxy is a network component that runs on each node and implements Kubernetes’ Service abstraction. Its responsibilities include:

Watching the API server for Service and Endpoint changes.
Programming the node’s packet-filtering layer (iptables or IPVS) so that traffic to a Service ClusterIP:port gets load-balanced to one of its healthy backend pods.
Handling session affinity, if configured (so repeated requests from the same client go to the same pod).

By doing this per-node, kube-proxy ensures any pod on that node can reach any Service IP without needing a central gateway.

What are iptables & IPVS?

Both iptables and IPVS are Linux kernel subsystems that kube-proxy can use to manage Service traffic:

iptables mode

kube-proxy generates a set of NAT rules (in the nat table) so that when a packet arrives for a Service IP, the kernel rewrites its destination to one of the backend pod IPs.

IPVS mode

IPVS (IP Virtual Server) runs as part of the kernel’s Netfilter framework. Instead of dozens or hundreds of iptables rules, it keeps a high-performance hash table of virtual services and real servers.

Here's the comparison of iptables and IPVS modes in a clean table format:

Mode	Pros	Cons
iptables	• Simple and universally available on Linux systems

• Battle-tested and easy to debug | • Rule complexity grows linearly with Services/Endpoints
• Packet processing slows at scale due to sequential rule checks
• Service updates trigger full rule reloads | | IPVS | • O(1) lookup time regardless of cluster size
• Built-in load-balancing algorithms (RR, LC, SH)
• Incremental updates without full rule recomputation
• Lower CPU overhead for large clusters | • Requires Linux kernel ≥4.4 and IPVS modules loaded
• More complex initial configuration
• Limited visibility with traditional tool |

Scenario Example: Debugging Service Connectivity

When troubleshooting service connectivity issues in a production cluster:

First, check if kube-proxy is functioning:

# Check kube-proxy pods
kubectl get pods -n kube-system -l k8s-app=kube-proxy

# Examine kube-proxy logs
kubectl logs -n kube-system kube-proxy-a1b2c

Inspect the iptables rules created by kube-proxy on a node:

# Connect to a node
ssh worker-node-1

# View iptables rules for a specific service
sudo iptables-save | grep my-service

The output reveals how traffic to ClusterIP 10.96.45.10 is load-balanced across multiple backend pod IPs:

-A KUBE-SVC-XYZAB12345 -m comment --comment "default/my-service" -m statistic --mode random --probability 0.33332 -j KUBE-SEP-POD1
-A KUBE-SVC-XYZAB12345 -m comment --comment "default/my-service" -m statistic --mode random --probability 0.50000 -j KUBE-SEP-POD2
-A KUBE-SVC-XYZAB12345 -m comment --comment "default/my-service" -j KUBE-SEP-POD3

Understanding these rules helps diagnose why traffic might not be reaching certain pods.

DNS and Service Discovery

Every service in Kubernetes relies on DNS to map a human-friendly name (for example, my-svc.default.svc.cluster.local) to its ClusterIP. When pods come and go, DNS records must update quickly so clients never hit stale addresses.

Kubernetes uses CoreDNS as a cluster DNS server. When you create a Service, an A record is added pointing to its ClusterIP. Endpoints (the pod IPs) are published as SRV (Service) records. If a pod crashes or is rescheduled, CoreDNS watches the Endpoints API and updates its records in near–real time.

Key mechanics:

Service A record → ClusterIP
Endpoint SRV records → backend pod IPs & ports
TTL tuning → how long clients cache entries

Why recovery matters:

A DNS TTL that’s too long can leave clients retrying an old IP.
A TTL that’s too short increases DNS load.
Readiness probes must signal “not ready” before CoreDNS removes a pod’s record.

CoreDNS

CoreDNS provides DNS resolution for services inside the cluster.

kubectl exec -it busybox -- nslookup nginx-service

Service discovery is automatic, using:

..svc.cluster.local

Scenario Example: Microservices Environment Variables vs. DNS

A team is migrating from hardcoded environment variables to Kubernetes DNS:

Before: Configuration via environment variables

apiVersion: v1
kind: Pod
metadata:
  name: order-service
spec:
  containers:
  - name: order-app
    image: order-service:v1
    env:
    - name: PAYMENT_SERVICE_HOST
      value: "10.100.45.12"
    - name: INVENTORY_SERVICE_HOST
      value: "10.100.67.34"
    - name: USER_SERVICE_HOST
      value: "10.100.23.78"

After: Using Kubernetes DNS service discovery

apiVersion: v1
kind: Pod
metadata:
  name: order-service
spec:
  containers:
  - name: order-app
    image: order-service:v2
    env:
    - name: PAYMENT_SERVICE_HOST
      value: "payment-service.default.svc.cluster.local"
    - name: INVENTORY_SERVICE_HOST
      value: "inventory-service.default.svc.cluster.local"
    - name: USER_SERVICE_HOST
      value: "user-service.default.svc.cluster.local"

When the team needs to relocate the payment service to a dedicated namespace for PCI compliance:

Move payment service to "finance" namespace
Update only one environment variable:

- name: PAYMENT_SERVICE_HOST
  value: "payment-service.finance.svc.cluster.local"

The application continues working without rebuilding container images or updating other services

Pod Networking Deep Dive

Under the hood, each pod has its own network namespace, virtual Ethernet (veth) pair, and an interface like eth0. The CNI plugin glues these into the cluster fabric.

When the kubelet creates a pod, it calls your CNI plugin:

1. Allocates an IP from a pool.
  1. Creates a veth pair and moves one end into the pod’s netns.
  2. Programs routes on the host so that other nodes know how to reach this IP.

Namespaces and Virtual Ethernet

Each pod gets a Linux network namespace and connects to the host via a virtual Ethernet pair.

kubectl exec -it nginx-pod -- ip addr

Scenario Example: Debugging Network Connectivity

When troubleshooting connectivity issues between pods:

Examine the network interfaces inside a pod:

kubectl exec -it web-frontend-pod -- ip addr

1: lo:  mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
    inet6 ::1/128 scope host
2: eth0@if18:  mtu 1450 qdisc noqueue state UP group default
    link/ether 82:cf:d8:e9:7a:12 brd ff:ff:ff:ff:ff:ff link-netnsid 0
    inet 10.244.2.45/24 scope global eth0
    inet6 fe80::80cf:d8ff:fee9:7a12/64 scope link

Trace the path from pod to node:

# On the node hosting the pod
sudo ip netns list
# Shows namespace like: cni-1a2b3c4d-e5f6-7890-a1b2-c3d4e5f6g7h8

# Examine connections on the node
sudo ip link | grep veth
# Shows virtual ethernet pairs like: veth123456@if2: ...

# Check routes on the node
sudo ip route | grep 10.244.2.45
# Shows how traffic reaches the pod

This investigation reveals how traffic flows from the pod through its namespace, via virtual ethernet pairs, then through the node's routing table to reach other pods.

Shared Networking in Multi-Container Pods

Multi-container pods share the same network namespace. Use this for sidecar and helper containers.

Scenario Example: Service Mesh Sidecar

When implementing Istio service mesh with automatic sidecar injection:

Deploy an application with Istio sidecar injection enabled:

apiVersion: v1
kind: Pod
metadata:
  name: api-service
  annotations:
    sidecar.istio.io/inject: "true"
spec:
  containers:
  - name: api-app
    image: api-service:v1
    ports:
    - containerPort: 8080

After deployment, the pod has two containers sharing the same network namespace:

kubectl describe pod api-service

Name:         api-service
...
Containers:
  api-app:
    ...
    Ports:          8080/TCP
    ...
  istio-proxy:
    ...
    Ports:          15000/TCP, 15001/TCP, 15006/TCP, 15008/TCP
    ...

The sidecar container intercepts all network traffic:

kubectl exec -it api-service -c istio-proxy -- netstat -tulpn

Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address     Foreign Address     State       PID/Program name
tcp        0      0 0.0.0.0:15001     0.0.0.0:*           LISTEN      1/envoy
tcp        0      0 0.0.0.0:15006     0.0.0.0:*           LISTEN      1/envoy

Traffic to the application container is transparently intercepted without requiring application changes:

kubectl exec -it api-service -c api-app -- curl localhost:8080
# Actually goes through the proxy even though it looks direct to the app

This shared network namespace enables the service mesh to implement features like traffic encryption, routing, and metrics collection without application modifications.

Services and Load Balancing

Kubernetes Services abstract a set of pods behind a single virtual IP. That virtual IP can be exposed in several ways:

A Service object defines a stable IP (ClusterIP), DNS entry, and a selector. kube-proxy then programs the node to intercept traffic to that IP and forward it to one of the pods.

Service types:

ClusterIP (default): internal only
NodePort: opens the Service on every node’s port (e.g. 30080)
LoadBalancer: asks your cloud provider for an external LB
ExternalName: CNAME to an outside DNS name

Load-balancing mechanics:

kube-proxy + iptables/IPVS (round-robin, least-conn)
External Ingress (NGINX, Traefik) for HTTP/S with host/path routing