makerhouse_network

Scripts, services, and configuration for running MakerHouse's home network and cluster. Features include:

external HTTPS endpoints (Traefik & Cert-Manager)
virtual services accessible via IP address (MetalLB)
DNS-level adblocking (PiHole)
inter-device pub/sub messaging via MQTT (Mosquitto)
distributed storage (Longhorn)
automation flows (NodeRed)
monitoring, dashboarding, and alerting (Prometheus / Grafana)
custom container hosting (via private registry)

For more high level details, see this blog post

For additional services deployed on top of this stack, see the other *.md files in this repository.

Setup

For setup, we will be installing a 64-bit ARM version of Ubuntu onto several raspberry pi's.

Then, we will install a version of kubernetes called k3s which is optimized for running on IoT and "edge" devices but is otherwise a drop-in replacement for k8s i.e. kubernetes. This lets us run replicated services with all of the fancy features listed above.

Setting up a basic, replicated pi cluster is an involved process consisting of several steps:

Setting up the cluster
1. Purchasing the hardware
2. (optional) Network setup
3. Flashing the OS
4. Installing K3S and linking the nodes together
Configuring the cluster to be useful
1. Configuring the load balancer and reverse proxy
2. Installing a distributed storage solution

Knowledge prerequisites for this guide:

Some basic networking (e.g. how to find a remote device's IP address and SSH into it)
Linux command line fundamentals (navigating to files, opening and editing them, and running commands)
It's also useful to know what DHCP is and how to configure it and subnets in your router, for the optional network setup step

Be prepared to spend several hours on initial setup, plus an hour or two here and there for further refinements (such as those in advanced_setup.md)

1.1: Purchasing the Hardware

For the cluster network, you will need:

A gigabit ethernet switch with as many ports as nodes in your cluster, plus one (such as this one)
An ethernet cable connected to your existing network

For each node, you will need:

A raspberry pi 4 (or better), recommended 4GB. Ideally all nodes are the same type of pi with the same hardware specs.
A USB-C power supply (5V with at least 2A, such as this one)
A short ethernet cable to connect the pi to the switch (such as this pack)

For sufficient storage, you will need (per node):

A USB 3 NVMe M.2 SSD enclosure (such as this one)
An NVMe M.2 SSD (I picked this 256GB one)

You will also likely need an SD card image to flash a "USB boot" bootloader to your raspberry pi's. This will only be needed for initial setup - it will not be used after the pi's boot via SSD.

Before continuing on:

Connect your switch to power and the LAN
Connect each pi to the switch via ethernet (which port doesn't matter)
Install an SSD into each enclosure, then plug one enclosures into one of the blue USB ports on each pi
- At this point, it helps to label the SSDs with the name you expect each node to be, e.g. k3s1, k3s2 etc. to keep track of where the image 'lives'.

A note on earlier versions of raspbery pi:

TL;DR: Try to avoid using raspberry pi's earlier than the pi 4, and distributions running ARMv6. If you use a Pi 4 and follow the instructions below, you're in the clear.

For compatibility with published k3s binaries, you must also be running an ARMv7 or higher OS. This is only supported on Raspberry Pi 2B and higher.This issue describes that kubernetes support for ARMv6 has been dropped.

1.2: Network setup

This guide will assume your router is set up with a LAN subnet of 192.168.0.0/23 (i.e. allowing for IP addresses from 192.168.0.1 all the way to 192.168.1.254). You may also use 192.168.0.0/16 as well, if you don't plan to have any other subnets on your local network (it takes up the full 192.168.* range).

192.168.0.1 is the address of the router
IP addresses from 192.168.0.2-254 are for exposed cluster services (i.e. virtual devices)
IP addresses from 192.168.1.2-254 are for physical devices (the pi's, other IoT devices, laptops, phones etc.)
- We recommend having a static IP address range not managed by DHCP, e.g. 192.168.1.2-30 and avoiding leasing 192.168.1.1 as it'd be confusing.

1.3: Flashing the OS

Setup SSD boot

Follow these instructions to install a USB bootloader onto each pi. Stop when you get to step 9 (inserting the Raspberry Pi OS) as we'll be installing Ubuntu for Servers instead.

Use https://www.balena.io/etcher/ or similar to write an Ubuntu 20.04 ARM 64-bit LTS image to one of the SSDs. We'll do the majority of setup on this drive, then clone it to the other pi's (with some changes).

Enable cgroups and SSH

On your local machine (not the pi) unplug and re-plug the SSD, then navigate to the boot partition and ensure there's a file labeled ssh there (if not, create a blank one). This allows us to remote in to the pi's.

Now we will enable cgroups which are used by k3s to manage the resources of processes that are running on the cluster.

Append to /boot/firmware/cmdline.txt (see here):

cgroup_enable=memory cgroup_memory=1

Example of a correct config:

ubuntu@k3s1:~$ cat /boot/firmware/cmdline.txt 
net.ifnames=0 dwc_otg.lpm_enable=0 console=serial0,115200 console=tty1 root=LABEL=writable rootfstype=ext4 elevator=deadline rootwait fixrtc cgroup_enable=memory cgroup_memory=1

You can now unmount and remove the SSD from your local machine.

Verify installation

Plug in the SSD to your pi, then plug in USB-C power to turn it on.

Look on your router to find the IP address of the pi. You should be able to SSH into it with username and password ubuntu.

Run passwd to change away from the default password.

Run sudo shutdown now (sudo password is ubuntu) and unplug power once its LED stops blinking.

Clone to other pi's

Remove the SSD and use your software of choice (e.g. gparted for linux) to clone it to the other blank SSDs. For each SSD, mount it and edit /etc/hostname to be something unique (e.g. k3s1, k3s2...)

Now's a good time to edit your router settings and assign static IP addresses to each pi for easier access later.

1.4: Installing k3s and linking the nodes together

For a three node cluster, we'll have one "master" node named k3s1 and two worker nodes (k3s2 and k3s3). These instructions generally follow the installation guide from Rancher.

Set up k3s1 as master

SSH into the pi, and run the install script from get.k3s.io (see install options for more details):

export INSTALL_K3S_VERSION=v1.24.3+k3s1
curl -sfL https://get.k3s.io | sh -s - --disable servicelb --disable local-storage

Before exiting k3s1, run sudo cat /var/lib/rancher/k3s/server/node-token and copy it for the next step of linking the client nodes.

Notes:

We include the K3S version for repeatability
ServiceLB and local storage are disabled to make way for MetalLB and Longhorn (distributed storage) configured later in this guide

Install and link the remaining nodes

To install on worker nodes and add them to the cluster, run the installation script with the K3S_URL and K3S_TOKEN environment variables. Note use of raw IP - this is more reliable than depending on the cluster DNS (Pihole) to be serving, since that service will itself be hosted on k3s.

export K3S_URL=https://<k3s1 IP address>:6443 
export INSTALL_K3S_VERSION=v1.24.3+k3s1
export K3S_TOKEN=<token from k3s1>
curl -sfL https://get.k3s.io | sh -

Where K3S_URL is the URL and port of a k3s server, and K3S_TOKEN comes from /var/lib/rancher/k3s/server/node-token on the server node (described in the prior step)

Verifying

That should be it! You can confirm the node successfully joined the cluster by running kubectl get nodes when SSH'd into k3s1:

~ kubectl get nodes
NAME   STATUS   ROLES                  AGE    VERSION
k3s1   Ready    control-plane,master   5m   v1.21.0+k3s1
k3s2   Ready    <none>                 1m   v1.21.0+k3s1
k3s3   Ready    <none>                 1m   v1.21.0+k3s1

Set Up Remote Access

It's convenient to run cluster management commands from a personal computer rather than having to SSH into the master every time.

Let's grab the k3s.yaml file from master, and convert it into our local config:

ssh ubuntu@k3s1 "sudo cat /etc/rancher/k3s/k3s.yaml" > ~/.kube/config

Now edit the server address to be the address of the pi, since from the server's perspective the master is localhost:

sed -i "s/127.0.0.1/<actual server IP address>/g" ~/.kube/config

2: Configuring the cluster

2.1: Configuring the load balancer and reverse proxy

We will be using MetalLB to allow us to "publish" virtual cluster services on actual IP addresses (in our 192.168.0.2-254 range). This allows us to type in e.g. 192.168.0.10 in a browser and see a webpage hosted from our cluster, without having a device with that specific IP address.

MetalLB load balancing / endpoint handling

Install MetalLB onto the cluster following https://metallb.universe.tf/installation/:

kubectl apply -f https://raw.githubusercontent.com/metallb/metallb/v0.13.4/config/manifests/metallb-native.yaml
kubectl apply -f metallb-addresspool.yml
kubectl apply -f metallb-l2advertisement.yml

See the yml files in core/ for details on what's being deployed.

Test whether metallb is working by starting an exposed service, then cleaning up after:

kubectl apply -f ./core/lbtest.yaml
kubectl describe service hello
1. Look for "IPAllocated" in event log
2. Visit 192.168.0.3 and confirm "Welcome to nginx!" is visible
kubectl delete service hello
kubectl delete deployment hello

Troubleshooting

Some failure modes of MetalLB cause only a fraction of the VIPs (Virtual IPs) to not be responsive.

Check to see if all MetalLB pods are in state "running" via kubectl get pods -n metallb-system -o wide:

    ```
    speaker-7l7kv                 1/1     Running   2          16d   192.168.1.5   pi4-1   <none>           <none>
    controller-65db86ddc6-fkpnj   1/1     Running   2          16d   10.42.0.75    pi4-1   <none>           <none>
    speaker-st749                 1/1     Running   1          16d   192.168.1.7   pi4-3   <none>           <none>
    speaker-8wcwj                 1/1     Running   0          16m   192.168.1.6   pi4-2   <none>           <none>

    ```

More details - download kubetail - see bottom of this page

./kubetail.sh -l component=speaker -n metallb-system
If you see an error like "connection refused" referencing 192.168.1.#:7946, check to see if one of the "speaker" pods isn't actually running.

2.2: Installing a distributed storage solution

Now we can set up a distributed storage solution. Storage should be replicated just as with nodes in a cluster - in this way, no storage location is a single point of failure for the cluster.

We'll be using Longhorn, the recommended solution from Rancher, which you can read more about here. In addition to data redundancy, it abstracts away "where the data is stored" and lets us specify a size of storage to use as a persistent volume for any individual servce, regardless of which node it's hosted on.

Follow the installation guide to set it up. See core/longhorn.yaml for the MakerHouse configured version, which pins the specific config and version, and tells MetalLB to expose it on 192.168.0.4 (see the loadBalancerIP setting).

Note: this solution requires an arm64 architecture, NOT armhf/armv7l which is the default for Raspbian / Raspberry PI OS. If you followed the instructions for OS installation above, you should be OK.

Be sure to set Longhorn as the default storage class, so that service configs without an explicit storageClass specified can automatically use it:

kubectl patch storageclass longhorn -p '{"metadata": {"annotations":{"storageclass.kubernetes.io/is-default-class":"true"}}}'

Ensure longhorn is up and running by visiting the exposted IP address.

Next steps

At this point, you should have a home raspberry pi cluster running k3s, with locally available endpoints and distributed block storage. Congrats!

While this is entirely functional on its own for running simple home services, you may find it lacking if you want to host your own custom/private container images,publish webpages / APIs for external use, or integrate with home automation services (e.g. control homebrew IoT devices with Google Assistant).

Head over to advanced_setup.md to learn how to get the most out of your k3s cluster!

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
core		core
dynamic_dns		dynamic_dns
matrix		matrix
metrics		metrics
mqtt		mqtt
nanoleaf		nanoleaf
nodered		nodered
pihole		pihole
registry		registry
url_shortener		url_shortener
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
advanced_setup.md		advanced_setup.md
devices.md		devices.md
maintenance_log.md		maintenance_log.md
troubleshooting.md		troubleshooting.md

License

smartin015/makerhouse_network

Folders and files

Latest commit

History

Repository files navigation