Improve resilience by changing routing to traefik and setting up more servers.

Some changes were required to set 3 VMs as the cluster since the NUC failed and we are waiting for new hardware to arrive.
The ingest routing from the internet was changed to use cloudflared tunnel to traefik instead of via a specific host.
This commit is contained in:
2025-10-11 14:42:02 +11:00
parent 7f3161b2bb
commit b53bfe4075
11 changed files with 127 additions and 45 deletions

View File

@@ -16,3 +16,16 @@ Put the age keys in /home/<user>/.config/sops/age/keys.txt
## Adding Secrets
Edit the secrets using `sops secrets.enc.json`
# Bootstrapping (starting without PostgreSQL running)
terraform apply -target=nomad_job.postgres
## Restoring PostgreSQL DBs
`psql -h jaglan-beta-m21 -p 5432 -U postgres -f ~/Downloads/all_databases.sql postgres`
# Deploying and testing changes
Sometimes the nomad job fails but the solution is to fix another job and so we need to tell nomad to retry the unchanged job.
`nomad job eval -force-reschedule glance`