kubernetes

Author	SHA1	Message	Date
Dustin C. Hatch	ab107022f4	home-assistant: Remove Tonight's Forecast sensor This template sensor will be migrated to a helper, since Home Assitant removed the `forecast` attribute of weather sensors and now requires calling an action (service) to get those data.	2024-08-27 09:46:56 -05:00
Dustin C. Hatch	b60ed65c80	home-assistant: whisper: Add tmp volume `faster-whisper` now requires writable temporary storage.	2024-08-27 09:35:57 -05:00
Dustin C. Hatch	7fb0932084	home-assistant: Remove unused template sensors	2024-08-27 09:34:08 -05:00
Dustin C. Hatch	01e95d22db	home-assistant: Remove Matrix integration The _hatch.chat_ Matrix homeserver is being retired. We don't use Matrix for any notifications any more.	2024-08-27 09:27:37 -05:00
Dustin C. Hatch	bcfd94948d	home-assistant: Remove deprecated YAML config These configuration settings are no longer supported in the YAML document, but configured via the UI.	2024-08-27 09:12:34 -05:00
Dustin	fd7b90bb1c	Merge pull request 'home-assistant: Update to 2024.8.3' (#10 ) from updatebot/home-assistant into master Reviewed-on: #10	2024-08-27 13:58:02 +00:00
Dustin C. Hatch	1267032847	argocd: apps/home-assistant: Enable auto-sync This way, merging PRs from updatebot will automatically trigger updating Home Assistant et al.	2024-08-27 08:57:03 -05:00
bot	ca80663c29	zwavejs2mqtt: Update to 9.17.0	2024-08-26 15:22:17 +00:00
bot	d16cca534a	zigbee2mqtt: Update to 1.39.1	2024-08-26 15:22:17 +00:00
bot	d78f17f529	piper: Update to 1.5.0	2024-08-26 15:22:17 +00:00
bot	5a33f55d38	whisper: Update to 2.1.0	2024-08-26 15:22:16 +00:00
bot	39c576a6eb	home-assistant: Update to 2024.8.3	2024-08-26 15:22:16 +00:00
Dustin C. Hatch	9c50acb6b9	ntfy: Handle ntfy.pyrocufflink.net name Now that the reverse proxy that handles requests from the Internet uses TLS pass-through, the Ingress for _ntfy_ needs to recognize both the internal and external name.	2024-08-24 11:31:47 -05:00
Dustin C. Hatch	a443929c0c	websites: Manage dcow cert via Ingress annotation Now that the reverse proxy for Internet-facing sites uses TLS passthrough, the certificate for the _darkchestofwonders.us_ Ingress needs to be correct. Since Ingress resources can only use either the default certificate (_*.pyrocufflink.blue_) or a certificate from their same namespace, we have to move the Certificate and its corresponding Secret into the _websites_ namespace. Fortunately, this is easy enoug to do, by setting the appropriate annotations on the Ingress. To keep the existing certificate (until it expires), I moved the Secret manually: ```sh kubectl get secret dcow-cert -o yaml \| grep -v namespace \| kubectl create -n websites -f - ```	2024-08-24 11:30:56 -05:00
Dustin C. Hatch	78afee9abc	v-m/scrape: Remove static VM hosts from collectd The VM hosts are now managed by the "main" Ansible inventory and thus appear in the host list ConfigMap. As such, they do not need to be listed explicitly in the static targets list.	2024-08-23 09:28:05 -05:00
Dustin C. Hatch	94b7168b1e	home-assistant: Add restart MQTTMarionette script There's obviously a bug or something in `mqttmarionette` because it occasionally gets "stuck" in a state where it is running but does not reconnect to the MQTT broker. In such situations, it has to be restarted (and even then it doesn't shut down correctly but has to be killed with SIGKILL, usually). I have been doing this manually, but with this shell script and a corresponding "shell command" integration in Home Assistant, it can be done automatically. This is similar to how Home Assistant restarts Mopidy on the living room stereo when it gets into the same kind of state.	2024-08-23 09:24:46 -05:00
Dustin C. Hatch	7dffb5195a	v-m: alertmanager: Group disk usage alerts Some machines have the same volume mounted multiple times (e.g. container hosts, BURP). Alerts will fire for all of these simultaneously when the filesystem usage passes the threshold. To avoid getting spammed with a bunch of messages about the same filesystem, we'll group alerts from the same machine.	2024-08-17 10:59:05 -05:00
Dustin C. Hatch	02001f61db	v-m/scrape: webistes: Stop scraping Matrix I'm not using Matrix for anything anymore, and it seems to have gone offline. I haven't fully decommissioned it yet, but the Blackbox scrape is failing, so I'll just disable that bit for now.	2024-08-17 10:57:22 -05:00
Dustin C. Hatch	c7e4baa466	v-m: scrape: Remove nvr2.p.b Zincati scrape target I've redeployed nvr2.pyrocufflink.blue as Fedora Linux, so it does not run Zincati anymore.	2024-08-17 10:56:06 -05:00
Dustin C. Hatch	1a631bf366	v-m: scrape: Remove serial1.p.b This machine never worked correctly; the USB-RS232 adapters would stop working randomly (and of course it would be whenever I needed to actually use them). I thought it was something wrong with the server itself (a Raspberry Pi 3), but the same thing happened when I tried using a Pi 4. The new backup server has a plethora of on-board RS-232 ports, so I'm going to use it as the serial console server, too.	2024-08-17 10:54:21 -05:00
Dustin C. Hatch	6f7f09de85	v-m: scrape: Update Unifi server target I've rebuilt the Unifi Network controller machine (again); unifi3.pyrocufflink.blue has replaced unifi2.p.b. The `unifi_exporter` no longer works with the latest version of Unifi Network, so it's not deployed on the new machine.	2024-08-17 10:52:51 -05:00
Dustin C. Hatch	809676f691	v-m: alerts: Add Longhorn alerts	2024-08-17 10:51:13 -05:00
Dustin C. Hatch	9977bb3de4	Merge remote-tracking branch 'refs/remotes/origin/master'	2024-08-06 08:03:42 -05:00
Dustin C. Hatch	dcd3f898c7	xactmon: Deploy Invoice Ninja importer for HLC Bank notifications sent to Tabitha's mailbox are now processed by `xactmon` and imported into Invoice Ninja as expenses for Hatch Learning Center.	2024-08-03 13:39:17 -05:00
Dustin C. Hatch	5b34547730	h-a: Config Zigbee2MQTT w/ env vars Zigbee2MQTT commits the cardinal sin of storing state in its configuration file. This means the file has to be writable and thus stored in persistent storage rather than in a ConfigMap. As a consequence, making changes to the configuration when the application is not running is rather difficult. Case in point: when I added the internal alias for _mqtt.pyrocufflink.blue_ pointing to the in-cluster service, Zigbee2MQTT became unable to connect to the broker because it was using the node port instead of the internal port. Since it could not connect to the broker, it refused to start, and thus the container would not stay running long enough to fix the configuration to point to the correct port. Fortunately, Zigbee2MQTT also allows configuring settings via environment variables, which can be managed with a ConfigMap. Luckily, the values read from environment variables override those from the configuration file, so pointing to the correct broker port with the environment variable was sufficient to allow the application to start.	2024-08-01 09:27:52 -05:00
Dustin C. Hatch	b366532c88	cert-manager, step-ca: Bypass cluster DNS Having name overrides for in-cluster services breaks ACME challenges, because the server tries to connect to the Service instead of the Ingress. To fix this, we need to configure both _cert-manager_ and _step-ca_ to only resolve names using the network-wide DNS server.	2024-07-29 20:58:18 -05:00
Dustin C. Hatch	a785fcec73	sshca: Allow Jenkins jobs to restart the Deployment The Jenkins job for the SSHCA Server restarts the Deployment after building a new container image.	2024-07-27 13:10:20 -05:00
Dustin C. Hatch	a26857819a	step-ca: Add Ingress resource It turns out, `step ca renew` _can_ renew certificates without mTLS; it has a `--mtls=false` command-line argument that configures it to use a JWT signed by the certificate, instead of using the certificate at the transport layer. This allows clients to renew their certificates without needing another authentication mechanism, even with the TLS-terminating proxy.	2024-07-27 13:07:26 -05:00
Dustin C. Hatch	079c3871b9	invoice-ninja: Fix document upload feature Invoice Ninja allows attaching documents to invoices, payments, expenses, etc. Tabitha wants to use this feature to attach receipts for her expenses, but the photos her phone takes of them are too large for the default nginx client body limit. We can raise this limit on the ingress, but we also need to raise it on the "inner" nginx.	2024-07-27 13:04:02 -05:00
Dustin C. Hatch	e74a6b3142	invoice-ninja: Run in a mutable container The Invoice Ninja container is not designed to be immutable at all; it makes a bunch of changes to its own contents when it starts up. Notably, it copies the contents of the `public` and `storage` directories from the container image to the persistent volume _and then deletes the source_. Additionally, being a Laravel application, it needs write access to its own code for caching, etc. Previously, the `init.sh` script copied the entire `app` directory to a temporary directory, and then the runtime container mounted that volume over the top of the original location. This allowed the root filesystem of the container to be read-only, while the `app` directory was still mutable. Unfortunately, this makes the startup process incredibly slow, as it takes a couple of minutes to copy the whole application. It's also pretty pointless, because the application runs as an unprivileged process, so it wouldn't have write access to the rest of the filesystem anyway. As such, I've decided to remove the `readOnlyRootFilesytem` restriction, and allow the container to run as upstream intends, albeit begrudgingly.	2024-07-27 12:57:02 -05:00
Dustin C. Hatch	78cd26c827	v-m: Scrape metrics from RabbitMQ	2024-07-26 20:59:00 -05:00
Dustin C. Hatch	e56a38c034	cert-manager: Add dch-ca issuer In-cluster services can now get certificates signed by the DCH CA via `step-ca`. This issuer uses ACME with the HTTP-01 challenge, so it can only issue certificates for names in the _pyrocufflink.blue_ zone that point to the ingress controllers.	2024-07-26 20:59:00 -05:00
Dustin C. Hatch	54187176ba	ingress: Proxy AMQP Passing port 5671 through the ingress-nginx proxy to the `rabbitmq` service will allow clients outside the cluster to connect to it. While we're at it, we'll move the definition of the `tcp-services` ConfigMap to its own file to make it easier to maintain.	2024-07-26 20:59:00 -05:00
Dustin C. Hatch	1a1d8ff27d	rabbitmq: Deploy RabbitMQ Server RabbitMQ is an AMQP message broker. It will be used by `xactmon` to pass messages between the components. Although RabbitMQ can be deployed in a high-availability cluster, we don't really need that level of robustness for `xactmon`, so we will just run a single instance. Deploying a single-host RabbitMQ server is pretty straightforward. We're using mTLS authentication; clients need to have a certificate issued by the RabbitMQ CA in order to connect to the message broker. The `rabbitmq-ca` _cert-manager_ ClusterIssuer issues these certificates for in-cluster services like `xactmon`.	2024-07-26 20:59:00 -05:00
Dustin C. Hatch	a04a2b5334	xactmon: Deploy xactmon `xactmon` is a new tool I developed to parse transaction notifications from banks and automatically import them into my personal finance tracker. It is designed in a modular fashion, composed of three main components: * Receiver * Processor * Importer Components communicate with one another using an AMQP exchange. Hypothetically, there could be multipel implementations of the receiver and importer components. Right now, there is only a JMAP receiver, which fetches email messages (from Fastmail), and a Firefly III importer. The processor is a singleton, handling notifications from the receiver, parsing them into a normalized format, and passing them on to the importer. It uses a set of rules to decide how to parse the messages, and supports using either a regular expression with named capture groups or an Awk script to extract the relevant information.	2024-07-26 20:53:19 -05:00
Dustin C. Hatch	ccc46288c2	Merge remote-tracking branch 'refs/remotes/origin/master'	2024-07-22 08:12:11 -05:00
Dustin C. Hatch	f4d41c0ec7	invoice-ninja: Add Ingress for HLC client portal Tabitha wants to use the Invoice Ninja Client Portal and Stripe integration for customer payments.	2024-07-14 15:41:14 -05:00
Dustin C. Hatch	989556d458	cert-manager: Update to v1.14.5	2024-07-14 15:14:44 -05:00
Dustin C. Hatch	74fa9264df	xactfetch: Configure secretsocket The `xactfetch` script now uses a helper tool, `secretsocket` to handle looking up secrets. This tool supports various secret source types, including files, environment variables, and external commands. Separating this functionality out of the main script makes it a lot more flexible and pluggable. It's main purpose, though, was actually to allow `xactfetch` to run in a container while communicating with `rbw` outside that container, specifically for development puposes. The `secretsocket` tool reads its configuration from a TOML document. This document defines the secrets the tool handles, and how to look them up. Note that the `xactfetch` container image no longer defines the `XDG_CONFIG_HOME` environment variable, as it uses Chromium instead of Firefox now, and the former does not work with a read-only config directory. As such, we have to mount the `rbw` configuration in the default location.	2024-07-11 22:49:07 -05:00
Dustin C. Hatch	71ca910ef7	home-assistant: Add Tabitha's HLC calendar	2024-07-11 22:15:56 -05:00
Dustin C. Hatch	ee00412bf6	xactfetch: Use separate CronJobs per bank Usually, `xactfetch` will only fail for one bank or the other. Rarely do we want to redownload the data from both banks just because one failed. The latest version of `xactfetch` supports specifying a bank name as a CLI argument, so now we can define separate jobs for each bank. Then, when one Job fails, only that one will be retried later. It's kind of a bummer that it's so repetitive to define two CronJobs that differ by only a single command-line argument. I suppose that's a good argument for using one of the preprocessor tools like Jsonnet or KCL.	2024-07-11 22:09:27 -05:00
Dustin C. Hatch	c741d04d54	xactfetch: Skip wait for manual runs When the `xactfetch` CronJob is triggered manually, it will now skip the `sleep` step. Presumably, whoever triggered it wants the script to run _right now_, probably to diagnose a problem.	2024-07-11 22:07:54 -05:00
Dustin C. Hatch	8cb292a4b2	v-m: alerts: Add alert for temperatures After the incident this week with the CPU overheating on _vmhost1_, I want to make sure I know as soon as possible when anything is starting to get too hot.	2024-07-11 22:07:27 -05:00
Dustin C. Hatch	8113e5a47f	v-m: Fix syntax in AlertManager config The `group_by` field takes a list of label names, rather than a single string.	2024-07-06 07:13:27 -05:00
Dustin C. Hatch	952ab9f264	v-m: alertmanager: Group camera notifications When Frigate is down, multiple alerts are generated for each camera, as Home Assistant creates camera entities for each tracked object. This is extremely annoying, not to mention unnecessary. To address this, we'll configure AlertManager to send a single notification for alerts in the group.	2024-07-05 07:30:30 -05:00
Dustin C. Hatch	9b26753e73	v-m: alerts: Add durations to spammy alerts Let's avoid sending alerts immediately when something is unavailable, because the issue might be transient and will resolve itself shortly.	2024-07-05 07:23:38 -05:00
Dustin C. Hatch	fa80b15a71	jenkins: Remove Argo CD sync hook Since Jenkins no longer uses a Longhorn volume, this sync hook is not useful.	2024-07-04 06:53:58 -05:00
Dustin C. Hatch	248a9a5ae9	v-m: Scrape PostgreSQL exporter The [postgres exporter][0] exposes metrics about the operation and performance of a PostgreSQL server. It's currently deployed on _db0.pyrocufflink.blue_, the primary server of the main PostgreSQL cluster. [0]: https://github.com/prometheus-community/postgres_exporter	2024-07-02 18:16:05 -05:00
Dustin C. Hatch	215b2c6975	home-assistant: Use external PostgreSQL server Home Assistant uses PostgreSQL for recording the history of entity states. Since we had been using the in-cluster database server for this, the data were migrated to the new external PostgreSQL server automatically when the backup from the former was restored on the latter. It follows, then, that we can point Home Assistant to the new server as well. Home Assistant uses SQLAlchemy, which in turn uses _libpq_ via _psycopg_, as a client for PostgreSQL. It doesn't expose any configuration parameters beyond the "database URL" directly, but we can use the standard environment variables to specify the certificate and private key for authentication. In fact, the empty `postgresql://` URL is sufficient, and indicates that _all_ of the connection parameters should be taken from environment variables. This makes specifying the parameters for both the `wait-for-db` init container and the main container take the exact same environment variables, so we can use YAML anchors to share their definitions.	2024-07-02 18:16:05 -05:00
Dustin C. Hatch	a269f8a1ae	firefly-iii: Connect to external PostgreSQL Since the new database server outside the Kubernetes cluster, created for Authelia, was seeded from a backup of the in-cluster server, it already contained the data from Firefly-III as well. Thus, we can switch Firefly-III to using it, too. The documentation for Firefly-III does not mention anything about how to configure it to use certificate-based authentication for PostgreSQL, as is required by the new server. Fortunately, it ultimately uses _libpq_, so the standard `PG...` environment variables work fine. We just need a certificate issued by the _postgresql-ca_ ClusterIssuer and the _DCH Root CA_ certificate mounted in the Firefly-III container.	2024-07-02 18:16:05 -05:00

... 2 3 4 5 6 ...

464 Commits