Dustin C. Hatch dc835ddc9d v-m/alerts: Fix PostgreSQL WAL archive failed alert
The `pg_stat_archiver_failed_count` metric is a counter, so once a WAL
archival has failed, it will increase and never return to `0`.  To
ensure the alert is resolved once the WAL archival process recovers, we
need to use the `increase` function to turn it into a gauge.  Finally,
we aggregate that gauge with `max_over_time` to keep the alert from
flapping if the WAL archive occurs less frequently than the scrape
interval.
2025-02-05 10:42:35 -06:00
2024-11-17 18:43:00 +00:00
2025-02-01 18:16:10 -06:00
2025-02-01 18:16:10 -06:00
2024-12-21 12:32:08 +00:00
2022-12-18 16:26:07 -06:00
2024-11-22 22:26:48 -06:00
2024-05-26 11:48:12 -05:00
2024-06-06 11:03:42 -05:00
2024-09-22 12:00:28 -05:00
2023-10-19 07:12:16 -05:00
2024-02-22 07:10:01 -06:00
2023-07-08 07:06:10 -05:00
2024-11-21 22:59:14 -06:00
2022-07-31 01:38:46 -05:00

Dustin's Kubernetes Cluster

This repository contains resources for deploying and managing my on-premises Kubernetes cluster

Cluster Setup

The cluster primarily consists of libvirt/QEMU+KVM virtual machines. The Control Plane nodes are VMs, as are the x86_64 worker nodes. Eventually, I would like to add Raspberry Pi or Pine64 machines as aarch64 nodes.

All machines run Fedora, using only Fedora builds of the Kubernetes components (kubeadm, kubectl, and kubeadm).

See Cluster Setup for details.

Jenkins Agents

One of the main use cases for the Kubernetes cluster is to provide dynamic agents for Jenkins. Using the Kubernetes Plugin, Jenkins will automatically launch worker nodes as Kubernetes pods.

See Jenkins Kubernetes Integration for details.

Persistent Storage

Persistent storage for pods is provided by Longhorn. Longhorn runs within the cluster and provisions storage on worker nodes to make available to pods over iSCSI.

See Persistent Storage Using Longorn for details.

Description
Resources for deploying and managing my personal Kubernetes cluster
Readme 3.9 MiB
Languages
Shell 80.5%
JavaScript 17.2%
Awk 2.3%