Reliability: - Add swap role (2GB, swappiness=10, idempotent via /etc/fstab) - Add mem_limit to plane-worker (512m) and plane-beat (256m) - Add health checks to all services (traefik, vaultwarden, forgejo, plane-*, syncthing, prometheus, grafana, loki) Code quality: - Remove Traefik Docker labels (file provider used, labels were dead code) - Add comment explaining file provider architecture Observability: - Add AlertManager with Telegram notifications - Add Prometheus alert rules: CPU, RAM, disk, swap, container health - Add Loki + Promtail for centralized log aggregation - Add Loki datasource to Grafana - Enable Traefik /ping endpoint for health checks Backups: - Add backup role: pg_dump for forgejo + plane DBs, tar for vaultwarden and forgejo data - 7-day retention, daily cron at 03:00 - Backup script at /usr/local/bin/backup-services Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
25 lines
600 B
YAML
25 lines
600 B
YAML
---
|
|
- name: Create backup directory
|
|
ansible.builtin.file:
|
|
path: "{{ backup_dir }}"
|
|
state: directory
|
|
owner: "{{ backup_user }}"
|
|
group: "{{ backup_user }}"
|
|
mode: "0750"
|
|
|
|
- name: Deploy backup script
|
|
ansible.builtin.template:
|
|
src: backup.sh.j2
|
|
dest: /usr/local/bin/backup-services
|
|
owner: root
|
|
group: root
|
|
mode: "0750"
|
|
|
|
- name: Schedule daily backup at 03:00
|
|
ansible.builtin.cron:
|
|
name: "Daily services backup"
|
|
minute: "0"
|
|
hour: "3"
|
|
job: "/usr/local/bin/backup-services >> /var/log/backup-services.log 2>&1"
|
|
user: root
|
|
state: present
|