SafetyWing Runbooks

      • KafkaConsumerGroupLagHigh
      • KafkaNoActiveController
      • KafkaOfflinePartitions
      • KafkaUnderReplicatedPartitions
      • KafkaConnectFailedTasks
      • KafkaConnectNoConnectors
      • KafkaConnectWorkersDown
      • MysqlConnectionsSaturated
      • MysqlDiskFillingUp
      • MysqlInstanceDown
      • MysqlReplicationLagHigh
      • RabbitmqDeadLetterMessages
      • RabbitmqDiskAlarm
      • RabbitmqMemoryAlarm
      • RabbitmqNodeDown
      • RabbitmqQueueBacklog
      • RabbitmqQueueNoConsumers
      • CephClusterNearFull
      • CephHealthError
      • CephHealthWarning
      • CephMonOutOfQuorum
      • CephOSDDown
      • ElasticsearchClusterRed
      • ElasticsearchClusterYellow
      • ElasticsearchDiskWatermark
      • ElasticsearchHeapHigh
      • NodeFilesystemAlmostFull
      • TraefikDown
      • TraefikHigh5xxRate
      • EnvironmentHigh5xxRate
    • Alert Catalog

    Node

    • Node

    Node#

    Supplemental node (Talos) platform alerts beyond the kube-prometheus-stack node mixin.

    • NodeFilesystemAlmostFull
    Backward ElasticsearchHeapHigh NodeFilesystemAlmostFull Forward
    • Node