SafetyWing Runbooks

      • KafkaConsumerGroupLagHigh
      • KafkaNoActiveController
      • KafkaOfflinePartitions
      • KafkaUnderReplicatedPartitions
      • KafkaConnectFailedTasks
      • KafkaConnectNoConnectors
      • KafkaConnectWorkersDown
      • MysqlConnectionsSaturated
      • MysqlDiskFillingUp
      • MysqlInstanceDown
      • MysqlReplicationLagHigh
      • RabbitmqDeadLetterMessages
      • RabbitmqDiskAlarm
      • RabbitmqMemoryAlarm
      • RabbitmqNodeDown
      • RabbitmqQueueBacklog
      • RabbitmqQueueNoConsumers
      • CephClusterNearFull
      • CephHealthError
      • CephHealthWarning
      • CephMonOutOfQuorum
      • CephOSDDown
      • ElasticsearchClusterRed
      • ElasticsearchClusterYellow
      • ElasticsearchDiskWatermark
      • ElasticsearchHeapHigh
      • NodeFilesystemAlmostFull
      • TraefikDown
      • TraefikHigh5xxRate
      • EnvironmentHigh5xxRate
    • Alert Catalog

    Ceph

    • Ceph

    Ceph#

    Rook-Ceph storage alerts (platform tier, cluster-wide). Namespace rook-ceph.

    • CephHealthError
    • CephMonOutOfQuorum
    • CephHealthWarning
    • CephOSDDown
    • CephClusterNearFull
    Backward RabbitmqQueueNoConsumers CephClusterNearFull Forward
    • Ceph