1
0
Fork 0
mirror of https://github.com/monitoring-mixins/website.git synced 2024-12-14 11:37:31 +00:00
monitoring-mixins-website/site/content/windows-active-directory/_index.md
2024-11-07 03:31:28 +00:00

6.2 KiB

title
windows-active-directory

Overview

{{< panel style="danger" >}} Jsonnet source code is available at github.com/grafana/jsonnet-libs {{< /panel >}}

Alerts

{{< panel style="warning" >}} Complete list of pregenerated alerts is available here. {{< /panel >}}

windows-alerts-active-directory

WindowsCPUHighUsage

{{< code lang="yaml" >}} alert: WindowsCPUHighUsage annotations: description: | CPU usage on host {{ $labels.instance }} is above 90%. The current value is {{ $value | printf "%.2f" }}%. summary: High CPU usage on Windows host. expr: | 100 - (avg without (mode, core) (rate(windows_cpu_time_total{job=~".windows.", mode="idle"}[2m])) * 100) > 90 for: 15m keep_firing_for: 5m labels: severity: warning {{< /code >}}

WindowsMemoryHighUtilization

{{< code lang="yaml" >}} alert: WindowsMemoryHighUtilization annotations: description: | Memory usage on host {{ $labels.instance }} is above 90%. The current value is {{ $value | printf "%.2f" }}%. summary: High memory usage on Windows host. expr: | 100 - ((windows_os_physical_memory_free_bytes{job=".windows."} / windows_cs_physical_memory_bytes{job=".windows."}) * 100) > 90 for: 15m keep_firing_for: 5m labels: severity: critical {{< /code >}}

WindowsDiskAlmostOutOfSpace

{{< code lang="yaml" >}} alert: WindowsDiskAlmostOutOfSpace annotations: description: | Volume {{ $labels.volume }} is almost full on host {{ $labels.instance }}, more than 90% of space is used. The current volume utilization is {{ $value | printf "%.2f" }}%. summary: Disk is almost full on Windows host. expr: | 100 - ((windows_logical_disk_free_bytes{job=".windows."} ) / (windows_logical_disk_size_bytes{job=".windows."})) * 100 > 90 for: 15m keep_firing_for: 5m labels: severity: critical {{< /code >}}

WindowsServiceNotHealthy

{{< code lang="yaml" >}} alert: WindowsServiceNotHealthy annotations: description: | Windows service {{ $labels.name }} is not in healthy state, currently in '{{ $labels.status }}'. summary: Windows service is not healthy. expr: | windows_service_status{job=".windows.", status!"starting|stopping|ok"} > 0 for: 5m labels: severity: critical {{< /code >}}

WindowsDiskDriveNotHealthy

{{< code lang="yaml" >}} alert: WindowsDiskDriveNotHealthy annotations: description: | Windows disk {{ $labels.name }} is not in healthy state, currently in '{{ $labels.status }}' status. summary: Windows physical disk is not healthy. expr: | windows_disk_drive_status{job=~".windows.", status="OK"} != 1 for: 5m labels: severity: critical {{< /code >}}

WindowsNTPClientDelay

{{< code lang="yaml" >}} alert: WindowsNTPClientDelay annotations: description: | Round-trip time of NTP client on instance {{ $labels.instance }} is greater than 1 second. Delay is {{ $value }} sec. summary: NTP client delay. expr: | windows_time_ntp_round_trip_delay_seconds{job=~".windows."} > 1 for: 5m keep_firing_for: 5m labels: severity: warning {{< /code >}}

WindowsNTPTimeOffset

{{< code lang="yaml" >}} alert: WindowsNTPTimeOffset annotations: description: | NTP time offset for instance {{ $labels.instance }} is greater than 1 second. Offset is {{ $value }} sec. summary: NTP time offset is too large. expr: | windows_time_computed_time_offset_seconds{job=~".windows."} > 1 for: 5m keep_firing_for: 5m labels: severity: warning {{< /code >}}

WindowsActiveDirectoryHighPendingReplicationOperations

{{< code lang="yaml" >}} alert: WindowsActiveDirectoryHighPendingReplicationOperations annotations: description: The number of pending replication operations on {{$labels.instance}} is {{ printf "%.0f" $value }} which is above the threshold of 50. summary: There is a high number of pending replication operations in Active Directory. A high number of pending operations sustained over a period of time can indicate a problem with replication. expr: "windows_ad_replication_pending_operations{job=~".windows."} >= 50 " for: 10m keep_firing_for: 5m labels: severity: warning {{< /code >}}

WindowsActiveDirectoryHighReplicationSyncRequestFailures

{{< code lang="yaml" >}} alert: WindowsActiveDirectoryHighReplicationSyncRequestFailures annotations: description: The number of replication sync request failures on {{$labels.instance}} is {{ printf "%.0f" $value }} which is above the threshold of 0. summary: There are a number of replication synchronization request failures. These can cause authentication failures, outdated information being propagated across domain controllers, and potentially data loss or inconsistencies. expr: "increase(windows_ad_replication_sync_requests_schema_mismatch_failure_total{job=~".windows."}[5m])

0 " for: 5m keep_firing_for: 5m labels: severity: critical {{< /code >}}

WindowsActiveDirectoryHighPasswordChanges

{{< code lang="yaml" >}} alert: WindowsActiveDirectoryHighPasswordChanges annotations: description: The number of password changes on {{$labels.instance}} is {{ printf "%.0f" $value }} which is greater than the threshold of 25. This alert would resolve itself if no new anomalies are detected within 24 hours. summary: There is a high number of password changes. This may indicate unauthorized changes or attacks. expr: | increase(windows_ad_sam_password_changes_total{job=~".windows."}[5m]) > 25 for: 5m labels: keep_firing_for: 24h severity: warning {{< /code >}}

WindowsActiveDirectoryMetricsDown

{{< code lang="yaml" >}} alert: WindowsActiveDirectoryMetricsDown annotations: description: There are no available metrics for Windows Active Directory integration from instance {{$labels.instance}}. summary: Windows Active Directory metrics are down. expr: | up{job="integrations/windows_exporter"} == 0 for: 5m labels: severity: critical {{< /code >}}

Dashboards

Following dashboards are generated from mixins and hosted on github: