chore: update perf docs for 1.12 (#10116)

* fix: update kwok installation Signed-off-by: ShutingZhao <shuting@nirmata.com> * feat: create deployment Signed-off-by: ShutingZhao <shuting@nirmata.com> * feat: create pod Signed-off-by: ShutingZhao <shuting@nirmata.com> * chore: update commands Signed-off-by: ShutingZhao <shuting@nirmata.com> * chore: update readme Signed-off-by: ShutingZhao <shuting@nirmata.com> --------- Signed-off-by: ShutingZhao <shuting@nirmata.com>
2025-03-28 18:38:40 +00:00 · 2024-04-29 18:09:44 +08:00 · 2024-04-29 18:09:44 +08:00 · 8929bd72a1
commit 8929bd72a1
parent ac4eeaaf8b
6 changed files with 171 additions and 81 deletions
--- a/docs/perf-testing/README.md
+++ b/docs/perf-testing/README.md
@ -1,4 +1,4 @@
-This document outlines the instructions for performance testing using [Kwok](https://kwok.sigs.k8s.io/) for the Kyverno 1.10 release.
+This document outlines the instructions for performance testing using [Kwok](https://kwok.sigs.k8s.io/) for the Kyverno 1.12 release.

 # Pre-requisite

@ -109,27 +109,69 @@ helm upgrade --install kyverno kyverno/kyverno -n kyverno \
  --set admissionController.serviceMonitor.enabled=true \
  --set admissionController.replicas=3 \
  --set reportsController.serviceMonitor.enabled=true \
-  --set reportsController.resources.limits.memory=10Gi 
+  --set reportsController.resources.limits.memory=10Gi \
+  --set "features.omitEvents.eventTypes={PolicyApplied,PolicySkipped,PolicyViolation,PolicyError}" \
  # --devel \
  # --set features.admissionReports.enabled=false \
 ```

 ## Deploy Kyverno PSS policies
 ```sh
-helm upgrade --install kyverno kyverno/kyverno-policies --set=podSecurityStandard=restricted --set=background=true --set=validationFailureAction=Enforce --devel
+helm upgrade --install kyverno kyverno/kyverno-policies --set=podSecurityStandard=restricted --set=background=true --set=validationFailureAction=Audit --devel
 ```

-# Create workloads
+# Testing the reports controller

-This script creates 1000 pods, with QPS and burst set to 50:
+The following instructions provide steps to create policyreports for installed workloads, measure resource usages of the reports controller and the total objects size in etcd.
+
+## Create workloads
+
+This script creates 100 deployments in namespace `test-1`, each deployment has 10 replicas:
+
+```
+./docs/perf-testing/deployment.sh
+Enter the deployment count:
+100
+Enter the deployment replicas:
+10
+Enter the deployment namespace:
+test-1
+Creating namespace test-1
+...
+```
+
+The total number of policyreports for the 100 deployments with 10 replicas each is 1200. With Kyverno 1.12.0, a policy report is created for one matching resource, therefore 100 deployments, 100 replicasets and 1000 pods will create 1200 policy reports in total.
+
+You can also create pods directly using `./docs/perf-testing/pod.sh`.
+
+Note that these pods will be scheduled to the Kwok nodes, not K3d nodes.
+
+## Objects sizes in etcd
+
+Run the following script to calculate total sizes for the given resource (policyreports in the following example):
+```sh
+$ ./docs/perf-testing/size.sh
+Enter the resource to caclutate the size:
+wgpolicyk8s.io/policyreports
+The total size for wgpolicyk8s.io/policyreports is 401851071 bytes.
+```
+
+You can also check the total etcd size:
+```sh
+$ etcdctl endpoint status -w table
+-------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
+|        ENDPOINT         |        ID        | VERSION | DB SIZE | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS |
+-------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
+| https://172.21.0.2:2379 | c2ed0eb8fc7bc4fc |   3.5.9 |  1.8 GB |      true |      false |         2 |    2428629 |            2428629 |        |
+-------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
+```
+
+This command returns the resources stored in etcd that have more than 100 objects:

 ```sh
-kubectl create ns test
-go run docs/perf-testing/main.go --count=1000 --kinds=pods --clientRateLimitQPS=50 --clientRateLimitBurst=50 --namespace=test
+kubectl get --raw=/metrics | grep apiserver_storage_objects |awk '$2>100' |sort -g -k 2
 ```

-Note that these pods will be scheduled to the Kwok nodes, not k3s nodes.
-
 # Prometheus Queries

 To view the Prometheus dashboard, you can expose it on your localhost's port at 9090:
@ -142,7 +184,7 @@ kubectl port-forward --address 127.0.0.1 svc/kube-prometheus-stack-prometheus 90
 To get an view of the memory utilization overtime, you can select by the container image for a specific Kyverno controller:

 ```
-container_memory_working_set_bytes{image="ghcr.io/kyverno/kyverno:v1.10.0-rc.1"}
+container_memory_working_set_bytes{image="ghcr.io/kyverno/kyverno:v1.12.0-rc.5"}
 ```

 `container_memory_working_set_bytes` gives you the current working set in bytes, and this is what the OOM killer is watching for.
@ -151,58 +193,7 @@ container_memory_working_set_bytes{image="ghcr.io/kyverno/kyverno:v1.10.0-rc.1"}
 ## CPU utilization

 ```
-rate(container_cpu_usage_seconds_total{image="ghcr.io/kyverno/kyverno:v1.10.0-rc.1"}[1m])
+rate(container_cpu_usage_seconds_total{image="ghcr.io/kyverno/kyverno:v1.12.0-rc.5"}[1m])
 ```

-`container_cpu_usage_seconds_total` is the sum of the total amount of “user” time (i.e. time spent not in the kernel) and the total amount of “system” time (i.e. time spent in the kernel). This query gives the average CPU usage in the last 1 minute.
-
-## Admission Request Rate
-
-It's a bit tricky to get the precise Admission Request rate (ARPS). When using the Prometheus [rate()](https://prometheus.io/docs/prometheus/latest/querying/functions/#rate) function, it always requires a time window to calculate the rate with the given internal. The rate may differ when the window differs.
-
-
-During our test, we calculate the increment in the count of admission requests recorded at the start and end time of a particular duration. Next, we divide this increment by the duration of the time window to derive the average admission request rate during that period.
-
-
-```
-sum(kyverno_admission_requests_total)
-```
-
-## Objects sizes in etcd
-
-Run the following script to calculate total sizes for the given resource (pods in the following example):
-```sh
-$ ./docs/perf-testing/size.sh
-Enter the resource to calculate the size:
-pods
-The total size for pods is 8861737 bytes.
-```
-
-You can also check the total etcd size:
-```sh
-$ etcdctl endpoint status -w table
-+-------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
-|        ENDPOINT         |        ID        | VERSION | DB SIZE | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS |
-+-------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
-| https://172.19.0.2:2379 | d7380397c3ec4b90 |   3.5.3 |   84 MB |      true |      false |         2 |     154449 |             154449 |        |
-+-------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
-```
-
-This command returns the resources stored in etcd that have more than 100 objects:
-
-```sh
-kubectl get --raw=/metrics | grep apiserver_storage_objects |awk '$2>100' |sort -g -k 2
-```
-
-
-## Admission review latency (average)
-
-Kyverno exposes two metrics that can be used to calculate the admission review latency, 
-```
-sum(kyverno_admission_review_duration_seconds_sum{resource_request_operation=~"create|update"})/sum(kyverno_admission_review_duration_seconds_count{resource_request_operation=~"create|update"})
-```
-
-The following metrics exposed by Prometheus should give you the same result if you follow the same setup on this page:
-```
-sum(apiserver_admission_webhook_admission_duration_seconds_sum{name="validate.kyverno.svc-fail",operation="CREATE"}) / sum(apiserver_admission_webhook_admission_duration_seconds_count{name="validate.kyverno.svc-fail",operation="CREATE"})
-```
+`container_cpu_usage_seconds_total` is the sum of the total amount of “user” time (i.e. time spent not in the kernel) and the total amount of “system” time (i.e. time spent in the kernel). This query gives the average CPU usage in the last 1 minute.
--- a/docs/perf-testing/deployment.sh
+++ b/docs/perf-testing/deployment.sh
@ -0,0 +1,61 @@
+#!/bin/bash
+
+export KUBECONFIG=/etc/rancher/k3s/k3s.yaml
+
+# read user input for count
+echo "Enter the deployment count:"
+read count
+
+echo "Enter the deployment replicas:"
+read replicas
+
+echo "Enter the deployment namespace:"
+read namespace
+
+echo "Creating namespace $namespace:"
+kubectl create namespace $namespace
+
+# iterate $count number of times
+for (( i=1; i<=$count; i++ ))
+do
+  # generate YAML configuration using heredoc with COUNT variable substitution
+  yaml=$(cat <<-END
+apiVersion: apps/v1
+kind: Deployment
+metadata:
+  name: fake-pod-$i
+  namespace: $namespace
+spec:
+  replicas: $replicas
+  selector:
+    matchLabels:
+      app: fake-pod
+  template:
+    metadata:
+      labels:
+        app: fake-pod
+    spec:
+      affinity:
+        nodeAffinity:
+          requiredDuringSchedulingIgnoredDuringExecution:
+            nodeSelectorTerms:
+            - matchExpressions:
+              - key: type
+                operator: In
+                values:
+                - kwok
+      # A taints was added to an automatically created Node.
+      # You can remove taints of Node or add this tolerations.
+      tolerations:
+      - key: "kwok.x-k8s.io/node"
+        operator: "Exists"
+        effect: "NoSchedule"
+      containers:
+      - name: fake-container
+        image: fake-image
+END
+)
+
+  # apply the generated configuration to Kubernetes cluster
+  echo "$yaml" | kubectl apply -f -
+done
--- a/docs/perf-testing/kwok.sh
+++ b/docs/perf-testing/kwok.sh
@ -2,23 +2,10 @@

 export KUBECONFIG=/etc/rancher/k3s/k3s.yaml

-# Variables preparation
-KWOK_WORK_DIR=$(mktemp -d)
+# KWOK repository
 KWOK_REPO=kubernetes-sigs/kwok
+# Get latest
 KWOK_LATEST_RELEASE=$(curl "https://api.github.com/repos/${KWOK_REPO}/releases/latest" | jq -r '.tag_name')

-# Render kustomization yaml
-cat <<EOF > "${KWOK_WORK_DIR}/kustomization.yaml"
-apiVersion: kustomize.config.k8s.io/v1beta1
-kind: Kustomization
-images:
-  - name: registry.k8s.io/kwok/kwok
-    newTag: "${KWOK_LATEST_RELEASE}"
-resources:
-  - "https://github.com/${KWOK_REPO}/kustomize/kwok?ref=${KWOK_LATEST_RELEASE}"
-EOF
-
-kubectl kustomize "${KWOK_WORK_DIR}" > "${KWOK_WORK_DIR}/kwok.yaml"
-
-# create `kwok` deployment 
-kubectl apply -f "${KWOK_WORK_DIR}/kwok.yaml"
+kubectl apply -f "https://github.com/${KWOK_REPO}/releases/download/${KWOK_LATEST_RELEASE}/kwok.yaml"
+kubectl apply -f "https://github.com/${KWOK_REPO}/releases/download/${KWOK_LATEST_RELEASE}/stage-fast.yaml"
--- a/docs/perf-testing/node.sh
+++ b/docs/perf-testing/node.sh
@ -28,7 +28,7 @@ do
        type: kwok
      name: kwok-node-$i
    spec:
-      taints:
+      taints: # Avoid scheduling actual running pods to fake Node
        - effect: NoSchedule
          key: kwok.x-k8s.io/node
          value: fake
--- a/docs/perf-testing/pod.sh
+++ b/docs/perf-testing/pod.sh
@ -0,0 +1,49 @@
+#!/bin/bash
+
+export KUBECONFIG=/etc/rancher/k3s/k3s.yaml
+
+# read user input for count
+echo "Enter the pod count:"
+read count
+
+echo "Enter the pod namespace:"
+read namespace
+
+echo "Creating namespace $namespace:"
+kubectl create namespace $namespace
+
+# iterate $count number of times
+for (( i=1; i<=$count; i++ ))
+do
+  # generate YAML configuration using heredoc with COUNT variable substitution
+  yaml=$(cat <<-END
+apiVersion: v1
+kind: Pod
+metadata:
+  name: fake-pod-$i
+  namespace: $namespace
+spec:
+    affinity:
+      nodeAffinity:
+        requiredDuringSchedulingIgnoredDuringExecution:
+            nodeSelectorTerms:
+            - matchExpressions:
+              - key: type
+                operator: In
+                values:
+                - kwok
+      # A taints was added to an automatically created Node.
+      # You can remove taints of Node or add this tolerations.
+    tolerations:
+      - key: "kwok.x-k8s.io/node"
+        operator: "Exists"
+        effect: "NoSchedule"
+    containers:
+      - name: fake-container
+        image: fake-image
+END
+)
+
+  # apply the generated configuration to Kubernetes cluster
+  echo "$yaml" | kubectl apply -f -
+done
--- a/docs/perf-testing/size.sh
+++ b/docs/perf-testing/size.sh
@ -6,6 +6,8 @@
 echo "Enter the resource to caclutate the size:"
 read resource

+# /registry/reports.kyverno.io/ephemeralreports/
+# /registry/wgpolicyk8s.io/policyreports/
 sum=0
 for key in `etcdctl get --prefix --keys-only /registry/$resource`
 do