1
0
Fork 0
mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2025-03-28 10:47:23 +00:00
node-feature-discovery/pkg
Markus Lehtonen a2068f7ce3 nfd-master: tweak list options for NodeFeature informer
Fix cache syncing problems on big clusters with thousands of NodeFeature
objects.

On the initial list (sync) the client-go cache reflector sets the
ResourceVersion to "0" (instead of leaving it empty). This causes
problems in the api server with (apiserver) logs like:

E writers.go:122] apiserver was unable to write a JSON response: http:
                  Handler timeout
E status.go:71] apiserver received an error that is not an
                metav1.Status: &errors.errorString{s:"http: Handler timeout"}:
                http: Handler timeout

On the nfd-master side we see corresponding log snippets like:

W reflector.go:547] failed to list *v1alpha1.NodeFeature: stream error
                    when reading response body, may be caused by closed
                    connection. Please retry. Original error: stream
                    error: stream ID 1521; INTERNAL_ERROR; received from
                    peer
I trace.go:236] "Reflector ListAndWatch" name:*** (***) (total time:
                61126ms): ---"Objects listed" error:stream error when
                reading response body, may be caused by closed
                connection. Please retry. Original error: stream
                error: stream ID 1521; INTERNAL_ERROR; received from
                peer 61126ms (***)

Decreasing the page size (opts.Limits) does not have any effect on the
timeouts. However, setting ResourceVersion to an empty value seems to
get the paging on its tracks, eliminating the timeouts.

TODO: investigate in Kubernetes upstream the root cause of the timeouts
with ResourceVersion="0".
2024-07-25 16:29:05 +03:00
..
apis/nfd Simplify code 2024-07-09 09:05:33 +03:00
cpuid pkg/cpuid: lint fixes 2020-05-20 21:48:06 +03:00
features feature-gates: mark NodeFeatureAPI as GA 2024-07-16 13:53:31 +03:00
kubectl-nfd Move NFD api to a separate go mod 2024-04-05 16:35:47 +02:00
labeler Fix update_codegen 2024-04-09 18:28:04 +02:00
nfd-gc Merge pull request #1600 from ArangoGutierrez/e2e-not-k8s 2024-04-09 02:06:06 -07:00
nfd-master nfd-master: tweak list options for NodeFeature informer 2024-07-25 16:29:05 +03:00
nfd-topology-updater Tidy up usage of channels for signaling 2024-04-05 14:39:58 +03:00
nfd-worker feature-gates: mark NodeFeatureAPI as GA 2024-07-16 13:53:31 +03:00
podres Update auto-generated code 2024-03-11 12:18:32 +02:00
resourcemonitor topology-updater: ditch apihelper 2024-01-25 22:15:20 +02:00
topologypolicy nrt-api: move from v1alpha1 to v1alpha2 2023-02-09 12:29:54 +01:00
utils nfd-master: prevent crash on empty config struct 2024-04-05 14:19:44 +03:00
version version: allow undefined version 2020-03-20 07:21:43 +02:00