1
0
Fork 0
mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2024-12-14 11:57:51 +00:00
Commit graph

2658 commits

Author SHA1 Message Date
Kubernetes Prow Robot
8c3cb31114
Merge pull request #1853 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/ginkgo/v2-2.20.1
build(deps): bump github.com/onsi/ginkgo/v2 from 2.20.0 to 2.20.1
2024-08-26 06:48:20 +01:00
dependabot[bot]
8c62586592
build(deps): bump github.com/onsi/ginkgo/v2 from 2.20.0 to 2.20.1
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo) from 2.20.0 to 2.20.1.
- [Release notes](https://github.com/onsi/ginkgo/releases)
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md)
- [Commits](https://github.com/onsi/ginkgo/compare/v2.20.0...v2.20.1)

---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-08-25 09:58:24 +00:00
Markus Lehtonen
02b6b7395c Drop dynamic run-time reconfiguration
Simplify the code and reduce possible error scenarios by dropping
fsnotify-based reconfiguration from nfd-master and nfd-worker. Also
eliminates repeated re-configuration in scenarios where kubelet
continuosly touches the (every minute) mounted file (configmap) on the
filesystem.

Also modifies the Helm and kustomize deployments so that nfd-master,
nfd-worker and nfd-topology-updater pods are restarted on configmap
updates. In kustomize, the slght downside of this is the name of the
config map(s) depends on the content, so every time a user customizes
the config data, the old unused configmap will be left and must be
garbage-collected manually.
2024-08-21 12:46:36 +03:00
Kubernetes Prow Robot
4db3216a80
Merge pull request #1848 from marquiz/devel/leader-elect
nfd-master: proper shutdown of nfd api informers
2024-08-20 03:02:44 -07:00
Markus Lehtonen
2bb8a72532 nfd-master: proper shutdown of nfd api informers
Stop blocking on event channels when the api controller is stopped.
Ensures that the nfd API informer factory is properly shut down and all
resources released when stop() is called. This eliminates a memory leak
on re-configure events when leader election is enabled.
2024-08-20 12:44:08 +03:00
Kubernetes Prow Robot
5a5b9e3c19
Merge pull request #1843 from marquiz/devel/master-chan
nfd-master: use only unbuffered chans in the nfd api-controller
2024-08-19 07:23:12 -07:00
Kubernetes Prow Robot
df7f65c9b9
Merge pull request #1844 from marquiz/devel/updater-pool-started
nfd-master: explicit state variable for the node updater pool
2024-08-19 06:36:20 -07:00
Markus Lehtonen
bf6ffadf36 nfd-master: use only unbuffered chans in the nfd api-controller
There's no reason why the "update all" chans should be buffered (while
the other are not).
2024-08-19 14:02:13 +03:00
Markus Lehtonen
0d3c1ac75b nfd-master: explicit state variable for the node updater pool 2024-08-19 13:27:56 +03:00
Markus Lehtonen
8a6853f138 test/e2e: use ptr.To to get pointer to bool 2024-08-14 13:16:03 +03:00
Kubernetes Prow Robot
a851aae914
Merge pull request #1835 from marquiz/devel/e2e
test/e2e: simplify TestMain
2024-08-14 02:32:49 -07:00
Kubernetes Prow Robot
f63e2b1673
Merge pull request #1834 from marquiz/devel/readme
README: update to v0.16.4
2024-08-14 02:10:19 -07:00
Markus Lehtonen
3e1c43dc6f test/e2e: simplify TestMain
Drop unneeded bits.
2024-08-12 14:17:27 +03:00
Markus Lehtonen
378fe8f86b README: update to v0.16.4 2024-08-12 13:22:03 +03:00
Kubernetes Prow Robot
6af374f700
Merge pull request #1832 from kubernetes-sigs/dependabot/go_modules/master/golang.org/x/time-0.6.0
build(deps): bump golang.org/x/time from 0.5.0 to 0.6.0
2024-08-11 23:30:34 -07:00
dependabot[bot]
e85b80602b
build(deps): bump golang.org/x/time from 0.5.0 to 0.6.0
Bumps [golang.org/x/time](https://github.com/golang/time) from 0.5.0 to 0.6.0.
- [Commits](https://github.com/golang/time/compare/v0.5.0...v0.6.0)

---
updated-dependencies:
- dependency-name: golang.org/x/time
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-08-12 06:12:45 +00:00
Kubernetes Prow Robot
396540239f
Merge pull request #1831 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/ginkgo/v2-2.20.0
build(deps): bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0
2024-08-11 23:11:53 -07:00
dependabot[bot]
9f73aabef1
build(deps): bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo) from 2.19.1 to 2.20.0.
- [Release notes](https://github.com/onsi/ginkgo/releases)
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md)
- [Commits](https://github.com/onsi/ginkgo/compare/v2.19.1...v2.20.0)

---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-08-11 09:45:04 +00:00
AhmedGrati
925a071595 docs: add CR restrictions to the master configuration reference
Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>
2024-08-10 22:39:14 +02:00
AhmedGrati
7bad0d583c feat/nfd-master: support CR restrictions
Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>
2024-08-10 22:39:10 +02:00
Kubernetes Prow Robot
c1db298de8
Merge pull request #1827 from fmuyassarov/fix-go-mod
tilt: sync up builder go version with project go.mod
2024-08-08 14:22:37 -07:00
Feruzjon Muyassarov
a7d8448eb6
tilt: sync up builder go version with project go.mod
Signed-off-by: Feruzjon Muyassarov <feruzjon.muyassarov@intel.com>
2024-08-09 00:00:47 +03:00
Kubernetes Prow Robot
d38e116667
Merge pull request #1829 from marquiz/devel/docs-theme
docs: use jekyll-rtd-theme from a ruby gem
2024-08-08 13:53:58 -07:00
Markus Lehtonen
b2bc18f5a5 docs: use jekyll-rtd-theme from a ruby gem
The upstream repo (and the release downloads)
github.com/rundocs/jekyll-rtd-theme has been deleted. This broke our
docs generation as the remote theme configuration depended on
downloading the release artefact.

This patch changes the docs building to use a Ruby gem instead of the
remote theme setting. To complicate matters, the gem has an seemingly
incorrect (too strict) version dependency. To mitigate this, we now
install bundler-override plugin to ignore this particular dependency.

The netlify conf is a hack, but I wasn't able to figure out a way how to
install the bundler-override plugin without doing all ruby
initialization in the build command.
2024-08-08 23:33:37 +03:00
Kubernetes Prow Robot
8ffe9f9997
Merge pull request #1807 from ArangoGutierrez/upgrade
Add helm migration guide
2024-08-05 06:44:59 -07:00
Kubernetes Prow Robot
6929e03933
Merge pull request #1824 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/gomega-1.34.1
build(deps): bump github.com/onsi/gomega from 1.34.0 to 1.34.1
2024-08-04 22:40:38 -07:00
dependabot[bot]
e622bdb216
build(deps): bump github.com/onsi/gomega from 1.34.0 to 1.34.1
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega) from 1.34.0 to 1.34.1.
- [Release notes](https://github.com/onsi/gomega/releases)
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md)
- [Commits](https://github.com/onsi/gomega/compare/v1.34.0...v1.34.1)

---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-08-04 09:22:11 +00:00
Kubernetes Prow Robot
13d5a4f81e
Merge pull request #1815 from marquiz/devel/gc-paging
nfd-gc: use paging when listing CRs
2024-08-02 05:59:48 -07:00
Markus Lehtonen
d6c1a7e44f tests: better assertion message in nfd-gc unit tests 2024-08-02 08:23:54 +03:00
Markus Lehtonen
45164f580a nfd-gc: use paging when listing CRs
List NodeFeature and NodeResourceTopology objects in pages of 200 items.
This reduces memory consumption and eliminates timeouts (on the
apiserver side) in big clusters of thousands of nodes.
2024-08-02 08:20:17 +03:00
Kubernetes Prow Robot
57f1b79856
Merge pull request #1813 from marquiz/devel/gc-metalister
nfd-gc: only fetch object metadata
2024-08-01 12:53:33 -07:00
Kubernetes Prow Robot
b2f2888fd4
Merge pull request #1821 from ChaoyiHuang/fixlink
Docs: Fix the link to feature gates documentation
2024-08-01 06:57:32 -07:00
joehuang
a442749f89 Docs: Fix the link to feature gates documentation
The link to feature gates documentation is pointing to the
feature-gates.md in master-commandline-reference.html and
worker-commandline-reference.html, it should be updated to
linking html file.

Signed-off-by: joehuang <joehuang.sweden@gmail.com>
2024-08-01 09:37:10 +00:00
joehuang
efd2bac490 Fix the link to feature gates documentation
The link to feature gates documentation is pointing to the
upward folder in master-commandline-reference.md, it should
be updated to linking file in the same folder.

Signed-off-by: joehuang <joehuang.sweden@gmail.com>
2024-08-01 01:15:03 +00:00
Markus Lehtonen
54befffa94 nfd-gc: only fetch object metadata
Significantly reduce the apiserver and network load by only
listing/getting the object metadata.
2024-07-30 16:01:04 +03:00
Kubernetes Prow Robot
2d24a4bee4
Merge pull request #1811 from marquiz/devel/informer-listopts
nfd-master: tweak list options for NodeFeature informer
2024-07-30 03:56:04 -07:00
Kubernetes Prow Robot
83779243ad
Merge pull request #1822 from omerap12/issue_1820
Docs: Fixed feature-gates reference
2024-07-30 01:24:38 -07:00
Omer Aplatony
b7c18b949d Docs: Fixed featue-gates reference
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2024-07-29 17:34:03 +03:00
Kubernetes Prow Robot
f58b09833e
Merge pull request #1818 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/ginkgo/v2-2.19.1
build(deps): bump github.com/onsi/ginkgo/v2 from 2.19.0 to 2.19.1
2024-07-29 00:00:29 -07:00
dependabot[bot]
67aadf7b63
build(deps): bump github.com/onsi/ginkgo/v2 from 2.19.0 to 2.19.1
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo) from 2.19.0 to 2.19.1.
- [Release notes](https://github.com/onsi/ginkgo/releases)
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md)
- [Commits](https://github.com/onsi/ginkgo/compare/v2.19.0...v2.19.1)

---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-07-29 06:41:14 +00:00
Kubernetes Prow Robot
4564b0e25c
Merge pull request #1819 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/gomega-1.34.0
build(deps): bump github.com/onsi/gomega from 1.33.1 to 1.34.0
2024-07-28 23:40:09 -07:00
dependabot[bot]
07e7eb9b09
build(deps): bump github.com/onsi/gomega from 1.33.1 to 1.34.0
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega) from 1.33.1 to 1.34.0.
- [Release notes](https://github.com/onsi/gomega/releases)
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md)
- [Commits](https://github.com/onsi/gomega/compare/v1.33.1...v1.34.0)

---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-07-28 09:56:54 +00:00
Kubernetes Prow Robot
094e801b6d
Merge pull request #1812 from marquiz/devel/gc-cache-sync
nfd-gc: check that node informer cache sync succeeded
2024-07-26 07:34:14 -07:00
Markus Lehtonen
454d443b72 nfd-gc: check that node informer cache sync succeeded 2024-07-26 10:29:15 +03:00
Markus Lehtonen
a2068f7ce3 nfd-master: tweak list options for NodeFeature informer
Fix cache syncing problems on big clusters with thousands of NodeFeature
objects.

On the initial list (sync) the client-go cache reflector sets the
ResourceVersion to "0" (instead of leaving it empty). This causes
problems in the api server with (apiserver) logs like:

E writers.go:122] apiserver was unable to write a JSON response: http:
                  Handler timeout
E status.go:71] apiserver received an error that is not an
                metav1.Status: &errors.errorString{s:"http: Handler timeout"}:
                http: Handler timeout

On the nfd-master side we see corresponding log snippets like:

W reflector.go:547] failed to list *v1alpha1.NodeFeature: stream error
                    when reading response body, may be caused by closed
                    connection. Please retry. Original error: stream
                    error: stream ID 1521; INTERNAL_ERROR; received from
                    peer
I trace.go:236] "Reflector ListAndWatch" name:*** (***) (total time:
                61126ms): ---"Objects listed" error:stream error when
                reading response body, may be caused by closed
                connection. Please retry. Original error: stream
                error: stream ID 1521; INTERNAL_ERROR; received from
                peer 61126ms (***)

Decreasing the page size (opts.Limits) does not have any effect on the
timeouts. However, setting ResourceVersion to an empty value seems to
get the paging on its tracks, eliminating the timeouts.

TODO: investigate in Kubernetes upstream the root cause of the timeouts
with ResourceVersion="0".
2024-07-25 16:29:05 +03:00
Kubernetes Prow Robot
bd8d74d6f2
Merge pull request #1809 from marquiz/devel/informer-cache-sync
nfd-master: check nfd api informer cache sync result
2024-07-25 01:22:14 -07:00
Markus Lehtonen
ea3243fb00 nfd-master: check nfd api informer cache sync result
Bail out if there were errors in syncing the cache of any resource.
2024-07-25 09:58:40 +03:00
Carlos Eduardo Arango Gutierrez
cb53f9f3c2
Add helm migration guide
Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>
2024-07-23 16:20:45 +02:00
Kubernetes Prow Robot
493aa0c80e
Merge pull request #1801 from omerap12/issue_1730
helm: add configurable liveness&readiness probes for master topology-updater and worker
2024-07-23 05:50:22 -07:00
Omer Aplatony
b2222e2c8c helm: add configurable liveness&readiness probes for master topology-updater and worker
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2024-07-22 21:54:25 +03:00