AhmedGrati
7bad0d583c
feat/nfd-master: support CR restrictions
...
Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>
2024-08-10 22:39:10 +02:00
Kubernetes Prow Robot
c1db298de8
Merge pull request #1827 from fmuyassarov/fix-go-mod
...
tilt: sync up builder go version with project go.mod
2024-08-08 14:22:37 -07:00
Feruzjon Muyassarov
a7d8448eb6
tilt: sync up builder go version with project go.mod
...
Signed-off-by: Feruzjon Muyassarov <feruzjon.muyassarov@intel.com>
2024-08-09 00:00:47 +03:00
Kubernetes Prow Robot
d38e116667
Merge pull request #1829 from marquiz/devel/docs-theme
...
docs: use jekyll-rtd-theme from a ruby gem
2024-08-08 13:53:58 -07:00
Markus Lehtonen
b2bc18f5a5
docs: use jekyll-rtd-theme from a ruby gem
...
The upstream repo (and the release downloads)
github.com/rundocs/jekyll-rtd-theme has been deleted. This broke our
docs generation as the remote theme configuration depended on
downloading the release artefact.
This patch changes the docs building to use a Ruby gem instead of the
remote theme setting. To complicate matters, the gem has an seemingly
incorrect (too strict) version dependency. To mitigate this, we now
install bundler-override plugin to ignore this particular dependency.
The netlify conf is a hack, but I wasn't able to figure out a way how to
install the bundler-override plugin without doing all ruby
initialization in the build command.
2024-08-08 23:33:37 +03:00
Kubernetes Prow Robot
8ffe9f9997
Merge pull request #1807 from ArangoGutierrez/upgrade
...
Add helm migration guide
2024-08-05 06:44:59 -07:00
Kubernetes Prow Robot
6929e03933
Merge pull request #1824 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/gomega-1.34.1
...
build(deps): bump github.com/onsi/gomega from 1.34.0 to 1.34.1
2024-08-04 22:40:38 -07:00
dependabot[bot]
e622bdb216
build(deps): bump github.com/onsi/gomega from 1.34.0 to 1.34.1
...
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega ) from 1.34.0 to 1.34.1.
- [Release notes](https://github.com/onsi/gomega/releases )
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/gomega/compare/v1.34.0...v1.34.1 )
---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-08-04 09:22:11 +00:00
Kubernetes Prow Robot
13d5a4f81e
Merge pull request #1815 from marquiz/devel/gc-paging
...
nfd-gc: use paging when listing CRs
2024-08-02 05:59:48 -07:00
Markus Lehtonen
45164f580a
nfd-gc: use paging when listing CRs
...
List NodeFeature and NodeResourceTopology objects in pages of 200 items.
This reduces memory consumption and eliminates timeouts (on the
apiserver side) in big clusters of thousands of nodes.
2024-08-02 08:20:17 +03:00
Kubernetes Prow Robot
57f1b79856
Merge pull request #1813 from marquiz/devel/gc-metalister
...
nfd-gc: only fetch object metadata
2024-08-01 12:53:33 -07:00
Kubernetes Prow Robot
b2f2888fd4
Merge pull request #1821 from ChaoyiHuang/fixlink
...
Docs: Fix the link to feature gates documentation
2024-08-01 06:57:32 -07:00
joehuang
a442749f89
Docs: Fix the link to feature gates documentation
...
The link to feature gates documentation is pointing to the
feature-gates.md in master-commandline-reference.html and
worker-commandline-reference.html, it should be updated to
linking html file.
Signed-off-by: joehuang <joehuang.sweden@gmail.com>
2024-08-01 09:37:10 +00:00
joehuang
efd2bac490
Fix the link to feature gates documentation
...
The link to feature gates documentation is pointing to the
upward folder in master-commandline-reference.md, it should
be updated to linking file in the same folder.
Signed-off-by: joehuang <joehuang.sweden@gmail.com>
2024-08-01 01:15:03 +00:00
Markus Lehtonen
54befffa94
nfd-gc: only fetch object metadata
...
Significantly reduce the apiserver and network load by only
listing/getting the object metadata.
2024-07-30 16:01:04 +03:00
Kubernetes Prow Robot
2d24a4bee4
Merge pull request #1811 from marquiz/devel/informer-listopts
...
nfd-master: tweak list options for NodeFeature informer
2024-07-30 03:56:04 -07:00
Kubernetes Prow Robot
83779243ad
Merge pull request #1822 from omerap12/issue_1820
...
Docs: Fixed feature-gates reference
2024-07-30 01:24:38 -07:00
Omer Aplatony
b7c18b949d
Docs: Fixed featue-gates reference
...
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2024-07-29 17:34:03 +03:00
Kubernetes Prow Robot
f58b09833e
Merge pull request #1818 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/ginkgo/v2-2.19.1
...
build(deps): bump github.com/onsi/ginkgo/v2 from 2.19.0 to 2.19.1
2024-07-29 00:00:29 -07:00
dependabot[bot]
67aadf7b63
build(deps): bump github.com/onsi/ginkgo/v2 from 2.19.0 to 2.19.1
...
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo ) from 2.19.0 to 2.19.1.
- [Release notes](https://github.com/onsi/ginkgo/releases )
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/ginkgo/compare/v2.19.0...v2.19.1 )
---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-29 06:41:14 +00:00
Kubernetes Prow Robot
4564b0e25c
Merge pull request #1819 from kubernetes-sigs/dependabot/go_modules/master/github.com/onsi/gomega-1.34.0
...
build(deps): bump github.com/onsi/gomega from 1.33.1 to 1.34.0
2024-07-28 23:40:09 -07:00
dependabot[bot]
07e7eb9b09
build(deps): bump github.com/onsi/gomega from 1.33.1 to 1.34.0
...
Bumps [github.com/onsi/gomega](https://github.com/onsi/gomega ) from 1.33.1 to 1.34.0.
- [Release notes](https://github.com/onsi/gomega/releases )
- [Changelog](https://github.com/onsi/gomega/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/gomega/compare/v1.33.1...v1.34.0 )
---
updated-dependencies:
- dependency-name: github.com/onsi/gomega
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-28 09:56:54 +00:00
Kubernetes Prow Robot
094e801b6d
Merge pull request #1812 from marquiz/devel/gc-cache-sync
...
nfd-gc: check that node informer cache sync succeeded
2024-07-26 07:34:14 -07:00
Markus Lehtonen
454d443b72
nfd-gc: check that node informer cache sync succeeded
2024-07-26 10:29:15 +03:00
Markus Lehtonen
a2068f7ce3
nfd-master: tweak list options for NodeFeature informer
...
Fix cache syncing problems on big clusters with thousands of NodeFeature
objects.
On the initial list (sync) the client-go cache reflector sets the
ResourceVersion to "0" (instead of leaving it empty). This causes
problems in the api server with (apiserver) logs like:
E writers.go:122] apiserver was unable to write a JSON response: http:
Handler timeout
E status.go:71] apiserver received an error that is not an
metav1.Status: &errors.errorString{s:"http: Handler timeout"}:
http: Handler timeout
On the nfd-master side we see corresponding log snippets like:
W reflector.go:547] failed to list *v1alpha1.NodeFeature: stream error
when reading response body, may be caused by closed
connection. Please retry. Original error: stream
error: stream ID 1521; INTERNAL_ERROR; received from
peer
I trace.go:236] "Reflector ListAndWatch" name:*** (***) (total time:
61126ms): ---"Objects listed" error:stream error when
reading response body, may be caused by closed
connection. Please retry. Original error: stream
error: stream ID 1521; INTERNAL_ERROR; received from
peer 61126ms (***)
Decreasing the page size (opts.Limits) does not have any effect on the
timeouts. However, setting ResourceVersion to an empty value seems to
get the paging on its tracks, eliminating the timeouts.
TODO: investigate in Kubernetes upstream the root cause of the timeouts
with ResourceVersion="0".
2024-07-25 16:29:05 +03:00
Kubernetes Prow Robot
bd8d74d6f2
Merge pull request #1809 from marquiz/devel/informer-cache-sync
...
nfd-master: check nfd api informer cache sync result
2024-07-25 01:22:14 -07:00
Markus Lehtonen
ea3243fb00
nfd-master: check nfd api informer cache sync result
...
Bail out if there were errors in syncing the cache of any resource.
2024-07-25 09:58:40 +03:00
Carlos Eduardo Arango Gutierrez
cb53f9f3c2
Add helm migration guide
...
Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>
2024-07-23 16:20:45 +02:00
Kubernetes Prow Robot
493aa0c80e
Merge pull request #1801 from omerap12/issue_1730
...
helm: add configurable liveness&readiness probes for master topology-updater and worker
2024-07-23 05:50:22 -07:00
Omer Aplatony
b2222e2c8c
helm: add configurable liveness&readiness probes for master topology-updater and worker
...
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2024-07-22 21:54:25 +03:00
Kubernetes Prow Robot
4e4ccf8818
Merge pull request #1804 from kubernetes-sigs/dependabot/go_modules/master/k8sio-6d3858f6bd
...
build(deps): bump k8s.io/kubernetes from 1.30.2 to 1.30.3 in the k8sio group
2024-07-22 05:35:16 -07:00
dependabot[bot]
0a63965648
build(deps): bump k8s.io/kubernetes in the k8sio group
...
Bumps the k8sio group with 1 update: [k8s.io/kubernetes](https://github.com/kubernetes/kubernetes ).
Updates `k8s.io/kubernetes` from 1.30.2 to 1.30.3
- [Release notes](https://github.com/kubernetes/kubernetes/releases )
- [Commits](https://github.com/kubernetes/kubernetes/compare/v1.30.2...v1.30.3 )
---
updated-dependencies:
- dependency-name: k8s.io/kubernetes
dependency-type: direct:production
update-type: version-update:semver-patch
dependency-group: k8sio
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-22 09:41:19 +03:00
Kubernetes Prow Robot
51f2794d4f
Merge pull request #1798 from marquiz/devel/docker-cache
...
Dockerfile: cache go modules on build
2024-07-18 09:14:03 -07:00
Markus Lehtonen
2f52f35720
Dockerfile: cache go modules on build
2024-07-18 15:58:16 +03:00
Rouke Broersma
1230d607ac
Helm: Add revision history limit for worker daemonset ( #1797 )
...
* Helm: Add revision history limit for worker daemonset
Signed-off-by: Rouke Broersma <mobrockers@gmail.com>
* Helm: Add revision history limit for topology updater daemonset
Signed-off-by: Rouke Broersma <mobrockers@gmail.com>
* chore: tidy table columns
---------
Signed-off-by: Rouke Broersma <mobrockers@gmail.com>
2024-07-18 05:31:49 -07:00
Kubernetes Prow Robot
876eea4374
Merge pull request #1796 from marquiz/devel/helm-lint
...
scripts/test-infra: bump helm to v3.15.3
2024-07-18 01:16:13 -07:00
Markus Lehtonen
5caaa88d51
scripts/test-infra: bump helm to v3.15.3
...
Helm is used for linting the chart in the CI.
2024-07-18 08:51:38 +03:00
Kubernetes Prow Robot
a4f41476b0
Merge pull request #1795 from marquiz/devel/golangci-lint
...
scripts/test-infra: bump golangci-lint to v1.59.1
2024-07-17 12:00:01 -07:00
Markus Lehtonen
4f60f4d8bf
scripts/test-infra: bump golangci-lint to v1.59.1
2024-07-17 18:17:03 +03:00
Kubernetes Prow Robot
e30cf85f61
Merge pull request #1778 from marquiz/devel/nf-api-ga
...
feature-gates: mark NodeFeatureAPI as GA
2024-07-16 06:22:59 -07:00
Kubernetes Prow Robot
f21e694967
Merge pull request #1794 from marquiz/devel/readme
...
README: update to v0.16.3
2024-07-16 05:26:59 -07:00
Markus Lehtonen
8bef70c470
README: update to v0.16.3
2024-07-16 15:21:10 +03:00
Markus Lehtonen
25e827a4c8
feature-gates: mark NodeFeatureAPI as GA
...
The feature gate is locked to true. That is, it is not possible to revert
back to the gPRC-based communication which makes the gRPC API ready for
removal.
2024-07-16 13:53:31 +03:00
Kubernetes Prow Robot
57fc83f8c0
Merge pull request #1793 from marquiz/devel/e2e-topology-updater
...
test/e2e: specify -sleep-interval in topology-updater exclude-memory test
2024-07-16 03:08:48 -07:00
Markus Lehtonen
5aeea28957
test/e2e: specify -sleep-interval in topology-updater exclude-memory test
...
Make the test finish considerably faster.
2024-07-16 12:47:08 +03:00
Kubernetes Prow Robot
81320d1864
Merge pull request #1791 from marquiz/devel/docs-helm
...
docs: reformat tables of helm parameters
2024-07-16 01:38:57 -07:00
Kubernetes Prow Robot
8e0951385b
Merge pull request #1790 from marquiz/devel/helm-values
...
helm: drop trailing whitespace from values.yaml
2024-07-16 01:18:33 -07:00
Kubernetes Prow Robot
ea199c7e37
Merge pull request #1792 from marquiz/devel/e2e-timeout
...
test/e2e: set topology-updater sleep-interval in podfingerprint test
2024-07-16 01:01:12 -07:00
Markus Lehtonen
5a81f748bf
test/e2e: set topology-updater sleep-interval in podfingerprint test
...
Run topology-updater with short sleep-interval to try to eliminate
flakiness in CI.
2024-07-16 10:22:46 +03:00
Markus Lehtonen
efdf1b8bd9
docs: reformat tables of helm parameters
...
Also correct the description of default value of master.tolerations.
2024-07-16 09:56:12 +03:00