1
0
Fork 0
mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2024-12-14 11:57:51 +00:00
Commit graph

1774 commits

Author SHA1 Message Date
Carlos Eduardo Arango Gutierrez
f5df7b658c
nfd-master: reject malformed extended resource dynamic capacity assignment
Reject malformed extended resource dynamic capacity assignment
capacity should be in the form of domain.feature.element,
add logic at func filterExtendedResources to check if true or ignore
ExtendedResource, logging as an error.

Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>
2023-04-22 08:43:50 +02:00
Kubernetes Prow Robot
d5bccda7c5
Merge pull request #1171 from ArangoGutierrez/foundon_typo
pkg/nfd-master/nfd-master.go: Fix typo
2023-04-21 12:21:11 -07:00
Kubernetes Prow Robot
c2c1e18908
Merge pull request #1173 from marquiz/devel/fix-master
nfd-master: fix a crash when processing NodeFeatureRules
2023-04-21 09:49:11 -07:00
Markus Lehtonen
9523f1e411 nfd-master: fix a crash when processing NodeFeatureRules
Fix a a bug where nfd-master with NodeFeature API enabled would crash
when NodeFeatureRule objects were processed in the case where no
NodeFeature objects existed. This was caused by trying to insert values
into a non-initialized NodeFeatureSpec in the code.

This patch adds two safety measures to prevent that from happening in
the future. First, add a constructor function for the NodeFeatureSpec
type, and second, check for uninitialized object in the function
inserting new functions.

TODO: add unit tests for the API helper functions.
2023-04-21 19:24:08 +03:00
Kubernetes Prow Robot
a425e30bad
Merge pull request #1166 from marquiz/devel/master-annotations
nfd-master: don't create emtpy annotations
2023-04-21 08:53:12 -07:00
Carlos Eduardo Arango Gutierrez
ae22031547
pkg/nfd-master/nfd-master.go: Fix typo
Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>
2023-04-21 16:17:11 +02:00
Markus Lehtonen
37306662fe nfd-master: don't create emtpy annotations
Make the nfd.node.kubernetes.io/feature-labels and
nfd.node.kubernetes.io/extended-resources annotations behave similary to
the taints annotation: only create the annotations if some labels or
extended resources are created.
2023-04-21 16:14:17 +03:00
Kubernetes Prow Robot
e0b076e08b
Merge pull request #1160 from marquiz/fixes/prune
nfd-master: fix -prune
2023-04-21 05:11:11 -07:00
Markus Lehtonen
f4de7ed8ee deployment/kustomize: add master config to prune overlay
Otherwise pods error out with failed mount of nfd-master-conf ConfigMap.
2023-04-20 20:38:36 +03:00
Markus Lehtonen
f0f6bbcf36 nfd-master: configure before prune
Otherwise prune will crash because of uninitialized configuration.
2023-04-20 20:38:11 +03:00
Kubernetes Prow Robot
5d9907ae86
Merge pull request #1161 from marquiz/fixes/prune-no-publish
nfd-master: support noPublish with -prune
2023-04-19 10:01:11 -07:00
Markus Lehtonen
32db081f3a nfd-master: support noPublish with -prune
Better this way than to crash which is what currently happens with this
combination.
2023-04-19 15:58:06 +03:00
Kubernetes Prow Robot
322574385f
Merge pull request #1159 from marquiz/devel/generate-mockery
generate: update mockery to v2.25.1
2023-04-19 04:09:11 -07:00
Markus Lehtonen
18f7bfa8e8 generate: update mockery to v2.25.1
Bump the vektra/mockery tool to the latest release.
2023-04-19 13:33:42 +03:00
Kubernetes Prow Robot
a09a646102
Merge pull request #1158 from marquiz/devel/generate-controller-gen
generate: update controller-gen to v0.11.3
2023-04-19 03:23:11 -07:00
Markus Lehtonen
a5ec646c48 generate: update controller-gen to v0.11.3
Update controller-gen tool from sigs.k8s.io/controller-tools to the
latest release.

Also, bump goimports from golang.org/x/tools to the latest version.
2023-04-19 12:48:12 +03:00
Kubernetes Prow Robot
6b85d9d66b
Merge pull request #1157 from marquiz/devel/generate-protobuf
generate: update protoc to v22.3
2023-04-19 02:42:45 -07:00
Markus Lehtonen
117baac1a6 generate: update protoc to v22.3 2023-04-19 10:44:55 +03:00
Kubernetes Prow Robot
e0af9c429e
Merge pull request #1156 from marquiz/devel/codegen
generate: update k8s code-generator to v0.27.1
2023-04-19 00:18:46 -07:00
Markus Lehtonen
ca7ed04a34 generate: update auto-generated code
Re-run "make generate".
2023-04-19 09:49:17 +03:00
Markus Lehtonen
bcd6ded4c0 generate: update k8s code-generator to v0.27.1
Also, don't pre-install all k8s code-generator tools in the image. The
generate-groups.sh of code-generator will do "go install" for the tools
it needs which fails with permission denied error in some environments
as it tries to overwrite the existing files owned by root.
2023-04-19 09:49:17 +03:00
Kubernetes Prow Robot
e8183499d3
Merge pull request #1155 from marquiz/devel/deps
deps: Update kubernetes to v1.27.1
2023-04-18 23:02:45 -07:00
Markus Lehtonen
87371e2df0 test/e2e: adapt tests to updates in k8s e2e-framework
Add context to functions that now require it. Also, replace the
deprecated wait.Poll* calls with wait.PollUntilContextTimeout.
2023-04-18 23:04:34 +03:00
Markus Lehtonen
e2d5ba1a2b pkg/podres: update mocked PodResourcesListerClient
Update mocked implementation of
k8s.io/kubelet/pkg/apis/podresources/v1.PodResourcesListerClient. The
mocked implementation is moved to a separate "mocks" subpackage as it's
for an external interface.

This patch also adds code for auto-generation for the mocked interface.
2023-04-18 20:51:51 +03:00
Markus Lehtonen
ba4b9b3432 go.mod: update kubernetes to v1.27.1 2023-04-18 20:51:51 +03:00
Kubernetes Prow Robot
a6bed7d0cf
Merge pull request #1154 from marquiz/devel/e2e-ctx
test/e2e: use proper context
2023-04-18 10:48:58 -07:00
Kubernetes Prow Robot
82a423b223
Merge pull request #1153 from marquiz/devel/readme
README: update for release v0.13.0
2023-04-18 08:26:59 -07:00
Markus Lehtonen
b53461c09b README: update for release v0.13.0 2023-04-18 14:57:23 +03:00
Markus Lehtonen
ad8bd057b7 test/e2e: use proper context
Eliminate all context.TODO() from the e2e tests and use ginkgo context
instead. This ensures that calls involving context are properly
cancelled and return fast in case the tests get aborted.
2023-04-18 14:55:09 +03:00
Kubernetes Prow Robot
8592f3ea8d
Merge pull request #1151 from marquiz/devel/hack
hack/prepare-release.sh: fix name of one e2e test file
2023-04-17 21:58:57 -07:00
Markus Lehtonen
e5d83d031b hack/prepare-release.sh: fix name of one e2e test file 2023-04-17 23:43:49 +03:00
Kubernetes Prow Robot
b0c52fe28f
Merge pull request #1149 from ArangoGutierrez/sev_capacity
cpu: expose the total number of AMD SEV ASID and ES
2023-04-17 13:22:58 -07:00
Carlos Eduardo Arango Gutierrez
05ef5d4e9d
cpu: expose the total number of AMD SEV ASID and ES
This patch add SEV ASIDs and the related (but distinct) SEV Encrypted State
(SEV-ES) IDs as two quantities to be exposed via extended resources.
In a kernel built with CONFIG_CGROUP_MISC on a suitably equipped AMD CPU, the
root control group will have a misc.capacity file that shows the number of
available IDs in each category.

The added extended resources are:
- sev.asids
- sev.encrypted_state_ids

Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>
2023-04-17 19:34:39 +02:00
Kubernetes Prow Robot
df584e03ed
Merge pull request #1145 from marquiz/devel/grpc-probe
Dockerfile: bump grpc-health-probe to v0.4.18
2023-04-17 05:28:43 -07:00
Markus Lehtonen
ecc242d78a Dockerfile: bump grpc-health-probe to v0.4.18
A new version that was just released.
2023-04-17 14:30:08 +03:00
Kubernetes Prow Robot
ca59fc0594
Merge pull request #1140 from marquiz/devel/owners
OWNERS: add PiotrProkop as a reviewer
2023-04-17 03:22:43 -07:00
Markus Lehtonen
57e21969d0 OWNERS: add PiotrProkop as a reviewer 2023-04-17 12:58:16 +03:00
Kubernetes Prow Robot
018cd33306
Merge pull request #1095 from fmuyassarov/codecov-uploader
e2e: add codecov uploader configuration
2023-04-14 14:30:41 -07:00
Kubernetes Prow Robot
fef5e56051
Merge pull request #1129 from mythi/sgx-epc
cpu: Expose SGX EPC resource
2023-04-14 10:42:41 -07:00
Mikko Ylinen
de1b69a8bf cpu: make SGX EPC resource available to NodeFeatureRules
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-04-14 15:31:54 +03:00
Kubernetes Prow Robot
cb604b877c
Merge pull request #1130 from marquiz/devel/tdx
source/cpu: don't create cpu-security.tdx.total_keys label
2023-04-14 04:18:41 -07:00
Markus Lehtonen
3320c74472 source/cpu: don't create cpu-security.tdx.total_keys label
Just have that as a feature for NodeFeatureRules to consume.
2023-04-14 13:33:13 +03:00
Kubernetes Prow Robot
84c348b69f
Merge pull request #1126 from marquiz/devel/er-deprecation
nfd-master: deprecate the -resource-labels flag
2023-04-13 10:52:39 -07:00
Kubernetes Prow Robot
8d71ed6755
Merge pull request #1086 from AhmedGrati/feat-support-builtin-kernel-mods
feat: support builtin kernel mods
2023-04-13 10:30:40 -07:00
Kubernetes Prow Robot
47acda75c3
Merge pull request #1128 from marquiz/devel/test-timeout
Makefile: set e2e test timeout to 1 hour
2023-04-13 09:24:38 -07:00
Markus Lehtonen
3a1a8d4c6f Makefile: set e2e test timeout to 1 hour
Previously we were using the default, which even if equal to 0, still
means 10 minute timout in practice (with the way we run the tests with
invoking go test directly). With the addition of latest e2e tests we
hit the limit and got bitten by it. Set the timeout to 1 hour which
should be enough for anyone...
2023-04-13 18:57:19 +03:00
Kubernetes Prow Robot
f9cc798057
Merge pull request #1127 from marquiz/devel/nfd-master-retry
nfd-master: re-try on node update failures
2023-04-13 07:14:39 -07:00
Markus Lehtonen
6b2d10753f nfd-master: re-try on node update failures
Change the NFD API handler to re-try on node update failures. Will work
around transient failures, making sure that failed nodes (i.e. nodes
that we failed to update) don't need to wait for the 1 hour resync
period before being tried again.
2023-04-13 16:30:31 +03:00
AhmedGrati
109caa1f28 feat: support builtin kernel mods
This PR adds the combination of dynamic and builtin kernel modules into
one feature called `kernel.enabledmodule`. It's a superset of the
`kernel.loadedmodule` feature.

Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>
2023-04-13 10:19:24 +01:00
Markus Lehtonen
8511980bf4 nfd-master: deprecate the -resource-labels flag
Mark the -resource-labels flag (and the corresponding resourceLabels
config option) as deprecated. We now support managing extended resources
via NodeFeatureRule objects. This kludge deserves to go, eventually.
2023-04-13 11:30:58 +03:00