node-feature-discovery

mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2024-12-14 11:57:51 +00:00

Author	SHA1	Message	Date
Kubernetes Prow Robot	c29ab3bb35	Merge pull request #652 from k8stopologyawareschedwg/consume-cluster-scoped-api NFD-Topology-Updater: Bump NRT API to version v0.0.12	2021-11-16 07:13:27 -08:00
Swati Sehgal	b444ef95a8	NFD-Topology-Updater: Bump NRT API to version v0.0.12 The NodeResourceTopology API has been made cluster scoped as in the current context a CR corresponds to a Node and since Node is a cluster scoped resource it makes sense to make NRT cluster scoped as well. Ref: https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/pull/18 Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-11-16 13:28:23 +00:00
Markus Lehtonen	6cbed379df	source/custom: implement matchAny directive Implement a new 'matchAny' directive in the new rule format, building on top of the previously implemented 'matchFeatures' matcher. MatchAny applies a logical OR over multiple matchFeatures directives. That is, it allows specifying multiple alternative matchers (at least one of which must match) in a single label rule. The configuration format for the new matchers is matchAny: - matchFeatures: - feature: <domain>.<feature> matchExpressions: <attribute>: op: <operator> value: - <list-of-values> - matchFeatures: ... A configuration example. In order to require a cpu feature, kernel module and one of two specific PCI devices (taking use of the shortform notation): - name: multi-device-test labels: multi-device-feature: "true" matchFeatures: - feature: kernel.loadedmodule matchExpressions: [driver-module] - feature: cpu.cpuid matchExpressions: [AVX512F] matchAny: - matchFeatures: - feature; pci.device matchExpressions: vendor: "8086" device: "1234" - matchFeatures: - feature: pci.device matchExpressions: vendor: "8086" device: "abcd"	2021-11-12 16:51:30 +02:00
Markus Lehtonen	e206f0b86b	source/custom: implement generic feature matching Implement generic feature matchers that cover all feature sources (that implement the FeatureSource interface). The implementation relies on the unified data model provided by the FeatureSource interface as well as the generic expression-based rule processing framework that was added to the source/custom/expression package. With this patch any new features added will be automatically available for custom rules, without any additional work. Rule hierarchy follows the source/feature hierarchy by design. This patch introduces a new format for custom rule specifications, dropping the 'value' field and introducing new 'labels' field which makes it possible to specify multiple labels per rule. Also, in the new format the 'name' field is just for reference and no matching label is created. The new generic rules are available in this new rule format under a 'matchFeatures. MatchFeatures implements a logical AND over an array of per-feature matchers - i.e. a match for all of the matchers is required. The goal of the new rule format is to make it better follow K8s API design guidelines and make it extensible for future enhancements (e.g. addition of templating, taints, annotations, extended resources etc). The old rule format (with cpuID, kConfig, loadedKMod, nodename, pciID, usbID rules) is still supported. The rule format (new vs. old) is determined at config parsing time based on the existence of the 'matchOn' field. The new rule format and the configuration format for the new matchFeatures field is - name: <rule-name> labels: <key>: <value> ... matchFeatures: - feature: <domain>.<feature> matchExpressions: <attribute>: op: <operator> value: - <list-of-values> - feature: <domain>.<feature> ... Currently, "cpu", "kernel", "pci", "system", "usb" and "local" sources are covered by the matshers/feature selectors. Thus, the following features are available for matching with this patch: - cpu.cpuid: <cpuid-flag>: <exists/does-not-exist> - cpu.cstate: enabled: <bool> - cpu.pstate: status: <string> turbo: <bool> scaling_governor: <string> - cpu.rdt: <rdt-feature>: <exists/does-not-exist> - cpu.sst: bf.enabled: <bool> - cpu.topology: hardware_multithreading: <bool> - kernel.config: <flag-name>: <string> - kernel.loadedmodule: <module-name>: <exists/does-not-exist> - kernel.selinux: enabled: <bool> - kernel.version: major: <int> minor: <int> revision: <int> full: <string> - system.osrelease: <key-name>: <string> VERSION_ID.major: <int> VERSION_ID.minor: <int> - system.name: nodename: <string> - pci.device: <device-instance>: class: <string> vendor: <string> device: <string> subsystem_vendor: <string> susbystem_device: <string> sriov_totalvfs: <int> - usb.device: <device-instance>: class: <string> vendor: <string> device: <string> serial: <string> - local.label: <label-name>: <string> The configuration also supports some "shortforms" for convenience: matchExpressions: [<attr-1>, <attr-2>=<val-2>] --- matchExpressions: <attr-3>: <attr-4>: <val-4> is equal to: matchExpressions: <attr-1>: {op: Exists} <attr-2>: {op: In, value: [<val-2>]} --- matchExpressions: <attr-3>: {op: Exists} <attr-4>: {op: In, value: [<val-4>]} In other words: - feature: kernel.config matchExpressions: ["X86", "INIT_ENV_ARG_LIMIT=32"] - feature: pci.device matchExpressions: vendor: "8086" is the same as: - feature: kernel.config matchExpressions: X86: {op: Exists} INIT_ENV_ARG_LIMIT: {op: In, values: ["32"]} - feature: pci.device matchExpressions: vendor: {op: In, value: ["8086"] Some configuration examples below. In order to match a CPUID feature the following snippet can be used: - name: cpu-test-1 labels: cpu-custom-feature: "true" matchFeatures: - feature: cpu.cpuid matchExpressions: AESNI: {op: Exists} AVX: {op: Exists} In order to match against a loaded kernel module and OS version: - name: kernel-test-1 labels: kernel-custom-feature: "true" matchFeatures: - feature: kernel.loadedmodule matchExpressions: e1000: {op: Exists} - feature: system.osrelease matchExpressions: NAME: {op: InRegexp, values: ["^openSUSE"]} VERSION_ID.major: {op: Gt, values: ["14"]} In order to require a kernel module and both of two specific PCI devices: - name: multi-device-test labels: multi-device-feature: "true" matchFeatures: - feature: kernel.loadedmodule matchExpressions: driver-module: {op: Exists} - pci.device: vendor: "8086" device: "1234" - pci.device: vendor: "8086" device: "abcd"	2021-11-12 16:51:13 +02:00
Elias Koromilas	e22b937391	Implicitly generate the worker ConfigMap name Signed-off-by: Elias Koromilas <elias.koromilas@gmail.com>	2021-11-03 11:21:58 +02:00
Wei Zhang	158a5590ab	deployment: add topology updater helm chart Signed-off-by: Wei Zhang <kweizh@gmail.com>	2021-10-26 10:52:40 +08:00
Elias Koromilas	c17a898c4c	deployment: Simplify NFD worker configuration in Helm (#627 ) * Simplify NFD worker service configuration in Helm Signed-off-by: Elias Koromilas <elias.koromilas@gmail.com> * Update docs/get-started/deployment-and-usage.md Co-authored-by: Markus Lehtonen <markus.lehtonen@intel.com> Co-authored-by: Markus Lehtonen <markus.lehtonen@intel.com>	2021-10-25 09:34:23 -07:00
Markus Lehtonen	890d9455f1	deployment/helm: don't force sleep-interval in worker cmdline flags Drop --sleep-interval from the template. We really don't want to do that as. First, it's the default value so no use repeating that in the template. And more importantly, the commandline flag will override anything that will be provided in the worker config file, making it impossible for users to specify the sleep interval (other than by editing the template directly).	2021-10-21 11:33:19 +03:00
Markus Lehtonen	e342076a5e	deployment: clean up base/topologyupdater-daemonset The base should really have the very bare minimum. Remove all redundant (at default-value) args and move the others to the specific topologyupdater kustomize component. This also makes these settings re-usable in user-specific overlays (that are not based on topologyupdater-daemonset).	2021-10-06 21:42:31 +03:00
Wei Zhang	4b1e9d7211	deployment: drop the topology updater job	2021-10-06 10:28:37 +08:00
Markus Lehtonen	1e85001a5f	deployment: align topologyupdater overlays Align "topologyupdater" overlay with "topologyupdater-job". Both should deploy topologyupdater as a standalone application. Previously the topologyupdater overlay did not deploy nfd-master at all (but deployed nfd-worker instead) causing the pods to end up in crashloopbackoff as there was no master to communicate with.	2021-09-30 10:22:24 +03:00
Markus Lehtonen	d2751102ef	deployment: fix typo in overlay name Rename topologupdater-job to topologyupdater-job.	2021-09-24 22:04:35 +03:00
Swati Sehgal	a2c066dc0d	topologyupdater: manifests: topologyupdater deployment files - create an overlay for deployment of all components - create an overlay for just topologyupdater deployment (to be deployed in conjunction with the default overlay) - create a separate overlay for deployment of master and topologyupdater-job Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Markus Lehtonen	3706de9308	deployment: fix formatting of the worker conf sample	2021-09-17 14:25:48 +03:00
Jorik Jonker	501ff37592	deployment: optional mount of /usr/src This commit makes the mount of /usr/src optional in the Helm chart, and removes it from the kustomization. Reason is that some systems do not have a /usr/src (such as Talos) and have a R/O filesystem. Since /usr/src is optional per FHS 3.0, NFD should not assume its presence. Signed-off-by: Jorik Jonker <jorik@kippendief.biz>	2021-08-26 10:52:26 +02:00
Markus Lehtonen	a3b2d97513	kustomize: fix broken master-worker-combined base Got broken unnoticed with the addition of liveness and readiness probes.	2021-08-19 23:22:28 +03:00
Carlos Eduardo Arango Gutierrez	dece85b394	Add livenessProbe via grpc to nfd-master Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>	2021-08-18 10:23:10 -05:00
Markus Lehtonen	1f8a6d7819	kustomize: add standard-combined overlay Replicates nfd-daemonset-combined.yaml.template. In addition to the overlay we need to add a separate set of patches under components/common in order to handle the double-container pod.	2021-08-18 15:10:25 +03:00
Markus Lehtonen	b38cf997d5	kustomize: add prune overlay Add an overlay for deploying "nfd-master --prune". Replaces nfd-prune.yaml.template.	2021-08-18 15:10:25 +03:00
Markus Lehtonen	787ebfe441	kustomize: add Job example deployment Add a new base kustomization for worker Job and an overlay stitching up the complete deployment. Replaces nfd-worker-job.yaml.template.	2021-08-18 15:10:25 +03:00
Markus Lehtonen	3737e0f6a3	kustomize: add an example custom rules configmap Add an example kustomize overlay for deploying a configmap specifying extra rules for the custom feature source.	2021-08-18 14:05:57 +03:00
Markus Lehtonen	03b67f8d6a	kustomize: add support for cert-manager Add an example kustomize overlay for enabling cert-manager in an NFD deployment.	2021-08-18 14:05:57 +03:00
Markus Lehtonen	8117c099a3	deployment: add kustomize base Implement functionality virtually replicating deployment templates for nfd-master and nfd-worker daemonset (nfd-master.yaml.template and nfd-worker-daemonset.yaml.template) by adding a kustomize overlay named "default". We split the resources into multiple bases (rbac, master and worker-daemonset) so that relevant parts are re-usable in other deployment scenarios added later (e.g. "one-shot job", and "combined daemonset"). This patch adds one component (components/common) doing the required kustomization for the example deployment.	2021-08-18 14:05:57 +03:00
Markus Lehtonen	0f2554abf1	helm: move files under deployment/helm	2021-08-16 14:44:26 +03:00
Markus Lehtonen	5fe3f7163f	helm: add readme Add minimal readme to the helm chart and update scripts/prepare-release.sh to handle references in the readme file.	2021-08-11 11:41:27 +03:00
Kubernetes Prow Robot	42e8fff4af	Merge pull request #537 from morremeyer/chore/control-plane chore: update tolerations and affinities to control-plane	2021-07-07 03:54:43 -07:00
Morre	d11edaa056	chore: update tolerations and affinities to control-plane This updates the tolerations and affinities to also respect the term „control-plane“ which will eventually replace „master“	2021-07-07 10:39:10 +02:00
Markus Lehtonen	31bd91988f	cpuid: correct the name of SSE4* cpuid flags The naming was changed in when with cpuid v2 (github.com/klauspost/cpuid/v2) and we didn't catch this in NFD. No issue reports of the inadvertent naming change so let's just adapt to the updated naming in NFD configuration. The SSE4* labels are disabled by default so they're not widely used, if at all.	2021-07-06 11:54:55 +03:00
Kubernetes Prow Robot	0cad925ee0	Merge pull request #519 from jschintag/s390x-mount-usr Mount /usr inside the Pod	2021-04-27 00:29:37 -07:00
Jan Schintag	5871207588	Mount /usr/lib and /usr/src inside the Pod Mount /usr/lib and /usr/src as /host-usr/lib and /host-usr/src inside the pod to allow NFD to search for the kernel configuration file inside /usr. This solves the problem of the kernel config file not being present in /boot on s390x RHCOS. Signed-off-by: Jan Schintag <jan.schintag@de.ibm.com>	2021-04-26 16:47:37 +02:00
Jordan Jacobelli	630e97a52c	helm: add extraLabelNs master flag Signed-off-by: Jordan Jacobelli <jordanjacobelli04@gmail.com>	2021-04-20 16:25:12 +02:00
robertdavidsmith	77bd4e4cf6	Accept client certs based on SAN, not just CN (#514 ) * first attempt at SAN-based VerifyNodeName * Update docs on verify-node-name	2021-04-20 01:44:32 -07:00
Bᴇʀɴᴅ Sᴄʜᴏʀɢᴇʀs	33a6425d05	Helm chart: Fix configMap indenting (#496 ) * Fix configMap indenting * Update Chart.yaml * Update nfd-worker-conf.yaml	2021-03-31 00:26:58 -07:00
Moshe Levi	57575231ab	[helm] fix nfd worker tolerations value the tolerations value is array therefore we need to pass [] and not {} Signed-off-by: Moshe Levi <moshele@nvidia.com>	2021-03-19 00:42:01 +02:00
Markus Lehtonen	3f18e880b4	nfd-worker: dynamic configuration of klog Make it possible to dynamically (at run-time) alter most of the logging configuration from the config file.	2021-02-25 16:10:43 +02:00
Kubernetes Prow Robot	d36500789e	Merge pull request #429 from slintes/configmap-hostname-labels Added nodename rule to custom source	2021-02-24 09:20:50 -08:00
Marc Sluiter	7038e49d02	source/custom: Add nodename rule There are cases when the only available metadata for discovering features is the node's name. The "nodename" rule extends the custom source and matches when the node's name matches one of the given nodename regexp patterns. It is also possible now to set an optional "value" on custom rules, which overrides the default "true" label value in case the rule matches. In order to allow more dynamic configurations without having to modify the complete worker configuration, custom rules are additionally read from a "custom.d" directory now. Typically that directory will be filled by mounting one or more ConfigMaps. Signed-off-by: Marc Sluiter <msluiter@redhat.com>	2021-02-24 16:26:35 +01:00
Ivan Kolodyazhny	c13af4d235	Fix NFD master chart template Commit `88bd80d415` introduces error in template due to the bad formatting. This commit fixes it.	2021-02-23 12:59:49 +02:00
Ivan Kolodyazhny	88bd80d415	Add NFD master '--instance' flag support to Helm chart	2021-02-21 09:45:20 +02:00
Adrian Chiris	e80900b8ee	Add helm chart for NFD This commit adds Helm chart for node-feature-discovery Signed-off-by: Adrian Chiris <adrianc@nvidia.com> Signed-off-by: Ivan Kolodiazhnyi <ikolodiazhny@nvidia.com>	2021-02-18 17:19:09 +02:00

1 2 3 4 5

240 commits