node-feature-discovery

mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2025-03-05 16:27:05 +00:00

Author	SHA1	Message	Date
Markus Lehtonen	3765ae24d6	pkg/apis/nfd: specify a dedicated type for regexp cache Having a dedicated type makes it possible to specify deepcopy functions for it. We need to do this manually as deepcopy-gen doesn't know how to create copies of regexps.	2021-11-17 13:40:43 +02:00
Markus Lehtonen	f3cc109f99	pkg/apis/nfd: work around issues with k8s deepcopy-gen Without this hack the generated code does not compile.	2021-11-17 13:40:43 +02:00
Markus Lehtonen	c3e2315834	pkg/apis/nfd: specify CRD for custom labeling rules Add a cluster-scoped Custom Resource Definition for specifying labeling rules. Nodes (node features, node objects) are cluster-level objects and thus the natural and encouraged setup is to only have one NFD deployment per cluster - the set of underlying features of the node stays the same independent of how many parallel NFD deployments you have. Our extension points (hooks, feature files and now CRs) can be be used by multiple actors (depending on us) simultaneously. Having the CRD cluster-scoped hopefully drives deployments in this direction. It also should make deployment of vendor-specific labeling rules easy as there is no need to worry about the namespace. This patch virtually replicates the source.custom.FeatureSpec in a CRD API (located in the pkg/apis/nfd/v1alpha1 package) with the notable exception that "MatchOn" legacy rules are not supported. Legacy rules are left out in order to keep the CRD simple and clean. The duplicate functionality in source/custom will be dropped by upcoming patches. This patch utilizes controller-gen (from sigs.k8s.io/controller-tools) for generating the CRD and deepcopy methods. Code can be (re-)generated with "make generate". Install controller-gen with: go install sigs.k8s.io/controller-tools/cmd/controller-gen@v0.7.0 Update kustomize and helm deployments to deploy the CRD.	2021-11-17 13:40:23 +02:00
Markus Lehtonen	0757248055	source/custom: move rule expressions to pkg/apis/nfd/v1alpha1 Create a new package pkg/apis/nfd/v1alpha1 and migrate the custom rule expressions over there. This is the first step in creating a new CRD API for custom rules.	2021-11-16 18:12:16 +02:00
Markus Lehtonen	47e7c47594	Send raw features over gRPC Enable transfer of raw features between nfd-worker and nfd-master.	2021-11-16 17:32:28 +02:00
Markus Lehtonen	d4d9a03732	grpc: extend the API to send raw features Enable transmitting the discovered "raw" features over the gRPC API. Extend pkg/api/feature with protobuf and gRPC code. In this, utilize go-to-protobuf from k8s code-generator for auto-generating the gRPC interface from golang code. The tool can be Installed with: go install k8s.io/code-generator/cmd/go-to-protobuf@v0.20.7 The auto-generated code is (re-)generated/updated with "make apigen".	2021-11-16 17:32:28 +02:00
Swati Sehgal	b444ef95a8	NFD-Topology-Updater: Bump NRT API to version v0.0.12 The NodeResourceTopology API has been made cluster scoped as in the current context a CR corresponds to a Node and since Node is a cluster scoped resource it makes sense to make NRT cluster scoped as well. Ref: https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/pull/18 Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-11-16 13:28:23 +00:00
Markus Lehtonen	dd92c9a9ce	pkg/api/feature: revert back to structs instead of pointers Less error prone, as no chance for a nil pointer dereference.	2021-11-11 17:56:55 +02:00
Markus Lehtonen	9bff4b3185	pkg/api/feature: generator functions with initial values Flavor the generator helper functions with arguments for specifying the set of features to put into the generated objects.	2021-11-09 13:40:35 +02:00
Markus Lehtonen	5de4d8857c	pkg/api/feature: use pointers of structs Make it easier to mutate the feature sets.	2021-11-09 12:15:38 +02:00
Markus Lehtonen	25711799f3	pkg/resourcemonitor: fix typo in comment	2021-11-05 16:42:49 +02:00
Artyom Lukianov	45062754fd	resourcemonitor: aggregate and provide the memory and hugepages information The Kuberenetes pod resource API now exposing the memory and hugepages information for guaranteed pods. We can use this information to update NodeResourceTopology resource with memory and hugepages data. Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-11-04 10:17:10 +02:00
Artyom Lukianov	a93b660f7c	utils: add methods to fetch NUMA nodes hugepages and memory capacity The methods are used during calculation of reserved memory for system workloads. The calcualation is `resourceCapacity - resourceAllocatable`. Signed-off-by: Artyom Lukianov <alukiano@redhat.com>	2021-11-04 10:14:51 +02:00
Markus Lehtonen	0b386981a6	pkg/nfd-master: fix linter errors in tests	2021-10-04 09:51:38 +03:00
Kubernetes Prow Robot	9cf732b64e	Merge pull request #602 from marquiz/devel/go-generate Utilize go generate	2021-09-21 06:16:24 -07:00
Kubernetes Prow Robot	064391f310	Merge pull request #601 from marquiz/devel/feature-source-interface source: introduce FeatureSource interface	2021-09-21 05:48:25 -07:00
Markus Lehtonen	51c0d70383	Update auto-generated code Generated by running "make generate".	2021-09-21 13:37:36 +03:00
Markus Lehtonen	9487fbeb18	Utilize go generate Use 'go generate' for auto-generating code. Drop the old 'mock' and 'apigen' makefile targets. Those are replaced with a single make generate which (re-)generates everything.	2021-09-21 13:36:37 +03:00
Swati Sehgal	a311719d1e	topologyupdater: Updates based on latest changes made to CRD API There have been recent changes made to the noderesourcetopology API storing the proto file generated using go-to-protobuf tool and this code inports the proto generated in the API in the topology-updater.proto The PRs corresponding to the changes are as follows: https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/pull/9 https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/pull/13 Commands used to generate topology-updater.pb.go file: go install github.com/golang/protobuf/protoc-gen-go@v1.4.3 go mod vendor protoc --go_opt=paths=source_relative --go_out=plugins=grpc:. pkg/topologyupdater/topology-updater.proto -I. -Ivendor As part of implmentation of this patch, reserved (non-allocatable) CPUs are evaluated by performing a difference between all the CPUs on a system (determined by using ghw) and allocatable CPUs (determined by querying GetAllocatableResources podResource API endpoint). When aggregator creates the NUMA zones, it will skip the zone creation if there are no allocatable resources. In this update we creates those missing zone with zero allocatable/available resources so we won't have holes in the array of reported zones. Co-Authored-by: Talor Itzhak <titzhak@redhat.com> Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Swati Sehgal	832f82baaa	topologyupdater: Handle pods with devices and integral CPU requests For accounting we should consider all guaranteed pods with integral CPU requests and all the pods with device requests This patch ensures that pods are only considered for accounting disregarding non-guranteed pods without any device request. Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Swati Sehgal	aa7ae9265c	topologyupdater: watch/consider only guaranteed pods for accounting - Files obtained after running make mock - Run `go get github.com/vektra/mockery` and make sure that mockery is in your $PATH - run `make mock` Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Francesco Romani	b4c92e4eed	topologyupdater: Bootstrap nfd-topology-updater in NFD - This patch allows to expose Resource Hardware Topology information through CRDs in Node Feature Discovery. - In order to do this we introduce another software component called nfd-topology-updater in addition to the already existing software components nfd-master and nfd-worker. - nfd-master was enhanced to communicate with nfd-topology-updater over gRPC followed by creation of CRs corresponding to the nodes in the cluster exposing resource hardware topology information of that node. - Pin kubernetes dependency to one that include pod resource implementation - This code is responsible for obtaining hardware information from the system as well as pod resource information from the Pod Resource API in order to determine the allocatable resource information for each NUMA zone. This information along with Costs for NUMA zones (obtained by reading NUMA distances) is gathered by nfd-topology-updater running on all the nodes of the cluster and propagate NUMA zone costs to master in order to populate that information in the CRs corresponding to the nodes. - We use GHW facilities for obtaining system information like CPUs, topology, NUMA distances etc. - This also includes updates made to Makefile and Dockerfile and Manifests for deploying nfd-topology-updater. - This patch includes unit tests - As part of the Topology Aware Scheduling work, this patch captures the configured Topology manager scope in addition to the Topology manager policy. Based on the value of both attribues a single string will be populated to the CRD. The string value will be on of the following {SingleNUMANodeContainerLevel, SingleNUMANodePodLevel, BestEffort, Restricted, None} Co-Authored-by: Artyom Lukianov <alukiano@redhat.com> Co-Authored-by: Francesco Romani <fromani@redhat.com> Co-Authored-by: Talor Itzhak <titzhak@redhat.com> Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:47:39 +01:00
Francesco Romani	00cc07da76	topologyupdater: gRPC API definition Setup the topologyupdater API for gRPC communication of nfd-topology-updater with master We generate pb.go file to reflect latest dependency changes using github.com/golang/protobuf/protoc-gen-go and generate grpc files via: `protoc pkg/topologyupdater/topology-updater.proto --go_out=plugins=grpc:.` Please refer to: https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/blob/master/pkg/apis/topology/v1alpha1/types.go Co-Authored-by: Artyom Lukianov <alukiano@redhat.com> Co-Authored-by: Francesco Romani <fromani@redhat.com> Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:47:39 +01:00
Markus Lehtonen	852cf4b61d	source: introduce FeatureSource interface Specify a new interface for managing "raw" feature data. This is the first step to separate raw feature data from node labels. None of the feature sources implement this interface, yet. This patch unifies the data format of "raw" features by dividing them into three different basic types. - keys, a set of names without any associated values, e.g. CPUID flags or loaded kernel modules - values, a map of key-value pairs, for features with a single value, e.g. kernel config flags or os version - instances, a list of instances each of which has multiple attributes (key-value pairs of their own), e.g. PCI or USB devices The new feature data types are defined in a new "pkg/api/feature" package, catering decoupling and re-usability of code e.g. within future extentions of the NFD gRPC API. Rename the Discover() method of LabelSource interface to GetLabels().	2021-09-20 09:58:07 +03:00
Markus Lehtonen	81378a3235	source: make sources register themselves Implement new registration infrastructure under the "source" package. This change loosens the coupling between label sources and the nfd-worker, making it easier to refactor and move the code around. Also, create a separate interface (ConfigurableSource) for configurable feature sources in order to eliminate boilerplate code. Add safety checks to the sources that they actually implement the interfaces they should. In sake of consistency and predictability (of behavior) change all methods of the sources to use pointer receivers. Add simple unit tests for the new functionality and include source/... into make test target.	2021-09-15 18:41:37 +03:00
Markus Lehtonen	befa7e9796	source: rename FeatureSource to LabelSource Prepare for separating feature detection from label creation.	2021-09-13 22:48:33 +03:00
Kubernetes Prow Robot	189f86bec8	Merge pull request #548 from marquiz/devel/profile-ns nfd-master: allow profile.node.kubernetes.io label ns	2021-08-27 07:24:04 -07:00
Markus Lehtonen	112744bc50	nfd-worker: split out gRPC connection handling Refactor the worker code and split out gRPC client connection handling into a separate base type. The intent is to promote re-usability of code for other NFD clients, too.	2021-08-20 15:29:27 +03:00
Carlos Eduardo Arango Gutierrez	dece85b394	Add livenessProbe via grpc to nfd-master Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>	2021-08-18 10:23:10 -05:00
Markus Lehtonen	55bd633425	nfd-master: allow profile.node.kubernetes.io label ns Add a separate label namespace for profile labels, intended for user-specified higher level "meta features". Also sub-namespaces of this (i.e. <sub-ns>.profile.node.kubernetes.io) are allowed.	2021-08-10 19:39:59 +03:00
Markus Lehtonen	c3760fbbab	nfd-master: rename LabelNs to FeatureLabelNs	2021-08-10 19:13:08 +03:00
Kubernetes Prow Robot	4a22a39928	Merge pull request #536 from marquiz/devel/label-sub-ns nfd-master: allow sub-namespaces of the default label ns	2021-08-10 04:19:18 -07:00
Markus Lehtonen	eb666f521d	nfd-master: allow sub-namespaces of the default label ns Allow <sub-ns>.feature.node.kubernetes.io label namespaces. Makes it possible to have e.g. vendor specific label ns without the need to user -extra-label-ns.	2021-08-10 11:41:52 +03:00
Markus Lehtonen	d12e62b1fe	Makefile: add apigen target For auto-generating api(s). Also, re-generate/refresh the gRPC with `make apigen` (with protoc v3.17.3 and protoc-gen-go from github.com/golang/protobuf v1.5.2) to sync up things.	2021-07-07 16:01:10 +03:00
Markus Lehtonen	a55783d533	Straighten wrinkles in lint fixes Fix small mistakes that slipped through with lint fixes (in `1230945564`).	2021-07-07 14:32:11 +03:00
Carlos Eduardo Arango Gutierrez	1230945564	make golint happy Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>	2021-06-14 12:27:58 -05:00
Carlos Eduardo Arango Gutierrez	894b7901ff	make gofmt happy by running gofmt -s Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>	2021-06-14 12:24:44 -05:00
Markus Lehtonen	99d223b029	utils/dump: do not print empty header line Makes log output cleaner.	2021-06-11 09:29:49 +03:00
robertdavidsmith	77bd4e4cf6	Accept client certs based on SAN, not just CN (#514 ) * first attempt at SAN-based VerifyNodeName * Update docs on verify-node-name	2021-04-20 01:44:32 -07:00
Kubernetes Prow Robot	c0e1000a7d	Merge pull request #474 from marquiz/devel/worker-log-verbosity nfd-worker: don't log labels returned by sources by default	2021-03-15 12:52:34 -07:00
Markus Lehtonen	6c6249a599	nfd-worker: don't log labels returned by sources by default Reduce default log verbosity. Only print out labels if log verbosity is 1 or higher ('core.klog.v: 1' config file option or '-v 1' on command line). Also, dump the labels in a reproducible (sorted) format.	2021-03-15 21:42:33 +02:00
Kubernetes Prow Robot	03f53d85e9	Merge pull request #475 from marquiz/devel/grpc-klog pkg/utils: show correct source file in gRPC logs	2021-03-11 06:20:24 -08:00
Markus Lehtonen	fb67a5027b	pkg/utils: show correct source file in gRPC logs Unwind two call frames so that the source (file:line) of the log message is correctly displayed.	2021-03-11 11:36:55 +02:00
Markus Lehtonen	8d67fc1122	pkg/utils: add dump functions A simple functions for pretty-printing and logging json-marshallable objects.	2021-03-11 07:12:22 +02:00
Markus Lehtonen	2d20a2ff7c	nfd-worker: support certificate rotation Watch for changes in TLS files and re-connect to nfd-master in the event of changes.	2021-03-09 14:40:51 +02:00
Markus Lehtonen	e771a35a21	nfd-master: support certificate rotation Add a helper/wrapper in pkg/utils to handle gRPC server-side certificate rotation.	2021-03-09 14:40:04 +02:00
Markus Lehtonen	dfc2596a22	pkg/utils: generalize file watcher Add the capability to watch multiple files. Move it to a separate package in order to make it reusable.	2021-03-09 14:20:34 +02:00
Markus Lehtonen	8af3a40ca7	logging: set grpc to use klog for logging	2021-03-05 14:44:44 +02:00
Markus Lehtonen	38d493aa67	pkg/utils: fix possible segfault in RegexpVal.Set	2021-03-02 22:46:34 +02:00
Markus Lehtonen	dd7691c486	nfd-worker: improve log messages of config handling	2021-03-02 18:49:58 +02:00

1 2 3

120 commits