node-feature-discovery

mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2025-03-06 16:57:10 +00:00

Author	SHA1	Message	Date
Kubernetes Prow Robot	733fb5deaa	Merge pull request #984 from marquiz/devel/worker-namespace nfd-worker: detect the namespace it is running in	2022-12-09 07:10:11 -08:00
Markus Lehtonen	f13ed2d91c	nfd-topology-updater: update NodeResourceTopology objects directly Drop the gRPC communication to nfd-master and connect to the Kubernetes API server directly when updating NodeResourceTopology objects. Topology-updater already has connection to the API server for listing Pods so this is not that dramatic change. It also simplifies the code a lot as there is no need for the NFD gRPC client and no need for managing TLS certs/keys. This change aligns nfd-topology-updater with the future direction of nfd-worker where the gRPC API is being dropped and replaced by a CRD-based API. This patch also update deployment files and documentation to reflect this change.	2022-12-08 11:03:22 +02:00
Markus Lehtonen	87b92f88ca	nfd-worker: detect the namespace it is running in Implement detection of kubernetes namespace by reading file /var/run/secrets/kubernetes.io/serviceaccount/namespace Aa a fallback (if the file is not accessible) we take namespace from KUBERNETES_NAMESPACE environment variable. This is useful for e.g. testing and development where you might run nfd-worker directly from the command line on a host system.	2022-12-08 10:34:52 +02:00
Markus Lehtonen	eb8e29c80a	nfd-worker: drop deprecated command line flags Drop the following flags that were deprecated already in v0.8.0: -sleep-interval (replaced by core.sleepInterval config file option) -label-whitelist (replaced by core.labelWhiteList config file option) -sources (replaced by -label-sources flag)	2022-11-23 22:33:51 +02:00
Talor Itzhak	5b0788ced4	topology-updater: introduce exclude-list The exclude-list allows to filter specific resource accounting from NRT's objects per node basis. The CRs created by the topology-updater are used by the scheduler-plugin as a source of truth for making scheduling decisions. As such, this feature allows to hide specific information from the scheduler, which in turn will affect the scheduling decision. A common use case is when user would like to perform scheduling decisions which are based on a specific resource. In that case, we can exclude all the other resources which we don't want the scheduler to exemine. The exclude-list is provided to the topology-updater via a ConfigMap. Resource type's names specified in the list should match the names as shown here: https://pkg.go.dev/k8s.io/api/core/v1#ResourceName This is a resurrection of an old work started here: https://github.com/kubernetes-sigs/node-feature-discovery/pull/545 Signed-off-by: Talor Itzhak <titzhak@redhat.com>	2022-11-21 14:08:25 +02:00
Kubernetes Prow Robot	a65ee959b9	Merge pull request #925 from marquiz/devel/feature-api-flatten apis/nfd: flatten the structure of features data type	2022-10-24 01:14:26 -07:00
Francesco Romani	700d9e215c	topology-updater: continue looping on scan error Scanning podresources can temporarily fail; the previous code was mistakenly not rearming the loop condition when this occurred, effectively stopping the monitoring. Rather, we should always pool and bail out on unrecoverable error or when asked to stop. Signed-off-by: Francesco Romani <fromani@redhat.com>	2022-10-20 10:08:13 +02:00
Markus Lehtonen	b907d07d7e	apis/nfd: flatten the structure of features data type Flatten the data structure that stores features, dropping the "domain" level from the data model. That extra level of hierarchy brought little benefit but just caused some extra complexity, instead. The new structure nicely matches what we have in the NodeFeatureRule object (the matchFeatures field of uses the same flat structure with the "feature" field having a value <domain>.<feature>, e.g. "kernel.version"). This is pre-work for introducing a new "node feature" CRD that contains the raw feature data. It makes the life of both users and developers easier when both CRDs, plus our internal code, handle feature data in a similar flat structure.	2022-10-18 18:37:28 +03:00
Markus Lehtonen	0e1d4a9046	apis/nfd: migrate pkg/api/feature Move the previously-protobuf-only internal "feature api" over to the public "nfd api" package. This is in preparation for introducing a new CRD API for communicating features. This patch carries no functional changes. Just moving code around.	2022-10-15 07:42:20 +03:00
Markus Lehtonen	06bd6c0609	nfd-worker: refactor gRPC connection logic Make the NoPublish config flag a more direct control point for whether to publishing features. This patch is pre-work for adding support for other clients (upcoming new CRD API) in nfd-worker.	2022-10-11 17:02:33 +03:00
Markus Lehtonen	11fd19fb7a	nfd-worker: rename some symbols Some renames in preparation for adding support for NFD CRD API client. I.e. a second client in addition to the existing gRPC client.	2022-10-04 17:18:25 +03:00
Markus Lehtonen	ffa35427cd	nfd-client: don't use deprecated grpc.WithInsecure() Replace deprecated grpc.WithInsecure() with grpc.WithTransportCredentials and insecure.NewCredentials(). Makes golangci-lint pass muster. enter the commit message for your changes. Lines starting	2022-09-09 11:07:22 +03:00
Markus Lehtonen	12e859d50c	Drop deprecated io/ioutil package Makes golanci-lint happy.	2022-09-08 14:26:02 +03:00
Markus Lehtonen	136c036d4d	Drop the iommu source It was deprecated in v0.10.0.	2022-06-14 15:00:29 +03:00
Dipto Chakrabarty	19a57789ad	Additional Lint Fixes in Codebase (#779 ) * fix comments and conditonals to fix lint issues * more linter fixes and spelling fixes * fix linter issues based on feedback	2022-03-02 17:12:46 -08:00
Markus Lehtonen	f9b4ba87a8	tls: require min TLS version 1.3 Deny deprecated TLS versions (1.0 and 1.1). We don't really excpect other clients than NFD itself so we can just request the latest version.	2022-02-25 10:08:37 +02:00
Mohammed Naser	cf1bc4a34d	Increase timeout in test setups This patch increases the timeout when setting up the NFD master to 5 seconds instead of 1 second to allow for running tests in slow environments.	2022-01-20 18:59:30 -05:00
Dipto Chakrabarty	755294184c	Fix GoLinter Issues in the files (#711 ) * fix linter issues for few files * fix linter issue of exported const Name should have comment or be unexported * fix name lint issue and resolve lints * add changes to comments	2022-01-18 23:12:06 -08:00
Markus Lehtonen	838a375f85	source/iommu: deprecate and disable by default Deprecate the iommu source and disable it by default.	2021-12-20 10:21:29 +02:00
Markus Lehtonen	a6eddbab4f	source: rename TestSource to SupplementalSource Just widen the scope in terms of naming, to cover deprecated and/or experimental sources too, for example.	2021-12-20 10:05:00 +02:00
Markus Lehtonen	bf01875368	nfd-worker: drop 'custom-' prefix from matchFeatures custom rules Do not prefix label names from the new matchFeatures/matchAny custom rules with "custom-". We want to have the same result (set of labels) from a rule independent of whether it has been specified in worker config or in a NodeFeatureRule CRs. Legacy matchOn rules (not available in NodeFeatureRule CRs) are intact, i.e. still prefixed, in order to retain backwards compatibility.	2021-12-09 21:52:40 +02:00
Markus Lehtonen	82e14300a4	source/fake: implement FeatureSource Makes it possible to create fake features for custom rules, enabling testing.	2021-12-07 10:34:41 +02:00
Markus Lehtonen	58e1461d90	nfd-worker: add -feature-sources command line flag Allows controlling (enable/disable) the "raw" feature detection. Especially useful for development and testing.	2021-12-03 09:42:35 +02:00
Markus Lehtonen	df6909ed5e	nfd-worker: add core.featureSources config option Add a configuration option for controlling the enabled "raw" feature sources. This is useful e.g. in testing and development, plus it also allows fully shutting down discovery of features that are not needed in a deployment. Supplements core.labelSources which controls the enablement of label sources.	2021-12-03 09:42:35 +02:00
Markus Lehtonen	2c3a4d1588	nfd-worker: rename nfdWorker.enabledSources to labelSources Refactoring in head of adding new config option for feature sources.	2021-12-02 21:08:46 +02:00
Markus Lehtonen	8cd58af613	nfd-worker: disable sources more easily Make it easier to disable single sources by prefixing the source name with a dash ('-') in the core.sources config option (or -sources cmdline flag).	2021-12-02 10:36:51 +02:00
Markus Lehtonen	773280de65	nfd-worker: provide deprecated core.sources config option Provide backwards compatibility via a deprecated 'core.sources' config file option. This will override 'core.labelSources'. A warning is printed in the log if this option is detected.	2021-12-01 17:11:49 +02:00
Markus Lehtonen	ad9c7dfa1e	nfd-worker: rename config option 'sources' to 'labelSources' The goal is to make the name more descriptive. Also keeping in mind a possible future addition a 'featureSources' option (or similar) for controlling the feature discovery.	2021-12-01 17:11:49 +02:00
Markus Lehtonen	a57a25f63c	Use single-dash format of cmdline flags Use the single-dash (i.e. '-option' instead of '--option') format consistently accross log messages and documentation. This is the format that was mostly used, already, and shown by command line help of the binaries, for example.	2021-11-25 18:03:54 +02:00
Markus Lehtonen	237c4f7824	pkg/apihelpers: split out loading of kubeconfig to a separate function Make kubeconfig loading and parsing re-usable for multiple clients.	2021-11-22 16:57:42 +02:00
NHM Tanveer Hossain Khan	856dfdd8b4	Remove fatal logging to error based on the feedback	2021-11-19 16:57:21 -05:00
Markus Lehtonen	47e7c47594	Send raw features over gRPC Enable transfer of raw features between nfd-worker and nfd-master.	2021-11-16 17:32:28 +02:00
Kubernetes Prow Robot	064391f310	Merge pull request #601 from marquiz/devel/feature-source-interface source: introduce FeatureSource interface	2021-09-21 05:48:25 -07:00
Swati Sehgal	a311719d1e	topologyupdater: Updates based on latest changes made to CRD API There have been recent changes made to the noderesourcetopology API storing the proto file generated using go-to-protobuf tool and this code inports the proto generated in the API in the topology-updater.proto The PRs corresponding to the changes are as follows: https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/pull/9 https://github.com/k8stopologyawareschedwg/noderesourcetopology-api/pull/13 Commands used to generate topology-updater.pb.go file: go install github.com/golang/protobuf/protoc-gen-go@v1.4.3 go mod vendor protoc --go_opt=paths=source_relative --go_out=plugins=grpc:. pkg/topologyupdater/topology-updater.proto -I. -Ivendor As part of implmentation of this patch, reserved (non-allocatable) CPUs are evaluated by performing a difference between all the CPUs on a system (determined by using ghw) and allocatable CPUs (determined by querying GetAllocatableResources podResource API endpoint). When aggregator creates the NUMA zones, it will skip the zone creation if there are no allocatable resources. In this update we creates those missing zone with zero allocatable/available resources so we won't have holes in the array of reported zones. Co-Authored-by: Talor Itzhak <titzhak@redhat.com> Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Swati Sehgal	aa7ae9265c	topologyupdater: watch/consider only guaranteed pods for accounting - Files obtained after running make mock - Run `go get github.com/vektra/mockery` and make sure that mockery is in your $PATH - run `make mock` Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Francesco Romani	b4c92e4eed	topologyupdater: Bootstrap nfd-topology-updater in NFD - This patch allows to expose Resource Hardware Topology information through CRDs in Node Feature Discovery. - In order to do this we introduce another software component called nfd-topology-updater in addition to the already existing software components nfd-master and nfd-worker. - nfd-master was enhanced to communicate with nfd-topology-updater over gRPC followed by creation of CRs corresponding to the nodes in the cluster exposing resource hardware topology information of that node. - Pin kubernetes dependency to one that include pod resource implementation - This code is responsible for obtaining hardware information from the system as well as pod resource information from the Pod Resource API in order to determine the allocatable resource information for each NUMA zone. This information along with Costs for NUMA zones (obtained by reading NUMA distances) is gathered by nfd-topology-updater running on all the nodes of the cluster and propagate NUMA zone costs to master in order to populate that information in the CRs corresponding to the nodes. - We use GHW facilities for obtaining system information like CPUs, topology, NUMA distances etc. - This also includes updates made to Makefile and Dockerfile and Manifests for deploying nfd-topology-updater. - This patch includes unit tests - As part of the Topology Aware Scheduling work, this patch captures the configured Topology manager scope in addition to the Topology manager policy. Based on the value of both attribues a single string will be populated to the CRD. The string value will be on of the following {SingleNUMANodeContainerLevel, SingleNUMANodePodLevel, BestEffort, Restricted, None} Co-Authored-by: Artyom Lukianov <alukiano@redhat.com> Co-Authored-by: Francesco Romani <fromani@redhat.com> Co-Authored-by: Talor Itzhak <titzhak@redhat.com> Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:47:39 +01:00
Markus Lehtonen	852cf4b61d	source: introduce FeatureSource interface Specify a new interface for managing "raw" feature data. This is the first step to separate raw feature data from node labels. None of the feature sources implement this interface, yet. This patch unifies the data format of "raw" features by dividing them into three different basic types. - keys, a set of names without any associated values, e.g. CPUID flags or loaded kernel modules - values, a map of key-value pairs, for features with a single value, e.g. kernel config flags or os version - instances, a list of instances each of which has multiple attributes (key-value pairs of their own), e.g. PCI or USB devices The new feature data types are defined in a new "pkg/api/feature" package, catering decoupling and re-usability of code e.g. within future extentions of the NFD gRPC API. Rename the Discover() method of LabelSource interface to GetLabels().	2021-09-20 09:58:07 +03:00
Markus Lehtonen	81378a3235	source: make sources register themselves Implement new registration infrastructure under the "source" package. This change loosens the coupling between label sources and the nfd-worker, making it easier to refactor and move the code around. Also, create a separate interface (ConfigurableSource) for configurable feature sources in order to eliminate boilerplate code. Add safety checks to the sources that they actually implement the interfaces they should. In sake of consistency and predictability (of behavior) change all methods of the sources to use pointer receivers. Add simple unit tests for the new functionality and include source/... into make test target.	2021-09-15 18:41:37 +03:00
Markus Lehtonen	befa7e9796	source: rename FeatureSource to LabelSource Prepare for separating feature detection from label creation.	2021-09-13 22:48:33 +03:00
Markus Lehtonen	112744bc50	nfd-worker: split out gRPC connection handling Refactor the worker code and split out gRPC client connection handling into a separate base type. The intent is to promote re-usability of code for other NFD clients, too.	2021-08-20 15:29:27 +03:00

40 commits