node-feature-discovery

mirror of https://github.com/kubernetes-sigs/node-feature-discovery.git synced 2025-03-31 04:04:51 +00:00

Author	SHA1	Message	Date
Carlos Eduardo Arango Gutierrez	e3aedd33e2	Enable metrics via prometheus operator Expose metrics via prometheus.monitoring.coreos.com/v1 The exposed metrics are \| Metric \| Type \| Meaning \| \| --------------- \| ---------------- \| ---------------- \| \| `nfd_master_build_info` \| Gauge \| Version from which nfd-master was built. \| \| `nfd_worker_build_info` \| Gauge \| Version from which nfd-worker was built. \| \| `nfd_updated_nodes` \| Counter \| Time taken to label a node \| \| `nfd_crd_processing_time` \| Gauge \| Time taken to process a NodeFeatureRule CRD \| \| `nfd_feature_discovery_duration_seconds` \| HistogramVec \| Time taken to discover features on a node \| Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com> Co-authored-by: Markus Lehtonen <markus.lehtonen@intel.com>	2023-07-21 10:59:52 +02:00
Dipankar Das	ebac4a25e7	Removal of the bases field as it is deprecated by kustomize Signed-off-by: Dipankar Das <dipankardas0115@gmail.com>	2023-06-09 12:49:24 +05:30
Markus Lehtonen	f4de7ed8ee	deployment/kustomize: add master config to prune overlay Otherwise pods error out with failed mount of nfd-master-conf ConfigMap.	2023-04-20 20:38:36 +03:00
Kubernetes Prow Robot	193c552b33	Merge pull request #1084 from AhmedGrati/feat-add-master-config-file feat: add master config file	2023-04-04 10:41:40 -07:00
AhmedGrati	3fff409f6d	Add master config file Similar to the nfd-worker, in this PR we want to support the dynamic run-time configurability through a config file for the nfd-master. We'll use a json or yaml configuration file along with the fsnotify in order to watch for changes in the config file. As a result, we're allowing dynamic control of logging params, allowed namespaces, extended resources, label whitelisting, and denied namespaces. Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>	2023-04-03 09:52:09 +01:00
Carlos Eduardo Arango Gutierrez	355807f98c	kustomize: trim prune overlay Signed-off-by: Carlos Eduardo Arango Gutierrez <eduardoa@nvidia.com>	2023-03-15 20:36:45 +01:00
PiotrProkop	59afae50ba	Add NodeResourceTopology garbage collector NodeResourceTopology(aka NRT) custom resource is used to enable NUMA aware Scheduling in Kubernetes. As of now node-feature-discovery daemons are used to advertise those resources but there is no service responsible for removing obsolete objects(without corresponding Kubernetes node). This patch adds new daemon called nfd-topology-gc which removes old NRTs. Signed-off-by: PiotrProkop <pprokop@nvidia.com>	2023-01-11 10:15:21 +01:00
Markus Lehtonen	59a2757115	Use single-dash format for nfd cmdline flags Use the "single-dash" version of nfd command line flags in deployment files and e2e-tests. No impact in functionality, just aligns with documentation and other parts of the codebase.	2022-12-21 15:00:49 +02:00
Markus Lehtonen	f13ed2d91c	nfd-topology-updater: update NodeResourceTopology objects directly Drop the gRPC communication to nfd-master and connect to the Kubernetes API server directly when updating NodeResourceTopology objects. Topology-updater already has connection to the API server for listing Pods so this is not that dramatic change. It also simplifies the code a lot as there is no need for the NFD gRPC client and no need for managing TLS certs/keys. This change aligns nfd-topology-updater with the future direction of nfd-worker where the gRPC API is being dropped and replaced by a CRD-based API. This patch also update deployment files and documentation to reflect this change.	2022-12-08 11:03:22 +02:00
Talor Itzhak	8b5918a2e9	kustomize: topology-updater: enable the configuration via kustomization Add a kustomization file with a config example for the exclude-list. Signed-off-by: Talor Itzhak <titzhak@redhat.com>	2022-11-21 21:31:14 +02:00
Kubernetes Prow Robot	885a061f12	Merge pull request #701 from marquiz/devel/deployment-custom-rule deployment: use new custom rule format in sample configs	2022-01-05 09:53:48 -08:00
Dave Baker	3e6ae535c7	Fix kustomization template to work with cert-manager	2022-01-04 13:19:09 +00:00
Markus Lehtonen	468fa2b817	deployment: use new rule format in sample custom rule overlay	2021-12-22 09:21:26 +02:00
Markus Lehtonen	c3e2315834	pkg/apis/nfd: specify CRD for custom labeling rules Add a cluster-scoped Custom Resource Definition for specifying labeling rules. Nodes (node features, node objects) are cluster-level objects and thus the natural and encouraged setup is to only have one NFD deployment per cluster - the set of underlying features of the node stays the same independent of how many parallel NFD deployments you have. Our extension points (hooks, feature files and now CRs) can be be used by multiple actors (depending on us) simultaneously. Having the CRD cluster-scoped hopefully drives deployments in this direction. It also should make deployment of vendor-specific labeling rules easy as there is no need to worry about the namespace. This patch virtually replicates the source.custom.FeatureSpec in a CRD API (located in the pkg/apis/nfd/v1alpha1 package) with the notable exception that "MatchOn" legacy rules are not supported. Legacy rules are left out in order to keep the CRD simple and clean. The duplicate functionality in source/custom will be dropped by upcoming patches. This patch utilizes controller-gen (from sigs.k8s.io/controller-tools) for generating the CRD and deepcopy methods. Code can be (re-)generated with "make generate". Install controller-gen with: go install sigs.k8s.io/controller-tools/cmd/controller-gen@v0.7.0 Update kustomize and helm deployments to deploy the CRD.	2021-11-17 13:40:23 +02:00
Wei Zhang	4b1e9d7211	deployment: drop the topology updater job	2021-10-06 10:28:37 +08:00
Markus Lehtonen	1e85001a5f	deployment: align topologyupdater overlays Align "topologyupdater" overlay with "topologyupdater-job". Both should deploy topologyupdater as a standalone application. Previously the topologyupdater overlay did not deploy nfd-master at all (but deployed nfd-worker instead) causing the pods to end up in crashloopbackoff as there was no master to communicate with.	2021-09-30 10:22:24 +03:00
Markus Lehtonen	d2751102ef	deployment: fix typo in overlay name Rename topologupdater-job to topologyupdater-job.	2021-09-24 22:04:35 +03:00
Swati Sehgal	a2c066dc0d	topologyupdater: manifests: topologyupdater deployment files - create an overlay for deployment of all components - create an overlay for just topologyupdater deployment (to be deployed in conjunction with the default overlay) - create a separate overlay for deployment of master and topologyupdater-job Signed-off-by: Swati Sehgal <swsehgal@redhat.com>	2021-09-21 10:48:10 +01:00
Markus Lehtonen	1f8a6d7819	kustomize: add standard-combined overlay Replicates nfd-daemonset-combined.yaml.template. In addition to the overlay we need to add a separate set of patches under components/common in order to handle the double-container pod.	2021-08-18 15:10:25 +03:00
Markus Lehtonen	b38cf997d5	kustomize: add prune overlay Add an overlay for deploying "nfd-master --prune". Replaces nfd-prune.yaml.template.	2021-08-18 15:10:25 +03:00
Markus Lehtonen	787ebfe441	kustomize: add Job example deployment Add a new base kustomization for worker Job and an overlay stitching up the complete deployment. Replaces nfd-worker-job.yaml.template.	2021-08-18 15:10:25 +03:00
Markus Lehtonen	3737e0f6a3	kustomize: add an example custom rules configmap Add an example kustomize overlay for deploying a configmap specifying extra rules for the custom feature source.	2021-08-18 14:05:57 +03:00
Markus Lehtonen	03b67f8d6a	kustomize: add support for cert-manager Add an example kustomize overlay for enabling cert-manager in an NFD deployment.	2021-08-18 14:05:57 +03:00
Markus Lehtonen	8117c099a3	deployment: add kustomize base Implement functionality virtually replicating deployment templates for nfd-master and nfd-worker daemonset (nfd-master.yaml.template and nfd-worker-daemonset.yaml.template) by adding a kustomize overlay named "default". We split the resources into multiple bases (rbac, master and worker-daemonset) so that relevant parts are re-usable in other deployment scenarios added later (e.g. "one-shot job", and "combined daemonset"). This patch adds one component (components/common) doing the required kustomization for the example deployment.	2021-08-18 14:05:57 +03:00

24 commits