Kubernetes Prow Robot
f4cf4877f2
Merge pull request #1309 from marquiz/devel/refactor-gc-3
...
topology-gc: rename run()
2023-08-18 11:26:28 -07:00
Kubernetes Prow Robot
b47667fc0c
Merge pull request #1306 from marquiz/devel/gc-fix-stop
...
topology-gc: fix Stop
2023-08-18 10:34:29 -07:00
Markus Lehtonen
ec51b29b3c
topology-gc: rename runGC to garbageCollect()
...
One less function named run.
2023-08-18 17:57:05 +03:00
Markus Lehtonen
98b0b36b87
topology-gc: rename run()
...
Too many run methods here.
2023-08-18 17:52:11 +03:00
Markus Lehtonen
108d603bdc
topology-gc: fix Stop
...
The stop channel has multiple readers to we need to close it so that all
of the readers get notified.
2023-08-18 17:46:54 +03:00
Kubernetes Prow Robot
fe0763eccb
Merge pull request #1303 from marquiz/devel/docs-deps
...
docs: update github-pages gem to v228
2023-08-16 09:40:27 -07:00
Markus Lehtonen
b64ba37377
docs: update github-pages gem to v228
...
Also update other dependencies.
2023-08-16 13:51:09 +03:00
Kubernetes Prow Robot
198eb2b5db
Merge pull request #1302 from marquiz/devel/deps
...
Update kubernetes to v1.28.0
2023-08-16 03:12:27 -07:00
Markus Lehtonen
2e79a015f5
test/e2e: align with latest kubernetes code base
2023-08-16 12:43:52 +03:00
Markus Lehtonen
5d5f133eff
go.mod: update kubernetes to v1.28.0
...
Also sync (update) other dependencies with what kubernetes v1.28 has.
2023-08-16 11:00:51 +03:00
Kubernetes Prow Robot
0bbf5f3f1e
Merge pull request #1300 from marquiz/devel/ci-lint
...
scripts/test-infra: bump golangci-lint to v1.54.0
2023-08-11 05:43:27 -07:00
Kubernetes Prow Robot
95069b410b
Merge pull request #1299 from marquiz/devel/logcheck
...
scripts/test-infra: update logcheck tool to v0.6.0
2023-08-11 05:15:29 -07:00
Markus Lehtonen
972374af0e
scripts/test-infra: bump golangci-lint to v1.54.0
...
Brings e.g. support for Go v1.21.
2023-08-11 11:42:23 +03:00
Markus Lehtonen
7e2a549db2
scripts/test-infra: update logcheck tool to v0.6.0
...
Update logcheck to the latest version. Fixes the flakiness we've been
experiencing.
2023-08-09 08:23:42 +03:00
Kubernetes Prow Robot
9d61b19454
Merge pull request #1287 from freelizhun/fix-empty-hugepages
...
fix empty hugepages in some numa nodes caused no such file or directory errors
2023-08-08 02:50:16 -07:00
lizhun
a4ad3d4411
fix empty hugepages in some numa nodes caused no such file or directory error
...
Signed-off-by: lizhun <lizhun@kylinos.cn>
2023-08-08 15:14:44 +08:00
Markus Lehtonen
5ba8d14b86
topology-updater: make -version always runnable
...
Make it possible to run -version in an environment whithout the
NODE_ADDRESS environment variable set.
2023-08-07 11:56:58 +03:00
Markus Lehtonen
5ad2294c14
metrics: add nfd_node_update_requests_total counter
...
Add a counter for total number of node update/sync requests. In
practice, this counts the number of gRPC requests received if the gRPC
API is in use. If the NodeFeature API is enabled, this counts the
requests initiated by the NFD API controller, i.e. updates triggered by
changes in NodeFeature or NodeFeatureRule objects plus updates initiated
by the controller resync period.
2023-08-07 09:37:29 +03:00
Markus Lehtonen
4b24cc1afa
metrics: counters for rejected labels, extended resources and taints
...
Add counters for labels, extended resources and taints rejected/filtered
out by nfd-master.
2023-08-07 09:37:29 +03:00
Markus Lehtonen
a8a29e6df2
metrics: add nfd_nodefeaturerule_processing_errors_total counter
...
Add a counter for errors encountered when processing NodeFeatureRules.
Another simple counter without any additional prometheus labels -
nfd-master logs can provide further details.
2023-08-07 09:37:29 +03:00
Markus Lehtonen
b90f2c318e
metrics: add nfd_node_update_failures_total counter
...
Add a new counter for tracking node update failures from nfd-master.
This tracks both normal feature updates and the --prune sub-command.
This is a simple counter without any additional labels - nfd-master logs
can be used for further diagnostics.
2023-08-07 09:37:27 +03:00
AhmedGrati
f0edc6532a
docs: add the support of the exipration date in the input format of the feature files
...
Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>
2023-08-05 20:39:09 +01:00
AhmedGrati
bd3ccf1e33
feat: add support for feature files expiration
...
Signed-off-by: AhmedGrati <ahmedgrati1999@gmail.com>
2023-08-05 20:38:44 +01:00
Kubernetes Prow Robot
9ed191808d
Merge pull request #1296 from marquiz/docs/metrics
...
docs: document -metrics flag in command line reference
2023-08-05 03:06:30 -07:00
Kubernetes Prow Robot
6caf554b4c
Merge pull request #1291 from marquiz/devel/master-renaming
...
nfd-master: use term node update instead of labeling
2023-08-04 09:22:24 -07:00
Kubernetes Prow Robot
35fbbaae99
Merge pull request #1294 from marquiz/devel/feature-file-comments
...
source/local: support comments in input
2023-08-04 07:58:21 -07:00
Markus Lehtonen
4b7ee47e5f
docs: document -metrics flag in command line reference
...
Document the -metrics command line flag in the command line reference of
nfd-master and nfd-worker.
2023-08-04 16:49:03 +03:00
Markus Lehtonen
06b333db1e
nfd-topology-updater: add metrics support
...
For now, add only one metric, a counter for the errors occurring while
scanning pod resources on the node.
2023-08-04 16:48:37 +03:00
Markus Lehtonen
4aa7a8f8f8
source/local: support comments in input
...
Lines starting with '#' are treated as comments and ignored when parsing
feature files and hook output.
2023-08-04 16:46:22 +03:00
Kubernetes Prow Robot
6d30ca9660
Merge pull request #1293 from marquiz/devel/feature-file-whitespace
...
source/local: trim whitespace from input
2023-08-04 06:38:24 -07:00
Kubernetes Prow Robot
1fa4178798
Merge pull request #1292 from marquiz/docs/notes
...
docs: unify formatting of NOTEs
2023-08-04 06:10:22 -07:00
Markus Lehtonen
181b4e0168
source/local: trim whitespace from input
...
Trim leading and trailing whitespace from the input (from feature files
and hooks). Makes it a bit more relaxed on the expected input format.
2023-08-04 15:24:46 +03:00
Markus Lehtonen
0a8b514d67
docs: unify formatting of NOTEs
2023-08-03 15:36:56 +03:00
Markus Lehtonen
039378c725
nfd-master: use term node update instead of labeling
...
Rename symbols and reword log messages to correlate with the
functionality (we may do other updates than just modify labels
nowadays).
2023-08-01 16:42:34 +03:00
Markus Lehtonen
d8f167d8a9
nfd-master: remove one stale empty line
2023-08-01 16:38:32 +03:00
Kubernetes Prow Robot
45dc46ab81
Merge pull request #1289 from marquiz/devel/metrics
...
docs: align metrics documentation with latest changes on naming
2023-08-01 06:20:39 -07:00
Markus Lehtonen
a1406767a9
docs: align metrics documentation with latest changes on naming
...
Also change table formatting and fix one incorrect description.
2023-08-01 15:53:06 +03:00
Kubernetes Prow Robot
c1cb63243b
Merge pull request #1288 from marquiz/devel/metrics
...
Improve metrics
2023-07-31 10:38:39 -07:00
Markus Lehtonen
5091fef84b
metrics: improve feature discovery duration metric
...
Rename the "NodeName" prometheus label to "node", aligning with
common prometheus/kubernetes conventions. Also reconfigure the
prometheus histogram buckets (now 10ms to 1s) to better match the
expected sample range.
2023-07-31 19:45:22 +03:00
Markus Lehtonen
47f621d970
metrics: improve the node updates gauge
...
Rename the metric, better describe what we're measuring and better
comply with prometheus naming conventions. Also change it to represent
actual updates of the node object on the Kubernetes apiserver.
2023-07-31 19:45:22 +03:00
Markus Lehtonen
945e7fcb3f
metrics: improve nfr processing time metric
...
Change the metric from a simple gauge (that basically was a single value
for the whole cluster) into a HistogramVec, aligning with the feature
discovery duration metric in nfd-worker. This improved metric now has
prometheus labels for the NFR name and node name, i.e. it is tracking
per-NFR metric for each node being processed. Also, change the naming to
better comply with prometheus suggested conventions.
2023-07-31 19:45:22 +03:00
Kubernetes Prow Robot
01ca8cb91d
Merge pull request #1284 from marquiz/devel/generator-deps
...
generate: bump tools to their latest versions
2023-07-31 06:32:39 -07:00
Kubernetes Prow Robot
e0f10a81de
Merge pull request #1256 from PiotrProkop/fix-topo-updater-policy-and-scope-advertisment
...
Fix Topology Manager policy and scope not being updated after NRT creation
2023-07-28 00:25:54 -07:00
Markus Lehtonen
7e375ad1f0
generate: bump tools to their latest versions
...
Bump tools versions and re-auto-generate files.
2023-07-27 14:29:48 +03:00
Kubernetes Prow Robot
65b7216313
Merge pull request #1283 from marquiz/docs/deprecation-policy
...
docs: deprecation policy for Helm chart params
2023-07-25 10:46:06 -07:00
Kubernetes Prow Robot
463a737b82
Merge pull request #1277 from marquiz/docs/k8s-compat
...
docs: describe supported Kubernetes versions
2023-07-25 08:54:06 -07:00
Markus Lehtonen
b1328b3166
docs: describe supported Kubernetes versions
2023-07-25 17:40:06 +03:00
Markus Lehtonen
b72b537261
docs: deprecation policy for Helm chart params
2023-07-24 14:06:30 +03:00
Kubernetes Prow Robot
73bdaa2e89
Merge pull request #1282 from jcpunk/podmon-labels
...
Add optional labels to the podmonitor
2023-07-24 03:40:12 -07:00
Pat Riehecky
0523257d1a
Add optional labels to the podmonitor
...
Signed-off-by: Pat Riehecky <riehecky@fnal.gov>
2023-07-21 10:03:50 -05:00