kubernetes

mirror of https://github.com/kubernetes/kubernetes.git synced 2026-03-23 19:04:33 -04:00

Author	SHA1	Message	Date
Kubernetes Prow Robot	4a1558c545	Merge pull request #133967 from pohly/dra-allocator-selection DRA: allocator selection	2025-09-30 08:24:18 -07:00
Patrick Ohly	60eeaa6ebd	DRA scheduler: add unit test for allocator selection This prevents the mistake from 1.34 where the default-on DRAResourceClaimDeviceStatus feature caused the use of the experimental allocator implementation. The test fails without a fix for that.	2025-09-30 16:53:38 +02:00
Patrick Ohly	7f57730ba4	DRA scheduler: fix selection of "incubating" allocator implementation In 1.34, the default feature gate selection picked the "experimental" allocator implementation when it should have used the "incubating" allocator. No harm came from that because the experimental allocator has all the necessary if checks to disable the extra code and no bugs were introduced when implementing it, but it means that our safety net wasn't there when we expected it to be. The reason is that the "DRAResourceClaimDeviceStatus" feature gate is on by default and was only listed as supported by the experimental implementation. This could be fixed by listing it as supported also by the other implementation, but that would be a bit odd because there is nothing to support for it (the reason why this was missed in 1.34!). Instead, the allocator features are now only indirectly related to feature gates, with a single boolean controlling the implementation of binding conditions.	2025-09-30 16:53:38 +02:00
Patrick Ohly	b5bcac998d	DRA scheduler: clean up feature gate handling Copying from feature.Features to new fields in the plugin got a bit silly with the long list of features that we have now. Embedding feature.Features is simpler. Two fields in feature.Features weren't named according to the feature gate, now they are named consistently and the fields are sorted.	2025-09-30 16:53:38 +02:00
hojinchoi	7028ba09db	fix: duplicated 'the' in comment	2025-09-18 18:11:44 +09:00
yliao	74cf1db218	sort the device requests in the extended resource claim spec. removed the sortClaim in the unit test.	2025-09-11 16:55:58 +00:00
yliao	79f8d1b1c5	fixed bug such that implicit extended resource name can always be used, no matter the explicit extendedResourceName field in device class is set or not.	2025-09-10 14:10:40 +00:00
Ania Borowiec	fadb40199f	Move interfaces: Handle and Plugin and related types from kubernetes/kubernetes to staging repo kube-scheduler	2025-09-02 09:42:53 +00:00
yliao	bf13cd1b81	added resourceClaimModified to bindClaim to decide whether to update assume cache	2025-08-29 16:12:55 +00:00
Abu Kashem	747a295cac	fix flake in dra test 'TestPlugin' TestPlugin/multi-claims-binding-conditions-all-success/PreEnqueue flakes due to the assumed cache not been synced with the initial store. The test waits until the registered handler used by the assumed cache has synced before proceeding with the test	2025-08-18 15:54:03 -04:00
Abu Kashem	c8ab780edb	dra plugin: assume claim after api call in bindClaim	2025-08-13 16:35:35 -04:00
yliao	2a026f6d65	1/ added retries to AssumeClaimAfterAPICall for the object which is not present in the cache (dynamicresources.go) 2/ modified the assume cache verification to not error out as long as the expected claim is in the cache, no matter its latest and api object are different or not. (dynamicresources_test.go). 3/ fixed nil panic as seen from https://prow.k8s.io/view/gs/kubernetes-ci-logs/pr-logs/pull/133321/pull-kubernetes-integration/1952472629470302208	2025-08-06 07:08:58 +00:00
yliao	0a12f00e9d	fix nil panic in hasBindingConditions, it cannot assume claim has allocations	2025-07-30 14:44:41 +09:00
Sunyanan Choochotkaew	7f052afaef	KEP 5075: implement scheduler Signed-off-by: Sunyanan Choochotkaew <sunyanan.choochotkaew1@ibm.com>	2025-07-30 09:52:49 +09:00
yliao	34a64db2c7	extended resource backed by DRA: implementation	2025-07-29 18:55:21 +00:00
Kobayashi,Daisuke	e8c3af1f5c	KEP-5007 DRA Device Binding Conditions: Implement scheduler logic	2025-07-29 11:34:30 +00:00
Kubernetes Prow Robot	a11bc701e8	Merge pull request #132457 from ania-borowiec/depends_on_cluster_move_podinfo Moving Scheduler interfaces to staging: Move PodInfo and NodeInfo interfaces (together with related types) to staging repo, leaving internal implementation in kubernetes/kubernetes/pkg/scheduler	2025-07-24 09:38:27 -07:00
Ania Borowiec	aecd37e6fb	Moving Scheduler interfaces to staging: Move PodInfo and NodeInfo interfaces (together with related types) to staging repo, leaving internal implementation in kubernetes/kubernetes/pkg/scheduler	2025-07-24 12:10:58 +00:00
Kubernetes Prow Robot	89a01ec72a	Merge pull request #133019 from pohly/dra-scheduler-plugin-owners DRA scheduler plugin: add pohly as approver	2025-07-24 03:42:33 -07:00
Patrick Ohly	5c4f81743c	DRA: use v1 API As before when adding v1beta2, DRA drivers built using the k8s.io/dynamic-resource-allocation helper packages remain compatible with all Kubernetes release >= 1.32. The helper code picks whatever API version is enabled from v1beta1/v1beta2/v1. However, the control plane now depends on v1, so a cluster configuration where only v1beta1 or v1beta2 are enabled without the v1 won't work.	2025-07-24 08:33:45 +02:00
Ed Bartosh	c2a06e7912	DRA: skip flaky test case on Windows Added a skipOnWindows flag to DynamicResources scheduler test case to skip test that relies on nanosecond timer precision. Windows timer granularity is much coarser than Linux, which causes the test to fail often.	2025-07-23 11:06:11 +03:00
Patrick Ohly	bc338e7505	DRA scheduler: implement filter timeout and cancellation The intent is to catch abnormal runtimes with the generously large default timeout of 10 seconds. We have to set up a context with the configured timeout (optional!), then ensure that both CEL evaluation and the allocation logic itself properly returns the context error. The scheduler plugin then can convert that into "unschedulable". The allocator and thus Filter now also check for context cancellation by the scheduler. This happens when enough nodes have been found.	2025-07-17 21:18:28 +02:00
Patrick Ohly	025c606e39	DRA scheduler: add plugin configuration The only option is the filter timeout. The implementation of it follows in a separate commit.	2025-07-17 16:47:47 +02:00
Patrick Ohly	a2a3839a8e	DRA scheduler: add pohly as approver This is meant for simple changes, like code cleanup or API changes of the allocator code. For more complex changes and new features, SIG Scheduling approvers will be required to approve, as before.	2025-07-17 09:43:44 +02:00
yliao	dd3691b169	refactor allocator, removed claimsToAllocate from NewAllocator(), instead, passed it through Allocate()	2025-07-16 15:11:11 +00:00
Kubernetes Prow Robot	ab685237f0	Merge pull request #132391 from sanposhiho/pre-bind-pre-flight feat: add PreBindPreFlight and implement in in-tree plugins	2025-07-15 04:06:23 -07:00
Patrick Ohly	5caf7bca15	DRA allocator: refactor code The goal is to maintain different version of the allocator logic. We already had one incidence where adding an alpha feature caused a regression also when it was disabled. Not everything can be implemented within obviously correct if branches. This also opens the door for implementing different alternatives. The code just gets moved around for now.	2025-07-10 17:34:21 +02:00
Kensei Nakada	ebae419337	feat: add PreBindPreFlight and implement in in-tree plugins	2025-07-05 17:14:21 -07:00
Ania Borowiec	ee8c265d35	Move Code and Status from pkg/scheduler/framework to k8s.io/kube-scheduler/framework	2025-06-30 10:06:22 +00:00
Ania Borowiec	00d3750503	Move ClusterEvent type to staging repo, leaving some functions (that contain logic internal to scheduler) in kubernetes/kubernetes (#132190 ) * Move ClusterEvent type to staging repo, leaving some functions (that contain logic internal to scheduler) in kubernetes/kubernetes apply review comment and fix linter warning * update-vendor.sh * update doc comments * run update-vendor.sh	2025-06-26 08:06:29 -07:00
Davanum Srinivas	03afe6471b	Add a replacement for cmp.Diff using json+go-difflib Co-authored-by: Jordan Liggitt <jordan@liggitt.net> Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2025-06-16 17:10:42 -04:00
Ania Borowiec	d75af825fb	Extract interface CycleState and move is to staging repo. CycleState implementation remains in k/k/pkg/scheduler/framework	2025-05-29 16:18:36 +00:00
Kubernetes Prow Robot	8a6b916765	Merge pull request #130720 from saintube/scheduler-expose-nodeinfo-in-prefilter Expose NodeInfo to PreFilter plugins	2025-04-23 13:31:29 -07:00
saintube	8dc6806d26	Expose NodeInfo to PreFilter plugins and Framework Co-authored-by: Zhan Sheng <49895476+AxeZhan@users.noreply.github.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> Signed-off-by: saintube <saintube@foxmail.com>	2025-03-21 14:55:25 +08:00
Cici Huang	f04cfdf6e7	Update gofmt.	2025-03-19 23:21:30 +00:00
Cici Huang	6d7f11689d	Complete feature impl, fix issues, add perDeviceNodeSelection support, add tests, address comments, etc.	2025-03-19 22:10:48 +00:00
Morten Torkildsen	ecba6cde1d	Allocator updates	2025-03-19 22:10:48 +00:00
Kubernetes Prow Robot	ab3cec0701	Merge pull request #130447 from pohly/dra-device-taints device taints and tolerations (KEP 5055)	2025-03-19 13:00:32 -07:00
Jon Huhn	5760a4f282	DRA scheduler: device taints and tolerations Thanks to the tracker, the plugin sees all taints directly in the device definition and can compare it against the tolerations of a request while trying to find a device for the request. When the feature is turnedd off, taints are ignored during scheduling.	2025-03-19 09:18:38 +01:00
Patrick Ohly	a027b439e5	DRA: add device taint eviction controller The controller is derived from the node taint eviction controller. In contrast to that controller it tracks the UID of pods to prevent deleting the wrong pod when it got replaced.	2025-03-19 09:18:38 +01:00
Patrick Ohly	d95d6ba526	DRA scheduler: fix potential panic during unit test verification If there was an unexpected status, the code extracting the expected error message crashed with a panic. Happened once so far, for unknown reasons because the unexpected status then didn't get logged.	2025-03-18 15:07:51 +01:00
Patrick Ohly	dfb8ab6521	DRA scheduler: fail in PreFilter when DRAPrioritizedList is disabled and used This was previously caught during Filter by the allocator check. Doing it sooner avoids wasting resources on a pod which ultimately cannot get scheduled. While at it, be a bit more clear about which feature is disabled. The user might not know that.	2025-03-07 08:45:32 +01:00
Morten Torkildsen	2229a78dfe	DRA: Update allocator for Prioritized Alternatives in Device Requests	2025-02-28 19:30:10 +00:00
Kubernetes Prow Robot	fc268ecd09	Merge pull request #129823 from googs1025/chore/log_improve fix(dra plugin): when there is no resourceclaim, return directly	2025-02-02 16:28:56 -08:00
googs1025	ed826dddfe	fix(dra plugin): when there is no resourceclaim, return directly	2025-01-29 08:47:52 +08:00
Davanum Srinivas	4e05bc20db	Linter to ensure go-cmp/cmp is used ONLY in tests Signed-off-by: Davanum Srinivas <davanum@gmail.com>	2025-01-24 20:49:14 -05:00
Paco Xu	2653caa248	fix dra test lint	2025-01-09 10:42:40 +08:00
googs1025	77eae7c34f	feature(scheduler): remove dra plugin resourceslice QueueingHintFn	2025-01-08 16:24:28 +08:00
Patrick Ohly	33ea278c51	DRA: use v1beta1 API No code is left which depends on the v1alpha3, except of course the code implementing that version.	2024-11-06 13:03:19 +01:00
Kuba Tużnik	8d489425aa	scheduler/dynamicresources: extract obtaining and tracking in-memory modifications of DRA objects All logic related to obtaining DRA objects and tracking modifications to ResourceClaims in-memory is extracted to DefaultDRAManager, which implements framework.SharedDRAManager. This is intended to be a no-op in terms of the DRA plugin behavior.	2024-11-05 14:11:04 +01:00

1 2 3 4

155 commits