Sam Dowell
1c1f00a5f4
fix: add RV check on GC delete calls
...
It was possible that the object was changed between the live Get and
Delete calls while processing an attempt to delete, causing incorrect
deletion of objects by the garbage collector. A defensive
resourceVersion precondition is added to the delete call to ensure that
the object was properly classified for deletion.
2025-07-02 11:01:56 -07:00
Kubernetes Prow Robot
4186edc4d1
Merge pull request #132615 from mimowo/commonize-pod-indexing
...
Commonize filtering of Pods by Owner with all orphans in namespace
2025-07-02 02:03:32 -07:00
Kubernetes Prow Robot
a735818b7a
Merge pull request #132533 from nojnhuh/dra-orphan-claim
...
DRA: fix deleting orphaned ResourceClaim on startup
2025-07-02 02:03:25 -07:00
Michal Wozniak
6d5e0bf2a2
review remarks
2025-07-01 16:59:19 +02:00
Michal Wozniak
ac86e67b7d
Commonize filtering of Pods by Owner with all orphans in namespace
2025-06-30 08:07:21 +02:00
Huy Pham
b2f27c0649
fix: Truncate too long Deployment name in RS name ( #132560 )
...
* fix: Truncate too long Deployment name in RS name
* fix: lint & adjust unit tests
* fix: use const for "-" & unit tests
* Add test case for very long hash
* Explicitly define expected deployment name portion
2025-06-27 16:32:29 -07:00
Jon Huhn
f1845218e2
fixup! DRA: fix deleting orphaned ResourceClaim on startup
2025-06-26 23:21:18 -05:00
Kubernetes Prow Robot
efd2a0d1f5
Merge pull request #132351 from googs1025/fix/hpa_memory
...
bugfix(hpa): introduce buildQuantity helper for consistent resource quantity
2025-06-26 11:02:35 -07:00
Jon Huhn
ef117edf35
DRA: fix deleting orphaned ResourceClaim on startup
2025-06-25 11:11:43 -05:00
googs1025
b50d508176
bugfix(hpa): introduce buildQuantity helper for consistent resource quantity creation
...
Signed-off-by: googs1025 <googs1025@gmail.com>
2025-06-25 08:23:53 +08:00
Kubernetes Prow Robot
5b1af0c8c2
Merge pull request #127655 from guozheng-shen/remove-usage
...
remove 'endpointsleases' and 'configmapsleases' from usage
2025-06-24 09:54:28 -07:00
Kubernetes Prow Robot
49c20d6f44
Merge pull request #132173 from dejanzele/feat/promote-job-pod-replacement-policy-ga
...
KEP-3939: Job Pod Replacement Policy; promote to GA
2025-06-24 07:04:28 -07:00
xigang
66c611125c
Add namespace-aware orphan pod indexing
...
Signed-off-by: xigang <wangxigang2014@gmail.com>
2025-06-19 16:32:20 +08:00
Kubernetes Prow Robot
f407bd6d24
Merge pull request #132254 from carlory/cleanup-MountContainers
...
Cleanup after Alpha feature MountContainers was removed
2025-06-18 17:24:50 -07:00
Kubernetes Prow Robot
8f1f17a04f
Merge pull request #132305 from xigang/job_index
...
Job controller optimization: reduce work duration time & minimize cache locking
2025-06-18 05:27:01 -07:00
xigang
91b4816c23
Optimize job controller performance: reduce work duration time & minimize cache locking
...
Signed-off-by: xigang <wangxigang2014@gmail.com>
2025-06-18 15:28:12 +08:00
Kubernetes Prow Robot
17e20ec9d4
Merge pull request #131281 from googs1025/add_miss_shutdown
...
chore: add miss Shutdown call for selinux_warning controller
2025-06-17 06:18:59 -07:00
Kubernetes Prow Robot
3e39d1074f
Merge pull request #132221 from dims/new-cmp-diff-impl
...
New implementation for `Diff` (drop in replacement for `cmp.Diff`)
2025-06-16 18:02:58 -07:00
Davanum Srinivas
03afe6471b
Add a replacement for cmp.Diff using json+go-difflib
...
Co-authored-by: Jordan Liggitt <jordan@liggitt.net>
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
2025-06-16 17:10:42 -04:00
Dejan Zele Pejchev
bccc9fe470
KEP-3939: Job Pod Replacement Policy; promote to GA
...
Signed-off-by: Dejan Zele Pejchev <pejcev.dejan@gmail.com>
2025-06-16 16:26:03 +02:00
Filip Křepinský
bdfa8839be
calculateStatus should use the same now time point for each pod
...
make IsPodAvailable time check inclusive
2025-06-14 18:39:15 +02:00
carlory
85bc3cb096
Remove GetExec method from VolumeHost
...
Signed-off-by: carlory <baofa.fan@daocloud.io>
2025-06-13 10:58:37 +08:00
aumpatel
db2555628c
Fix: HPA suppresses FailedRescale event on successful conflict retry
...
This change modifies the HPA controller to use retry.RetryOnConflict when updating a scale subresource. This prevents the controller from emitting a FailedRescale event on transient API conflicts if a subsequent retry succeeds. If the retry is successful, a SuccessfulRescale event is emitted. If all retries are exhausted and the conflict persists, the original FailedRescale event is emitted. This reduces event noise caused by race conditions where the scale subresource is updated by another process.
2025-06-12 07:35:07 -04:00
carlory
f0dde38234
Remove pluginName param from GetMounter and GetExec
...
Signed-off-by: carlory <baofa.fan@daocloud.io>
2025-06-12 17:29:17 +08:00
Kubernetes Prow Robot
089849ac22
Merge pull request #131822 from atiratree/replicationcontroller-terminating-replicas
...
disable terminatingReplicas reconciliation in ReplicationController
2025-06-10 15:17:01 -07:00
Kubernetes Prow Robot
a26f3fd5c6
Merge pull request #132109 from linxiulei/jobdelay
...
Clean backoff record earlier
2025-06-06 13:38:38 -07:00
Eric Lin
1f46b3fdbf
Clean backoff record earlier
...
Once received job deletion event, it cleans the backoff records for that
job before enqueueing this job so that we can avoid a race condition
that the syncJob() may incorrect use stale backoff records for a newly created
job with same key.
Co-authored-by: Michal Wozniak <michalwozniak@google.com>
2025-06-06 18:31:38 +00:00
Kubernetes Prow Robot
a883be6e36
Merge pull request #132031 from atiratree/update-getRSPods
...
add orphanedPods parameter to getRSPods and improve code flow in syncReplicaSet
2025-06-03 12:10:39 -07:00
Kubernetes Prow Robot
62f72addf2
Merge pull request #120816 from tnqn/fix-unreachable-taint-delay
...
NoExecute taint should be added when a Node's ready condition becomes Unknown
2025-06-03 00:54:44 -07:00
Filip Křepinský
b7d16fea7f
disable terminatingReplicas reconciliation in ReplicationController
2025-05-30 21:08:12 +02:00
Filip Křepinský
aac00c1f0e
add orphanedPods parameter to getRSPods
...
and improve code flow in syncReplicaSet
2025-05-29 10:50:32 +02:00
Antonio Ojea
b9fec8bf4f
fix scheme import
...
Change-Id: I9a94c06b931031a1c2391184342fd5ffa79e3128
2025-05-15 13:46:48 +00:00
Kubernetes Prow Robot
b587977f7c
Merge pull request #131445 from natasha41575/renameObservedGenHelperFns
...
update godoc for and rename observedGeneration helpers
2025-05-14 11:39:19 -07:00
Kubernetes Prow Robot
1325262b5f
Merge pull request #130961 from hakuna-matatah/rs
...
Optimize RS Controller Performance: Reduce Work Duration Time & Minimize Cache Locking
2025-05-13 08:43:15 -07:00
Kubernetes Prow Robot
b8d9c12d1b
Merge pull request #131330 from aojea/servicecidr_fixes
...
servicecidr: only patch status if necessary
2025-05-12 17:53:16 -07:00
Harish Kuna
e42aba6c0c
Optimize RS Controller Performance: Reduce Work Duration Time & Minimize Cache Locking
2025-05-12 19:56:46 +00:00
Quan Tian
f718096b74
NoExecute taint should be added when a Node's ready condition becomes Unknown
...
After a Node has stopped posting heartbeats for nodeMonitorGracePeriod,
it will be considered unreachable, its ready condition will be set to
Unknown, NoSchedule taint will be added, all Pods on it will be set to
NotReady, but there is always a delay of 5s before NoExecute taint is
added to the Node, adding 5s to the recovery time of Pods which are
supposed to be evicted by the taint and recreated on other Nodes sooner.
The delay is because processTaintBaseEviction() uses the last observed
ready condition of the Node instead of the current one to determine
whether it should add the Node to the taint queue. When a Node is set to
unreachable due to missing heartbeats, the last observed ready condition
is still true and the current ready condition is unknown, we should use
the latter for processTaintBaseEviction().
Signed-off-by: Quan Tian <qtian@vmware.com>
2025-05-10 17:22:11 +08:00
Kubernetes Prow Robot
fa10ea63a6
Merge pull request #127050 from omerap12/podautoscaler-ExternalPerpodMetricReplicas-intmax
...
HPA: Fix int overflow in GetExternalPerPodMetricReplicas
2025-05-09 13:37:14 -07:00
Omer Aplatony
af1d60f30b
Add hpa reviewers
...
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2025-05-07 18:16:15 +00:00
Omer Aplatony
0acc7bd4dc
HPA: Fix int overflow in GetExternalPerPodMetricReplicas
...
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2025-05-07 16:26:27 +00:00
Kubernetes Prow Robot
d2507bb01a
Merge pull request #130806 from hakuna-matatah/master
...
Optimize Statefulset Controller Performance: Reduce Work Duration Time & Minimize Cache Locking.
2025-05-06 06:03:13 -07:00
Kubernetes Prow Robot
0b8133816b
Merge pull request #131477 from pohly/golangci-lint@v2
...
golangci-lint v2
2025-05-02 23:03:55 -07:00
Jordan Liggitt
6bb6c99342
Drop null creationTimestamp from test fixtures
2025-05-02 15:38:40 -04:00
Matthieu MOREL
4adb58565c
chore: bump golangci-lint to v2
...
Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>
2025-05-02 12:51:02 +02:00
Antonio Ojea
56e533f4a0
servicecidr: only patch status if necessary
...
Change-Id: I1fadec3e48bd3cb734658b8bfca58bb80ab911b9
2025-05-02 08:26:17 +00:00
Kubernetes Prow Robot
fe5afa919b
Merge pull request #130333 from kmala/job
...
handle job complete update delayed event
2025-04-25 17:55:22 -07:00
Natasha Sarkar
92359cdc69
update godoc for and rename observedGeneration helpers
2025-04-24 16:05:01 +00:00
Kubernetes Prow Robot
c59203e051
Merge pull request #121967 from torredil/update-logging
...
Update log verbosity for node health and taint checks
2025-04-24 06:22:34 -07:00
googs1025
e8dbfc0b6f
add miss Shutdown call for selinux_warning controller
2025-04-14 09:07:51 +08:00
Keerthan Reddy Mala
d4fd41285b
update the log message to reflect success and failed jobs
2025-04-08 10:21:02 -07:00