Commit Graph

8396 Commits

Author SHA1 Message Date
k8s-infra-cherrypick-robot 58fb98db0b [release-2.28] Deduplicate GraphQL node IDs in update-hashes to fix 502 err (#13069)
* Deduplicate GraphQL node IDs in update-hashes to fix 502

* Bump component_hash_update version to 1.0.1

Avoids stale pip/uv installation cache in CI pipelines
after the GraphQL deduplication fix.

---------

Co-authored-by: Hamza <12420351+0xMH@users.noreply.github.com>
2026-03-16 12:57:36 +05:30
Max Gautier f11cdac50b Patch versions updates (#13049)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-02-25 06:09:34 +05:30
k8s-infra-cherrypick-robot 00429b3d49 kubeadm_patches: remove old patches on inventory change (#13022)
Currently, if changing the inventory variable `kubeadm_patches`, new
patches will be created, but the existing ones will also be left on the
filesystem, and applied by kubeadm ; this means that removed or changed
configuration can linger.

Cleanup old patches (which are the difference between existing patches
on filesystem and the one created for the current runs).

Co-authored-by: Max Gautier <mg@max.gautier.name>
2026-02-17 14:27:04 +05:30
Max Gautier 8d1174b26d Patch versions updates (#12996)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-02-12 15:10:01 +05:30
Max Gautier 0280234bd3 Patch versions updates (#12976)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-02-09 21:13:25 +05:30
k8s-infra-cherrypick-robot d4eefa6912 Make etcd node removal idempotent (#12961)
Co-authored-by: Max Gautier <mg@max.gautier.name>
2026-02-05 14:52:32 +05:30
k8s-infra-cherrypick-robot 0d5bde6c54 Remove nifcloud terraform provider support (it is no longer available) (#12963)
The nifcloud terraform provider has been deleted, so remove support and
CI.

Co-authored-by: Max Gautier <mg@max.gautier.name>
2026-02-05 12:28:31 +05:30
k8s-infra-cherrypick-robot 46a1b887cd wait for control plane node to become ready after joining (#12923)
When joining a control plane node and "upgrading" the cluster setup (for
example, to update etcd addresses after adding a new etcd) in the same
playbook run, the node can take a bit of time to become ready after
joining.
This triggers a kubeadm preflight check (ControlPlaneNodesReady) in
kubeadm upgrade, which is run directly after the join tasks.

Add a configurable wait for the control plane node to become Ready to
fix this race condition.

Co-authored-by: Max Gautier <mg@max.gautier.name>
2026-01-29 15:49:50 +05:30
k8s-infra-cherrypick-robot 5ee67297bc etcd-certs: only change necessary permissions (#12915)
We currently **recursively** set the permissions of /etc/ssl/etcd/ssl
(default path) to 700. But this removes group permission from the files
under it, and certain composents (like calio with etcd datastore) rely
on it ; thus, the upgrade of a cluster can fail because the
calico-kube-controller can't access the certs, and thus the etcd.

This works in other case because as far as I can tell, the apiserver
which do access the etcd run as root (the owner of the files, not just
the "group owner")

We also for some reasons do this twice.

Only create the etcd cert directory with the correct permissions once,
not recursively.

Co-authored-by: Max Gautier <mg@max.gautier.name>
2026-01-27 20:31:52 +05:30
Max Gautier 41d4ae9894 Patch versions updates (#12896)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-01-23 21:35:31 +05:30
k8s-infra-cherrypick-robot 5979110c2c cri-o: fix duplicate top-level "auths" keys in registry config template (#12885)
The config.json.j2 template was generating invalid JSON when multiple
crio_registry_auth entries were defined, resulting in multiple top-level
"auths" objects being rendered, e.g.:

{
  "auths": { "registry1": { "auth": "xxxx" } },
  "auths": { "registry2": { "auth": "yyyy" } }
}

This change moves the loop inside the "auths" object so that all registries
are rendered as siblings under a single "auths" key, producing valid JSON:

{
  "auths": {
    "registry1": { "auth": "xxxx" },
    "registry2": { "auth": "yyyy" }
  }
}

Co-authored-by: Martin Cahill <martin.cahill@gmail.com>
2026-01-20 19:58:51 +05:30
Max Gautier 183330f93d Patch versions updates (#12856)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2026-01-14 13:05:36 +05:30
k8s-infra-cherrypick-robot 7bc37640d7 Fix ansible-lint config error (#12864)
Co-authored-by: Max Gautier <mg@max.gautier.name>
2026-01-13 20:27:38 +05:30
Max Gautier 9ccd606ac1 Patch versions updates (#12802)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-12-20 21:08:33 -08:00
k8s-infra-cherrypick-robot 84dbb84386 Let containerd create storage / state dir (#12807)
Containerd manages by itself, so there is no need to override it and
change permissions.

Co-authored-by: Max Gautier <mg@max.gautier.name>
2025-12-20 06:44:30 -08:00
Max Gautier 360ea8238e Patch versions updates (#12745)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-11-25 01:08:37 -08:00
k8s-infra-cherrypick-robot c6050f4b10 [release-2.28] CI: enable unsafe_show_logs == true by default (#12727)
* CI: enable unsafe_show_logs == true by default

* Deduplicate defaults vars (unsafe_show_logs)

---------

Co-authored-by: Max Gautier <mg@max.gautier.name>
2025-11-19 23:50:00 -08:00
Max Gautier 0140fda589 Patch versions updates (#12720)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-11-18 02:13:39 -08:00
Max Gautier ebf9fedf9b Remove etcd member by peerURLs (#12690)
The way to obtain the IP of a particular member is convoluted and depend
on multiple variables. The match is also textual and it's not clear
against what we're matching

It's also broken for etcd member which are not also Kubernetes nodes,
because the "Lookup node IP in kubernetes" task will fail and abort the
play.

Instead, match against 'peerURLs', which does not need new variable, and
use json output.

- Add testcase for etcd removal on external etcd
2025-11-17 02:57:40 -08:00
Max Gautier f15f58b5b9 Update pre-commit hooks (#12699) 2025-11-15 21:49:36 -08:00
k8s-infra-cherrypick-robot 5505943ddc [release-2.28] CI: Try a full ssh connection on hosts instead of only checking the port (#12710)
* CI: Try a full ssh connection on hosts instead of only checking the port

If we only try the port, we can try to connect in the playbook which is
executed next even though the managed node has not yet completed it's
boot-up sequence ("System is booting up. Unprivileged users are not
permitted to log in yet. Please come back later. For technical details,
see pam_nologin(8).")

This does not account for python-less hosts, but we don't use those in
CI anyway (for now, at least).

* CI: Remove connection method override when creating VMs

This prevented wait_for_connection to work correctly by hijacking the
connection to localhost, thus bypassing the connection check.

---------

Co-authored-by: Max Gautier <mg@max.gautier.name>
2025-11-15 12:39:36 -08:00
k8s-infra-cherrypick-robot f9e82eb6f5 fix(cilium):correct loadBalancer.mode rendering in values.yaml (#12704)
Co-authored-by: Anurag Ojha <aojharaj2004@gmail.com>
2025-11-14 07:03:41 -08:00
Ali Afsharzadeh 3320c329e5 Upgrade cilium from 1.17.8 to 1.17.9 (#12648) 2025-10-29 19:54:01 -07:00
Ali Afsharzadeh 8f5e599cac [release-2.28] Upgrade cilium from 1.17.7 to 1.17.8 (#12570)
Signed-off-by: Ali Afsharzadeh <afsharzadeh8@gmail.com>
2025-10-23 23:07:35 -07:00
k8s-infra-cherrypick-robot bb91baf1cb Fix breakage when ignoring all kubeadm preflight errors (#12617)
kubeadm errors out if 'all' is specified with specific checks, so check
that case when we add hardcoded checks.

Add a test to catch regression.

Co-authored-by: Max Gautier <mg@max.gautier.name>
2025-10-13 21:45:32 -07:00
k8s-infra-cherrypick-robot 40e015f98f Fix cilium_policy_audit_mode variable (#12612)
Signed-off-by: Ali Afsharzadeh <afsharzadeh8@gmail.com>
Co-authored-by: Ali Afsharzadeh <afsharzadeh8@gmail.com>
2025-10-12 03:42:52 -07:00
Max Gautier 0a975f6447 Patch versions updates (#12607)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-10-09 09:57:01 -07:00
k8s-infra-cherrypick-robot 4471512232 Make main_ip cacheable in facts (#12601)
Co-authored-by: fox0430 <shuu.n@protonmail.com>
2025-10-09 02:23:03 -07:00
k8s-infra-cherrypick-robot 2336d54088 Fix calico etcd mode networkpolicy RBAC (#12587)
Co-authored-by: Chad Swenson <chadswen@gmail.com>
2025-09-29 10:36:17 -07:00
ChengHao Yang e52f788942 Add proxy_env to cilium install task for proxy (#12530)
Signed-off-by: ChengHao Yang <17496418+tico88612@users.noreply.github.com>
2025-09-09 03:53:30 -07:00
Chad Swenson ad7ab2cd37 [release-2.28] Fix automated cherrypick failure in UpgradeConfiguration.node (#12524) 2025-09-03 08:25:16 -07:00
Max Gautier 4fd9e6d7a0 Patch versions updates (#12519)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-09-03 01:07:15 -07:00
ChengHao Yang ba4044b90d Bump: Galaxy version to 2.28.2 (#12525)
Signed-off-by: ChengHao Yang <17496418+tico88612@users.noreply.github.com>
2025-09-02 23:15:14 -07:00
k8s-infra-cherrypick-robot a20891ab67 Fix SAN address collection from ansible_default_ipv{4,6} (#12505)
Signed-off-by: Hyeonki Hong <hhk7734@gmail.com>
Co-authored-by: Hyeonki Hong <hhk7734@gmail.com>
v2.28.1
2025-08-26 03:02:11 -07:00
Max Gautier 0858e46dc6 Patch versions updates (#12504)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-08-26 02:36:11 -07:00
ChengHao Yang d695115061 Fix: constant etcd_supported_version to dynamic (#12499)
Signed-off-by: ChengHao Yang <17496418+tico88612@users.noreply.github.com>
2025-08-25 07:15:10 -07:00
k8s-infra-cherrypick-robot 9a09ac5a40 Fix: Change "empty" definition for PodSecurity Admission configuration (#12478)
Fixes a bug where `kube-apiserver` fails to start if the PodSecurity
configuration file doesn't have the `apiVersion` and `kind` keys.

Signed-off-by: Alejandro Macedo <alex.macedopereira@gmail.com>
Co-authored-by: Alejandro Macedo <alex.macedopereira@gmail.com>
2025-08-19 04:51:36 -07:00
k8s-infra-cherrypick-robot 3a1a2bd4f4 Argo CD : checksum support for the install url (#12467)
Fixes https://github.com/kubernetes-sigs/kubespray/issues/12223

Co-authored-by: Romain Lalaut <rlalaut@proton.me>
2025-08-17 20:09:07 -07:00
Ali Afsharzadeh 842e352767 [release-2.28] Upgrade cilium from 1.17.3 to 1.17.7 (#12471)
Signed-off-by: Ali Afsharzadeh <afsharzadeh8@gmail.com>
2025-08-17 19:11:07 -07:00
ChengHao Yang 4bb24b253c Patch versions updates (#12462)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-08-15 13:23:07 -07:00
k8s-infra-cherrypick-robot f815c80139 bug: fix missing cilium_enable_bgp_control_plane config (#12432)
Co-authored-by: XuhuiSun95 <ericsun1995@gmail.com>
2025-07-26 22:10:27 -07:00
Max Gautier 86fcc2ba59 Patch versions updates (#12431)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-07-25 01:18:28 -07:00
k8s-infra-cherrypick-robot e7cf546f37 Fix#12385 cilium typo (#12404)
Signed-off-by: wangsifei99 <wangsifei@kylinos.cn>
Co-authored-by: wangsifei99 <wangsifei@kylinos.cn>
2025-07-15 08:14:24 -07:00
k8s-infra-cherrypick-robot a59585b6dc Fixed syntax error in _bgp_config dict (#12394)
Co-authored-by: mathgaming <mathgaming@outlook.com>
2025-07-11 00:31:28 -07:00
k8s-infra-cherrypick-robot 2d0cc2b4ea Fix kubeadm upgrade node skipPhases with multiple CP nodes (#12384)
Add 1.32 conditional defaults

Restore support for kubeadm upgrade node --skip-phases < 1.32, apply still needs to be restricted

Co-authored-by: Chad Swenson <chadswen@gmail.com>
2025-07-07 23:35:26 -07:00
k8s-infra-cherrypick-robot 9a08afa3f9 [release-2.28] Cilium: Pass cluster DNS to hubble.peerService in values.yaml.j2 (#12374)
* cilium: pass cluster DNS to hubble.peerService in values.yaml.j2

* Add dedicated Hubble variable defaulting to inventory cluster domain

---------

Co-authored-by: Mustafa Mertcan CAM <mertcancam@gmail.com>
2025-07-04 01:23:25 -07:00
k8s-infra-cherrypick-robot 22e9335484 fix(kubeadm): Conditionally add --skip-phases flag for v1.32.0+ (#12354)
Signed-off-by: bo.jiang <bo.jiang@daocloud.io>
Co-authored-by: bo.jiang <bo.jiang@daocloud.io>
2025-06-28 05:44:29 -07:00
k8s-infra-cherrypick-robot 4789e9dd89 fix ETCD_INITIAL_CLUSTER config in etcd.env and etcd-events.env (#12352)
Co-authored-by: liuxu <liuxu623@gmail.com>
2025-06-27 23:54:29 -07:00
k8s-infra-cherrypick-robot 9a86253beb fix: add cilium extraConfig values (#12338)
Co-authored-by: atobaum <atobaum@gmail.com>
2025-06-24 00:00:31 -07:00
ChengHao Yang 1e76d9113b Patch versions updates (#12330)
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
2025-06-23 19:22:28 -07:00