Skip to content

NimTechnology

Trình bày các công nghệ CLOUD một cách dễ hiểu.

  • Kubernetes & Container
    • Docker
    • Kubernetes
      • Gateway API
      • Ingress
      • Pod
    • Helm Chart
    • OAuth2 Proxy
    • Isito-EnvoyFilter
    • Apache Kafka
      • Kafka
      • Kafka Connect
      • Lenses
    • Vault
    • Longhorn – Storage
    • VictoriaMetrics
    • MetalLB
    • Kong Gateway
  • CI/CD
    • ArgoCD
    • ArgoWorkflows
    • Argo Events
    • Spinnaker
    • Jenkins
    • Harbor
    • TeamCity
    • Git
      • Bitbucket
  • Coding
    • DevSecOps
    • Terraform
      • GCP – Google Cloud
      • AWS – Amazon Web Service
      • Azure Cloud
    • Golang
    • Laravel
    • Python
    • Jquery & JavaScript
    • Selenium
  • Log, Monitor & Tracing
    • DataDog
    • Prometheus
    • Grafana
    • ELK
      • Kibana
      • Logstash
  • BareMetal
    • NextCloud
  • Toggle search form

[Rancher/EKS] Rancher from v2.12.x can not work on eks cluster.

Posted on April 15, 2026April 15, 2026 By nim No Comments on [Rancher/EKS] Rancher from v2.12.x can not work on eks cluster.

Rancher v2.12 giới thiệu một endpoint /ext mới được phục vụ thông qua một Extension API Server nội bộ, chỉ lắng nghe trên localhost:6666 trên cụm quản lý/cục bộ. Cổng này được chọn để phù hợp với Imperative API RFC được định nghĩa bởi nhóm kỹ thuật Rancher. Nó được đăng ký trong Kubernetes như một dịch vụ API được tổng hợp (v1.ext.cattle.io) thông qua Lớp Tổng hợp API Kubernetes tiêu chuẩn, có nghĩa là nó hoàn toàn có thể truy cập được thông qua kubectl.

Bạn sẽ gặp trướng hợp là Upgrade rancher lên v2.12.x trở lên sẽ và Rancher không work và bạn không còn truy cập vào rancher được nữa.

Contents

Toggle
  • Root cause chain
  • How to fix:

Root cause chain

  1. EKS security group missing port 6666 — The EKS control plane SG (sg-0d32d906a18ed9f0c) was allowed on ports
    443, 4443, 6443, 8443, 9443, 10250, 10251 to the worker node SG (sg-03da3b5b8c78bffcf), but port 6666 was
    missing. Rancher’s extension APIService (v1.ext.cattle.io) uses this port for the kube-apiserver to call back
    to Rancher. The kube-apiserver could never successfully probe it → FailedDiscoveryCheck with exponential
    backoff
  2. Crash loop from backoff — When Rancher started, it waited up to ~5 minutes for kube-apiserver to call its
    imperative API on port 6666. With a long backoff (>5 min), kube-apiserver never called during startup →
    FATAL: kube-apiserver did not contact the rancher imperative api in time
  3. Stuck namespace — cattle-provisioning-capi-system was stuck Terminating since April 13 because the GC
    controller’s discovery of ext.cattle.io/v1 kept failing, generating hundreds of failing helm-operation pods

How to fix:

Lúc này anh em cần tìm đến Instance Security Group là SecGroup mà attached vào Auto Scaling Group hoặc Instance.

rồi bạn thêm inbound tcp 6666

EKS Console (Control Plane SGs)
  ├── Cluster security group:    sg-0cac9f294862929b4  ← applied to control plane ENIs
  └── Additional security groups: sg-0d32d906a18ed9f0c  ← also on control plane ENIs
  EC2 / Node Group level (Worker Node SGs)
  └── Instance security group:  sg-03da3b5b8c78bffcf  ← attached to the EC2 instances

  You can verify sg-03da3b5b8c78bffcf in the AWS console at:
  • EC2 → Instances → click any worker node → Security tab → you'll see it there
  • OR: EKS → Clusters → devsecops-mdaas → Compute → Node Groups → click the node group → Launch template → the
    SG is set in the launch template

  The reason we had to add port 6666 to the worker node SG is that sg-0d32d906a18ed9f0c (control plane) was
  already allowed inbound on specific ports (443, 4443, 6443, 8443, 9443, 10250, 10251) but NOT port 6666. The fix
   we applied added that missing rule:

  Node SG (sg-03da3b5b8c78bffcf)  ←  allow TCP 6666  ←  Control Plane SG (sg-0d32d906a18ed9f0c)

  If the Cluster security group (sg-0cac9f294862929b4) had been applied to the worker nodes as well, it would have
   allowed all traffic bidirectionally between control plane and nodes automatically — and this port 6666 issue
  would never have occurred. That's the standard EKS recommendation, but your cluster uses a custom node SG
  instead.

Kubernetes

Post navigation

Previous Post: [Telegram/Openclaw] Configure openclaw bot in a Telegram group.

More Related Articles

[Kubernetes] Don’t believe kubectl top Kubernetes
[Kubernetes] How to delete Persistent Volume is Terminating and very stubborn Kubernetes
[Rancher] Control anything of K8S through Rancher Kubernetes
[code/k8s] Lost or not found file inside the container on k8s Kubernetes
[Vault] Using Service Acount of Kubernetes to login Vault system. Kubernetes
[K8s] Signals and the “kubectl delete” command Kubernetes

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Tham Gia Group DevOps nhé!
Để Nim có nhiều động lực ra nhiều bài viết.
Để nhận được những thông báo mới nhất.

Recent Posts

  • [Rancher/EKS] Rancher from v2.12.x can not work on eks cluster. April 15, 2026
  • [Telegram/Openclaw] Configure openclaw bot in a Telegram group. March 31, 2026
  • Tutorial: Gateway API + Traefik + oauth2-proxy (Microsoft Entra ID) March 30, 2026
  • Full + incremental backup: When restoring, do deleted files come back? March 27, 2026
  • [K8S] Create long-lived kubeconfig on k8s March 23, 2026

Archives

  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • May 2023
  • April 2023
  • March 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • August 2021
  • July 2021
  • June 2021

Categories

  • AI
    • OpenClaw
  • BareMetal
    • NextCloud
  • CI/CD
    • Argo Events
    • ArgoCD
    • ArgoWorkflows
    • Git
      • Bitbucket
    • Harbor
    • Jenkins
    • Spinnaker
    • TeamCity
  • Coding
    • DevSecOps
    • Golang
    • Jquery & JavaScript
    • Laravel
    • NextJS 14 & ReactJS & Type Script
    • Python
    • Selenium
    • Terraform
      • AWS – Amazon Web Service
      • Azure Cloud
      • GCP – Google Cloud
  • Kubernetes & Container
    • Apache Kafka
      • Kafka
      • Kafka Connect
      • Lenses
    • Docker
    • Helm Chart
    • Isito-EnvoyFilter
    • Kong Gateway
    • Kubernetes
      • Gateway API
      • Ingress
      • Pod
    • Longhorn – Storage
    • MetalLB
    • OAuth2 Proxy
    • Vault
    • VictoriaMetrics
  • Log, Monitor & Tracing
    • DataDog
    • ELK
      • Kibana
      • Logstash
    • Fluent
    • Grafana
    • Prometheus
  • Uncategorized
  • Admin

Copyright © 2026 NimTechnology.