Skip to content

Conversation

@joshdk
Copy link

@joshdk joshdk commented Jan 23, 2026

Implements an additional healthcheck for HTTPRoute resources which have a failed status for the "Reconciled" condition type.

Also includes a minor refactor to make it easier to manage the list of condition types to check against.

Checklist:

  • Either (a) I've created an enhancement proposal and discussed it with the community, (b) this is a bug fix, or (c) this does not need to be in the release notes.
  • The title of the PR states what changed and the related issues number (used for the release note).
  • The title of the PR conforms to the Title of the PR
  • I've included "Closes [ISSUE #]" or "Fixes [ISSUE #]" in the description to automatically close the associated issue.
  • I've updated both the CLI and UI to expose my feature, or I plan to submit a second PR with them.
  • Does this PR require documentation updates?
  • I've updated documentation as required by this PR.
  • I have signed off all my commits as required by DCO
  • I have written unit and/or e2e tests for my change. PRs without these are unlikely to be merged.
  • My build is green (troubleshooting builds).
  • My new feature complies with the feature status guidelines.
  • I have added a brief description of why this PR is necessary and/or what this PR solves.
  • Optional. My organization is added to USERS.md.
  • Optional. For bug fixes, I've indicated what older releases this fix should be cherry-picked into (this may or may not happen depending on risk/complexity).

@joshdk joshdk requested a review from a team as a code owner January 23, 2026 20:18
@bunnyshell
Copy link

bunnyshell bot commented Jan 23, 2026

🔴 Preview Environment stopped on Bunnyshell

See: Environment Details | Pipeline Logs

Available commands (reply to this comment):

  • 🔵 /bns:start to start the environment
  • 🚀 /bns:deploy to redeploy the environment
  • /bns:delete to remove the environment

@joshdk
Copy link
Author

joshdk commented Jan 23, 2026

// For some additional context which doesn't necessarily belong in the git history, but will hopefully help anyone searching the same error...


We are using the GKE Gateway Controller. Some of the relevant Kubernetes resources are backed by GCP resources, which have their own restrictions and limits.

One specific example is a 64 KB limit on the size of a URL map (see https://docs.cloud.google.com/load-balancing/docs/quotas#:~:text=Size%20of%20URL%20maps). This limit can be hit if enough HTTPRoute resources (with their associated spec/rules) are admitted to the cluster.

When this happens the following error message can be observed in the HTTPRoute status/parents/conditions:

apiVersion: gateway.networking.k8s.io/v1
kind: HTTPRoute

status:
  parents:
    - conditions:
        - lastTransitionTime: "2026-01-23T18:42:18Z"
          observedGeneration: 1
          reason: ReconciliationFailed
          status: "False"
          type: Reconciled
          message: |
            error cause: gceSync: Update: generic::invalid_argument: Invalid resource usage: ''Size of the url map exceeds the maximum permitted size''.

An example fix for this is to delete a sufficient number of HTTPRoute resources, then wait for the Gateway Controller to re-reconcile after ~5 minutes...


ArgoCD currently marks this resource as healthy. This change properly marks these resources as degraded. 💔

Thank you!

Implements an additional healthcheck for HTTPRoute resources which have
a failed status for the "Reconciled" condition type.

Also includes a minor refactor to make it easier to manage the list of
condition types to check against.

Signed-off-by: Josh Komoroske <jdkomo@gmail.com>
@joshdk
Copy link
Author

joshdk commented Jan 26, 2026

@chansuke @lindeskar (authors who have also touched these files) in case you have opinions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant