Skip to content

chore(deps): update aks/aks-gpu-cuda docker tag to v595#8439

Open
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/major-nvidia-gpu-cuda
Open

chore(deps): update aks/aks-gpu-cuda docker tag to v595#8439
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/major-nvidia-gpu-cuda

Conversation

@renovate

@renovate renovate Bot commented Apr 30, 2026

Copy link
Copy Markdown
Contributor

This PR contains the following updates:

Package Update Change
aks/aks-gpu-cuda major 580.126.09-20260126030251595.71.05-20260623180420

Configuration

📅 Schedule: (UTC)

  • Branch creation
    • At any time (no schedule defined)
  • Automerge
    • At any time (no schedule defined)

🚦 Automerge: Enabled.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

Copilot AI review requested due to automatic review settings April 30, 2026 21:19
@renovate renovate Bot added the renovate This pull request was created by renovate label Apr 30, 2026
@renovate renovate Bot removed the request for review from Copilot April 30, 2026 21:19
@renovate renovate Bot enabled auto-merge (squash) April 30, 2026 21:19
@github-actions github-actions Bot added the components This pull request updates cached components on Linux or Windows VHDs label Apr 30, 2026
Copilot AI review requested due to automatic review settings May 14, 2026 17:03
@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from 76d0af9 to cdf19ae Compare May 14, 2026 17:03

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from cdf19ae to ff06fd8 Compare May 14, 2026 18:13
@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from ff06fd8 to 9f8dfa3 Compare May 19, 2026 17:38
Copilot AI review requested due to automatic review settings May 19, 2026 17:38

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from 9f8dfa3 to fcf7a49 Compare May 20, 2026 01:17
Copilot AI review requested due to automatic review settings May 22, 2026 23:40
@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from fcf7a49 to 274446a Compare May 22, 2026 23:40

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from 274446a to 2b31591 Compare May 29, 2026 16:34
@renovate renovate Bot changed the title chore(deps): update aks/aks-gpu-cuda docker tag to v595 chore(deps): update nvidia-gpu-cuda to v595 Jun 2, 2026
@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from 2b31591 to 8298b25 Compare June 9, 2026 17:56
Copilot AI review requested due to automatic review settings June 9, 2026 17:56

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@aks-node-assistant

Copy link
Copy Markdown
Contributor

AgentBaker Linux PR gate — Single NetworkIsolated localdns-exporter assertion failure (test-assertion issue, not PR-caused)

  • Run: 167348479 (failed)
  • Failed task: Run AgentBaker E2E
  • Test summary: DONE 457 tests, 95 skipped, 2 failures in 1837.481s (notably NO fwupd hits in this run)
  • Failing scenario: Test_Ubuntu2204_ArtifactStreaming_NetworkIsolatedCluster/default

Exact failure signature:

validation.go:82: 🔴 FAIL:
  validate_localdns_exporter_metrics.go:71
  Error: Not equal: expected: "0"  actual: "1"
  Test:   Test_Ubuntu2204_ArtifactStreaming_NetworkIsolatedCluster/default
  Messages: localdns exporter metrics validation failed

Node provisions cleanly, kubelet ready, CSE OK, ethtool/waagent/node-exporter all pass; the failing assertion is inside validate_localdns_exporter_metrics.go:71 comparing an expected metric value of "0" to an actual "1" after the test sends ~1000 DNS queries through localdns. The localdns exporter on port 9353 is up, the metrics endpoint responds with a populated payload (localdns_memory_usage_bytes, localdns_cpu_usage_seconds_total, localdns_vnetdns_forward_info{...} 1, etc.), so the issue is the assertion's expected value, not a missing/unreachable exporter.

Three-level analysis:

  1. L1: assertion expects metric 0, gets 1, on a NetworkIsolated 22.04 ArtifactStreaming scenario.
  2. L2 corroboration: all other validators in this scenario pass; the rest of the run is green (only 2 failures out of 457); the other failure is a single companion sub-scenario of the same test. No fwupd, no CSE error, no infra timeout. PR chore(deps): update aks/aks-gpu-cuda docker tag to v595 #8439 changes nvidia-gpu-cuda to v595 — touches GPU driver path only, not localdns exporter, not NetworkIsolated, not 22.04 base.
  3. L3 challenge: alternatives — (a) PR-caused: refuted, nvidia-gpu-cuda has zero overlap with localdns exporter assertions on a non-GPU 22.04 scenario; (b) infra flake: weaker than usual — node and exporter healthy; this looks like a test-assertion drift (expected value was probably tightened recently and doesn't match real-world counters after the load-priming step generates legitimate traffic). Strongest alternative is "recently-tightened assertion in validate_localdns_exporter_metrics.go" — i.e. test-code regression in main, not in this PR.

Build-vs-test: test-code/assertion (not PR-caused, not product regression).
Confidence: HIGH that PR #8439 is not the cause; MEDIUM-HIGH that this is a recently-introduced assertion mismatch in e2e/validate_localdns_exporter_metrics.go:71.

Recommended next action / owner: NodeSIG-dev / E2E owner of validate_localdns_exporter_metrics.go — review the line 71 assertion (which counter is expected to be exactly 0 after sending 1000 priming queries?). The exporter is reporting localdns_kubedns_forward_info{...} 1 and localdns_vnetdns_forward_info{...} 1 for several blocks; either the assertion should expect ≥0 / >0, or the priming step should not target the metric under test. PR author: do NOT block merge on this; this is the only failing scenario and it's outside your PR's scope.

Posted by Clawpilot AgentBaker gate detective.

@renovate renovate Bot changed the title chore(deps): update nvidia-gpu-cuda to v595 chore(deps): update nvidia-gpu-cuda (major) Jun 22, 2026
@renovate renovate Bot force-pushed the renovate/major-nvidia-gpu-cuda branch from 8298b25 to 942ce87 Compare June 23, 2026 18:53
@renovate renovate Bot changed the title chore(deps): update nvidia-gpu-cuda (major) chore(deps): update aks/aks-gpu-cuda docker tag to v595 Jun 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

components This pull request updates cached components on Linux or Windows VHDs renovate This pull request was created by renovate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants