Document why cuda-samples is pinned to v12.5 rather than the latest
tag: it has to match the CUDA 12.5 toolkit in the base image and the
cuda-demo-suite-12-5 apt package used on x86_64. v13+ cuda-samples
also requires CUDA Toolkit 13.x and switched from make to CMake, so
bumping is a coordinated change across base image, apt package, git
tag, and build commands.
Signed-off-by: Davanum Srinivas <davanum@gmail.com>
The [Feature:GPUDevicePlugin] Sanity test embeds
`apt-get install -y cuda-demo-suite-12-5` under `set -e`. NVIDIA's CUDA
apt repo publishes cuda-demo-suite-* for x86_64 but NOT for sbsa
(confirmed against the public Packages index on
developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/{sbsa,x86_64}/).
On arm64 the install fails, the container exits 1, pod.Status.Phase
becomes Failed, and the subsequent `gomega.Expect(... .Equal(Succeeded))`
assertion trips.
Split the demo phase on architecture. On x86_64 keep the existing apt
path unchanged. On anything else, build deviceQuery / vectorAdd /
bandwidthTest from the public NVIDIA/cuda-samples repo instead.
busGrind is exclusive to cuda-demo-suite (no source equivalent in
cuda-samples) and is skipped on non-x86_64.
The pattern is the one already in production use by
sigs.k8s.io/dra-driver-nvidia-gpu in tests/bats/specs/gpu-cuda-demo-suite.yaml,
which has been green on Lambda gpu_1x_gh200.
Signed-off-by: Davanum Srinivas <davanum@gmail.com>