Skip to content

Add docs for 26.3.0 release#353

Open
a-mccarthy wants to merge 4 commits intoNVIDIA:mainfrom
a-mccarthy:dev-26.3.0
Open

Add docs for 26.3.0 release#353
a-mccarthy wants to merge 4 commits intoNVIDIA:mainfrom
a-mccarthy:dev-26.3.0

Conversation

@a-mccarthy
Copy link
Collaborator

No description provided.

@github-actions
Copy link

Documentation preview

https://nvidia.github.io/cloud-native-docs/review/pr-353

Comment on lines 38 to 39
You must uninstall an existing installation and then install the Operator again.
Uninstalling the Operator interrupts services and applications that require access to NVIDIA GPUs.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
You must uninstall an existing installation and then install the Operator again.
Uninstalling the Operator interrupts services and applications that require access to NVIDIA GPUs.


#. Create a file, such as ``nvd-precomiled-some.yaml``, with contents like the following:

.. literalinclude:: ./manifests/input/nvd-precompiled-some.yaml
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we also change the driver version specified in these files?

@@ -397,4 +393,4 @@
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets use 580 here.

.. code-block:: console

$ kubectl patch nvidiadriver/demo-silver --type='json' \
-p='[{"op": "replace", "path": "/spec/version", "value": "525.125.06"}]'
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets use updated version here.

@a-mccarthy a-mccarthy force-pushed the dev-26.3.0 branch 3 times, most recently from 3c58318 to 7003594 Compare March 17, 2026 18:22
Copy link
Contributor

@cdesiniotis cdesiniotis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Made a first pass. Will review again tomorrow.

Comment on lines +78 to +81
* Added full support for the NVIDIA Driver Custom Resource Definition (CRD).
Previously available in Technology Preview, the NVIDIA Driver CRD is now generally available.
Use this feature to configure multiple driver types and versions on different nodes or multiple operating system versions on nodes.
Refer to the :doc:`NVIDIA Driver Custom Resource Definition documentation <gpu-driver-configuration>` for more information.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Question -- do we want to call out the limitations regarding migration here?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated.


* Added support for KubeVirt GPU passthrough with Ubuntu 24.04 LTS and the VFIO framework.

* Added support for vGPU precompiled driver container for Azure Linux.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rajathagasthya were we actually planning to call this out in the release notes? My gut says no...

* Added PodSecurityContext support for DaemonSets (`PR #2120 <https://github.com/NVIDIA/gpu-operator/pull/2120>`_).
In ClusterPolicy, set ``spec.daemonsets.podSecurityContext``; in NVIDIADriver, set ``spec.podSecurityContext``.

* See `PR #2014 <https://github.com/NVIDIA/gpu-operator/pull/2014>`_ for related changes.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This bullet lacks context / feels out of place. Was this meant to be a sub-bullet for another bullet?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated!

* Improved the NVIDIA Kubernetes Device Plugin to avoid unnecessary GPU unbind/rebind operations during rolling updates of the vfio-manager DaemonSet.
This improves the stability of GPU passthrough workloads (KubeVirt, Kata Containers).
* Improved the Upgrade Controller to decrease unnecessary reconciliation in environments with Node Feature Discovery (NFD) enabled.
* Improved performance (`PR #2113 <https://github.com/NVIDIA/gpu-operator/pull/2113>`_).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What perf improvements? We should be more specific.

A suggestion (but someone might have a better suggestion 😄 ):

Suggested change
* Improved performance (`PR #2113 <https://github.com/NVIDIA/gpu-operator/pull/2113>`_).
* Improved performance of the clusterpolicy controller by reducing the number of API calls made (`PR #2113 <https://github.com/NVIDIA/gpu-operator/pull/2113>`_).

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i updated this bullet, let me know if it make sense


* Marked unused field ``defaultRuntime`` as optional in the ClusterPolicy. (`PR #2000 <https://github.com/NVIDIA/gpu-operator/pull/2000>`_)
* The NVIDIA Kata Manager for Kubernetes is now deprecated.
To enable Kata Containers for GPUs, install the upstream kata-deploy Helm chart, which deploys all Kata runtime classes, including the NVIDIA-specific runtime classes.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we link to the kata procedure (which you are adding in #365) here?

Signed-off-by: Abigail McCarthy <20771501+a-mccarthy@users.noreply.github.com>

Co-authored-by: Rajath Agasthya <rajathagasthya@gmail.com>
Signed-off-by: Abigail McCarthy <20771501+a-mccarthy@users.noreply.github.com>
Signed-off-by: Abigail McCarthy <20771501+a-mccarthy@users.noreply.github.com>
When post-release testing confirms support for newer versions of operands, these updates are identified as *recommended updates* to a GPU Operator version.
Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.

.. note::
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@cdesiniotis is this accurate? In this release i feel like we can remove the separate components table in the gov-ready page, https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/install-gpu-operator-gov-ready.html#supported-gpu-operator-components. What components are still not gov ready? I want to make sure i note them correctly here.

Signed-off-by: Abigail McCarthy <20771501+a-mccarthy@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants