Skip to content

Latest commit

 

History

History
148 lines (98 loc) · 6.73 KB

RELEASE.md

File metadata and controls

148 lines (98 loc) · 6.73 KB

ROCm 6.1.1 release notes

ROCm™ 6.1.1 introduces minor fixes and improvements to some tools and libraries.

OS support

ROCm 6.1.1 has been tested against a pre-release version of Ubuntu 22.04.5 (kernel 6.8).

AMD SMI

AMD SMI for ROCm 6.1.1

Additions

  • Added deferred error correctable counts to amd-smi metric -ecc -ecc-blocks.

Changes

  • Updated the output of amd-smi metric --ecc-blocks to show counters available from blocks.
  • Updated the output of amd-smi metric --clock to reflect each engine.
  • Updated the output of amd-smi topology --json to align with output reported by host and guest systems.

Fixes

  • Fixed amd-smi metric --clock's clock lock and deep sleep status.
  • Fixed an issue that would cause an error when resetting non-AMD GPUs.
  • Fixed amd-smi metric --pcie and amdsmi_get_pcie_info() when using RDNA3 (Navi 32 and Navi 31) hardware to prevent "UNKNOWN" reports.
  • Fixed the output results of amd-smi process when getting processes running on a device.

Removals

  • Removed the amdsmi_get_gpu_process_info API from the Python library. It was removed from the C library in an earlier release.

Known issues

  • amd-smi bad-pages can result in a ValueError: Null pointer access error when using some PMU firmware versions.
See the [detailed changelog](https://github.com/ROCm/amdsmi/blob/docs/6.1.1/CHANGELOG.md) with code samples for more information.

HIPCC

HIPCC for ROCm 6.1.1

Changes

  • Upcoming: a future release will enable use of compiled binaries hipcc.bin and hipconfig.bin by default. No action is needed by users. You can continue calling high-level Perl scripts hipcc and hipconfig. hipcc.bin and hipconfig.bin will be invoked by the high-level Perl scripts. To revert to the previous behavior and invoke hipcc.pl and hipconfig.pl, set the HIP_USE_PERL_SCRIPTS environment variable to 1.
  • Upcoming: a subsequent release will remove high-level Perl scripts hipcc and hipconfig. This release will remove the HIP_USE_PERL_SCRIPTS environment variable. It will rename hipcc.bin and hipconfig.bin to hipcc and hipconfig respectively. No action is needed by the users. To revert to the previous behavior, invoke hipcc.pl and hipconfig.pl explicitly.
  • Upcoming: a subsequent release will remove hipcc.pl and hipconfig.pl.

HIPIFY

HIPIFY for ROCm 6.1.1

Additions

  • Added support for LLVM 18.1.2.
  • Added support for cuDNN 9.0.0.
  • Added a new option: --clang-resource-directory to specify the clang resource path (the path to the parent folder for the include folder that contains __clang_cuda_runtime_wrapper.h and other header files used during the hipification process).

ROCm SMI

ROCm SMI for ROCm 6.1.1

Known issues

  • ROCm SMI reports GPU utilization incorrectly for RDNA3 GPUs in some situations.

Library changes in ROCm 6.1.1

Library Version
AMDMIGraphX 2.9
hipBLAS 2.1.0
hipBLASLt 0.7.0
hipCUB 3.1.0
hipFFT 1.0.14
hipRAND 2.10.17
hipSOLVER 2.1.0 ⇒ 2.1.1
hipSPARSE 3.0.1
hipSPARSELt 0.2.0
hipTensor 1.2.0
MIOpen 3.1.0
MIVisionX 2.5.0
rccl 2.18.6
rocALUTION 3.1.1
rocBLAS 4.1.0
rocDecode 0.5.0
rocFFT 1.0.26 ⇒ 1.0.27
rocm-cmake 0.12.0
rocPRIM 3.1.0
rocRAND 3.0.1
rocSOLVER 3.25.0
rocSPARSE 3.1.2
rocThrust 3.0.1
rocWMMA 1.4.0
rpp 1.5.0
Tensile 4.40.0

hipBLASLt 0.7.0

hipBLASLt 0.7.0 for ROCm 6.1.1

Additions
  • Added hipblasltExtSoftmax extension API.
  • Added hipblasltExtLayerNorm extension API.
  • Added hipblasltExtAMax extension API.
  • Added GemmTuning extension parameter to set split-k by user.
  • Added support for mixed precision datatype: fp16/fp8 in with fp16 outk.
Deprecations
  • Upcoming: algoGetHeuristic() ext API for GroupGemm will be deprecated in a future release of hipBLASLt.

hipSOLVER 2.1.1

hipSOLVER 2.1.1 for ROCm 6.1.1

Changes

  • By default, BUILD_WITH_SPARSE is now set to OFF on Microsoft Windows.

Fixes

  • Fixed benchmark client build when BUILD_WITH_SPARSE is OFF.

rocFFT 1.0.27

rocFFT 1.0.27 for ROCm 6.1.1

Additions

  • Enable multi-GPU testing on systems without direct GPU-interconnects.

Fixes

  • Fixed kernel launch failure on execute of very large odd-length real-complex transforms.