Support for architecture-accelerated features #2102

maleadt · 2023-09-28T19:40:58Z

i.e. sm_90a

In general, PTX code generated for one target architecture can be run on future architectures (i.e., it is forward compatible). However, CUDA 12.0 introduced the concept of "architecture-accelerated features" whose PTX does not have forward compatibility guarantees. Several Hopper PTX instructions fall under this category of architecture-accelerated features, and thus require a sm_90a target architecture (note the "a" appended). For more details on this and other architecture-accelerated instructions, please refer to the CUDA Documentation.

The text was updated successfully, but these errors were encountered:

maleadt added enhancement New feature or request cuda kernels Stuff about writing CUDA kernels. labels Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for architecture-accelerated features #2102

Support for architecture-accelerated features #2102

maleadt commented Sep 28, 2023

Support for architecture-accelerated features #2102

Support for architecture-accelerated features #2102

Comments

maleadt commented Sep 28, 2023