This repository has been archived by the owner on Jun 5, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 93
/
Copy pathInstallation-Guide.rst
1360 lines (782 loc) · 47.4 KB
/
Installation-Guide.rst
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
.. image:: /Installation_Guide/amdblack.jpg
|
==============================================
AMD ROCm QuickStart Installation Guide v3.3.0
==============================================
- `Deploying ROCm`_
- `Ubuntu`_
- `Centos RHEL v7.7`_
- `SLES 15 Service Pack 1`_
- `ROCm Installation Known Issues and Workarounds`_
- `Getting the ROCm Source Code`_
.. _Deploying ROCm:
Deploying ROCm
~~~~~~~~~~~~~~~~
AMD hosts both Debian and RPM repositories for the ROCm v3.x packages.
The following directions show how to install ROCm on supported Debian-based systems such as Ubuntu 18.04.x
**Note**: These directions may not work as written on unsupported Debian-based distributions. For example, newer versions of Ubuntu may not be compatible with the rock-dkms kernel driver. In this case, you can exclude the rocm-dkms and rock-dkms packages.
.. _Ubuntu:
Ubuntu
^^^^^^^^
Installing a ROCm Package from a Debian Repository
'''''''''''''''''''''''''''''''''''''''''''''''''''''
To install from a Debian Repository:
1. Run the following code to ensure that your system is up to date:
::
sudo apt update
sudo apt dist-upgrade
sudo apt install libnuma-dev
sudo reboot
2. Add the ROCm apt repository.
For Debian-based systems like Ubuntu, configure the Debian ROCm repository as follows:
::
wget -q -O - http://repo.radeon.com/rocm/apt/debian/rocm.gpg.key | sudo apt-key add -
echo 'deb [arch=amd64] http://repo.radeon.com/rocm/apt/debian/ xenial main' | sudo tee /etc/apt/sources.list.d/rocm.list
The gpg key may change; ensure it is updated when installing a new release. If the key signature verification fails while updating, re-add the key from the ROCm apt repository.
The current rocm.gpg.key is not available in a standard key ring distribution, but has the following sha1sum hash:
::
e85a40d1a43453fe37d63aa6899bc96e08f2817a rocm.gpg.key
3. Install the ROCm meta-package. Update the appropriate repository list and install the rocm-dkms meta-package:
::
sudo apt update
sudo apt install rocm-dkms
4. Set permissions. To access the GPU, you must be a user in the video group. Ensure your user account is a member of the video group prior to using ROCm. To identify the groups you are a member of, use the following command:
::
groups
5. To add your user to the video group, use the following command for the sudo password:
::
sudo usermod -a -G video $LOGNAME
6. By default, add any future users to the video group. Run the following command to add users to the video group:
::
echo 'ADD_EXTRA_GROUPS=1' | sudo tee -a /etc/adduser.conf
echo 'EXTRA_GROUPS=video' | sudo tee -a /etc/adduser.conf
7. Restart the system.
8. Test the basic ROCm installation.
9. After restarting the system, run the following commands to verify that the ROCm installation is successful. If you see your GPUs listed by both commands, the installation is considered successful.
::
/opt/rocm/bin/rocminfo
/opt/rocm/opencl/bin/x86_64/clinfo
Note: To run the ROCm programs more efficiently, add the ROCm binaries in your PATH.
::
echo 'export PATH=$PATH:/opt/rocm/bin:/opt/rocm/profiler/bin:/opt/rocm/opencl/bin/x86_64' |
sudo tee -a /etc/profile.d/rocm.sh
Uninstalling ROCm Packages from Ubuntu
''''''''''''''''''''''''''''''''''''''''
To uninstall the ROCm packages from Ubuntu 16.04.6 or Ubuntu 18.04.4, run the following command:
::
sudo apt autoremove rocm-opencl rocm-dkms rocm-dev rocm-utils
Installing Development Packages for Cross Compilation
''''''''''''''''''''''''''''''''''''''''''''''''''''''''
It is recommended that you develop and test development packages on different systems. For example, some development or build systems may not have an AMD GPU installed. In this scenario, you must avoid installing the ROCk kernel driver on the development system.
Instead, install the following development subset of packages:
::
sudo apt update
sudo apt install rocm-dev
Note: To execute ROCm enabled applications, you must install the full ROCm driver stack on your system.
Using Debian-based ROCm with Upstream Kernel Drivers
''''''''''''''''''''''''''''''''''''''''''''''''''''''
You can install the ROCm user-level software without installing the AMD's custom ROCk kernel driver. To use the upstream kernels, run the following commands instead of installing rocm-dkms:
::
sudo apt update
sudo apt install rocm-dev
echo 'SUBSYSTEM=="kfd", KERNEL=="kfd", TAG+="uaccess", GROUP="video"'
sudo tee /etc/udev/rules.d/70-kfd.rules
.. _CentOS RHEL:
CentOS RHEL v7.7
^^^^^^^^^^^^^^
This section describes how to install ROCm on supported RPM-based systems such as CentOS v7.7.
Preparing RHEL v7 (7.7) for Installation
'''''''''''''''''''''''''''''''''''''''''''
RHEL is a subscription-based operating system. You must enable the external repositories to install on the devtoolset-7 environment and the dkms support files.
Note: The following steps do not apply to the CentOS installation.
1. The subscription for RHEL must be enabled and attached to a pool ID. See the Obtaining an RHEL image and license page for instructions on registering your system with the RHEL subscription server and attaching to a pool id.
2. Enable the following repositories:
::
sudo subscription-manager repos --enable rhel-server-rhscl-7-rpms
sudo subscription-manager repos --enable rhel-7-server-optional-rpms
sudo subscription-manager repos --enable rhel-7-server-extras-rpms
3. Enable additional repositories by downloading and installing the epel-release-latest-7 repository RPM:
::
sudo rpm -ivh <repo>
For more details, see https://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm
4. Install and set up Devtoolset-7.
To setup the Devtoolset-7 environment, follow the instructions on this page: https://www.softwarecollections.org/en/scls/rhscl/devtoolset-7/
Note: devtoolset-7 is a software collections package and is not supported by AMD.
Installing CentOS/RHEL (v7.7) for DKMS
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Use the dkms tool to install the kernel drivers on CentOS/RHEL v7.7:
::
sudo yum install -y epel-release
sudo yum install -y dkms kernel-headers-`uname -r` kernel-devel-`uname -r`
.. _ROCM install:
Installing ROCm
'''''''''''''''''
To install ROCm on your system, follow the instructions below:
1. Delete the previous versions of ROCm before installing the latest version.
2. Create a /etc/yum.repos.d/rocm.repo file with the following contents:
::
[ROCm]
name=ROCm
baseurl=http://repo.radeon.com/rocm/yum/rpm
enabled=1
gpgcheck=0
Note: The URL of the repository must point to the location of the repositories’ repodata database.
3. Install ROCm components using the following command:
::
sudo yum install rocm-dkms
4. Restart the system. The rock-dkms component is installed and the /dev/kfd device is now available.
5. Set permissions. To access the GPU, you must be a user in the video group. Ensure your user account is a member of the video group prior to using ROCm. To identify the groups you are a member of, use the following command:
::
groups
6. To add your user to the video group, use the following command for the sudo password:
::
sudo usermod -a -G video $LOGNAME
7. By default, add any future users to the video group. Run the following command to add users to the video group:
::
echo 'ADD_EXTRA_GROUPS=1' | sudo tee -a /etc/adduser.conf
echo 'EXTRA_GROUPS=video' | sudo tee -a /etc/adduser.conf
Note: The current release supports CentOS/RHEL v7.7. Before updating to the latest version of the operating system, delete the ROCm packages to avoid DKMS-related issues.
8. Restart the system.
9. Test the ROCm installation.
Testing the ROCm Installation
'''''''''''''''''''''''''''''''
After restarting the system, run the following commands to verify that the ROCm installation is successful. If you see your GPUs listed, you are good to go!
::
/opt/rocm/bin/rocminfo
/opt/rocm/opencl/bin/x86_64/clinfo
**Note**: Add the ROCm binaries in your PATH for easy implementation of the ROCm programs.
::
echo 'export PATH=$PATH:/opt/rocm/bin:/opt/rocm/profiler/bin:/opt/rocm/opencl/bin/x86_64' |
sudo tee -a /etc/profile.d/rocm.sh
Compiling Applications Using HCC, HIP, and Other ROCm Software
''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''''
To compile applications or samples, run the following command to use gcc-7.2 provided by the devtoolset-7 environment:
::
scl enable devtoolset-7 bash
Uninstalling ROCm from CentOS/RHEL v7.7
''''''''''''''''''''''''''''''''''''''''
To uninstall the ROCm packages, run the following command:
::
sudo yum autoremove rocm-opencl rocm-dkms rock-dkms
Installing Development Packages for Cross Compilation
'''''''''''''''''''''''''''''''''''''''''''''''''''''''
You can develop and test ROCm packages on different systems. For example, some development or build systems may not have an AMD GPU installed. In this scenario, you can avoid installing the ROCm kernel driver on your development system. Instead, install the following development subset of packages:
::
sudo yum install rocm-dev
Note: To execute ROCm-enabled applications, you will require a system installed with the full ROCm driver stack.
Using ROCm with Upstream Kernel Drivers
'''''''''''''''''''''''''''''''''''''''''
You can install ROCm user-level software without installing AMD's custom ROCk kernel driver. To use the upstream kernel drivers, run the following commands
::
sudo yum install rocm-dev
echo 'SUBSYSTEM=="kfd", KERNEL=="kfd", TAG+="uaccess", GROUP="video"'
sudo tee /etc/udev/rules.d/70-kfd.rules
**Note**: You can use this command instead of installing rocm-dkms.
.. _SLES 15 Service Pack 1:
SLES 15 Service Pack 1
^^^^^^^^^^^^^^^^^^^^^^^
The following section tells you how to perform an install and uninstall ROCm on SLES 15 SP 1.
**Installation**
1. Install the "dkms" package.
::
sudo SUSEConnect --product PackageHub/15.1/x86_64
sudo zypper install dkms
2. Add the ROCm repo.
::
sudo zypper clean –all
sudo zypper addrepo --no-gpgcheck http://repo.radeon.com/rocm/zyp/zypper/ rocm
sudo zypper ref
zypper install rocm-dkms
sudo zypper install rocm-dkms
sudo reboot
3. Run the following command once
::
cat <<EOF | sudo tee /etc/modprobe.d/10-unsupported-modules.conf
allow_unsupported_modules 1
EOF
sudo modprobe amdgpu
4. Verify the ROCm installation.
5. Run /opt/rocm/bin/rocminfo and /opt/rocm/opencl/bin/x86_64/clinfo commands to list the GPUs and verify that the ROCm installation is successful.
6. Set permissions.
To access the GPU, you must be a user in the video group. Ensure your user account is a member of the video group prior to using ROCm. To identify the groups you are a member of, use the following command:
::
groups
7. To add your user to the video group, use the following command for the sudo password:
::
sudo usermod -a -G video $LOGNAME
8. By default, add any future users to the video group. Run the following command to add users to the video group:
::
echo 'ADD_EXTRA_GROUPS=1' | sudo tee -a /etc/adduser.conf
echo 'EXTRA_GROUPS=video' | sudo tee -a /etc/adduser.conf
9. Restart the system.
10. Test the basic ROCm installation.
11. After restarting the system, run the following commands to verify that the ROCm installation is successful. If you see your GPUs listed by both commands, the installation is considered successful.
::
/opt/rocm/bin/rocminfo
/opt/rocm/opencl/bin/x86_64/clinfo
Note: To run the ROCm programs more efficiently, add the ROCm binaries in your PATH.
::
echo 'export PATH=$PATH:/opt/rocm/bin:/opt/rocm/profiler/bin:/opt/rocm/opencl/bin/x86_64'|sudo tee -a /etc/profile.d/rocm.sh
**Uninstallation**
To uninstall, use the following command:
::
sudo zypper remove rocm-opencl rocm-dkms rock-dkms
Note: Ensure all other installed packages/components are removed.
Note: Ensure all the content in the /opt/rocm directory is completely removed. If the command does not remove all the ROCm components/packages, ensure you remove them individually.
Performing an OpenCL-only Installation of ROCm
''''''''''''''''''''''''''''''''''''''''''''''''
Some users may want to install a subset of the full ROCm installation. If you are trying to install on a system with a limited amount of storage space, or which will only run a small collection of known applications, you may want to install only the packages that are required to run OpenCL applications. To do that, you can run the following installation command instead of the command to install rocm-dkms.
::
sudo yum install rock-dkms rocm-opencl-devel
ROCm Installation Known Issues and Workarounds
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Closed source components
''''''''''''''''''''''''''
The ROCm platform relies on some closed source components to provide functionalities like HSA image support. These components are only available through the ROCm repositories, and they may be deprecated or become open source components in the future. These components are made available in the following packages:
• hsa-ext-rocr-dev
Getting the ROCm Source Code
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
AMD ROCm is built from open source software. It is, therefore, possible to modify the various components of ROCm by downloading the source code and rebuilding the components. The source code for ROCm components can be cloned from each of the GitHub repositories using git. For easy access to download the correct versions of each of these tools, the ROCm repository contains a repo manifest file called default.xml. You can use this manifest file to download the source code for ROCm software.
Installing the Repo
^^^^^^^^^^^^^^^^^^^^^
The repo tool from Google® allows you to manage multiple git repositories simultaneously. Run the following commands to install the repo:
::
mkdir -p ~/bin/
curl https://storage.googleapis.com/git-repo-downloads/repo > ~/bin/repo
chmod a+x ~/bin/repo
Note: You can choose a different folder to install the repo into if you desire. ~/bin/ is used as an example.
Downloading the ROCm Source Code
''''''''''''''''''''''''''''''''''
The following example shows how to use the repo binary to download the ROCm source code. If you choose a directory other than ~/bin/ to install the repo, you must use that chosen directory in the code as shown below:
::
mkdir -p ~/ROCm/
cd ~/ROCm/
~/bin/repo init -u https://github.com/RadeonOpenCompute/ROCm.git -b roc-3.0.0
repo sync
Note: Using this sample code will cause the repo to download the open source code associated with this ROCm release. Ensure that you have ssh-keys configured on your machine for your GitHub ID prior to the download.
Building the ROCm Source Code
'''''''''''''''''''''''''''''''
Each ROCm component repository contains directions for building that component. You can access the desired component for instructions to build the repository.
.. _Machine Learning and High Performance Computing Software Stack for AMD GPU:
===================================================================================
Machine Learning and High Performance Computing Software Stack for AMD GPU v3.3.0
===================================================================================
Machine Learning and High Performance Computing Software Stack for AMD GPU v3.3.0
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
**ROCm Version 3.3.0**
.. _ROCm Binary Package Structure:
ROCm Binary Package Structure
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ROCm is a collection of software ranging from drivers and runtimes to libraries and developer tools. In AMD's package distributions, these software projects are provided as a separate packages. This allows users to install only the packages they need, if they do not wish to install all of ROCm. These packages will install most of the ROCm software into ``/opt/rocm/`` by default.
The packages for each of the major ROCm components are:
ROCm Core Components
=====================
- ROCk Kernel Driver: ``rock-dkms``
- ROCr Runtime: ``hsa-rocr-dev``, ``hsa-ext-rocr-dev``
- ROCt Thunk Interface: ``hsakmt-roct``, ``hsakmt-roct-dev``
ROCm Support Software
======================
- ROCm SMI: ``rocm-smi``
- ROCm cmake: ``rocm-cmake``
- rocminfo: ``rocminfo``
- ROCm Bandwidth Test: ``rocm_bandwidth_test``
ROCm Development ToolChain
===========================
- HCC compiler: ``hcc``
- HIP: ``hip_base``, ``hip_doc``, ``hip_hcc``, ``hip_samples``
- ROCm Device Libraries: ``rocm-device-libs``
- ROCm OpenCL: ``rocm-opencl``, ``rocm-opencl-devel`` (on RHEL/CentOS), ``rocm-opencl-dev`` (on Ubuntu)
- ROCM Clang-OCL Kernel Compiler: ``rocm-clang-ocl``
- Asynchronous Task and Memory Interface (ATMI): ``atmi``
- ROCm Debug Agent: ``rocm_debug_agent``
- ROCm Code Object Manager: ``comgr``
- ROC Profiler: ``rocprofiler-dev``
- ROC Tracer: ``roctracer-dev``
- Radeon Compute Profiler: ``rocm-profiler``
ROCm Libraries
==============
- rocALUTION: ``rocalution``
- rocBLAS: ``rocblas``
- hipBLAS: ``hipblas``
- hipCUB: ``hipCUB``
- rocFFT: ``rocfft``
- rocRAND: ``rocrand``
- rocSPARSE: ``rocsparse``
- hipSPARSE: ``hipsparse``
- ROCm SMI Lib: ``rocm-smi-lib64``
- rocThrust: ``rocThrust``
- MIOpen: ``MIOpen-HIP`` (for the HIP version), ``MIOpen-OpenCL`` (for the OpenCL version)
- MIOpenGEMM: ``miopengemm``
- MIVisionX: ``mivisionx``
- RCCL: ``rccl``
To make it easier to install ROCm, the AMD binary repositories provide a number of meta-packages that will automatically install multiple other packages. For example, ``rocm-dkms`` is the primary meta-package that is
used to install most of the base technology needed for ROCm to operate.
It will install the ``rock-dkms`` kernel driver, and another meta-package
(``rocm-dev``) which installs most of the user-land ROCm core components, support software, and development tools.
The *rocm-utils* meta-package will install useful utilities that, while not required for ROCm to operate, may still be beneficial to have. Finally, the *rocm-libs* meta-package will install some (but not all) of the libraries that are part of ROCm.
The chain of software installed by these meta-packages is illustrated below:
::
rocm-dkms
|--rock-dkms
\--rocm-dev
|--comgr
|--hcc
|--hip-base
|--hip-doc
|--hip-hcc
|--hip-samples
|--hsakmt-roct
|--hsakmt-roct-dev
|--hsa-amd-aqlprofile
|--hsa-ext-rocr-dev
|--hsa-rocr-dev
|--rocm-cmake
|--rocm-device-libs
|--rocm-smi-lib64
|--rocprofiler-dev
|--rocm-debug-agent
\--rocm-utils
|--rocminfo
\--rocm-clang-ocl # This will cause OpenCL to be installed
rocm-libs
|--hipblas
|--hipcub
|--hipsparse
|--rocalution
|--rocblas
|--rocfft
|--rocprim
|--rocrand
|--rocsparse
\--rocthrust
These meta-packages are not required but may be useful to make it easier to install ROCm on most systems.
Note: Some users may want to skip certain packages. For instance, a user that wants to use the upstream kernel drivers (rather than those supplied by AMD) may want to skip the rocm-dkms and rock-dkms packages. Instead, they could directly install rocm-dev.
Similarly, a user that only wants to install OpenCL support instead of HCC and HIP may want to skip the rocm-dkms and rocm-dev packages. Instead, they could directly install rock-dkms, rocm-opencl, and rocm-opencl-dev and their dependencies.
.. _ROCm Platform Packages:
ROCm Platform Packages
^^^^^^^^^^^^^^^^^^^^^^^
The following platform packages are for ROCm v3.3.0:
Drivers, ToolChains, Libraries, and Source Code
The latest supported version of the drivers, tools, libraries and source code for the ROCm platform have been released and are available from the following GitHub repositories:
**ROCm Core Components**
- `ROCk Kernel Driver`_
- `ROCr Runtime`_
- `ROCt Thunk Interface`_
**ROCm Support Software**
- `ROCm SMI`_
- `ROCm cmake`_
- `rocminfo`_
- `ROCm Bandwidth Test`_
**ROCm Development ToolChain**
- `HCC compiler`_
- `HIP`_
- `ROCm Device Libraries`_
- `ROCm OpenCL Runtime`_
- `ROCm LLVM OCL`_
- `ROCm Device Libraries OCL`_
- `ROCM Clang-OCL Kernel Compiler`_
- `Asynchronous Task and Memory Interface`_
- `ROCr Debug Agent`_
- `ROCm Code Object Manager`_
- `ROC Profiler`_
- `ROC Tracer`_
- `AOMP`_
- `Radeon Compute Profiler`_
- `ROCm Validation Suite`_
Example Applications:
- `HCC Examples`_
- `HIP Examples`_
**ROCm Libraries**
- `rocBLAS`_
- `hipBLAS`_
- `rocFFT`_
- `rocRAND`_
- `rocSPARSE`_
- `hipSPARSE`_
- `rocALUTION`_
- `MIOpenGEMM`_
- `mi open`_
- `rocThrust`_
- `ROCm SMI Lib`_
- `RCCL`_
- `MIVisionX`_
- `hipCUB`_
- `AMDMIGraphX`_
ROCm Core Components
=====================
.. _ROCk Kernel Driver: https://github.com/RadeonOpenCompute/ROCK-Kernel-Driver/tree/roc-3.3.0
.. _ROCr Runtime: https://github.com/RadeonOpenCompute/ROCR-Runtime/tree/rocm-3.3.0
.. _ROCt Thunk Interface: https://github.com/RadeonOpenCompute/ROCT-Thunk-Interface/tree/roc-3.3.0
ROCm Support Software
======================
.. _ROCm SMI: https://github.com/RadeonOpenCompute/ROC-smi/tree/roc-3.3.0
.. _ROCm cmake: https://github.com/RadeonOpenCompute/rocm-cmake/tree/rocm-3.3.0
.. _rocminfo: https://github.com/RadeonOpenCompute/rocminfo/tree/rocm-3.3.0
.. _ROCm Bandwidth Test: https://github.com/RadeonOpenCompute/rocm_bandwidth_test/tree/rocm-3.3.0
ROCm Development ToolChain
============================
.. _HCC compiler: https://github.com/RadeonOpenCompute/hcc/tree/rocm-3.3.0
.. _HIP: https://github.com/ROCm-Developer-Tools/HIP/tree/rocm-3.3.0
.. _ROCm Device Libraries: https://github.com/RadeonOpenCompute/ROCm-Device-Libs/tree/rocm-ocl-3.3.0
.. _ROCm OpenCL Runtime: http://github.com/RadeonOpenCompute/ROCm-OpenCL-Runtime/tree/roc-3.3.0
.. _ROCm LLVM OCL: https://github.com/RadeonOpenCompute/llvm-project/tree/rocm-ocl-3.3.0
.. _ROCm Device Libraries OCL: https://github.com/RadeonOpenCompute/ROCm-Device-Libs/tree/rocm-ocl-3.3.0
.. _ROCM Clang-OCL Kernel Compiler: https://github.com/RadeonOpenCompute/clang-ocl/tree/rocm-3.3.0
.. _Asynchronous Task and Memory Interface: https://github.com/RadeonOpenCompute/atmi/tree/rocm-3.3.0
.. _ROCr Debug Agent: https://github.com/ROCm-Developer-Tools/rocr_debug_agent/tree/roc-3.3.0
.. _ROCm Code Object Manager: https://github.com/RadeonOpenCompute/ROCm-CompilerSupport/tree/rocm-3.3.0
.. _ROC Profiler: https://github.com/ROCm-Developer-Tools/rocprofiler/tree/roc-3.3.0
.. _ROC Tracer: https://github.com/ROCm-Developer-Tools/roctracer/tree/roc-3.3.0
.. _AOMP: https://github.com/ROCm-Developer-Tools/aomp/tree/roc-3.3.0
.. _Radeon Compute Profiler: https://github.com/GPUOpen-Tools/RCP/tree/3a49405
.. _ROCm Validation Suite: https://github.com/ROCm-Developer-Tools/ROCmValidationSuite/tree/roc-3.3.0
.. _HCC Examples: https://github.com/ROCm-Developer-Tools/HCC-Example-Application/tree/ffd65333
.. _HIP Examples: https://github.com/ROCm-Developer-Tools/HIP-Examples/tree/rocm-3.3.0
ROCm Libraries
===============
.. _rocBLAS: https://github.com/ROCmSoftwarePlatform/rocBLAS/tree/rocm-3.3.0
.. _hipBLAS: https://github.com/ROCmSoftwarePlatform/hipBLAS/tree/rocm-3.3.0
.. _rocFFT: https://github.com/ROCmSoftwarePlatform/rocFFT/tree/rocm-3.3
.. _rocRAND: https://github.com/ROCmSoftwarePlatform/rocRAND/tree/rocm-3.3.0
.. _rocSPARSE: https://github.com/ROCmSoftwarePlatform/rocSPARSE/tree/rocm-3.3.0
.. _hipSPARSE: https://github.com/ROCmSoftwarePlatform/hipSPARSE/tree/rocm-3.3.0
.. _rocALUTION: https://github.com/ROCmSoftwarePlatform/rocALUTION/tree/rocm-3.3.0
.. _MIOpenGEMM: https://github.com/ROCmSoftwarePlatform/MIOpenGEMM/tree/6275a879
.. _mi open: https://github.com/ROCmSoftwarePlatform/MIOpen/tree/roc-3.3.0
.. _rocThrust: https://github.com/ROCmSoftwarePlatform/rocThrust/tree/rocm-3.3.0
.. _ROCm SMI Lib: https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/rocm-3.3.0
.. _RCCL: https://github.com/ROCmSoftwarePlatform/rccl/tree/rocm-3.3.0
.. _hipCUB: https://github.com/ROCmSoftwarePlatform/hipCUB/tree/rocm-3.3.0
.. _MIVisionX: https://github.com/GPUOpen-ProfessionalCompute-Libraries/MIVisionX/tree/1.7
.. _AMDMIGraphX: https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/commit/d1e945dabce0078d44c78de67b00232b856e18bc
Features and enhancements introduced in previous versions of ROCm can be found in :ref:`Current-Release-Notes`.
=========================
AMD ROCm Version History
=========================
This file contains archived version history information for the ROCm project
New features and enhancements in ROCm v3.2
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
This release was not productized.
New features and enhancements in ROCm v3.1
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
'Release Notes: https://github.com/RadeonOpenCompute/ROCm/tree/roc-3.1.0
**Change in ROCm Installation Directory Structure**
A fresh installation of the ROCm toolkit installs the packages in the /opt/rocm-<version> folder.
Previously, ROCm toolkit packages were installed in the /opt/rocm folder.
**Reliability, Accessibility, and Serviceability Support for Vega 7nm**
The Reliability, Accessibility, and Serviceability (RAS) support for Vega7nm is now available.
**SLURM Support for AMD GPU**
SLURM (Simple Linux Utility for Resource Management) is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters.
New features and enhancements in ROCm v3.0
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Release Notes: https://github.com/RadeonOpenCompute/ROCm/tree/roc-3.0.0
* Support for CentOS RHEL v7.7
* Support is extended for CentOS/RHEL v7.7 in the ROCm v3.0 release. For more information about the CentOS/RHEL v7.7 release, see:
* CentOS/RHEL
* Initial distribution of AOMP 0.7-5 in ROCm v3.0
The code base for this release of AOMP is the Clang/LLVM 9.0 sources as of October 8th, 2019. The LLVM-project branch used to build this release is AOMP-191008. It is now locked. With this release, an artifact tarball of the entire source tree is created. This tree includes a Makefile in the root directory used to build AOMP from the release tarball. You can use Spack to build AOMP from this source tarball or build manually without Spack.
* Fast Fourier Transform Updates
The Fast Fourier Transform (FFT) is an efficient algorithm for computing the Discrete Fourier Transform. Fast Fourier transforms are used in signal processing, image processing, and many other areas. The following real FFT performance change is made in the ROCm v3.0 release:
* Implement efficient real/complex 2D transforms for even lengths.
Other improvements:
• More 2D test coverage sizes.
• Fix buffer allocation error for large 1D transforms.
• C++ compatibility improvements.
MemCopy Enhancement for rocProf
In the v3.0 release, the rocProf tool is enhanced with an additional capability to dump asynchronous GPU memcopy information into a .csv file. You can use the '-hsa-trace' option to create the results_mcopy.csv file. Future enhancements will include column labels.
New features and enhancements in ROCm v2.10
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
rocBLAS Support for Complex GEMM
The rocBLAS library is a gpu-accelerated implementation of the standard Basic Linear Algebra Subroutines (BLAS). rocBLAS is designed to enable you to develop algorithms, including high performance computing, image analysis, and machine learning.
In the AMD ROCm release v2.10, support is extended to the General Matrix Multiply (GEMM) routine for multiple small matrices processed simultaneously for rocBLAS in AMD Radeon Instinct MI50. Both single and double precision, CGEMM and ZGEMM, are now supported in rocBLAS.
Support for SLES 15 SP1
In the AMD ROCm v2.10 release, support is added for SUSE Linux® Enterprise Server (SLES) 15 SP1. SLES is a modular operating system for both multimodal and traditional IT.
Code Marker Support for rocProfiler and rocTracer Libraries
Code markers provide the external correlation ID for the calling thread. This function indicates that the calling thread is entering and leaving an external API region.
New features and enhancements in ROCm 2.9
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Initial release for Radeon Augmentation Library(RALI)
The AMD Radeon Augmentation Library (RALI) is designed to efficiently decode and process images from a variety of storage formats and modify them through a processing graph programmable by the user. RALI currently provides C API.
Quantization in MIGraphX v0.4
MIGraphX 0.4 introduces support for fp16 and int8 quantization. For additional details, as well as other new MIGraphX features, see MIGraphX documentation.
rocSparse csrgemm
csrgemm enables the user to perform matrix-matrix multiplication with two sparse matrices in CSR format.
Singularity Support
ROCm 2.9 adds support for Singularity container version 2.5.2.
Initial release of rocTX
ROCm 2.9 introduces rocTX, which provides a C API for code markup for performance profiling. This initial release of rocTX supports annotation of code ranges and ASCII markers.
* Added support for Ubuntu 18.04.3
* Ubuntu 18.04.3 is now supported in ROCm 2.9.
New features and enhancements in ROCm 2.8
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Support for NCCL2.4.8 API
Implements ncclCommAbort() and ncclCommGetAsyncError() to match the NCCL 2.4.x API
New features and enhancements in ROCm 2.7.2
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
This release is a hotfix for ROCm release 2.7.
Issues fixed in ROCm 2.7.2
~~~~~~~~~~~~~~~~~~~~~~~~~~~
* A defect in upgrades from older ROCm releases has been fixed.
* rocprofiler --hiptrace and --hsatrace fails to load roctracer library
* In ROCm 2.7.2, rocprofiler --hiptrace and --hsatrace fails to load roctracer library defect has been fixed.
* To generate traces, please provide directory path also using the parameter: -d <$directoryPath> for example:
/opt/rocm/bin/rocprof --hsa-trace -d $PWD/traces /opt/rocm/hip/samples/0_Intro/bit_extract/bit_extract
All traces and results will be saved under $PWD/traces path
Upgrading from ROCm 2.7 to 2.7.2
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
To upgrade, please remove 2.7 completely as specified for ubuntu or for centos/rhel, and install 2.7.2 as per instructions install instructions
Other notes
To use rocprofiler features, the following steps need to be completed before using rocprofiler:
Step-1: Install roctracer
Ubuntu 16.04 or Ubuntu 18.04:
sudo apt install roctracer-dev
CentOS/RHEL 7.6:
sudo yum install roctracer-dev
Step-2: Add /opt/rocm/roctracer/lib to LD_LIBRARY_PATH
New features and enhancements in ROCm 2.7
[rocFFT] Real FFT Functional
Improved real/complex 1D even-length transforms of unit stride. Performance improvements of up to 4.5x are observed. Large problem sizes should see approximately 2x.
rocRand Enhancements and Optimizations
Added support for new datatypes: uchar, ushort, half.
Improved performance on "Vega 7nm" chips, such as on the Radeon Instinct MI50
mtgp32 uniform double performance changes due generation algorithm standardization. Better quality random numbers now generated with 30% decrease in performance
Up to 5% performance improvements for other algorithms
RAS
Added support for RAS on Radeon Instinct MI50, including:
* Memory error detection
* Memory error detection counter
* ROCm-SMI enhancements
* Added ROCm-SMI CLI and LIB support for FW version, compute running processes, utilization rates, utilization counter, link error counter, and unique ID.
New features and enhancements in ROCm 2.6
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ROCmInfo enhancements
ROCmInfo was extended to do the following: For ROCr API call errors including initialization determine if the error could be explained by:
ROCk (driver) is not loaded / available
User does not have membership in appropriate group - "video"
If not above print the error string that is mapped to the returned error code
If no error string is available, print the error code in hex
Thrust - Functional Support on Vega20
ROCm2.6 contains the first official release of rocThrust and hipCUB. rocThrust is a port of thrust, a parallel algorithm library. hipCUB is a port of CUB, a reusable software component library. Thrust/CUB has been ported to the HIP/ROCm platform to use the rocPRIM library. The HIP ported library works on HIP/ROCm platforms.
Note: rocThrust and hipCUB library replaces https://github.com/ROCmSoftwarePlatform/thrust (hip-thrust), i.e. hip-thrust has been separated into two libraries, rocThrust and hipCUB. Existing hip-thrust users are encouraged to port their code to rocThrust and/or hipCUB. Hip-thrust will be removed from official distribution later this year.
MIGraphX v0.3
MIGraphX optimizer adds support to read models frozen from Tensorflow framework. Further details and an example usage at https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/wiki/Getting-started:-using-the-new-features-of-MIGraphX-0.3
MIOpen 2.0
This release contains several new features including an immediate mode for selecting convolutions, bfloat16 support, new layers, modes, and algorithms.
MIOpenDriver, a tool for benchmarking and developing kernels is now shipped with MIOpen. BFloat16 now supported in HIP requires an updated rocBLAS as a GEMM backend.
Immediate mode API now provides the ability to quickly obtain a convolution kernel.
MIOpen now contains HIP source kernels and implements the ImplicitGEMM kernels. This is a new feature and is currently disabled by default. Use the environmental variable "MIOPEN_DEBUG_CONV_IMPLICIT_GEMM=1" to activation this feature. ImplicitGEMM requires an up to date HIP version of at least 1.5.9211.
A new "loss" catagory of layers has been added, of which, CTC loss is the first. See the API reference for more details. 2.0 is the last release of active support for gfx803 architectures. In future releases, MIOpen will not actively debug and develop new features specifically for gfx803.
System Find-Db in memory cache is disabled by default. Please see build instructions to enable this feature. Additional documentation can be found here: https://rocmsoftwareplatform.github.io/MIOpen/doc/html/
Bloat16 software support in rocBLAS/Tensile
Added mixed precision bfloat16/IEEE f32 to gemm_ex. The input and output matrices are bfloat16. All arithmetic is in IEEE f32.
AMD Infinity Fabric™ Link enablement
The ability to connect four Radeon Instinct MI60 or Radeon Instinct MI50 boards in two hives or two Radeon Instinct MI60 or Radeon Instinct MI50 boards in four hives via AMD Infinity Fabric™ Link GPU interconnect technology has been added.
ROCm-smi features and bug fixes
mGPU & Vendor check