Skip to content

Releases: NVIDIA/dcgm-exporter

3.3.9-3.6.1

19 Nov 20:33
b97b763
Compare
Choose a tag to compare
  • Update to DCGM 3.3.9
  • Allow selecting the service's ClusterIP - Remi
  • Configurable service monitor API value

3.3.8-3.6.0

19 Sep 22:11
402a10f
Compare
Choose a tag to compare
  • Update DCGM to 3.3.8
  • [Helm] Enable custom metrics, mount ConfigMap by default (Chip Zoller)

3.3.7-3.5.0

25 Jul 17:51
6d499c6
Compare
Choose a tag to compare

Changes:

  • Make nvidia resource names configurable (lx1036)
  • Update default PCIe metrics name (koshieguchi)
  • Correct metric help text (pintohutch)
  • Add pci_bus_id label for metric (fungaren)
  • Update to DCGM 3.3.7

3.3.6-3.4.2

20 May 20:06
dd3001a
Compare
Choose a tag to compare
  • Enable HPC job ID as label with --hpc-job-mapping-dir
  • Add err_msg label for XID errors
  • Bug fixes, bump base container, etc

3.3.5-3.4.1

03 Apr 20:55
5121ded
Compare
Choose a tag to compare
  • Fix for duplicate DCGM_FI_DEV_XID_ERRORS
  • Make kubelet pod-resources socket directory configurable
  • Allow setting runtimeClassName
  • use Linux container CPU quota
  • Update Go dependencies
  • Added DCGM logging options
  • many tests added

3.3.5-3.4.0

26 Feb 23:05
9547688
Compare
Choose a tag to compare
  • New DCGM_EXP_CLOCK_EVENTS_COUNT and DCGM_EXP_XID_ERRORS_COUNT metrics
  • Graceful handling of panics
  • Various lint fixes
  • Control modelName format

3.3.3-3.3.1

02 Feb 22:38
0518edc
Compare
Choose a tag to compare
  • Fix for crash in dcgmGetCpuHierarchy

3.3.3-3.3.0

29 Jan 21:56
7e8e4cb
Compare
Choose a tag to compare
  • Update DCGM to 3.3.3
  • Enable Grace CPU support
  • Update Go to 1.21
  • TLS and auth support
  • Community fixes and enhancements (Thank you!)

3.3.0-3.2.0

08 Nov 20:56
05b85eb
Compare
Choose a tag to compare
  • Update DCGM to 3.3.0
  • Various community fixes and enhancements

3.2.5-3.1.7

31 Aug 14:42
fdcc02d
Compare
Choose a tag to compare
  • Update DCGM to 3.2.5
  • Grafana fixes