Releases: NVIDIA/dcgm-exporter
Releases · NVIDIA/dcgm-exporter
3.3.9-3.6.1
- Update to DCGM 3.3.9
- Allow selecting the service's ClusterIP - Remi
- Configurable service monitor API value
3.3.8-3.6.0
- Update DCGM to 3.3.8
- [Helm] Enable custom metrics, mount ConfigMap by default (Chip Zoller)
3.3.7-3.5.0
Changes:
- Make nvidia resource names configurable (lx1036)
- Update default PCIe metrics name (koshieguchi)
- Correct metric help text (pintohutch)
- Add pci_bus_id label for metric (fungaren)
- Update to DCGM 3.3.7
3.3.6-3.4.2
- Enable HPC job ID as label with --hpc-job-mapping-dir
- Add err_msg label for XID errors
- Bug fixes, bump base container, etc
3.3.5-3.4.1
- Fix for duplicate DCGM_FI_DEV_XID_ERRORS
- Make kubelet pod-resources socket directory configurable
- Allow setting runtimeClassName
- use Linux container CPU quota
- Update Go dependencies
- Added DCGM logging options
- many tests added
3.3.5-3.4.0
- New DCGM_EXP_CLOCK_EVENTS_COUNT and DCGM_EXP_XID_ERRORS_COUNT metrics
- Graceful handling of panics
- Various lint fixes
- Control modelName format
3.3.3-3.3.1
- Fix for crash in dcgmGetCpuHierarchy
3.3.3-3.3.0
- Update DCGM to 3.3.3
- Enable Grace CPU support
- Update Go to 1.21
- TLS and auth support
- Community fixes and enhancements (Thank you!)
3.3.0-3.2.0
- Update DCGM to 3.3.0
- Various community fixes and enhancements
3.2.5-3.1.7
- Update DCGM to 3.2.5
- Grafana fixes