Skip to content

Releases: ovis-hpc/ldms

OVIS-3.4.6

23 May 15:48
Compare
Choose a tag to compare

Changes in 3.4.6 since 3.4.4

FUNCTIONAL CHANGES:

  • Added /usr/bin/ldms-static-test.sh and numerous test examples of ldms configuration in /usr/share/doc/ovis-ldms-3.4.6/examples/static-test. See man ldms-static-test. Includes store, sampler, and multilevel aggregation examples.

  • Added dstat sampler for monitoring ldmsd itself. Expected use is to be
    loaded on aggregator and storage ldmsd instances. See Plugin_dstat man page.

  • Added jobid collection support to lustre2_client sampler.

  • Added opa2 sampler to collect omnipath hfi interface metrics. See Plugin_opa2 man page.

  • Updated libgenders support for managing ports (see man ldms-attributes) in init scripts (see man ldms-attributes):
    ldmsd_use_unix_socket
    ldmsd_sockpath
    ldmsd_use_inet_socket
    ldmsd_config_port
    ldmsd_log
    ldmsd_vg
    ldmsd_vgargfile

  • Added filters to trap and warn about common gender spelling and punctuation errors.

  • Split the build/install of libgenders/boost tool from install of systemd scripts. Systemd scripts can be used without the ldmsctl_args3 tool if the user provides the daemon configuration commands in a named script listed in ldmsd.local.conf.

  • Added missing man pages for samplers ported from LDMS v2: clock, procstat, sysclassib, jobid, lustre2_client, procsensors.

  • New/updates to man, plugins for cray samplers aries_linkstatus, aries_mmr.

  • Changed defaults in systemd scripts to allow more open files at aggregators and syslogid.

  • Fixed overzealous failure condition handling in ldms_jobid.

  • Added debug output of registered memory (mmalloc) in use at exit to better bound -m option value needed for ldmsd instances. New mm_stat call in lib/mmalloc supplies the data.

SECURITY CHANGES:

  • Fixed default insecure (commonly know secret) ldmsauth file. Now it is invalid by default (too short).

RUNTIME CHANGES/BUG FIXES:

  • Fixed C bugs in store related code:

    • idx_delete
    • notification (memory leak)
    • avl (attribute/value list handling of error conditions)
    • thread locking error in store_csv
  • Fixed C bugs in network transports:

    • rdma connection resource leaks in error handling cases.
  • Fixed C bugs in samplers:

    • jobid minor fixes
    • procnfs sampler now accounts for variations in nfs file layout. The procnfs sampler has never supported nfsv4 metrics and does not now.
    • Reduced repetitive logging of the same transient failure conditions.
    • Updated several samplers to run through transient disappearance of /proc.

HOUSEKEEPING CHANGES:

  • Removed LDMS_BUILDTYPE from systemd control scripts (it was preventing relocatability, and is in any case obsolete).

  • Remove most old packaging scripts from ldms source tree packaging/ directory.

  • Change install permissions on pedigree script.

  • Update rpath macro in build (deprecates some old apple os versions).

  • Made rpms fully relocatable without forcing the user to manually set ld and zap related environment variables before invocation. This entails wrapping all the sbin/ldms binaries in .ldms-wrapper. Thanks to cray for assistance in this.

DEVELOPER CHANGES:

  • Updated installed include files and /usr/lib/ovis-[ldms/lib]-configvars.sh so that 3rd party plugins can be built when only the installed ldms binaries and headers are used.

  • Updated .gitignore settings.

OVIS-3.4.5

19 Mar 14:11
Compare
Choose a tag to compare

Fix a return code bug in procdiskstats ldmsd sampler plugin.

OVIS-3.4.4

31 Aug 21:23
Compare
Choose a tag to compare

4th patch of OVIS-3.4

OVIS-3.4.3

28 Jun 16:24
Compare
Choose a tag to compare

Update to genders-based init scripts for TOSS, Cray environments.

OVIS-3.4.2

22 Jun 22:06
Compare
Choose a tag to compare
  • ldms bug fixes
  • ldmsd config file error report with line number
  • ldmsd test_sampler updates
  • ldmsd csv_store rename/chmod/chown ability

OVIS-3.4.1

20 Apr 21:27
Compare
Choose a tag to compare

Bug fixes:

  • Fix ldms set tree corruption in set delete
  • Remove ldmsd debug message
  • Fix ldmsd store_csv crash due to NULL storekeys
  • Fix bug in rbt_find_lub
  • Fix uninitialized variable bug in ldmsd inet control path

OVIS-3.4.0

29 Mar 22:18
Compare
Choose a tag to compare

OVIS 3.4.0 - Initial Release of 3.4 series.

OVIS-3.3.3

16 Dec 21:03
Compare
Choose a tag to compare
  • Add PAPI and perfevent samplers in LDMSD
  • Fix LDMS metric value set operation for double
  • Fix store_sos bug in the case that ldmsd store metrics selectively (not entire set)
  • Cray RPM build update
  • Update GPCD submodule

OVIS-3.3.2

22 Nov 19:31
Compare
Choose a tag to compare
  • bclient paging tty check
  • include git-SHA in '-V' for both ldmsd and balerd
  • ldmsd -c to logs configuration errors
  • ldmsd store_function_csv bug fixes
  • ldmsd cray_system_sampler bug fixes
  • confusing top-level scripts removal
  • ldmsd init script examples

OVIS-3.3.1

03 Nov 21:41
Compare
Choose a tag to compare

The first patch of OVIS-3.3

  • addressing RPM generation bugs for RHEL7
  • addressing various log message nits
  • baler documentation update
  • sos update
  • MSR sampler update