Skip to content

Commit

Permalink
update toplev to TMA 4.8
Browse files Browse the repository at this point in the history
* toplev updated to TMA 4.8:
  * Bottlenecks View:
    * Renamed Base_Non_Br to Useful_Work and simplified descriptions for all BV metrics.
    * Cache_Memory_Latency now accounts for L1 cache latency as well.
    * Improved Branching_Overhead accuracy for function calling and alignments
    * Cross-reference Bottlenecks w/ TMA tree for tool visualization (VTune request)
  * New Tree Nodes
    * L1_Hit_Latency: estimates fraction of cycles with demand load accesses that hit the L1 cache (relies on Dependent_Loads_Weight SystemParameter today)
  * New Informative Metrics
    * Fetch_LSD (client), Fetch_DSB, Fetch_MITE under Info.Pipeline group [SKL onwards]
    * DSB_Bandwidth under Info.Botlnk.L2
    * L2MPKI_RFO under Info.Memory
  * Key Enhancements & fixes
    * Fixed Ports_Utilization/Ports_Utilized_0
    * Slightly tuned memory (fixed cost) latencies [SPR, EMR]
  * Corrected CPU_Utilization, CPUs_Utilized for Linux perf based tools
  • Loading branch information
Andi Kleen committed Apr 2, 2024
1 parent 8986105 commit 9f7b475
Show file tree
Hide file tree
Showing 19 changed files with 3,746 additions and 2,508 deletions.
17 changes: 17 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -120,6 +120,23 @@ on newer Linux kernels.

# Recent new features:

## TMA 4.8 release
* toplev updated to TMA 4.8:
* Bottlenecks View:
* Renamed Base_Non_Br to Useful_Work and simplified descriptions for all BV metrics.
* Cache_Memory_Latency now accounts for L1 cache latency as well.
* Improved Branching_Overhead accuracy for function calling and alignments
* Cross-reference Bottlenecks w/ TMA tree for tool visualization (VTune request)
* New Tree Nodes
* L1_Hit_Latency: estimates fraction of cycles with demand load accesses that hit the L1 cache (relies on Dependent_Loads_Weight SystemParameter today)
* New Informative Metrics
* Fetch_LSD (client), Fetch_DSB, Fetch_MITE under Info.Pipeline group [SKL onwards]
* DSB_Bandwidth under Info.Botlnk.L2
* L2MPKI_RFO under Info.Memory
* Key Enhancements & fixes
* Fixed Ports_Utilization/Ports_Utilized_0
* Slightly tuned memory (fixed cost) latencies [SPR, EMR]
* Corrected CPU_Utilization, CPUs_Utilized for Linux perf based tools
* toplev now supports Meteor Lake systems.
* Add a new genretlat.py tool to tune the toplev model for a workload. The basic tuning needs to be
generated before first toplev use using genretlat -o mtl-retlat.json ./workloads/BC1s (or suitable workload). toplev
Expand Down
568 changes: 371 additions & 197 deletions adl_glc_ratios.py

Large diffs are not rendered by default.

172 changes: 99 additions & 73 deletions bdw_client_ratios.py

Large diffs are not rendered by default.

172 changes: 99 additions & 73 deletions bdx_server_ratios.py

Large diffs are not rendered by default.

591 changes: 368 additions & 223 deletions clx_server_ratios.py

Large diffs are not rendered by default.

Loading

0 comments on commit 9f7b475

Please sign in to comment.