nab is a SPEC CPU(R) benchmark described here and written in C. The workload runs on all logical cores.

Topdown profile shows medium level backend-bound stalls with a ~35% retirement rate. There is period at end with different characteristic.

AMD metrics on 7840 processor show more CPU stalls than memory stalls. Overall memory is only 50 L2 access per 1000 instructions and ~5% miss rate.
elapsed 581.066
on_cpu 0.970 # 15.53 / 16 cores
utime 9013.279
stime 8.575
nvcsw 13853 # 13.67%
nivcsw 87522 # 86.33%
inblock 0 # 0.00/sec
onblock 13480 # 23.20/sec
cpu-clock 9022371301663 # 9022.371 seconds
task-clock 9022438159718 # 9022.438 seconds
page faults 2719530 # 301.419/sec
context switches 100813 # 11.174/sec
cpu migrations 151 # 0.017/sec
major page faults 761 # 0.084/sec
minor page faults 2718769 # 301.334/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 3705536285749 # 83.289 branches per 1000 inst
branch misses 48320502361 # 1.30% branch miss
conditional 3211079075725 # 72.175 conditional branches per 1000 inst
indirect 78933292050 # 1.774 indirect branches per 1000 inst
cpu-cycles 35173554892555 # 3.77 GHz
instructions 44502941930878 # 1.27 IPC
slots 70335260112720 #
retiring 17957481268465 # 25.5% (36.9%)
-- ucode 49152961542 # 0.1%
-- fastpath 17908328306923 # 25.5%
frontend 4243956072338 # 6.0% ( 8.7%)
-- latency 3490353506760 # 5.0%
-- bandwidth 753602565578 # 1.1%
backend 25363097670219 # 36.1% (52.1%)
-- cpu 16028170538565 # 22.8%
-- memory 9334927131654 # 13.3%
speculation 1119054942785 # 1.6% ( 2.3%)
-- branch mispredict 1084377989807 # 1.5%
-- pipeline restart 34676952978 # 0.0%
smt-contention 21651631104051 # 30.8% ( 0.0%)
cpu-cycles 35197471408100 # 3.77 GHz
instructions 44501705968789 # 1.26 IPC
instructions 14829986452623 # 52.578 l2 access per 1000 inst
l2 hit from l1 563992892841 # 4.94% l2 miss
l2 miss from l1 11308778556 #
l2 hit from l2 pf 188511013533 #
l3 hit from l2 pf 10720306269 #
l3 miss from l2 pf 16503721139 #
instructions 14826609038424 # 318.530 float per 1000 inst
float 512 249 # 0.000 AVX-512 per 1000 inst
float 256 24571061449 # 1.657 AVX-256 per 1000 inst
float 128 4698147760247 # 316.873 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 0 # 0.000 scalar per 1000 inst
instructions 44489194256996 #
opcache 4756323710148 # 106.910 opcache per 1000 inst
opcache miss 66379189344 # 1.4% opcache miss rate
l1 dTLB miss 41837658678 # 0.940 L1 dTLB per 1000 inst
l2 dTLB miss 1090929870 # 0.025 L2 dTLB per 1000 inst
instructions 44489204211481 #
icache 80661600392 # 1.813 icache per 1000 inst
icache miss 12168515583 # 15.1% icache miss rate
l1 iTLB miss 173686412 # 0.004 L1 iTLB per 1000 inst
l2 iTLB miss 0 # 0.000 L2 iTLB per 1000 inst
tlb flush 182039 # 0.000 TLB flush per 1000 inst
Process overview shows time spent in nab_r_base.mev-
581 processes
48 nab_r_base.mev- 9011.77 4.93
69 specperl 8.59 1.55
1 clang 0.01 0.00
11 ps 0.00 0.02
1 lsb_release 0.00 0.01
173 sh 0.00 0.00
54 specrxp 0.00 0.00
48 bash 0.00 0.00
41 specinvoke 0.00 0.00
21 grep 0.00 0.00
20 cat 0.00 0.00
12 uniq 0.00 0.00
11 sort 0.00 0.00
10 expand 0.00 0.00
6 pwd 0.00 0.00
5 basename 0.00 0.00
5 specmake 0.00 0.00
5 systemctl 0.00 0.00
4 specpp 0.00 0.00
4 uname 0.00 0.00
3 dirname 0.00 0.00
3 dmidecode 0.00 0.00
3 lscpu 0.00 0.00
2 df 0.00 0.00
2 dpkg 0.00 0.00
2 rm 0.00 0.00
2 runcpu 0.00 0.00
2 specsha512sum 0.00 0.00
2 specxz 0.00 0.00
2 who 0.00 0.00
1 cpupower 0.00 0.00
1 head 0.00 0.00
1 logname 0.00 0.00
1 ls 0.00 0.00
1 numactl 0.00 0.00
1 sysctl 0.00 0.00
1 w 0.00 0.00
1 wc 0.00 0.00
1 which 0.00 0.00
0 processes running
53 maximum processes
specinvoke starts separate processes on each logical core.
452134) specinvoke cpu=1 start=3.26 finish=195.57
452136) sh cpu=0 start=3.26 finish=191.47
452143) bash cpu=0 start=3.26 finish=191.47
452167) nab_r_base.mev- cpu=0 start=3.26 finish=191.46
452137) sh cpu=10 start=3.26 finish=191.13
452144) bash cpu=1 start=3.26 finish=191.13
452165) nab_r_base.mev- cpu=1 start=3.26 finish=191.12
452138) sh cpu=10 start=3.26 finish=191.07
452147) bash cpu=2 start=3.26 finish=191.07
452170) nab_r_base.mev- cpu=2 start=3.27 finish=191.06
452139) sh cpu=9 start=3.26 finish=191.56
452150) bash cpu=3 start=3.26 finish=191.56
452174) nab_r_base.mev- cpu=3 start=3.27 finish=191.54
452140) sh cpu=9 start=3.26 finish=195.48
452154) bash cpu=4 start=3.26 finish=195.48
452173) nab_r_base.mev- cpu=4 start=3.27 finish=195.47
452141) sh cpu=5 start=3.26 finish=189.13
452152) bash cpu=5 start=3.26 finish=189.13
452172) nab_r_base.mev- cpu=5 start=3.27 finish=189.11
452142) sh cpu=7 start=3.26 finish=191.44
452149) bash cpu=6 start=3.26 finish=191.44
452171) nab_r_base.mev- cpu=6 start=3.27 finish=191.43
452145) sh cpu=11 start=3.26 finish=191.33
452156) bash cpu=7 start=3.26 finish=191.33
452175) nab_r_base.mev- cpu=7 start=3.27 finish=191.31
452146) sh cpu=2 start=3.26 finish=191.26
452158) bash cpu=8 start=3.26 finish=191.26
452177) nab_r_base.mev- cpu=8 start=3.27 finish=191.25
452148) sh cpu=10 start=3.26 finish=191.28
452160) bash cpu=9 start=3.26 finish=191.28
452179) nab_r_base.mev- cpu=9 start=3.27 finish=191.27
452151) sh cpu=5 start=3.26 finish=190.86
452162) bash cpu=10 start=3.26 finish=190.86
452178) nab_r_base.mev- cpu=10 start=3.27 finish=190.84
452153) sh cpu=8 start=3.26 finish=191.28
452163) bash cpu=11 start=3.26 finish=191.28
452176) nab_r_base.mev- cpu=11 start=3.27 finish=191.26
452155) sh cpu=11 start=3.26 finish=195.57
452164) bash cpu=12 start=3.26 finish=195.57
452180) nab_r_base.mev- cpu=12 start=3.27 finish=195.56
452157) sh cpu=5 start=3.26 finish=189.76
452166) bash cpu=13 start=3.26 finish=189.76
452181) nab_r_base.mev- cpu=13 start=3.27 finish=189.75
452159) sh cpu=13 start=3.26 finish=191.16
452168) bash cpu=14 start=3.26 finish=191.16
452182) nab_r_base.mev- cpu=14 start=3.27 finish=191.14
452161) sh cpu=11 start=3.26 finish=191.66
452169) bash cpu=15 start=3.27 finish=191.66
452183) nab_r_base.mev- cpu=15 start=3.27 finish=191.65
