Shows the latency between various core combinations on the CPU. It runs for a while before concluding a single number (86.5ns on AMD and 146.2ns on Intel). Mostly three threads with a spiky usage profile and moderate level of interrupts.

Topdown profile shows more backend constraints with different phases.

AMD metrics include almost no floating point, many branches, and reasonable backend stalls with 45% memory related stalls.
elapsed 347.624
on_cpu 0.068 # 1.09 / 16 cores
utime 377.052
stime 1.097
nvcsw 16947 # 34.74%
nivcsw 31834 # 65.26%
inblock 0 # 0.00/sec
onblock 12744 # 36.66/sec
cpu-clock 381605191602 # 381.605 seconds
task-clock 381632595433 # 381.633 seconds
page faults 366889 # 961.367/sec
context switches 50347 # 131.925/sec
cpu migrations 15123 # 39.627/sec
major page faults 2 # 0.005/sec
minor page faults 366887 # 961.362/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 1290084247883 # 317.431 branches per 1000 inst
branch misses 2376229821 # 0.18% branch miss
conditional 1262738703916 # 310.702 conditional branches per 1000 inst
indirect 7684465944 # 1.891 indirect branches per 1000 inst
cpu-cycles 1686373241655 # 0.30 GHz
instructions 4047814184336 # 2.40 IPC
slots 3570759217254 #
retiring 992216232198 # 27.8% (27.8%)
-- ucode 862926843 # 0.0%
-- fastpath 991353305355 # 27.8%
frontend 706977537808 # 19.8% (19.8%)
-- latency 398876251902 # 11.2%
-- bandwidth 308101285906 # 8.6%
backend 1798308407946 # 50.4% (50.4%)
-- cpu 187178390007 # 5.2%
-- memory 1611130017939 # 45.1%
speculation 69693310609 # 2.0% ( 2.0%)
-- branch mispredict 48349557225 # 1.4%
-- pipeline restart 21343753384 # 0.6%
smt-contention 3562760504 # 0.1% ( 0.0%)
cpu-cycles 1707892632149 # 0.30 GHz
instructions 4094199228615 # 2.40 IPC
instructions 1350541887660 # 0.776 l2 access per 1000 inst
l2 hit from l1 1004371938 # 67.46% l2 miss
l2 miss from l1 682022950 #
l2 hit from l2 pf 18820367 #
l3 hit from l2 pf 10453741 #
l3 miss from l2 pf 14761656 #
instructions 1353828518705 # 3.712 float per 1000 inst
float 512 46 # 0.000 AVX-512 per 1000 inst
float 256 384 # 0.000 AVX-256 per 1000 inst
float 128 5025323444 # 3.712 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 13 # 0.000 scalar per 1000 inst
instructions 4069008521799 #
opcache 259340881921 # 63.736 opcache per 1000 inst
opcache miss 2342524413 # 0.9% opcache miss rate
l1 dTLB miss 33936466 # 0.008 L1 dTLB per 1000 inst
l2 dTLB miss 6530737 # 0.002 L2 dTLB per 1000 inst
instructions 4089180366000 #
icache 4014925148 # 0.982 icache per 1000 inst
icache miss 330099304 # 8.2% icache miss rate
l1 iTLB miss 8820947 # 0.002 L1 iTLB per 1000 inst
l2 iTLB miss 0 # 0.000 L2 iTLB per 1000 inst
Intel metrics show this fits in L3 and interestingly is more cpu-bound than memory-bound in contrast to the AMD processor.
elapsed 338.064
on_cpu 0.068 # 1.09 / 16 cores
utime 366.068
stime 0.844
nvcsw 16599 # 34.41%
nivcsw 31636 # 65.59%
inblock 8 # 0.02/sec
onblock 1496 # 4.43/sec
cpu-clock 369575801564 # 369.576 seconds
task-clock 369591458164 # 369.591 seconds
page faults 557191 # 1507.586/sec
context switches 49764 # 134.646/sec
cpu migrations 14929 # 40.393/sec
major page faults 0 # 0.000/sec
minor page faults 557191 # 1507.586/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 1488546243170 # 320.204 branches per 1000 inst
branch misses 2105739450 # 0.14% branch miss
conditional 1488546728162 # 320.204 conditional branches per 1000 inst
indirect 170666966807 # 36.713 indirect branches per 1000 inst
slots 5976172192742 #
retiring 3048896025037 # 51.0% (51.0%)
-- ucode 73313850447 # 1.2%
-- fastpath 2975582174590 # 49.8%
frontend 415667801674 # 7.0% ( 7.0%)
-- latency 134077085418 # 2.2%
-- bandwidth 281590716256 # 4.7%
backend 2284255214220 # 38.2% (38.2%)
-- cpu 1853685353998 # 31.0%
-- memory 430569860222 # 7.2%
speculation 229582167842 # 3.8% ( 3.8%)
-- branch mispredict 208351532527 # 3.5%
-- pipeline restart 21230635315 # 0.4%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 999791035793 # 0.18 GHz
instructions 4168346020829 # 4.17 IPC high
l2 access 1082001631 # 0.260 l2 access per 1000 inst
l2 miss 736658180 # 68.08% l2 miss
cpu-cycles 997555573602 # 9.8% memory latency
load stalls 97736959756 # 0.3% l1 bound
l1 miss 94948090517 # 1.4% l2 bound
l2 miss 81224075073 # 8.1% l3 bound
l3 miss 330804862 # 0.0% dram bound
store_stalls 509402552 # 0.1% store bound
Process overview shows ~15k versions of the core-latency process. Each of these runs for a very short time.
15322 processes
14976 core-latency 1535299.29 6920.82
68 clinfo 15.87 6.98
38 vulkaninfo 1.71 0.95
4 vulkani:disk$0 0.18 0.10
6 glxinfo:gdrv0 0.11 0.07
6 glxinfo:gl0 0.11 0.07
2 llvmpipe-0 0.09 0.05
2 llvmpipe-1 0.09 0.05
2 llvmpipe-10 0.09 0.05
2 llvmpipe-11 0.09 0.05
2 llvmpipe-12 0.09 0.05
2 llvmpipe-13 0.09 0.05
2 llvmpipe-14 0.09 0.05
2 llvmpipe-15 0.09 0.05
2 llvmpipe-2 0.09 0.05
2 llvmpipe-3 0.09 0.05
2 llvmpipe-4 0.09 0.05
2 llvmpipe-5 0.09 0.05
2 llvmpipe-6 0.09 0.05
2 llvmpipe-7 0.09 0.05
2 llvmpipe-8 0.09 0.05
2 llvmpipe-9 0.09 0.05
6 php 0.08 0.06
6 clang 0.06 0.06
2 glxinfo 0.05 0.03
2 glxinfo:cs0 0.05 0.03
2 glxinfo:disk$0 0.05 0.03
2 glxinfo:sh0 0.05 0.03
2 glxinfo:shlo0 0.05 0.03
3 rocminfo 0.03 0.00
1 lspci 0.00 0.03
80 sh 0.00 0.00
13 gcc 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
7 gsettings 0.00 0.00
6 llvm-link 0.00 0.00
5 gmain 0.00 0.00
5 phoronix-test-s 0.00 0.00
4 dconf worker 0.00 0.00
2 cc 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 date 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 ps 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
0 processes running
47 maximum processes
