Measuring the time to compile Erlang/OTP. The number of runnable processes jumps high and CPU core usage is variable.

Topdown profile has a higher rate of topdown stalls with frontend stalls and retirement roughly similar.

AMD metrics show on average half the cores. Frontend stalls are the highest. There is not much floating point.
elapsed 374.136
on_cpu 0.520 # 8.33 / 16 cores
utime 2600.996
stime 513.696
nvcsw 7826857 # 26.55%
nivcsw 21651959 # 73.45%
inblock 24 # 0.06/sec
onblock 7827320 # 20921.03/sec
cpu-clock 3122273753608 # 3122.274 seconds
task-clock 3122941278301 # 3122.941 seconds
page faults 88926824 # 28475.343/sec
context switches 29413373 # 9418.484/sec
cpu migrations 517461 # 165.697/sec
major page faults 2910 # 0.932/sec
minor page faults 88923651 # 28474.327/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 2646561802828 # 192.873 branches per 1000 inst
branch misses 91839791917 # 3.47% branch miss
conditional 1924017982101 # 140.216 conditional branches per 1000 inst
indirect 112928353961 # 8.230 indirect branches per 1000 inst
cpu-cycles 11752414130889 # 1.98 GHz
instructions 13299671577609 # 1.13 IPC
slots 24883400890320 #
retiring 4545808476867 # 18.3% (22.4%)
-- ucode 19058779784 # 0.1%
-- fastpath 4526749697083 # 18.2%
frontend 8930881982420 # 35.9% (43.9%)
-- latency 6557542868286 # 26.4%
-- bandwidth 2373339114134 # 9.5%
backend 6017703339481 # 24.2% (29.6%)
-- cpu 853507402202 # 3.4%
-- memory 5164195937279 # 20.8%
speculation 831290685150 # 3.3% ( 4.1%)
-- branch mispredict 815429325807 # 3.3%
-- pipeline restart 15861359343 # 0.1%
smt-contention 4557505158750 # 18.3% ( 0.0%)
cpu-cycles 11730454882014 # 1.97 GHz
instructions 13294674228204 # 1.13 IPC
instructions 4558616129744 # 36.897 l2 access per 1000 inst
l2 hit from l1 144877796872 # 19.30% l2 miss
l2 miss from l1 19728155814 #
l2 hit from l2 pf 10591886123 #
l3 hit from l2 pf 6041907329 #
l3 miss from l2 pf 6689503149 #
instructions 4551855078110 # 31.314 float per 1000 inst
float 512 27337 # 0.000 AVX-512 per 1000 inst
float 256 189660 # 0.000 AVX-256 per 1000 inst
float 128 142535017390 # 31.314 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 5847 # 0.000 scalar per 1000 inst
instructions 2705980 #
opcache 1009880 # 373.203 opcache per 1000 inst
opcache miss 542791 # 53.7% opcache miss rate
l1 dTLB miss 6952 # 2.569 L1 dTLB per 1000 inst
l2 dTLB miss 1329 # 0.491 L2 dTLB per 1000 inst
instructions 2721500 #
icache 1326721 # 487.496 icache per 1000 inst
icache miss 112354 # 8.5% icache miss rate
l1 iTLB miss 9 # 0.003 L1 iTLB per 1000 inst
l2 iTLB miss 0 # 0.000 L2 iTLB per 1000 inst
tlb flush 19 # 0.007 TLB flush per 1000 inst
Intel metrics
elapsed 395.889
on_cpu 0.533 # 8.53 / 16 cores
utime 3054.027
stime 323.352
nvcsw 7022544 # 24.22%
nivcsw 21975293 # 75.78%
inblock 71304 # 180.11/sec
onblock 7816912 # 19745.22/sec
cpu-clock 3380383176672 # 3380.383 seconds
task-clock 3381121509150 # 3381.122 seconds
page faults 88938503 # 26304.439/sec
context switches 28933721 # 8557.433/sec
cpu migrations 542782 # 160.533/sec
major page faults 1469 # 0.434/sec
minor page faults 88936766 # 26303.925/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 2571256188919 # 189.076 branches per 1000 inst
branch misses 61958336618 # 2.41% branch miss
conditional 2571261108567 # 189.076 conditional branches per 1000 inst
indirect 571366916746 # 42.015 indirect branches per 1000 inst
slots 23248461525260 #
retiring 7749937132128 # 33.3% (33.3%)
-- ucode 552996012166 # 2.4%
-- fastpath 7196941119962 # 31.0%
frontend 7269498380742 # 31.3% (31.3%)
-- latency 4040354222203 # 17.4%
-- bandwidth 3229144158539 # 13.9%
backend 5019972335993 # 21.6% (21.6%)
-- cpu 3131106952317 # 13.5%
-- memory 1888865383676 # 8.1%
speculation 3276124690314 # 14.1% (14.1%) high
-- branch mispredict 3156167726549 # 13.6%
-- pipeline restart 119956963765 # 0.5%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 8559345730142 # 1.37 GHz
instructions 11195665887023 # 1.31 IPC
l2 access 285577713652 # 35.058 l2 access per 1000 inst
l2 miss 78765912679 # 27.58% l2 miss
cpu-cycles 6242576212630 # 23.8% memory latency
load stalls 1419074135749 # 6.3% l1 bound
l1 miss 1024870800867 # 6.3% l2 bound
l2 miss 629646656380 # 2.3% l3 bound
l3 miss 486990186219 # 7.8% dram bound
store_stalls 65398513932 # 1.0% store bound
Process overview has a set of processes running to take user time. Over 150,000 processes total and it looks like we missed some.
153290 processes
3948 beam.smp 1349.21 277.09
3948 sys_sig_dispatc 1349.16 277.07
3948 sys_msg_dispatc 1349.11 277.04
3948 1_scheduler 1349.03 277.02
3948 1_dirty_cpu_sch 1348.95 277.00
3948 1_dirty_io_sche 1348.90 276.98
3948 2_dirty_io_sche 1348.84 276.98
3948 3_dirty_io_sche 1348.82 276.97
3948 4_dirty_io_sche 1348.80 276.95
3948 5_dirty_io_sche 1348.73 276.95
3948 6_dirty_io_sche 1348.67 276.91
3948 7_dirty_io_sche 1348.60 276.90
3948 8_dirty_io_sche 1348.56 276.88
3948 9_dirty_io_sche 1348.53 276.88
3948 10_dirty_io_sch 1348.47 276.87
3948 1_aux 1348.44 276.84
3948 0_poller 1348.39 276.83
3948 async_1 1347.00 276.47
186 javac 1089.11 40.06
96 Finalizer 544.81 20.08
96 Common-Cleaner 544.79 20.09
3795 cc1 353.47 31.04
288 cc1plus 157.15 20.61
2937 as 25.94 2.92
68 clinfo 16.54 5.98
30 10_scheduler 7.89 2.01
30 11_scheduler 7.89 2.01
30 2_scheduler 7.89 2.01
30 3_scheduler 7.89 2.01
30 4_scheduler 7.89 2.01
30 5_scheduler 7.89 2.01
30 6_scheduler 7.89 2.01
30 7_scheduler 7.89 2.01
30 8_scheduler 7.89 2.01
30 9_scheduler 7.89 2.01
30 12_scheduler 7.88 2.01
30 13_scheduler 7.88 2.01
30 14_scheduler 7.87 2.01
30 15_scheduler 7.87 2.01
30 16_scheduler 7.87 2.01
30 2_dirty_cpu_sch 7.87 2.01
30 3_dirty_cpu_sch 7.87 2.01
30 4_dirty_cpu_sch 7.87 2.01
30 5_dirty_cpu_sch 7.87 2.01
30 6_dirty_cpu_sch 7.87 2.01
30 10_dirty_cpu_sc 7.87 2.00
30 11_dirty_cpu_sc 7.87 2.00
30 12_dirty_cpu_sc 7.87 2.00
30 13_dirty_cpu_sc 7.87 2.00
30 14_dirty_cpu_sc 7.87 2.00
30 15_dirty_cpu_sc 7.87 2.00
30 16_dirty_cpu_sc 7.87 2.00
30 7_dirty_cpu_sch 7.87 2.00
30 8_dirty_cpu_sch 7.87 2.00
30 9_dirty_cpu_sch 7.87 2.00
9 yielding_c_fun 2.93 0.54
930 ld 2.65 0.90
3 gzip 2.05 0.10
93 G1 Main Marker 2.00 0.00
93 G1 Young RemSet 2.00 0.00
93 VM Periodic Tas 2.00 0.00
6670 bash 1.15 3.70
38 vulkaninfo 1.15 1.33
801 make 0.88 0.44
6 jar 0.54 0.10
72 perl 0.46 0.00
6 php 0.14 0.22
4 vulkani:disk$0 0.13 0.14
6 glxinfo:gdrv0 0.12 0.06
6 glxinfo:gl0 0.12 0.06
3 tar 0.07 1.33
2 llvmpipe-0 0.07 0.07
2 llvmpipe-1 0.07 0.07
2 llvmpipe-10 0.07 0.07
2 llvmpipe-11 0.07 0.07
2 llvmpipe-12 0.07 0.07
2 llvmpipe-13 0.07 0.07
2 llvmpipe-14 0.07 0.07
2 llvmpipe-15 0.07 0.07
2 llvmpipe-2 0.07 0.07
2 llvmpipe-3 0.07 0.07
2 llvmpipe-4 0.07 0.07
2 llvmpipe-5 0.07 0.07
2 llvmpipe-6 0.07 0.07
2 llvmpipe-7 0.07 0.07
2 llvmpipe-8 0.07 0.07
2 llvmpipe-9 0.07 0.07
6 clang 0.07 0.05
2 glxinfo 0.07 0.02
2 glxinfo:cs0 0.06 0.02
2 glxinfo:disk$0 0.06 0.02
2 glxinfo:sh0 0.06 0.02
2 glxinfo:shlo0 0.06 0.02
42 flex 0.06 0.00
5506 rm 0.03 1.00
7645 sh 0.03 0.07
3259 gcc 0.03 0.05
3221 sed 0.03 0.00
3 rocminfo 0.03 0.00
24 ranlib 0.02 0.80
24 ar 0.02 0.78
307 configure 0.02 0.00
12 m4 0.01 0.00
804 C2 CompilerThre 0.00 4156.98
426 C1 CompilerThre 0.00 2211.08
96 GC Thread#0 0.00 544.82
96 G1 Conc#0 0.00 544.81
96 Service Thread 0.00 544.81
96 VM Thread 0.00 544.81
96 Reference Handl 0.00 544.80
96 Sweeper thread 0.00 544.79
96 Signal Dispatch 0.00 544.61
93 G1 Conc#1 0.00 544.52
93 G1 Conc#2 0.00 544.52
93 GC Thread#1 0.00 544.51
93 G1 Refine#0 0.00 539.18
924 install 0.00 0.09
1 lspci 0.00 0.02
11774 dirname 0.00 0.00
5148 basename 0.00 0.00
3948 erl_child_setup 0.00 0.00
3885 dyn_erl 0.00 0.00
3573 cat 0.00 0.00
2475 config.sub 0.00 0.00
1044 mkdir 0.00 0.00
930 collect2 0.00 0.00
817 grep 0.00 0.00
786 awk 0.00 0.00
552 mv 0.00 0.00
453 expr 0.00 0.00
294 uname 0.00 0.00
276 g++ 0.00 0.00
261 cp 0.00 0.00
192 ls 0.00 0.00
119 conftest 0.00 0.00
114 ln 0.00 0.00
92 cc 0.00 0.00
82 sort 0.00 0.00
81 hostname 0.00 0.00
63 chmod 0.00 0.00
61 mktemp 0.00 0.00
60 inet_gethost 0.00 0.00
57 rmdir 0.00 0.00
54 arch 0.00 0.00
45 diff 0.00 0.00
45 find 0.00 0.00
42 tr 0.00 0.00
27 getconf 0.00 0.00
15 config.guess 0.00 0.00
12 gen_git_version 0.00 0.00
12 touch 0.00 0.00
11 gsettings 0.00 0.00
9 echo 0.00 0.00
9 otp_build 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 llvm-link 0.00 0.00
6 pkg-config 0.00 0.00
5 phoronix-test-s 0.00 0.00
4 date 0.00 0.00
3 build-erlang 0.00 0.00
3 git 0.00 0.00
3 gmain 0.00 0.00
3 snmp-v2tov1 0.00 0.00
2 dconf worker 0.00 0.00
2 lscpu 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 ps 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
