Another build test, this time for the mplayer media player. A fairly quick build taking less than a minute.

Topdown overview shows a similar overview to other build processes. Generally heavier on front-end stalls and not as much backend stalls. Slight differences towards end at link stage.

AMD metrics show ~12 cores kept busy and 1/5th of the instructions are branches.
elapsed 131.083
on_cpu 0.749 # 11.98 / 16 cores
utime 1434.909
stime 136.095
nvcsw 184881 # 54.88%
nivcsw 151998 # 45.12%
inblock 4840 # 36.92/sec
onblock 815992 # 6225.02/sec
cpu-clock 1569995918518 # 1569.996 seconds
task-clock 1570022200562 # 1570.022 seconds
page faults 33042137 # 21045.650/sec
context switches 311123 # 198.165/sec
cpu migrations 32456 # 20.672/sec
major page faults 704 # 0.448/sec
minor page faults 33041433 # 21045.201/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 1588505172838 # 207.892 branches per 1000 inst
branch misses 43921596872 # 2.76% branch miss
conditional 1230296354350 # 161.012 conditional branches per 1000 inst
indirect 34954151096 # 4.575 indirect branches per 1000 inst
cpu-cycles 5992955401481 # 2.86 GHz
instructions 7521843661027 # 1.26 IPC
slots 12295701158958 #
retiring 2473184787261 # 20.1% (25.0%)
-- ucode 3277908923 # 0.0%
-- fastpath 2469906878338 # 20.1%
frontend 4267191925642 # 34.7% (43.1%)
-- latency 3145065275880 # 25.6%
-- bandwidth 1122126649762 # 9.1%
backend 2704586566512 # 22.0% (27.3%)
-- cpu 373789605992 # 3.0%
-- memory 2330796960520 # 19.0%
speculation 458055998059 # 3.7% ( 4.6%)
-- branch mispredict 453301995104 # 3.7%
-- pipeline restart 4754002955 # 0.0%
smt-contention 2392662345903 # 19.5% ( 0.0%)
cpu-cycles 5999256771224 # 2.86 GHz
instructions 7518589349203 # 1.25 IPC
instructions 2545551521414 # 39.154 l2 access per 1000 inst
l2 hit from l1 86386798530 # 17.25% l2 miss
l2 miss from l1 10772079364 #
l2 hit from l2 pf 6864784791 #
l3 hit from l2 pf 3645118667 #
l3 miss from l2 pf 2771144343 #
instructions 2542523986390 # 24.173 float per 1000 inst
float 512 8375 # 0.000 AVX-512 per 1000 inst
float 256 567122 # 0.000 AVX-256 per 1000 inst
float 128 61459757300 # 24.173 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 1 # 0.000 scalar per 1000 inst
Intel metrics show a larger overall elapsed time, perhaps because of additional variability depending on what cores are used.
elapsed 702.937
on_cpu 0.830 # 13.27 / 16 cores
utime 8850.134
stime 480.073
nvcsw 805769 # 51.24%
nivcsw 766621 # 48.76%
inblock 37360 # 53.15/sec
onblock 2924240 # 4160.03/sec
cpu-clock 9327209381783 # 9327.209 seconds
task-clock 9327273727003 # 9327.274 seconds
page faults 150700356 # 16156.957/sec
context switches 1466331 # 157.209/sec
cpu migrations 144812 # 15.526/sec
major page faults 2783 # 0.298/sec
minor page faults 150697573 # 16156.658/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 7741484482358 # 207.389 branches per 1000 inst
branch misses 191113104298 # 2.47% branch miss
conditional 7741488432022 # 207.389 conditional branches per 1000 inst
indirect 1540448949106 # 41.268 indirect branches per 1000 inst
slots 9270230760800 #
retiring 3740816186972 # 40.4% (40.4%)
-- ucode 254059658497 # 2.7%
-- fastpath 3486756528475 # 37.6%
frontend 3249083635174 # 35.0% (35.0%)
-- latency 1538385778881 # 16.6%
-- bandwidth 1710697856293 # 18.5%
backend 700103408302 # 7.6% ( 7.6%)
-- cpu 352231579537 # 3.8%
-- memory 347871828765 # 3.8%
speculation 1600749496422 # 17.3% (17.3%)
-- branch mispredict 1561048913198 # 16.8%
-- pipeline restart 39700583224 # 0.4%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 3783910784078 # 1.61 GHz
instructions 5236575955741 # 1.38 IPC
l2 access 158201882092 # 39.489 l2 access per 1000 inst
l2 miss 35363291724 # 22.35% l2 miss
Process summary shows more compilation than I expected with ~5900 C complier front end (cc1) dominating the time.
28756 processes
5918 cc1 1292.43 69.61
331 yasm 56.10 0.40
68 clinfo 16.53 5.99
30 make 6.38 2.36
5917 as 3.86 0.09
157 ld 1.67 1.16
1 xz 0.80 0.05
38 vulkaninfo 0.76 1.34
137 awk 0.61 0.01
22 ar 0.33 0.58
6 glxinfo:gdrv0 0.14 0.07
6 php 0.09 0.21
3 codec-cfg 0.09 0.00
4 vulkani:disk$0 0.08 0.14
6 clang 0.07 0.05
2 glxinfo 0.07 0.03
2 glxinfo:cs0 0.07 0.03
2 glxinfo:disk$0 0.07 0.03
2 glxinfo:shlo0 0.07 0.03
2 glxinfo:sh0 0.06 0.03
2351 configure 0.04 0.32
2 llvmpipe-0 0.04 0.07
2 llvmpipe-1 0.04 0.07
2 llvmpipe-10 0.04 0.07
2 llvmpipe-11 0.04 0.07
2 llvmpipe-12 0.04 0.07
2 llvmpipe-13 0.04 0.07
2 llvmpipe-14 0.04 0.07
2 llvmpipe-15 0.04 0.07
2 llvmpipe-2 0.04 0.07
2 llvmpipe-3 0.04 0.07
2 llvmpipe-4 0.04 0.07
2 llvmpipe-5 0.04 0.07
2 llvmpipe-6 0.04 0.07
2 llvmpipe-7 0.04 0.07
2 llvmpipe-8 0.04 0.07
2 llvmpipe-9 0.04 0.07
3 rocminfo 0.03 0.00
1 tar 0.02 0.41
1 lspci 0.01 0.01
431 rm 0.00 0.33
5643 sh 0.00 0.07
1 ps 0.00 0.01
5934 cc 0.00 0.00
536 cat 0.00 0.00
526 tr 0.00 0.00
157 collect2 0.00 0.00
46 sed 0.00 0.00
24 pkg-config 0.00 0.00
21 true 0.00 0.00
14 cut 0.00 0.00
14 tmp 0.00 0.00
13 gcc 0.00 0.00
13 grep 0.00 0.00
12 cmp 0.00 0.00
12 gsettings 0.00 0.00
10 cp 0.00 0.00
10 head 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
8 version.sh 0.00 0.00
6 llvm-link 0.00 0.00
5 phoronix-test-s 0.00 0.00
4 bash 0.00 0.00
4 help_create.sh 0.00 0.00
4 uname 0.00 0.00
3 basename 0.00 0.00
3 mv 0.00 0.00
3 time-compile-mp 0.00 0.00
3 touch 0.00 0.00
2 dconf worker 0.00 0.00
2 git 0.00 0.00
2 gmain 0.00 0.00
2 lscpu 0.00 0.00
2 tail 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 date 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mkdir 0.00 0.00
1 mktemp 0.00 0.00
1 nm 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sort 0.00 0.00
1 strings 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
77 processes running
158 maximum processes
