Rust written AV1 video encoder test. There are four workloads at various speed settings. These also seem to vary on how much on-cpu time is spent.

Topdown profile shows a consistent retirement rate with backend stalls higher in the last and frontend stalls higher in the first.

AMD metrics show on average six cores. The is some floating point and not much L2 activity.
elapsed 680.283
on_cpu 0.405 # 6.48 / 16 cores
utime 4380.077
stime 29.750
nvcsw 529699 # 94.26%
nivcsw 32249 # 5.74%
inblock 0 # 0.00/sec
onblock 15808 # 23.24/sec
cpu-clock 4406070896173 # 4406.071 seconds
task-clock 4406708412361 # 4406.708 seconds
page faults 6123612 # 1389.611/sec
context switches 565143 # 128.246/sec
cpu migrations 7223 # 1.639/sec
major page faults 2 # 0.000/sec
minor page faults 6123610 # 1389.611/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 4662711827592 # 111.593 branches per 1000 inst
branch misses 52847581667 # 1.13% branch miss
conditional 3741973573338 # 89.557 conditional branches per 1000 inst
indirect 184899584708 # 4.425 indirect branches per 1000 inst
cpu-cycles 17439787503556 # 1.60 GHz
instructions 41786711974119 # 2.40 IPC
slots 34865080874874 #
retiring 13743579233199 # 39.4% (51.7%)
-- ucode 30766579480 # 0.1%
-- fastpath 13712812653719 # 39.3%
frontend 6301204183817 # 18.1% (23.7%)
-- latency 3991486122378 # 11.4%
-- bandwidth 2309718061439 # 6.6%
backend 5757168999901 # 16.5% (21.6%)
-- cpu 1810773177827 # 5.2%
-- memory 3946395822074 # 11.3%
speculation 804632596029 # 2.3% ( 3.0%)
-- branch mispredict 783402978854 # 2.2%
-- pipeline restart 21229617175 # 0.1%
smt-contention 8258433048876 # 23.7% ( 0.0%)
cpu-cycles 17436364491114 # 1.60 GHz
instructions 41772250512279 # 2.40 IPC
instructions 13931405364622 # 30.075 l2 access per 1000 inst
l2 hit from l1 399942908773 # 2.77% l2 miss
l2 miss from l1 4259235520 #
l2 hit from l2 pf 11699912704 #
l3 hit from l2 pf 4059614458 #
l3 miss from l2 pf 3278519298 #
instructions 13924366621450 # 107.589 float per 1000 inst
float 512 70 # 0.000 AVX-512 per 1000 inst
float 256 534 # 0.000 AVX-256 per 1000 inst
float 128 1498110268831 # 107.589 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 0 # 0.000 scalar per 1000 inst
Intel metrics
elapsed 856.175
on_cpu 0.403 # 6.45 / 16 cores
utime 5497.713
stime 22.929
nvcsw 539159 # 89.84%
nivcsw 60943 # 10.16%
inblock 664 # 0.78/sec
onblock 4800 # 5.61/sec
cpu-clock 5513430812677 # 5513.431 seconds
task-clock 5514030198429 # 5514.030 seconds
page faults 6400742 # 1160.810/sec
context switches 604186 # 109.572/sec
cpu migrations 45126 # 8.184/sec
major page faults 5 # 0.001/sec
minor page faults 6400733 # 1160.808/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 4723772752112 # 108.408 branches per 1000 inst
branch misses 55866941113 # 1.18% branch miss
conditional 4723772771216 # 108.408 conditional branches per 1000 inst
indirect 979987448967 # 22.490 indirect branches per 1000 inst
slots 42426994010144 #
retiring 25181587491963 # 59.4% (59.4%)
-- ucode 1276993347564 # 3.0%
-- fastpath 23904594144399 # 56.3%
frontend 9465887818390 # 22.3% (22.3%)
-- latency 3321055045324 # 7.8%
-- bandwidth 6144832773066 # 14.5%
backend 4043327042134 # 9.5% ( 9.5%)
-- cpu 2743315610137 # 6.5%
-- memory 1300011431997 # 3.1%
speculation 3778862180966 # 8.9% ( 8.9%)
-- branch mispredict 3656643627421 # 8.6%
-- pipeline restart 122218553545 # 0.3%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 19850940272366 # 1.44 GHz
instructions 52275803783004 # 2.63 IPC
l2 access 629155938709 # 24.142 l2 access per 1000 inst
l2 miss 42062422802 # 6.69% l2 miss
Process overview shows time spent in rav1e processes.
573 processes
212 rav1e 73556.31 386.72
68 clinfo 15.47 6.98
38 vulkaninfo 1.12 1.14
6 glxinfo:gdrv0 0.17 0.04
4 vulkani:disk$0 0.11 0.12
6 php 0.10 0.14
2 glxinfo 0.08 0.02
2 glxinfo:cs0 0.08 0.02
2 glxinfo:disk$0 0.08 0.02
2 glxinfo:sh0 0.08 0.02
2 glxinfo:shlo0 0.08 0.02
2 llvmpipe-0 0.06 0.06
2 llvmpipe-1 0.06 0.06
2 llvmpipe-10 0.06 0.06
2 llvmpipe-11 0.06 0.06
2 llvmpipe-12 0.06 0.06
2 llvmpipe-13 0.06 0.06
2 llvmpipe-14 0.06 0.06
2 llvmpipe-15 0.06 0.06
2 llvmpipe-2 0.06 0.06
2 llvmpipe-3 0.06 0.06
2 llvmpipe-4 0.06 0.06
2 llvmpipe-5 0.06 0.06
2 llvmpipe-6 0.06 0.06
2 llvmpipe-7 0.06 0.06
2 llvmpipe-8 0.06 0.06
2 llvmpipe-9 0.06 0.06
6 clang 0.03 0.09
3 rocminfo 0.03 0.00
1 lspci 0.00 0.02
87 sh 0.00 0.00
12 gcc 0.00 0.00
12 tr 0.00 0.00
10 gsettings 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 llvm-link 0.00 0.00
5 phoronix-test-s 0.00 0.00
4 gmain 0.00 0.00
2 dconf worker 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 cc 0.00 0.00
1 date 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 ps 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
4 processes running
51 maximum processes
These seem to be started on all CPUs
157481) rav1e cpu=11 start=92.26 finish=175.52
157482) rav1e cpu=7 start=92.26 finish=175.46
157483) rav1e cpu=15 start=92.27 finish=175.46
157484) rav1e cpu=12 start=92.27 finish=175.46
157485) rav1e cpu=5 start=92.27 finish=175.47
157486) rav1e cpu=13 start=92.27 finish=175.47
157487) rav1e cpu=0 start=92.27 finish=175.47
157488) rav1e cpu=14 start=92.27 finish=175.47
157489) rav1e cpu=11 start=92.27 finish=175.47
157490) rav1e cpu=2 start=92.27 finish=175.47
157491) rav1e cpu=9 start=92.27 finish=175.46
157492) rav1e cpu=10 start=92.27 finish=175.47
157493) rav1e cpu=12 start=92.27 finish=175.47
157494) rav1e cpu=4 start=92.27 finish=175.47
157495) rav1e cpu=6 start=92.27 finish=175.47
157496) rav1e cpu=3 start=92.27 finish=175.46
157497) rav1e cpu=3 start=92.27 finish=175.47
157498) rav1e cpu=8 start=92.27 finish=175.47
157500) tr cpu=4 start=175.52 finish=175.52
