Test of compressing and decompressing the Linux kernel (through a different one than the compress-gzip benchmark). Interesting that none tests of different compression tools use the same metrics and workload so not easy to compare between tools.

AMD metrics show a lower on-cpu (3.63) but not single threaded and also a lot of output. Not as much branch misprediction as some other codes.
elapsed 159.883
on_cpu 0.227 # 3.63 / 16 cores
utime 373.561
stime 206.226
nvcsw 24109219 # 99.98%
nivcsw 4882 # 0.02%
inblock 344 # 2.15/sec
onblock 9913648 # 62005.75/sec
cpu-clock 573029063163 # 573.029 seconds
task-clock 579433116869 # 579.433 seconds
page faults 471162 # 813.143/sec
context switches 24114697 # 41617.740/sec
cpu migrations 2346204 # 4049.137/sec
major page faults 4 # 0.007/sec
minor page faults 471158 # 813.136/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 381485532457 # 148.151 branches per 1000 inst
branch misses 21732499954 # 5.70% branch miss
conditional 331455035535 # 128.721 conditional branches per 1000 inst
indirect 2131195176 # 0.828 indirect branches per 1000 inst
cpu-cycles 2247254350506 # 0.92 GHz
instructions 2657644056246 # 1.18 IPC
slots 4337089786164 #
retiring 856287818753 # 19.7% (22.6%)
-- ucode 1063178575 # 0.0%
-- fastpath 855224640178 # 19.7%
frontend 1416917998202 # 32.7% (37.4%)
-- latency 990546038496 # 22.8%
-- bandwidth 426371959706 # 9.8%
backend 1317194261591 # 30.4% (34.7%)
-- cpu 380456352651 # 8.8%
-- memory 936737908940 # 21.6%
speculation 201494052691 # 4.6% ( 5.3%)
-- branch mispredict 200227001429 # 4.6%
-- pipeline restart 1267051262 # 0.0%
smt-contention 543796763011 # 12.5% ( 0.0%)
cpu-cycles 2245077480593 # 0.92 GHz
instructions 2667029914647 # 1.19 IPC
instructions 872960594992 # 41.360 l2 access per 1000 inst
l2 hit from l1 27079123490 # 31.89% l2 miss
l2 miss from l1 5990309331 #
l2 hit from l2 pf 3502066051 #
l3 hit from l2 pf 4885619672 #
l3 miss from l2 pf 638696475 #
instructions 870165842389 # 20.284 float per 1000 inst
float 512 77 # 0.000 AVX-512 per 1000 inst
float 256 1157816 # 0.001 AVX-256 per 1000 inst
float 128 17649607359 # 20.283 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 0 # 0.000 scalar per 1000 inst
Intel metrics
elapsed 239.855
on_cpu 0.218 # 3.49 / 16 cores
utime 628.377
stime 209.002
nvcsw 25437007 # 92.02%
nivcsw 2205558 # 7.98%
inblock 237344 # 989.53/sec
onblock 9913984 # 41333.30/sec
cpu-clock 833996774060 # 833.997 seconds
task-clock 840839881199 # 840.840 seconds
page faults 454067 # 540.016/sec
context switches 27643630 # 32876.212/sec
cpu migrations 7769626 # 9240.316/sec
major page faults 16 # 0.019/sec
minor page faults 454051 # 539.997/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 378883981060 # 144.229 branches per 1000 inst
branch misses 14411618351 # 3.80% branch miss
conditional 378883995012 # 144.229 conditional branches per 1000 inst
indirect 107206296207 # 40.810 indirect branches per 1000 inst
slots 4884567876338 #
retiring 1388281256735 # 28.4% (28.4%)
-- ucode 80372149297 # 1.6%
-- fastpath 1307909107438 # 26.8%
frontend 864325651934 # 17.7% (17.7%)
-- latency 475762760331 # 9.7%
-- bandwidth 388562891603 # 8.0%
backend 1861932065696 # 38.1% (38.1%)
-- cpu 477788296605 # 9.8%
-- memory 1384143769091 # 28.3%
speculation 880364205059 # 18.0% (18.0%)
-- branch mispredict 866195517000 # 17.7%
-- pipeline restart 14168688059 # 0.3%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 2069940879352 # 0.58 GHz
instructions 3033820737561 # 1.47 IPC
l2 access 58335330439 # 37.463 l2 access per 1000 inst
l2 miss 24403745862 # 41.83% l2 miss
Process overview is straightforward
420 processes
51 rar 6839.31 1699.16
64 clinfo 12.16 2.56
1 xz 4.45 0.27
38 vulkaninfo 0.93 0.95
6 php 0.51 4.54
2 cp 0.32 5.24
1 tar 0.12 2.32
6 glxinfo:gdrv0 0.11 0.06
4 vulkani:disk$0 0.09 0.10
7 rm 0.07 3.77
2 glxinfo 0.06 0.02
2 glxinfo:cs0 0.06 0.02
2 glxinfo:disk$0 0.06 0.02
2 glxinfo:sh0 0.06 0.02
2 glxinfo:shlo0 0.06 0.02
2 llvmpipe-0 0.05 0.05
2 llvmpipe-1 0.05 0.05
2 llvmpipe-10 0.05 0.05
2 llvmpipe-11 0.05 0.05
2 llvmpipe-12 0.05 0.05
2 llvmpipe-13 0.05 0.05
2 llvmpipe-14 0.05 0.05
2 llvmpipe-15 0.05 0.05
2 llvmpipe-2 0.05 0.05
2 llvmpipe-3 0.05 0.05
2 llvmpipe-4 0.05 0.05
2 llvmpipe-5 0.05 0.05
2 llvmpipe-6 0.05 0.05
2 llvmpipe-7 0.05 0.05
2 llvmpipe-8 0.05 0.05
2 llvmpipe-9 0.05 0.05
6 clang 0.03 0.04
1 lspci 0.00 0.03
92 sh 0.00 0.00
12 gcc 0.00 0.00
9 stty 0.00 0.00
8 gsettings 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 llvm-link 0.00 0.00
5 gmain 0.00 0.00
5 phoronix-test-s 0.00 0.00
4 bash 0.00 0.00
3 compress-rar 0.00 0.00
3 dconf worker 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 cc 0.00 0.00
1 date 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 ps 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
0 processes running
47 maximum processes
The core computation blocks show we are starting one thread per core, so relatively lower on_cpu likely indicates waiting e.g. for disk.
34369) compress-rar start=17.02 finish=58.27
34370) rar start=17.02 finish=58.27
34371) rar start=18.90 finish=58.19
34372) rar start=18.90 finish=58.19
34373) rar start=18.90 finish=58.19
34374) rar start=18.90 finish=58.19
34375) rar start=18.90 finish=58.19
34376) rar start=18.90 finish=58.19
34377) rar start=18.90 finish=58.19
34378) rar start=18.90 finish=58.19
34379) rar start=18.90 finish=58.19
34380) rar start=18.90 finish=58.19
34381) rar start=18.90 finish=58.19
34382) rar start=18.90 finish=58.19
34383) rar start=18.90 finish=58.19
34384) rar start=18.90 finish=58.19
34385) rar start=18.90 finish=58.19
34386) rar start=18.90 finish=58.19
