Test of compressing and decompressing the Linux kernel (through a different one than the compress-gzip benchmark). Interesting that none tests of different compression tools use the same metrics and workload so not easy to compare between tools.

AMD metrics show a lower on-cpu (3.63) but not single threaded and also a lot of output. Not as much branch misprediction as some other codes.

elapsed              159.883
on_cpu               0.227          # 3.63 / 16 cores
utime                373.561
stime                206.226
nvcsw                24109219       # 99.98%
nivcsw               4882           # 0.02%
inblock              344            # 2.15/sec
onblock              9913648        # 62005.75/sec
cpu-clock            573029063163   # 573.029 seconds
task-clock           579433116869   # 579.433 seconds
page faults          471162         # 813.143/sec
context switches     24114697       # 41617.740/sec
cpu migrations       2346204        # 4049.137/sec
major page faults    4              # 0.007/sec
minor page faults    471158         # 813.136/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             381485532457   # 148.151 branches per 1000 inst
branch misses        21732499954    # 5.70% branch miss
conditional          331455035535   # 128.721 conditional branches per 1000 inst
indirect             2131195176     # 0.828 indirect branches per 1000 inst
cpu-cycles           2247254350506  # 0.92 GHz
instructions         2657644056246  # 1.18 IPC
slots                4337089786164  #
retiring             856287818753   # 19.7% (22.6%)
-- ucode             1063178575     #     0.0%
-- fastpath          855224640178   #    19.7%
frontend             1416917998202  # 32.7% (37.4%)
-- latency           990546038496   #    22.8%
-- bandwidth         426371959706   #     9.8%
backend              1317194261591  # 30.4% (34.7%)
-- cpu               380456352651   #     8.8%
-- memory            936737908940   #    21.6%
speculation          201494052691   #  4.6% ( 5.3%)
-- branch mispredict 200227001429   #     4.6%
-- pipeline restart  1267051262     #     0.0%
smt-contention       543796763011   # 12.5% ( 0.0%)
cpu-cycles           2245077480593  # 0.92 GHz
instructions         2667029914647  # 1.19 IPC
instructions         872960594992   # 41.360 l2 access per 1000 inst
l2 hit from l1       27079123490    # 31.89% l2 miss
l2 miss from l1      5990309331     #
l2 hit from l2 pf    3502066051     #
l3 hit from l2 pf    4885619672     #
l3 miss from l2 pf   638696475      #
instructions         870165842389   # 20.284 float per 1000 inst
float 512            77             # 0.000 AVX-512 per 1000 inst
float 256            1157816        # 0.001 AVX-256 per 1000 inst
float 128            17649607359    # 20.283 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst

Intel metrics

elapsed              239.855
on_cpu               0.218          # 3.49 / 16 cores
utime                628.377
stime                209.002
nvcsw                25437007       # 92.02%
nivcsw               2205558        # 7.98%
inblock              237344         # 989.53/sec
onblock              9913984        # 41333.30/sec
cpu-clock            833996774060   # 833.997 seconds
task-clock           840839881199   # 840.840 seconds
page faults          454067         # 540.016/sec
context switches     27643630       # 32876.212/sec
cpu migrations       7769626        # 9240.316/sec
major page faults    16             # 0.019/sec
minor page faults    454051         # 539.997/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             378883981060   # 144.229 branches per 1000 inst
branch misses        14411618351    # 3.80% branch miss
conditional          378883995012   # 144.229 conditional branches per 1000 inst
indirect             107206296207   # 40.810 indirect branches per 1000 inst
slots                4884567876338  #
retiring             1388281256735  # 28.4% (28.4%)
-- ucode             80372149297    #     1.6%
-- fastpath          1307909107438  #    26.8%
frontend             864325651934   # 17.7% (17.7%)
-- latency           475762760331   #     9.7%
-- bandwidth         388562891603   #     8.0%
backend              1861932065696  # 38.1% (38.1%)
-- cpu               477788296605   #     9.8%
-- memory            1384143769091  #    28.3%
speculation          880364205059   # 18.0% (18.0%)
-- branch mispredict 866195517000   #    17.7%
-- pipeline restart  14168688059    #     0.3%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           2069940879352  # 0.58 GHz
instructions         3033820737561  # 1.47 IPC
l2 access            58335330439    # 37.463 l2 access per 1000 inst
l2 miss              24403745862    # 41.83% l2 miss

Process overview is straightforward

420 processes
	 51 rar                   6839.31  1699.16
	 64 clinfo                  12.16     2.56
	  1 xz                       4.45     0.27
	 38 vulkaninfo               0.93     0.95
	  6 php                      0.51     4.54
	  2 cp                       0.32     5.24
	  1 tar                      0.12     2.32
	  6 glxinfo:gdrv0            0.11     0.06
	  4 vulkani:disk$0           0.09     0.10
	  7 rm                       0.07     3.77
	  2 glxinfo                  0.06     0.02
	  2 glxinfo:cs0              0.06     0.02
	  2 glxinfo:disk$0           0.06     0.02
	  2 glxinfo:sh0              0.06     0.02
	  2 glxinfo:shlo0            0.06     0.02
	  2 llvmpipe-0               0.05     0.05
	  2 llvmpipe-1               0.05     0.05
	  2 llvmpipe-10              0.05     0.05
	  2 llvmpipe-11              0.05     0.05
	  2 llvmpipe-12              0.05     0.05
	  2 llvmpipe-13              0.05     0.05
	  2 llvmpipe-14              0.05     0.05
	  2 llvmpipe-15              0.05     0.05
	  2 llvmpipe-2               0.05     0.05
	  2 llvmpipe-3               0.05     0.05
	  2 llvmpipe-4               0.05     0.05
	  2 llvmpipe-5               0.05     0.05
	  2 llvmpipe-6               0.05     0.05
	  2 llvmpipe-7               0.05     0.05
	  2 llvmpipe-8               0.05     0.05
	  2 llvmpipe-9               0.05     0.05
	  6 clang                    0.03     0.04
	  1 lspci                    0.00     0.03
	 92 sh                       0.00     0.00
	 12 gcc                      0.00     0.00
	  9 stty                     0.00     0.00
	  8 gsettings                0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 gmain                    0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  4 bash                     0.00     0.00
	  3 compress-rar             0.00     0.00
	  3 dconf worker             0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 cc                       0.00     0.00
	  1 date                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 ps                       0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
47 maximum processes

The core computation blocks show we are starting one thread per core, so relatively lower on_cpu likely indicates waiting e.g. for disk.

      34369) compress-rar start=17.02 finish=58.27
        34370) rar start=17.02 finish=58.27
          34371) rar start=18.90 finish=58.19
          34372) rar start=18.90 finish=58.19
          34373) rar start=18.90 finish=58.19
          34374) rar start=18.90 finish=58.19
          34375) rar start=18.90 finish=58.19
          34376) rar start=18.90 finish=58.19
          34377) rar start=18.90 finish=58.19
          34378) rar start=18.90 finish=58.19
          34379) rar start=18.90 finish=58.19
          34380) rar start=18.90 finish=58.19
          34381) rar start=18.90 finish=58.19
          34382) rar start=18.90 finish=58.19
          34383) rar start=18.90 finish=58.19
          34384) rar start=18.90 finish=58.19
          34385) rar start=18.90 finish=58.19
          34386) rar start=18.90 finish=58.19