Testing the xz tool for compressing and decompressing a Linux ISO image. This is the compression where Intel scores best relative to AMD (1.13x slower). Very much a backend bound task. Interesting that none tests of different compression tools use the same metrics and workload so not easy to compare between tools.

AMD metrics show a memory-bound process with low IPC and not much branch misprediction.

elapsed              100.692
on_cpu               0.679          # 10.87 / 16 cores
utime                1082.313
stime                11.855
nvcsw                5507           # 23.18%
nivcsw               18252          # 76.82%
inblock              32             # 0.32/sec
onblock              4784560        # 47516.65/sec
cpu-clock            1094318856774  # 1094.319 seconds
task-clock           1094336999873  # 1094.337 seconds
page faults          4003695        # 3658.558/sec
context switches     24024          # 21.953/sec
cpu migrations       397            # 0.363/sec
major page faults    0              # 0.000/sec
minor page faults    4003695        # 3658.558/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             305826469983   # 140.424 branches per 1000 inst
branch misses        15442205577    # 5.05% branch miss
conditional          207603757938   # 95.324 conditional branches per 1000 inst
indirect             24481367087    # 11.241 indirect branches per 1000 inst
cpu-cycles           5275899989187  # 3.13 GHz
instructions         2174535846634  # 0.41 IPC
slots                10560962539674 #
retiring             752461733550   #  7.1% ( 8.5%)
-- ucode             922767045      #     0.0%
-- fastpath          751538966505   #     7.1%
frontend             958596815267   #  9.1% (10.8%)
-- latency           520474659312   #     4.9%
-- bandwidth         438122155955   #     4.1%
backend              6775989132895  # 64.2% (76.4%)
-- cpu               166640846018   #     1.6%
-- memory            6609348286877  #    62.6%
speculation          384200934302   #  3.6% ( 4.3%)
-- branch mispredict 378018914083   #     3.6%
-- pipeline restart  6182020219     #     0.1%
smt-contention       1689708234394  # 16.0% ( 0.0%)
cpu-cycles           5272870167404  # 3.26 GHz
instructions         2176222575312  # 0.41 IPC
instructions         725787433025   # 24.967 l2 access per 1000 inst
l2 hit from l1       12503036149    # 58.73% l2 miss
l2 miss from l1      5866288583     #
l2 hit from l2 pf    840727641      #
l3 hit from l2 pf    1081438228     #
l3 miss from l2 pf   3695290297     #
instructions         724987198135   # 0.163 float per 1000 inst
float 512            51             # 0.000 AVX-512 per 1000 inst
float 256            336            # 0.000 AVX-256 per 1000 inst
float 128            118149446      # 0.163 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst

Intel metrics

elapsed              111.043
on_cpu               0.701          # 11.21 / 16 cores
utime                1235.188
stime                10.000
nvcsw                6590           # 38.10%
nivcsw               10708          # 61.90%
inblock              1267304        # 11412.76/sec
onblock              4784552        # 43087.47/sec
cpu-clock            1245213949207  # 1245.214 seconds
task-clock           1245224972995  # 1245.225 seconds
page faults          3999376        # 3211.770/sec
context switches     17634          # 14.161/sec
cpu migrations       611            # 0.491/sec
major page faults    166            # 0.133/sec
minor page faults    3999210        # 3211.637/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             303879312864   # 139.664 branches per 1000 inst
branch misses        17380240502    # 5.72% branch miss
conditional          303879327936   # 139.664 conditional branches per 1000 inst
indirect             96960741484    # 44.564 indirect branches per 1000 inst
slots                6462683136008  #
retiring             1298072778471  # 20.1% (20.1%)
-- ucode             95216649660    #     1.5%
-- fastpath          1202856128811  #    18.6%
frontend             749757159532   # 11.6% (11.6%)
-- latency           423443276271   #     6.6%
-- bandwidth         326313883261   #     5.0%
backend              3300833107005  # 51.1% (51.1%)
-- cpu               567361250552   #     8.8%
-- memory            2733471856453  #    42.3%
speculation          1323336368123  # 20.5% (20.5%)
-- branch mispredict 1309834630416  #    20.3%
-- pipeline restart  13501737707    #     0.2%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           13555660857979 # 2.11 GHz
instructions         7882561270813  # 0.58 IPC
l2 access            68241841123    # 16.037 l2 access per 1000 inst
l2 miss              32692385575    # 47.91% l2 miss

Process structure

455 processes
	 78 xz                   18339.60   175.46
	 64 clinfo                  11.52     2.88
	 38 vulkaninfo               0.75     1.32
	  4 vulkani:disk$0           0.08     0.14
	  6 glxinfo:gdrv0            0.05     0.13
	  6 php                      0.05     0.09
	  6 clang                    0.05     0.02
	  2 llvmpipe-0               0.04     0.07
	  2 llvmpipe-1               0.04     0.07
	  2 llvmpipe-10              0.04     0.07
	  2 llvmpipe-11              0.04     0.07
	  2 llvmpipe-12              0.04     0.07
	  2 llvmpipe-13              0.04     0.07
	  2 llvmpipe-14              0.04     0.07
	  2 llvmpipe-15              0.04     0.07
	  2 llvmpipe-2               0.04     0.07
	  2 llvmpipe-3               0.04     0.07
	  2 llvmpipe-4               0.04     0.07
	  2 llvmpipe-5               0.04     0.07
	  2 llvmpipe-6               0.04     0.07
	  2 llvmpipe-7               0.04     0.07
	  2 llvmpipe-8               0.04     0.07
	  2 llvmpipe-9               0.04     0.07
	  2 glxinfo                  0.03     0.06
	  2 glxinfo:cs0              0.03     0.05
	  2 glxinfo:disk$0           0.03     0.05
	  2 glxinfo:sh0              0.03     0.05
	  2 glxinfo:shlo0            0.03     0.05
	  4 rm                       0.00     0.31
	  1 lspci                    0.00     0.03
	 93 sh                       0.00     0.00
	 13 gcc                      0.00     0.00
	 10 gsettings                0.00     0.00
	 10 sed                      0.00     0.00
	  9 stty                     0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  4 bash                     0.00     0.00
	  4 gmain                    0.00     0.00
	  3 compress-xz              0.00     0.00
	  3 ls                       0.00     0.00
	  2 cc                       0.00     0.00
	  2 dconf worker             0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 ps                       0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sort                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
47 maximum processes

Core computation area shows we launch on all cores

      36804) compress-xz start=37.55 finish=65.65
        36805) xz start=37.56 finish=65.40
          36806) xz start=37.56 finish=37.56
            36807) xz start=37.56 finish=37.56
          36808) xz start=37.56 finish=37.56
          36809) xz start=37.56 finish=37.56
            36810) xz start=37.56 finish=37.56
            36811) sed start=37.56 finish=37.56
          36812) xz start=37.56 finish=37.57
            36813) ls start=37.56 finish=37.57
            36814) sed start=37.56 finish=37.57
          36815) xz start=37.57 finish=37.57
          36816) xz start=37.57 finish=37.57
            36817) xz start=37.57 finish=37.57
            36818) sed start=37.57 finish=37.57
          36819) xz start=37.57 finish=65.40
          36820) xz start=37.60 finish=65.40
          36821) xz start=37.62 finish=65.40
          36822) xz start=37.65 finish=65.40
          36823) xz start=37.68 finish=65.40
          36824) xz start=37.71 finish=65.40
          36825) xz start=37.73 finish=65.40
          36826) xz start=37.76 finish=65.40
          36827) xz start=37.79 finish=65.40
          36828) xz start=37.82 finish=65.40
          36829) xz start=37.85 finish=65.40
          36830) xz start=37.88 finish=65.40
          36831) xz start=37.92 finish=65.40
          36832) xz start=37.95 finish=65.40
          36833) xz start=37.98 finish=65.40
          36834) xz start=38.01 finish=65.40
      36835) sh start=65.65 finish=65.77
        36836) bash start=65.65 finish=65.77
          36837) rm start=65.65 finish=65.77