quicksilver is a proxy app developed by LLNL. The source is here. There are three workloads. Somehow the AMD processor is much more stable with the workload while the Intel processor needs more iterations to reduce the deviation, particularly on the third workload. The system overview shows full usage of the CPU with a constant set of runnable processes.

The topdown metrics shows a reasonable retirement rate that is limited by backend stalls and with low frontend stalls.

AMD metrics show floating point code with a low branch miss rate and small amount of L2 access.

elapsed              2243.218
on_cpu               0.962          # 15.40 / 16 cores
utime                34535.396
stime                4.859
nvcsw                47991          # 14.52%
nivcsw               282447         # 85.48%
inblock              0              # 0.00/sec
onblock              14152          # 6.31/sec
cpu-clock            34541446980854 # 34541.447 seconds
task-clock           34541661660234 # 34541.662 seconds
page faults          928862         # 26.891/sec
context switches     341458         # 9.885/sec
cpu migrations       748            # 0.022/sec
major page faults    2              # 0.000/sec
minor page faults    928860         # 26.891/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             33807731553674 # 139.022 branches per 1000 inst
branch misses        289158326921   # 0.86% branch miss
conditional          24835473487220 # 102.127 conditional branches per 1000 inst
indirect             93366335290    # 0.384 indirect branches per 1000 inst
cpu-cycles           301810730461036 # 3.82 GHz
instructions         557270879640021 # 1.85 IPC
slots                603566322131106 #
retiring             200786255732776 # 33.3% (53.8%)
-- ucode             119011016871   #     0.0%
-- fastpath          200667244715905 #    33.2%
frontend             25741282012149 #  4.3% ( 6.9%)
-- latency           14193596857908 #     2.4%
-- bandwidth         11547685154241 #     1.9%
backend              131803639022099 # 21.8% (35.3%)
-- cpu               65456513788431 #    10.8%
-- memory            66347125233668 #    11.0%
speculation          15093195866873 #  2.5% ( 4.0%)
-- branch mispredict 14790109025347 #     2.5%
-- pipeline restart  303086841526   #     0.1%
smt-contention       230141589715974 # 38.1% ( 0.0%)
cpu-cycles           136527883836719 # 3.79 GHz
instructions         243228511528062 # 1.78 IPC
instructions         81079749463259 # 22.298 l2 access per 1000 inst
l2 hit from l1       1388992535076  # 6.77% l2 miss
l2 miss from l1      58670581772    #
l2 hit from l2 pf    355183403163   #
l3 hit from l2 pf    54030832873    #
l3 miss from l2 pf   9713216835     #
instructions         81035206592148 # 181.052 float per 1000 inst
float 512            63             # 0.000 AVX-512 per 1000 inst
float 256            600            # 0.000 AVX-256 per 1000 inst
float 128            14671559696429 # 181.052 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         5              # 0.000 scalar per 1000 inst

Intel metrics also reflect the longer runtime.

elapsed              5771.167
on_cpu               0.966          # 15.45 / 16 cores
utime                89151.589
stime                8.509
nvcsw                200322         # 25.08%
nivcsw               598411         # 74.92%
inblock              0              # 0.00/sec
onblock              3976           # 0.69/sec
cpu-clock            89157847905491 # 89157.848 seconds
task-clock           89158226755683 # 89158.227 seconds
page faults          1424428        # 15.976/sec
context switches     827367         # 9.280/sec
cpu migrations       60452          # 0.678/sec
major page faults    0              # 0.000/sec
minor page faults    1424428        # 15.976/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             50932134417907 # 143.312 branches per 1000 inst
branch misses        698619389499   # 1.37% branch miss
conditional          50932134437843 # 143.312 conditional branches per 1000 inst
indirect             12618507409368 # 35.506 indirect branches per 1000 inst
slots                440838563740580 #
retiring             173635425896210 # 39.4% (39.4%)
-- ucode             7860615540594  #     1.8%
-- fastpath          165774810355616 #    37.6%
frontend             60186415101796 # 13.7% (13.7%)
-- latency           36475838824133 #     8.3%
-- bandwidth         23710576277663 #     5.4%
backend              168243321796642 # 38.2% (38.2%)
-- cpu               122576607097747 #    27.8%
-- memory            45666714698895 #    10.4%
speculation          39460472909226 #  9.0% ( 9.0%)
-- branch mispredict 39056742849763 #     8.9%
-- pipeline restart  403730059463   #     0.1%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           285841270645200 # 3.06 GHz
instructions         351599636598954 # 1.23 IPC
l2 access            1962502029154  # 11.053 l2 access per 1000 inst
l2 miss              448311834186   # 22.84% l2 miss

Process overview i straightforward

499 processes
	144 qs                   552276.80    51.29
	 68 clinfo                  15.87     7.65
	 38 vulkaninfo               0.76     1.52
	  6 php                      0.24     0.12
	  6 glxinfo:gdrv0            0.12     0.12
	  4 vulkani:disk$0           0.08     0.16
	  6 clang                    0.07     0.05
	  2 glxinfo                  0.06     0.06
	  2 glxinfo:cs0              0.06     0.06
	  2 glxinfo:disk$0           0.06     0.05
	  2 glxinfo:sh0              0.06     0.04
	  2 glxinfo:shlo0            0.06     0.04
	  2 llvmpipe-0               0.04     0.08
	  2 llvmpipe-1               0.04     0.08
	  2 llvmpipe-10              0.04     0.08
	  2 llvmpipe-11              0.04     0.08
	  2 llvmpipe-12              0.04     0.08
	  2 llvmpipe-13              0.04     0.08
	  2 llvmpipe-14              0.04     0.08
	  2 llvmpipe-15              0.04     0.08
	  2 llvmpipe-2               0.04     0.08
	  2 llvmpipe-3               0.04     0.08
	  2 llvmpipe-4               0.04     0.08
	  2 llvmpipe-5               0.04     0.08
	  2 llvmpipe-6               0.04     0.08
	  2 llvmpipe-7               0.04     0.08
	  2 llvmpipe-8               0.04     0.08
	  2 llvmpipe-9               0.04     0.08
	  1 lspci                    0.01     0.02
	  3 rocminfo                 0.00     0.03
	  1 ps                       0.00     0.01
	 86 sh                       0.00     0.00
	 13 gcc                      0.00     0.00
	 11 gsettings                0.00     0.00
	  9 quicksilver              0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  4 gmain                    0.00     0.00
	  2 cc                       0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dconf worker             0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
47 maximum processes

Following is the compute structure

      105523) quicksilver      cpu=5 start=6.09  finish=441.82
        105524) qs               cpu=6 start=6.10  finish=441.82
          105525) qs               cpu=5 start=6.29  finish=441.82
          105526) qs               cpu=8 start=6.29  finish=441.82
          105527) qs               cpu=3 start=6.29  finish=441.82
          105528) qs               cpu=7 start=6.29  finish=441.82
          105529) qs               cpu=9 start=6.29  finish=441.82
          105530) qs               cpu=4 start=6.29  finish=441.82
          105531) qs               cpu=14 start=6.29  finish=441.82
          105532) qs               cpu=2 start=6.29  finish=441.82
          105533) qs               cpu=15 start=6.29  finish=441.82
          105534) qs               cpu=0 start=6.29  finish=441.82
          105535) qs               cpu=1 start=6.29  finish=441.82
          105536) qs               cpu=10 start=6.29  finish=441.82
          105537) qs               cpu=11 start=6.29  finish=441.82
          105538) qs               cpu=12 start=6.29  finish=441.82
          105539) qs               cpu=13 start=6.29  finish=441.82