cp2k is a molecular dynamics package for quantum chemistry and solid state physics. There are three workloads but the second one fails. These run with MPI and appear to run only on one thread per hyper-threaded core.

Topdown profile shows slightly different profiles for workloads but generally with high amounts of backend stalls and moderate retirement rate.

AMD metrics show floating point code and low level of L2 cache access.

elapsed              314.596
on_cpu               0.403          # 6.44 / 16 cores
utime                1950.571
stime                76.618
nvcsw                37942          # 85.96%
nivcsw               6195           # 14.04%
inblock              1393752        # 4430.29/sec
onblock              4130936        # 13130.91/sec
cpu-clock            2250486500161  # 2250.487 seconds
task-clock           2250512015123  # 2250.512 seconds
page faults          17296215       # 7685.458/sec
context switches     59023          # 26.226/sec
cpu migrations       3878           # 1.723/sec
major page faults    16478          # 7.322/sec
minor page faults    17279723       # 7678.130/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             2797017555498  # 118.729 branches per 1000 inst
branch misses        16332501305    # 0.58% branch miss
conditional          2093263099042  # 88.856 conditional branches per 1000 inst
indirect             103483474302   # 4.393 indirect branches per 1000 inst
cpu-cycles           9417383943085  # 1.86 GHz
instructions         24068965945112 # 2.56 IPC
slots                18839421656568 #
retiring             8130515809241  # 43.2% (43.2%)
-- ucode             18296888116    #     0.1%
-- fastpath          8112218921125  #    43.1%
frontend             1702876357108  #  9.0% ( 9.1%)
-- latency           812901365928   #     4.3%
-- bandwidth         889974991180   #     4.7%
backend              8497846096167  # 45.1% (45.2%)
-- cpu               2735878442045  #    14.5%
-- memory            5761967654122  #    30.6%
speculation          483239296612   #  2.6% ( 2.6%)
-- branch mispredict 466874614374   #     2.5%
-- pipeline restart  16364682238    #     0.1%
smt-contention       24935254612    #  0.1% ( 0.0%)
cpu-cycles           9414659187497  # 1.86 GHz
instructions         23998277195786 # 2.55 IPC
instructions         7994669047541  # 14.114 l2 access per 1000 inst
l2 hit from l1       85551514714    # 17.78% l2 miss
l2 miss from l1      8305630626     #
l2 hit from l2 pf    15533727367    #
l3 hit from l2 pf    4669322426     #
l3 miss from l2 pf   7084179657     #
instructions         7998409850474  # 230.289 float per 1000 inst
float 512            92             # 0.000 AVX-512 per 1000 inst
float 256            2409186741     # 0.301 AVX-256 per 1000 inst
float 128            1839534443120  # 229.988 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst
instructions         23840085961713 #
opcache              3733895544700  # 156.623 opcache per 1000 inst
opcache miss         139454228115   #  3.7% opcache miss rate
l1 dTLB miss         18022412303    # 0.756 L1 dTLB per 1000 inst
l2 dTLB miss         1346924796     # 0.056 L2 dTLB per 1000 inst
instructions         23808482127795 #
icache               215006604128   # 9.031 icache per 1000 inst
icache miss          35602086813    # 16.6% icache miss rate
l1 iTLB miss         1346273170     # 0.057 L1 iTLB per 1000 inst
l2 iTLB miss         0              # 0.000 L2 iTLB per 1000 inst
tlb flush            509622         # 0.000 TLB flush per 1000 inst

Process overview shows cp2k.popt as the primary wworking process

329 processes
	 72 cp2k.popt             6240.32   291.46
	 38 vulkaninfo               0.95     1.52
	 12 mpirun                   0.49     3.61
	  6 glxinfo:gdrv0            0.15     0.16
	  4 vulkani:disk$0           0.10     0.16
	  2 glxinfo                  0.08     0.06
	  2 glxinfo:cs0              0.08     0.06
	  2 glxinfo:disk$0           0.08     0.06
	  2 glxinfo:shlo0            0.08     0.06
	  2 glxinfo:sh0              0.07     0.06
	  6 php                      0.06     3.52
	  2 llvmpipe-0               0.05     0.08
	  2 llvmpipe-1               0.05     0.08
	  2 llvmpipe-10              0.05     0.08
	  2 llvmpipe-11              0.05     0.08
	  2 llvmpipe-12              0.05     0.08
	  2 llvmpipe-13              0.05     0.08
	  2 llvmpipe-14              0.05     0.08
	  2 llvmpipe-15              0.05     0.08
	  2 llvmpipe-2               0.05     0.08
	  2 llvmpipe-3               0.05     0.08
	  2 llvmpipe-4               0.05     0.08
	  2 llvmpipe-5               0.05     0.08
	  2 llvmpipe-6               0.05     0.08
	  2 llvmpipe-7               0.05     0.08
	  2 llvmpipe-8               0.05     0.08
	  2 llvmpipe-9               0.05     0.08
	  1 lspci                    0.00     0.02
	  1 ps                       0.00     0.01
	 65 sh                       0.00     0.00
	 13 gcc                      0.00     0.00
	  9 gsettings                0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  5 gmain                    0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  3 cp2k                     0.00     0.00
	  2 cc                       0.00     0.00
	  2 dconf worker             0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 python3                  0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
47 maximum processes

Computation blocks look as follows

      461986) cp2k             cpu=5 start=5.29  finish=102.53
        461987) mpirun           cpu=13 start=5.29  finish=102.53
          461988) mpirun           cpu=8 start=5.84  finish=102.53
          461989) mpirun           cpu=7 start=6.33  finish=102.53
          461990) mpirun           cpu=10 start=6.33  finish=102.53
          461991) cp2k.popt        cpu=6 start=6.33  finish=102.50
            461996) cp2k.popt        cpu=0 start=6.35  finish=102.50
            462000) cp2k.popt        cpu=7 start=6.36  finish=102.50
          461992) cp2k.popt        cpu=1 start=6.34  finish=102.50
            461998) cp2k.popt        cpu=4 start=6.35  finish=102.50
            462002) cp2k.popt        cpu=4 start=6.36  finish=102.50
          461993) cp2k.popt        cpu=12 start=6.34  finish=102.50
            462001) cp2k.popt        cpu=5 start=6.36  finish=102.50
            462005) cp2k.popt        cpu=0 start=6.36  finish=102.50
          461994) cp2k.popt        cpu=11 start=6.34  finish=102.50
            462004) cp2k.popt        cpu=3 start=6.36  finish=102.50
            462007) cp2k.popt        cpu=9 start=6.37  finish=102.50
          461995) cp2k.popt        cpu=2 start=6.35  finish=102.50
            462006) cp2k.popt        cpu=12 start=6.37  finish=102.50
            462009) cp2k.popt        cpu=8 start=6.37  finish=102.50
          461997) cp2k.popt        cpu=14 start=6.35  finish=102.50
            462008) cp2k.popt        cpu=8 start=6.37  finish=102.50
            462011) cp2k.popt        cpu=1 start=6.37  finish=102.50
          461999) cp2k.popt        cpu=5 start=6.36  finish=102.50
            462010) cp2k.popt        cpu=0 start=6.37  finish=102.50
            462013) cp2k.popt        cpu=10 start=6.38  finish=102.49
          462003) cp2k.popt        cpu=15 start=6.36  finish=102.50
            462012) cp2k.popt        cpu=8 start=6.38  finish=102.50
            462014) cp2k.popt        cpu=3 start=6.38  finish=102.49