This test looks like a variation of the stream benchmark with copy/scale/triad and average. It uses floating point and integer data types. It looks to be mostly one or two threads.

Topdown profile shows a backend bound program

AMD metrics confirm this as memory bound with high rate of L2 access. Despite moving floating point, there are almost no floating point intructions.

elapsed              1557.486
on_cpu               0.055          # 0.88 / 16 cores
utime                1326.810
stime                41.987
nvcsw                68351          # 92.84%
nivcsw               5271           # 7.16%
inblock              32             # 0.02/sec
onblock              13816          # 8.87/sec
cpu-clock            2742989846478  # 2742.990 seconds
task-clock           2743086882030  # 2743.087 seconds
page faults          49322001       # 17980.474/sec
context switches     105687         # 38.528/sec
cpu migrations       532            # 0.194/sec
major page faults    2              # 0.001/sec
minor page faults    49321999       # 17980.473/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             147713856443   # 93.840 branches per 1000 inst
branch misses        7525003207     # 5.09% branch miss
conditional          114162095786   # 72.525 conditional branches per 1000 inst
indirect             98555933       # 0.063 indirect branches per 1000 inst
cpu-cycles           12641110483498 # 0.51 GHz
instructions         1554483337148  # 0.12 IPC low
slots                25288565605854 #
retiring             618318956718   #  2.4% ( 2.4%) low
-- ucode             775636722      #     0.0%
-- fastpath          617543319996   #     2.4%
frontend             841830393022   #  3.3% ( 3.3%) low
-- latency           497749780566   #     2.0%
-- bandwidth         344080612456   #     1.4%
backend              23818249605363 # 94.2% (94.2%) high
-- cpu               597500871971   #     2.4%
-- memory            23220748733392 #    91.8%
speculation          10018864501    #  0.0% ( 0.0%) low
-- branch mispredict 9753004805     #     0.0%
-- pipeline restart  265859696      #     0.0%
smt-contention       146921617      #  0.0% ( 0.0%)
cpu-cycles           12618813414012 # 0.51 GHz
instructions         1556603033250  # 0.12 IPC low
instructions         517961772956   # 980.903 l2 access per 1000 inst
l2 hit from l1       288657905828   # 40.52% l2 miss
l2 miss from l1      8939564541     #
l2 hit from l2 pf    22494472717    #
l3 hit from l2 pf    210256769      #
l3 miss from l2 pf   196707682284   #
instructions         517796842548   # 0.447 float per 1000 inst
float 512            100            # 0.000 AVX-512 per 1000 inst
float 256            650            # 0.000 AVX-256 per 1000 inst
float 128            231462915      # 0.447 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst
instructions         1554155337938  #
opcache              127266670070   # 81.888 opcache per 1000 inst
opcache miss         53635397087    # 42.1% opcache miss rate
l1 dTLB miss         13222015095    # 8.508 L1 dTLB per 1000 inst
l2 dTLB miss         12953589496    # 8.335 L2 dTLB per 1000 inst
instructions         1554100729697  #
icache               121460160159   # 78.155 icache per 1000 inst
icache miss          3758654716     #  3.1% icache miss rate
l1 iTLB miss         9921047        # 0.006 L1 iTLB per 1000 inst
l2 iTLB miss         0              # 0.000 L2 iTLB per 1000 inst
tlb flush            26472          # 0.000 TLB flush per 1000 inst

Intel metrics confirm memory stalls with 38% dram and 16% L3

elapsed              2181.952
on_cpu               0.057          # 0.92 / 16 cores
utime                1973.303
stime                28.338
nvcsw                19700          # 74.29%
nivcsw               6818           # 25.71%
inblock              1776           # 0.81/sec
onblock              2424           # 1.11/sec
cpu-clock            4002738390187  # 4002.738 seconds
task-clock           4002789217081  # 4002.789 seconds
page faults          49256733       # 12305.603/sec
context switches     61697          # 15.414/sec
cpu migrations       3389           # 0.847/sec
major page faults    15             # 0.004/sec
minor page faults    49256718       # 12305.599/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             133960761624   # 40.246 branches per 1000 inst
branch misses        53656019       # 0.04% branch miss
conditional          133960773688   # 40.246 conditional branches per 1000 inst
indirect             115513238      # 0.035 indirect branches per 1000 inst
slots                79291431925904 #
retiring             3147625843228  #  4.0% ( 4.0%) low
-- ucode             63566823852    #     0.1%
-- fastpath          3084059019376  #     3.9%
frontend             425391016890   #  0.5% ( 0.5%) low
-- latency           241218002295   #     0.3%
-- bandwidth         184173014595   #     0.2%
backend              75525111531388 # 95.3% (95.3%) high
-- cpu               6323241151248  #     8.0%
-- memory            69201870380140 #    87.3%
speculation          38772786029    #  0.0% ( 0.0%) low
-- branch mispredict 6942804511     #     0.0%
-- pipeline restart  31829981518    #     0.0%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           13254648321108 # 0.37 GHz
instructions         3310529227804  # 0.25 IPC low
l2 access            1554923758519  # 469.826 l2 access per 1000 inst
l2 miss              1118397500954  # 71.93% l2 miss
cpu-cycles           13269097487447 # 85.4% memory latency
load stalls          9214888159006  #  1.6% l1 bound
l1 miss              9003832128730  # 13.4% l2 bound
l2 miss              7221520942138  # 16.1% l3 bound
l3 miss              5088357775927  # 38.3% dram bound
store_stalls         2114460128716  # 15.9% store bound

Process overview shows ramsp as the execution process

441 processes
	 60 ramsmp                2668.41    76.81
	 68 clinfo                  17.54     5.60
	 38 vulkaninfo               1.14     1.15
	  6 php                      0.16     0.65
	  4 vulkani:disk$0           0.12     0.13
	  2 llvmpipe-0               0.06     0.07
	  2 llvmpipe-1               0.06     0.07
	  2 llvmpipe-10              0.06     0.07
	  2 llvmpipe-11              0.06     0.07
	  2 llvmpipe-12              0.06     0.07
	  2 llvmpipe-13              0.06     0.07
	  2 llvmpipe-14              0.06     0.07
	  2 llvmpipe-15              0.06     0.07
	  2 llvmpipe-2               0.06     0.07
	  2 llvmpipe-3               0.06     0.07
	  2 llvmpipe-4               0.06     0.07
	  2 llvmpipe-5               0.06     0.07
	  2 llvmpipe-6               0.06     0.07
	  2 llvmpipe-7               0.06     0.07
	  2 llvmpipe-8               0.06     0.07
	  2 llvmpipe-9               0.06     0.07
	  6 clang                    0.06     0.06
	  3 rocminfo                 0.03     0.00
	  1 lspci                    0.00     0.02
	102 sh                       0.00     0.00
	 30 ramspeed                 0.00     0.00
	 13 gcc                      0.00     0.00
	 11 gsettings                0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 glxinfo                  0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  3 gmain                    0.00     0.00
	  2 cc                       0.00     0.00
	  2 dconf worker             0.00     0.00
	  2 grep                     0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 setterm                  0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  1 date                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 ps                       0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
0 processes running

Computation blocks show two ramssmp processes

      239720) sh               cpu=11 start=5.59  finish=5.60 
        239721) stty             cpu=6 start=5.59  finish=5.60 
      239722) ramspeed         cpu=0 start=5.60  finish=51.38
        239723) ramsmp           cpu=3 start=5.60  finish=51.38
          239724) ramsmp           cpu=12 start=5.60  finish=51.38
      239727) ramspeed         cpu=11 start=55.38 finish=101.25
        239728) ramsmp           cpu=4 start=55.38 finish=101.25
          239729) ramsmp           cpu=14 start=55.39 finish=101.25
      239730) ramspeed         cpu=11 start=105.26 finish=151.06
        239731) ramsmp           cpu=4 start=105.26 finish=151.06
          239732) ramsmp           cpu=6 start=105.26 finish=151.06
      239733) sh               cpu=7 start=151.06 finish=151.06