An acoustic modeling program with five workloads. Mostly keeps the CPU busy and runs with all cores.

Topdown metrics. Largest share is backend stalls and not many frontend stalls to make a moderate retirement rate.

AMD metrics show floating point code with not many L2 accesses and a low number of branches.

elapsed              3762.238
on_cpu               0.868          # 13.89 / 16 cores
utime                52056.538
stime                185.652
nvcsw                1302099        # 75.38%
nivcsw               425392         # 24.62%
inblock              2760           # 0.73/sec
onblock              54832224       # 14574.36/sec
cpu-clock            52248285770710 # 52248.286 seconds
task-clock           52249743951891 # 52249.744 seconds
page faults          12748154       # 243.985/sec
context switches     1745081        # 33.399/sec
cpu migrations       26810          # 0.513/sec
major page faults    4149           # 0.079/sec
minor page faults    12744005       # 243.906/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             23504693240647 # 68.840 branches per 1000 inst
branch misses        13097079745    # 0.06% branch miss
conditional          17987708794657 # 52.682 conditional branches per 1000 inst
indirect             655636174293   # 1.920 indirect branches per 1000 inst
cpu-cycles           210656764084175 # 3.46 GHz
instructions         342832809611451 # 1.63 IPC
slots                421307158364352 #
retiring             118910637523092 # 28.2% (39.0%)
-- ucode             620443396647   #     0.1%
-- fastpath          118290194126445 #    28.1%
frontend             19634413076632 #  4.7% ( 6.4%)
-- latency           7202348283648  #     1.7%
-- bandwidth         12432064792984 #     3.0%
backend              165406079832537 # 39.3% (54.2%)
-- cpu               50571970115083 #    12.0%
-- memory            114834109717454 #    27.3%
speculation          1024838848568  #  0.2% ( 0.3%)
-- branch mispredict 254966966749   #     0.1%
-- pipeline restart  769871881819   #     0.2%
smt-contention       116330770486487 # 27.6% ( 0.0%)
cpu-cycles           176597097461140 # 3.47 GHz
instructions         285399042943448 # 1.62 IPC
instructions         95129255715027 # 31.569 l2 access per 1000 inst
l2 hit from l1       1993274139610  # 10.04% l2 miss
l2 miss from l1      53313585716    #
l2 hit from l2 pf    761553213145   #
l3 hit from l2 pf    23601262979    #
l3 miss from l2 pf   224661217208   #
instructions         95092350669095 # 288.511 float per 1000 inst
float 512            378            # 0.000 AVX-512 per 1000 inst
float 256            1468           # 0.000 AVX-256 per 1000 inst
float 128            27435143678913 # 288.511 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         82189          # 0.000 scalar per 1000 inst

Intel metrics

elapsed              2703.437
on_cpu               0.715          # 11.43 / 16 cores
utime                30811.972
stime                98.515
nvcsw                479608         # 86.30%
nivcsw               76110          # 13.70%
inblock              5472           # 2.02/sec
onblock              23388688       # 8651.47/sec
cpu-clock            30911106052904 # 30911.106 seconds
task-clock           30911215860555 # 30911.216 seconds
page faults          7715342        # 249.597/sec
context switches     568289         # 18.385/sec
cpu migrations       63222          # 2.045/sec
major page faults    2773           # 0.090/sec
minor page faults    7712569        # 249.507/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             17379440069501 # 80.716 branches per 1000 inst
branch misses        46914036018    # 0.27% branch miss
conditional          17379440133789 # 80.716 conditional branches per 1000 inst
indirect             3085933427959  # 14.332 indirect branches per 1000 inst
slots                192018763461350 #
retiring             109512682772057 # 57.0% (57.0%)
-- ucode             8868767675421  #     4.6%
-- fastpath          100643915096636 #    52.4%
frontend             8584569166548  #  4.5% ( 4.5%)
-- latency           3782079896939  #     2.0%
-- bandwidth         4802489269609  #     2.5%
backend              68661134260813 # 35.8% (35.8%)
-- cpu               37059672708408 #    19.3%
-- memory            31601461552405 #    16.5%
speculation          5165879176000  #  2.7% ( 2.7%)
-- branch mispredict 3677065606214  #     1.9%
-- pipeline restart  1488813569786  #     0.8%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           83175153659116 # 1.92 GHz
instructions         284864692479299 # 3.42 IPC
l2 access            759190674835   # 6.941 l2 access per 1000 inst
l2 miss              298370738806   # 39.30% l2 miss

Process overview

1805 processes
	480 xspecfem3D           107646.66   215.23
	360 xgenerate_datab        495.93    68.34
	 68 clinfo                  16.53     6.33
	180 mpirun                   7.70    20.65
	 15 xdecompose_mesh          7.14     0.41
	 38 vulkaninfo               0.95     1.34
	  3 awk                      0.23     0.02
	  6 php                      0.20     0.34
	  6 glxinfo:gdrv0            0.13     0.06
	  4 vulkani:disk$0           0.11     0.14
	  2 glxinfo                  0.07     0.03
	  2 glxinfo:cs0              0.07     0.02
	  2 glxinfo:disk$0           0.07     0.02
	  2 glxinfo:sh0              0.07     0.02
	  2 glxinfo:shlo0            0.07     0.02
	  6 clang                    0.06     0.06
	  2 llvmpipe-0               0.05     0.07
	  2 llvmpipe-1               0.05     0.07
	  2 llvmpipe-10              0.05     0.07
	  2 llvmpipe-11              0.05     0.07
	  2 llvmpipe-12              0.05     0.07
	  2 llvmpipe-13              0.05     0.07
	  2 llvmpipe-14              0.05     0.07
	  2 llvmpipe-15              0.05     0.07
	  2 llvmpipe-2               0.05     0.07
	  2 llvmpipe-3               0.05     0.07
	  2 llvmpipe-4               0.05     0.07
	  2 llvmpipe-5               0.05     0.07
	  2 llvmpipe-6               0.05     0.07
	  2 llvmpipe-7               0.05     0.07
	  2 llvmpipe-8               0.05     0.07
	  2 llvmpipe-9               0.05     0.07
	 63 run_this_exampl          0.04     0.02
	  3 rocminfo                 0.03     0.00
	 45 rm                       0.00     2.01
	  1 lspci                    0.00     0.02
	 90 sh                       0.00     0.00
	 51 mkdir                    0.00     0.00
	 49 grep                     0.00     0.00
	 45 cp                       0.00     0.00
	 45 ln                       0.00     0.00
	 33 cut                      0.00     0.00
	 31 date                     0.00     0.00
	 16 sed                      0.00     0.00
	 15 cat                      0.00     0.00
	 15 gsettings                0.00     0.00
	 15 specfem3d                0.00     0.00
	 13 gcc                      0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 create_tomograp          0.00     0.00
	  6 llvm-link                0.00     0.00
	  6 mv                       0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  2 cc                       0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 gmain                    0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 ps                       0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
48 maximum processes

Computation structure

      2601293) specfem3d        cpu=5 start=5.84  finish=77.48
        2601294) rm               cpu=2 start=5.84  finish=5.84 
        2601295) sed              cpu=6 start=5.84  finish=5.85 
        2601296) run_this_exampl  cpu=15 start=5.85  finish=77.48
          2601297) date             cpu=1 start=5.85  finish=5.85 
          2601298) run_this_exampl  cpu=2 start=5.85  finish=5.85 
          2601299) mkdir            cpu=3 start=5.85  finish=5.85 
          2601300) rm               cpu=6 start=5.85  finish=5.92 
          2601301) mkdir            cpu=3 start=5.92  finish=5.92 
          2601302) rm               cpu=9 start=5.92  finish=5.92 
          2601303) ln               cpu=2 start=5.92  finish=5.92 
          2601304) ln               cpu=9 start=5.92  finish=5.93 
          2601305) ln               cpu=2 start=5.93  finish=5.93 
          2601306) cp               cpu=3 start=5.93  finish=5.93 
          2601307) cp               cpu=9 start=5.93  finish=5.93 
          2601308) cp               cpu=2 start=5.93  finish=5.93 
          2601309) run_this_exampl  cpu=3 start=5.93  finish=5.94 
            2601310) grep             cpu=14 start=5.94  finish=5.94 
            2601311) grep             cpu=4 start=5.94  finish=5.94 
            2601312) cut              cpu=0 start=5.94  finish=5.94 
          2601313) run_this_exampl  cpu=2 start=5.94  finish=5.94 
            2601314) grep             cpu=9 start=5.94  finish=5.94 
            2601315) cut              cpu=5 start=5.94  finish=5.94 
          2601316) mkdir            cpu=3 start=5.94  finish=5.94 
          2601317) xdecompose_mesh  cpu=0 start=5.95  finish=6.39 
          2601318) mpirun           cpu=8 start=6.39  finish=8.25 
            2601322) mpirun           cpu=0 start=6.97  finish=8.25 
            2601323) mpirun           cpu=11 start=6.97  finish=6.97 
            2601324) mpirun           cpu=4 start=6.99  finish=8.24 
            2601325) mpirun           cpu=13 start=7.48  finish=8.24 
            2601326) mpirun           cpu=1 start=7.48  finish=8.24 
            2601327) xgenerate_datab  cpu=12 start=7.49  finish=8.23 
              2601329) xgenerate_datab  cpu=13 start=7.49  finish=8.23 
              2601331) xgenerate_datab  cpu=7 start=7.50  finish=8.23 
            2601328) xgenerate_datab  cpu=4 start=7.49  finish=8.23 
              2601332) xgenerate_datab  cpu=8 start=7.50  finish=8.23 
              2601335) xgenerate_datab  cpu=14 start=7.50  finish=8.23 
            2601330) xgenerate_datab  cpu=11 start=7.50  finish=8.23 
              2601334) xgenerate_datab  cpu=1 start=7.50  finish=8.23 
              2601338) xgenerate_datab  cpu=5 start=7.51  finish=8.23 
            2601333) xgenerate_datab  cpu=6 start=7.50  finish=8.23 
              2601337) xgenerate_datab  cpu=3 start=7.51  finish=8.23 
              2601340) xgenerate_datab  cpu=15 start=7.51  finish=8.23 
            2601336) xgenerate_datab  cpu=0 start=7.51  finish=8.23 
              2601341) xgenerate_datab  cpu=13 start=7.51  finish=8.23 
              2601344) xgenerate_datab  cpu=12 start=7.52  finish=8.23 
            2601339) xgenerate_datab  cpu=13 start=7.51  finish=8.23 
              2601343) xgenerate_datab  cpu=15 start=7.52  finish=8.23 
              2601347) xgenerate_datab  cpu=3 start=7.53  finish=8.23 
            2601342) xgenerate_datab  cpu=1 start=7.52  finish=8.23 
              2601346) xgenerate_datab  cpu=2 start=7.52  finish=8.23 
              2601349) xgenerate_datab  cpu=2 start=7.53  finish=8.23 
            2601345) xgenerate_datab  cpu=7 start=7.52  finish=8.23 
              2601348) xgenerate_datab  cpu=4 start=7.53  finish=8.23 
              2601350) xgenerate_datab  cpu=8 start=7.54  finish=8.23 
          2601351) mpirun           cpu=7 start=8.28  finish=77.45
            2601356) mpirun           cpu=11 start=8.84  finish=77.45
            2601357) mpirun           cpu=13 start=8.84  finish=8.84 
            2601358) mpirun           cpu=2 start=8.86  finish=77.44
            2601360) mpirun           cpu=0 start=9.36  finish=77.44
            2601361) mpirun           cpu=9 start=9.36  finish=77.45
            2601362) xspecfem3D       cpu=10 start=9.37  finish=77.44
              2601364) xspecfem3D       cpu=2 start=9.38  finish=77.44
              2601367) xspecfem3D       cpu=2 start=9.38  finish=77.44
              2601389) xspecfem3D       cpu=3 start=9.66  finish=77.44
            2601363) xspecfem3D       cpu=6 start=9.38  finish=77.44
              2601366) xspecfem3D       cpu=3 start=9.38  finish=77.44
              2601370) xspecfem3D       cpu=15 start=9.39  finish=77.44
              2601388) xspecfem3D       cpu=0 start=9.66  finish=77.44
            2601365) xspecfem3D       cpu=13 start=9.38  finish=77.44
              2601369) xspecfem3D       cpu=15 start=9.39  finish=77.44
              2601373) xspecfem3D       cpu=9 start=9.39  finish=77.44
              2601386) xspecfem3D       cpu=15 start=9.66  finish=77.44
            2601368) xspecfem3D       cpu=8 start=9.39  finish=77.44
              2601372) xspecfem3D       cpu=9 start=9.39  finish=77.44
              2601376) xspecfem3D       cpu=3 start=9.40  finish=77.44
              2601392) xspecfem3D       cpu=5 start=9.67  finish=77.44
            2601371) xspecfem3D       cpu=4 start=9.39  finish=77.44
              2601375) xspecfem3D       cpu=14 start=9.40  finish=77.44
              2601380) xspecfem3D       cpu=4 start=9.40  finish=77.44
              2601391) xspecfem3D       cpu=3 start=9.66  finish=77.44
            2601374) xspecfem3D       cpu=11 start=9.39  finish=77.44
              2601378) xspecfem3D       cpu=2 start=9.40  finish=77.44
              2601382) xspecfem3D       cpu=8 start=9.41  finish=77.44
              2601393) xspecfem3D       cpu=0 start=9.67  finish=77.44
            2601377) xspecfem3D       cpu=0 start=9.40  finish=77.44
              2601381) xspecfem3D       cpu=13 start=9.41  finish=77.44
              2601384) xspecfem3D       cpu=5 start=9.41  finish=77.43
              2601390) xspecfem3D       cpu=1 start=9.66  finish=77.44
            2601379) xspecfem3D       cpu=1 start=9.40  finish=77.44
              2601383) xspecfem3D       cpu=5 start=9.41  finish=77.44
              2601385) xspecfem3D       cpu=6 start=9.42  finish=77.44
              2601387) xspecfem3D       cpu=9 start=9.66  finish=77.44
          2601396) date             cpu=6 start=77.47 finish=77.47
        2601397) cat              cpu=1 start=77.48 finish=77.48