An acoustic modeling program with five workloads. Mostly keeps the CPU busy and runs with all cores.

Topdown metrics. Largest share is backend stalls and not many frontend stalls to make a moderate retirement rate.

AMD metrics show floating point code with not many L2 accesses and a low number of branches.
elapsed 3762.238
on_cpu 0.868 # 13.89 / 16 cores
utime 52056.538
stime 185.652
nvcsw 1302099 # 75.38%
nivcsw 425392 # 24.62%
inblock 2760 # 0.73/sec
onblock 54832224 # 14574.36/sec
cpu-clock 52248285770710 # 52248.286 seconds
task-clock 52249743951891 # 52249.744 seconds
page faults 12748154 # 243.985/sec
context switches 1745081 # 33.399/sec
cpu migrations 26810 # 0.513/sec
major page faults 4149 # 0.079/sec
minor page faults 12744005 # 243.906/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 23504693240647 # 68.840 branches per 1000 inst
branch misses 13097079745 # 0.06% branch miss
conditional 17987708794657 # 52.682 conditional branches per 1000 inst
indirect 655636174293 # 1.920 indirect branches per 1000 inst
cpu-cycles 210656764084175 # 3.46 GHz
instructions 342832809611451 # 1.63 IPC
slots 421307158364352 #
retiring 118910637523092 # 28.2% (39.0%)
-- ucode 620443396647 # 0.1%
-- fastpath 118290194126445 # 28.1%
frontend 19634413076632 # 4.7% ( 6.4%)
-- latency 7202348283648 # 1.7%
-- bandwidth 12432064792984 # 3.0%
backend 165406079832537 # 39.3% (54.2%)
-- cpu 50571970115083 # 12.0%
-- memory 114834109717454 # 27.3%
speculation 1024838848568 # 0.2% ( 0.3%)
-- branch mispredict 254966966749 # 0.1%
-- pipeline restart 769871881819 # 0.2%
smt-contention 116330770486487 # 27.6% ( 0.0%)
cpu-cycles 176597097461140 # 3.47 GHz
instructions 285399042943448 # 1.62 IPC
instructions 95129255715027 # 31.569 l2 access per 1000 inst
l2 hit from l1 1993274139610 # 10.04% l2 miss
l2 miss from l1 53313585716 #
l2 hit from l2 pf 761553213145 #
l3 hit from l2 pf 23601262979 #
l3 miss from l2 pf 224661217208 #
instructions 95092350669095 # 288.511 float per 1000 inst
float 512 378 # 0.000 AVX-512 per 1000 inst
float 256 1468 # 0.000 AVX-256 per 1000 inst
float 128 27435143678913 # 288.511 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 82189 # 0.000 scalar per 1000 inst
Intel metrics
elapsed 2703.437
on_cpu 0.715 # 11.43 / 16 cores
utime 30811.972
stime 98.515
nvcsw 479608 # 86.30%
nivcsw 76110 # 13.70%
inblock 5472 # 2.02/sec
onblock 23388688 # 8651.47/sec
cpu-clock 30911106052904 # 30911.106 seconds
task-clock 30911215860555 # 30911.216 seconds
page faults 7715342 # 249.597/sec
context switches 568289 # 18.385/sec
cpu migrations 63222 # 2.045/sec
major page faults 2773 # 0.090/sec
minor page faults 7712569 # 249.507/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 17379440069501 # 80.716 branches per 1000 inst
branch misses 46914036018 # 0.27% branch miss
conditional 17379440133789 # 80.716 conditional branches per 1000 inst
indirect 3085933427959 # 14.332 indirect branches per 1000 inst
slots 192018763461350 #
retiring 109512682772057 # 57.0% (57.0%)
-- ucode 8868767675421 # 4.6%
-- fastpath 100643915096636 # 52.4%
frontend 8584569166548 # 4.5% ( 4.5%)
-- latency 3782079896939 # 2.0%
-- bandwidth 4802489269609 # 2.5%
backend 68661134260813 # 35.8% (35.8%)
-- cpu 37059672708408 # 19.3%
-- memory 31601461552405 # 16.5%
speculation 5165879176000 # 2.7% ( 2.7%)
-- branch mispredict 3677065606214 # 1.9%
-- pipeline restart 1488813569786 # 0.8%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 83175153659116 # 1.92 GHz
instructions 284864692479299 # 3.42 IPC
l2 access 759190674835 # 6.941 l2 access per 1000 inst
l2 miss 298370738806 # 39.30% l2 miss
Process overview
1805 processes
480 xspecfem3D 107646.66 215.23
360 xgenerate_datab 495.93 68.34
68 clinfo 16.53 6.33
180 mpirun 7.70 20.65
15 xdecompose_mesh 7.14 0.41
38 vulkaninfo 0.95 1.34
3 awk 0.23 0.02
6 php 0.20 0.34
6 glxinfo:gdrv0 0.13 0.06
4 vulkani:disk$0 0.11 0.14
2 glxinfo 0.07 0.03
2 glxinfo:cs0 0.07 0.02
2 glxinfo:disk$0 0.07 0.02
2 glxinfo:sh0 0.07 0.02
2 glxinfo:shlo0 0.07 0.02
6 clang 0.06 0.06
2 llvmpipe-0 0.05 0.07
2 llvmpipe-1 0.05 0.07
2 llvmpipe-10 0.05 0.07
2 llvmpipe-11 0.05 0.07
2 llvmpipe-12 0.05 0.07
2 llvmpipe-13 0.05 0.07
2 llvmpipe-14 0.05 0.07
2 llvmpipe-15 0.05 0.07
2 llvmpipe-2 0.05 0.07
2 llvmpipe-3 0.05 0.07
2 llvmpipe-4 0.05 0.07
2 llvmpipe-5 0.05 0.07
2 llvmpipe-6 0.05 0.07
2 llvmpipe-7 0.05 0.07
2 llvmpipe-8 0.05 0.07
2 llvmpipe-9 0.05 0.07
63 run_this_exampl 0.04 0.02
3 rocminfo 0.03 0.00
45 rm 0.00 2.01
1 lspci 0.00 0.02
90 sh 0.00 0.00
51 mkdir 0.00 0.00
49 grep 0.00 0.00
45 cp 0.00 0.00
45 ln 0.00 0.00
33 cut 0.00 0.00
31 date 0.00 0.00
16 sed 0.00 0.00
15 cat 0.00 0.00
15 gsettings 0.00 0.00
15 specfem3d 0.00 0.00
13 gcc 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 create_tomograp 0.00 0.00
6 llvm-link 0.00 0.00
6 mv 0.00 0.00
5 phoronix-test-s 0.00 0.00
2 cc 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 gmain 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 ps 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
0 processes running
48 maximum processes
Computation structure
2601293) specfem3d cpu=5 start=5.84 finish=77.48
2601294) rm cpu=2 start=5.84 finish=5.84
2601295) sed cpu=6 start=5.84 finish=5.85
2601296) run_this_exampl cpu=15 start=5.85 finish=77.48
2601297) date cpu=1 start=5.85 finish=5.85
2601298) run_this_exampl cpu=2 start=5.85 finish=5.85
2601299) mkdir cpu=3 start=5.85 finish=5.85
2601300) rm cpu=6 start=5.85 finish=5.92
2601301) mkdir cpu=3 start=5.92 finish=5.92
2601302) rm cpu=9 start=5.92 finish=5.92
2601303) ln cpu=2 start=5.92 finish=5.92
2601304) ln cpu=9 start=5.92 finish=5.93
2601305) ln cpu=2 start=5.93 finish=5.93
2601306) cp cpu=3 start=5.93 finish=5.93
2601307) cp cpu=9 start=5.93 finish=5.93
2601308) cp cpu=2 start=5.93 finish=5.93
2601309) run_this_exampl cpu=3 start=5.93 finish=5.94
2601310) grep cpu=14 start=5.94 finish=5.94
2601311) grep cpu=4 start=5.94 finish=5.94
2601312) cut cpu=0 start=5.94 finish=5.94
2601313) run_this_exampl cpu=2 start=5.94 finish=5.94
2601314) grep cpu=9 start=5.94 finish=5.94
2601315) cut cpu=5 start=5.94 finish=5.94
2601316) mkdir cpu=3 start=5.94 finish=5.94
2601317) xdecompose_mesh cpu=0 start=5.95 finish=6.39
2601318) mpirun cpu=8 start=6.39 finish=8.25
2601322) mpirun cpu=0 start=6.97 finish=8.25
2601323) mpirun cpu=11 start=6.97 finish=6.97
2601324) mpirun cpu=4 start=6.99 finish=8.24
2601325) mpirun cpu=13 start=7.48 finish=8.24
2601326) mpirun cpu=1 start=7.48 finish=8.24
2601327) xgenerate_datab cpu=12 start=7.49 finish=8.23
2601329) xgenerate_datab cpu=13 start=7.49 finish=8.23
2601331) xgenerate_datab cpu=7 start=7.50 finish=8.23
2601328) xgenerate_datab cpu=4 start=7.49 finish=8.23
2601332) xgenerate_datab cpu=8 start=7.50 finish=8.23
2601335) xgenerate_datab cpu=14 start=7.50 finish=8.23
2601330) xgenerate_datab cpu=11 start=7.50 finish=8.23
2601334) xgenerate_datab cpu=1 start=7.50 finish=8.23
2601338) xgenerate_datab cpu=5 start=7.51 finish=8.23
2601333) xgenerate_datab cpu=6 start=7.50 finish=8.23
2601337) xgenerate_datab cpu=3 start=7.51 finish=8.23
2601340) xgenerate_datab cpu=15 start=7.51 finish=8.23
2601336) xgenerate_datab cpu=0 start=7.51 finish=8.23
2601341) xgenerate_datab cpu=13 start=7.51 finish=8.23
2601344) xgenerate_datab cpu=12 start=7.52 finish=8.23
2601339) xgenerate_datab cpu=13 start=7.51 finish=8.23
2601343) xgenerate_datab cpu=15 start=7.52 finish=8.23
2601347) xgenerate_datab cpu=3 start=7.53 finish=8.23
2601342) xgenerate_datab cpu=1 start=7.52 finish=8.23
2601346) xgenerate_datab cpu=2 start=7.52 finish=8.23
2601349) xgenerate_datab cpu=2 start=7.53 finish=8.23
2601345) xgenerate_datab cpu=7 start=7.52 finish=8.23
2601348) xgenerate_datab cpu=4 start=7.53 finish=8.23
2601350) xgenerate_datab cpu=8 start=7.54 finish=8.23
2601351) mpirun cpu=7 start=8.28 finish=77.45
2601356) mpirun cpu=11 start=8.84 finish=77.45
2601357) mpirun cpu=13 start=8.84 finish=8.84
2601358) mpirun cpu=2 start=8.86 finish=77.44
2601360) mpirun cpu=0 start=9.36 finish=77.44
2601361) mpirun cpu=9 start=9.36 finish=77.45
2601362) xspecfem3D cpu=10 start=9.37 finish=77.44
2601364) xspecfem3D cpu=2 start=9.38 finish=77.44
2601367) xspecfem3D cpu=2 start=9.38 finish=77.44
2601389) xspecfem3D cpu=3 start=9.66 finish=77.44
2601363) xspecfem3D cpu=6 start=9.38 finish=77.44
2601366) xspecfem3D cpu=3 start=9.38 finish=77.44
2601370) xspecfem3D cpu=15 start=9.39 finish=77.44
2601388) xspecfem3D cpu=0 start=9.66 finish=77.44
2601365) xspecfem3D cpu=13 start=9.38 finish=77.44
2601369) xspecfem3D cpu=15 start=9.39 finish=77.44
2601373) xspecfem3D cpu=9 start=9.39 finish=77.44
2601386) xspecfem3D cpu=15 start=9.66 finish=77.44
2601368) xspecfem3D cpu=8 start=9.39 finish=77.44
2601372) xspecfem3D cpu=9 start=9.39 finish=77.44
2601376) xspecfem3D cpu=3 start=9.40 finish=77.44
2601392) xspecfem3D cpu=5 start=9.67 finish=77.44
2601371) xspecfem3D cpu=4 start=9.39 finish=77.44
2601375) xspecfem3D cpu=14 start=9.40 finish=77.44
2601380) xspecfem3D cpu=4 start=9.40 finish=77.44
2601391) xspecfem3D cpu=3 start=9.66 finish=77.44
2601374) xspecfem3D cpu=11 start=9.39 finish=77.44
2601378) xspecfem3D cpu=2 start=9.40 finish=77.44
2601382) xspecfem3D cpu=8 start=9.41 finish=77.44
2601393) xspecfem3D cpu=0 start=9.67 finish=77.44
2601377) xspecfem3D cpu=0 start=9.40 finish=77.44
2601381) xspecfem3D cpu=13 start=9.41 finish=77.44
2601384) xspecfem3D cpu=5 start=9.41 finish=77.43
2601390) xspecfem3D cpu=1 start=9.66 finish=77.44
2601379) xspecfem3D cpu=1 start=9.40 finish=77.44
2601383) xspecfem3D cpu=5 start=9.41 finish=77.44
2601385) xspecfem3D cpu=6 start=9.42 finish=77.44
2601387) xspecfem3D cpu=9 start=9.66 finish=77.44
2601396) date cpu=6 start=77.47 finish=77.47
2601397) cat cpu=1 start=77.48 finish=77.48
