openradioss is a finite element solver. It has high IPC and relatively high retirement rate. It seems to run on cores w/o hyperthreading.

Overall characteristic is a high retirement rate with some backend stalls and lower than average frontend stalls.

AMD metrics show this is floating point code with low amounts of L2 access and predictable branches.
elapsed 531.485
on_cpu 0.464 # 7.42 / 16 cores
utime 3795.661
stime 147.925
nvcsw 43472 # 79.88%
nivcsw 10952 # 20.12%
inblock 0 # 0.00/sec
onblock 731552 # 1376.43/sec
cpu-clock 3943618364987 # 3943.618 seconds
task-clock 3943649988277 # 3943.650 seconds
page faults 914997 # 232.018/sec
context switches 56878 # 14.423/sec
cpu migrations 6942 # 1.760/sec
major page faults 355 # 0.090/sec
minor page faults 914642 # 231.928/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 5557698857321 # 121.437 branches per 1000 inst
branch misses 29875479009 # 0.54% branch miss
conditional 4141825327173 # 90.500 conditional branches per 1000 inst
indirect 268280512784 # 5.862 indirect branches per 1000 inst
cpu-cycles 16471095334701 # 1.94 GHz
instructions 45771448504477 # 2.78 IPC
slots 32949521879028 #
retiring 15391256553201 # 46.7% (46.7%)
-- ucode 9344807545 # 0.0%
-- fastpath 15381911745656 # 46.7%
frontend 3828359236713 # 11.6% (11.6%)
-- latency 2317004612910 # 7.0%
-- bandwidth 1511354623803 # 4.6%
backend 13351083160137 # 40.5% (40.5%)
-- cpu 4635525039147 # 14.1%
-- memory 8715558120990 # 26.5%
speculation 362246006081 # 1.1% ( 1.1%)
-- branch mispredict 336608096267 # 1.0%
-- pipeline restart 25637909814 # 0.1%
smt-contention 16563806243 # 0.1% ( 0.0%)
cpu-cycles 16546016719780 # 1.94 GHz
instructions 46064185230041 # 2.78 IPC
instructions 15360678440271 # 33.454 l2 access per 1000 inst
l2 hit from l1 413590215669 # 13.24% l2 miss
l2 miss from l1 36696882459 #
l2 hit from l2 pf 68932385772 #
l3 hit from l2 pf 23076349381 #
l3 miss from l2 pf 8277129782 #
instructions 15349691378355 # 267.046 float per 1000 inst
float 512 49 # 0.000 AVX-512 per 1000 inst
float 256 434 # 0.000 AVX-256 per 1000 inst
float 128 4099081048403 # 267.046 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 0 # 0.000 scalar per 1000 inst
Intel metrics show this runs on all cores even efficiency cores.
elapsed 847.283
on_cpu 0.723 # 11.56 / 16 cores
utime 9595.125
stime 199.603
nvcsw 60837 # 73.48%
nivcsw 21957 # 26.52%
inblock 18800 # 22.19/sec
onblock 750528 # 885.81/sec
cpu-clock 9794905769811 # 9794.906 seconds
task-clock 9794944473121 # 9794.944 seconds
page faults 1004870 # 102.591/sec
context switches 86826 # 8.864/sec
cpu migrations 10636 # 1.086/sec
major page faults 834 # 0.085/sec
minor page faults 1004036 # 102.506/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 9104033983434 # 135.404 branches per 1000 inst
branch misses 26726595189 # 0.29% branch miss
conditional 9104033999338 # 135.404 conditional branches per 1000 inst
indirect 2480353523643 # 36.890 indirect branches per 1000 inst
slots 58096258688948 #
retiring 37262987672189 # 64.1% (64.1%)
-- ucode 2394267712513 # 4.1%
-- fastpath 34868719959676 # 60.0%
frontend 6405281429073 # 11.0% (11.0%)
-- latency 2856116536906 # 4.9%
-- bandwidth 3549164892167 # 6.1%
backend 12246441453518 # 21.1% (21.1%)
-- cpu 6987019337033 # 12.0%
-- memory 5259422116485 # 9.1%
speculation 2739914923365 # 4.7% ( 4.7%)
-- branch mispredict 2224112949461 # 3.8%
-- pipeline restart 515801973904 # 0.9%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 24281497297185 # 1.78 GHz
instructions 95579973489733 # 3.94 IPC
l2 access 466273512227 # 12.072 l2 access per 1000 inst
l2 miss 105967016868 # 22.73% l2 miss
Process summary shows the engine_linux64 as main process.
444 processes
75 engine_linux64_ 11763.26 451.87
67 clinfo 18.52 5.25
3 starter_linux64 1.26 0.41
38 vulkaninfo 0.95 1.15
18 mpirun 0.80 3.64
6 glxinfo:gdrv0 0.16 0.09
4 vulkani:disk$0 0.10 0.12
2 glxinfo 0.08 0.03
2 glxinfo:cs0 0.08 0.03
2 glxinfo:disk$0 0.08 0.03
2 glxinfo:sh0 0.08 0.03
2 glxinfo:shlo0 0.08 0.03
6 clang 0.07 0.04
6 php 0.06 0.13
2 llvmpipe-0 0.05 0.06
2 llvmpipe-1 0.05 0.06
2 llvmpipe-10 0.05 0.06
2 llvmpipe-11 0.05 0.06
2 llvmpipe-12 0.05 0.06
2 llvmpipe-13 0.05 0.06
2 llvmpipe-14 0.05 0.06
2 llvmpipe-15 0.05 0.06
2 llvmpipe-2 0.05 0.06
2 llvmpipe-3 0.05 0.06
2 llvmpipe-4 0.05 0.06
2 llvmpipe-5 0.05 0.06
2 llvmpipe-6 0.05 0.06
2 llvmpipe-7 0.05 0.06
2 llvmpipe-8 0.05 0.06
2 llvmpipe-9 0.05 0.06
3 rocminfo 0.03 0.00
1 lspci 0.00 0.02
1 ps 0.00 0.01
81 sh 0.00 0.00
12 gcc 0.00 0.00
10 gsettings 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 llvm-link 0.00 0.00
6 rm 0.00 0.00
5 phoronix-test-s 0.00 0.00
4 gmain 0.00 0.00
3 openradioss 0.00 0.00
2 dconf worker 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 cc 0.00 0.00
1 date 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
0 processes running
47 maximum processes
The program runs using mpirun on the cores.
55998) openradioss cpu=14 start=5.72 finish=177.15
55999) starter_linux64 cpu=10 start=5.73 finish=6.30
56000) mpirun cpu=4 start=6.31 finish=177.11
56003) mpirun cpu=15 start=6.89 finish=177.11
56004) mpirun cpu=2 start=6.89 finish=6.89
56005) mpirun cpu=15 start=6.92 finish=177.11
56006) mpirun cpu=1 start=7.40 finish=177.11
56007) mpirun cpu=12 start=7.40 finish=177.11
56008) engine_linux64_ cpu=13 start=7.44 finish=177.10
56013) engine_linux64_ cpu=6 start=7.46 finish=176.47
56019) engine_linux64_ cpu=12 start=7.47 finish=176.47
56032) engine_linux64_ cpu=6 start=7.77 finish=176.98
56009) engine_linux64_ cpu=1 start=7.44 finish=176.47
56015) engine_linux64_ cpu=15 start=7.46 finish=176.47
56021) engine_linux64_ cpu=15 start=7.47 finish=176.47
56010) engine_linux64_ cpu=4 start=7.45 finish=176.47
56017) engine_linux64_ cpu=11 start=7.47 finish=176.47
56022) engine_linux64_ cpu=8 start=7.48 finish=176.47
56011) engine_linux64_ cpu=2 start=7.45 finish=176.47
56020) engine_linux64_ cpu=15 start=7.47 finish=176.47
56024) engine_linux64_ cpu=11 start=7.48 finish=176.47
56012) engine_linux64_ cpu=9 start=7.46 finish=176.47
56023) engine_linux64_ cpu=6 start=7.48 finish=176.47
56027) engine_linux64_ cpu=7 start=7.49 finish=176.47
56014) engine_linux64_ cpu=0 start=7.46 finish=176.47
56025) engine_linux64_ cpu=12 start=7.49 finish=176.47
56028) engine_linux64_ cpu=6 start=7.50 finish=176.47
56016) engine_linux64_ cpu=3 start=7.47 finish=176.47
56026) engine_linux64_ cpu=14 start=7.49 finish=176.47
56030) engine_linux64_ cpu=9 start=7.50 finish=176.47
56018) engine_linux64_ cpu=10 start=7.47 finish=176.47
56029) engine_linux64_ cpu=14 start=7.50 finish=176.47
56031) engine_linux64_ cpu=13 start=7.51 finish=176.47
56037) rm cpu=0 start=177.14 finish=177.15
56038) rm cpu=9 start=177.15 finish=177.15
