SPICE circuit simulator with two different test cases. The program looks single-threaded with occasional three threads.

Topdown profile shows a large amount of topdown stalls and a low retirement rate

AMD metrics shows floating point code with a moderate L2 access and miss rate. There are not many branches. The on-cpu is barely more than one core.
elapsed 654.274
on_cpu 0.070 # 1.12 / 16 cores
utime 731.620
stime 1.458
nvcsw 13729 # 81.39%
nivcsw 3140 # 18.61%
inblock 0 # 0.00/sec
onblock 56064 # 85.69/sec
cpu-clock 733047101244 # 733.047 seconds
task-clock 733083455448 # 733.083 seconds
page faults 375005 # 511.545/sec
context switches 19961 # 27.229/sec
cpu migrations 315 # 0.430/sec
major page faults 2 # 0.003/sec
minor page faults 375003 # 511.542/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 227636710334 # 88.502 branches per 1000 inst
branch misses 1487507273 # 0.65% branch miss
conditional 165221763672 # 64.236 conditional branches per 1000 inst
indirect 10647435856 # 4.140 indirect branches per 1000 inst
cpu-cycles 3379846461857 # 0.32 GHz
instructions 2573410274886 # 0.76 IPC
slots 6764786299656 #
retiring 884507274916 # 13.1% (13.1%) low
-- ucode 1115721355 # 0.0%
-- fastpath 883391553561 # 13.1%
frontend 239339399574 # 3.5% ( 3.5%) low
-- latency 118070306598 # 1.7%
-- bandwidth 121269092976 # 1.8%
backend 5535942344017 # 81.8% (81.9%) high
-- cpu 761174160581 # 11.3%
-- memory 4774768183436 # 70.6%
speculation 103593987406 # 1.5% ( 1.5%)
-- branch mispredict 74106925238 # 1.1%
-- pipeline restart 29487062168 # 0.4%
smt-contention 1402399494 # 0.0% ( 0.0%)
cpu-cycles 3389350813164 # 0.32 GHz
instructions 2570015738058 # 0.76 IPC
instructions 858514957618 # 75.488 l2 access per 1000 inst
l2 hit from l1 48217698572 # 39.31% l2 miss
l2 miss from l1 14973906198 #
l2 hit from l2 pf 6090783709 #
l3 hit from l2 pf 7164595567 #
l3 miss from l2 pf 3334366653 #
instructions 856700215185 # 222.357 float per 1000 inst
float 512 46 # 0.000 AVX-512 per 1000 inst
float 256 910 # 0.000 AVX-256 per 1000 inst
float 128 190493618209 # 222.357 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 0 # 0.000 scalar per 1000 inst
Intel metrics
elapsed 650.632
on_cpu 0.083 # 1.32 / 16 cores
utime 857.839
stime 1.072
nvcsw 13716 # 82.68%
nivcsw 2874 # 17.32%
inblock 1840 # 2.83/sec
onblock 44824 # 68.89/sec
cpu-clock 858768871242 # 858.769 seconds
task-clock 858793135960 # 858.793 seconds
page faults 364203 # 424.087/sec
context switches 19661 # 22.894/sec
cpu migrations 607 # 0.707/sec
major page faults 15 # 0.017/sec
minor page faults 364188 # 424.070/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 226966574551 # 88.287 branches per 1000 inst
branch misses 1541480071 # 0.68% branch miss
conditional 226966586935 # 88.287 conditional branches per 1000 inst
indirect 10685420053 # 4.156 indirect branches per 1000 inst
slots 17759394818246 #
retiring 2473017644334 # 13.9% (13.9%) low
-- ucode 322404419382 # 1.8%
-- fastpath 2150613224952 # 12.1%
frontend 821427631570 # 4.6% ( 4.6%) low
-- latency 358912915741 # 2.0%
-- bandwidth 462514715829 # 2.6%
backend 14581736945460 # 82.1% (82.1%) high
-- cpu 6516868648695 # 36.7%
-- memory 8064868296765 # 45.4%
speculation 273647358689 # 1.5% ( 1.5%)
-- branch mispredict 216235726103 # 1.2%
-- pipeline restart 57411632586 # 0.3%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 2955502961512 # 0.28 GHz
instructions 2570750857075 # 0.87 IPC
l2 access 306554724965 # 119.256 l2 access per 1000 inst
l2 miss 158681184532 # 51.76% l2 miss
Process overview is simple invocations of ngspice
368 processes
18 ngspice 1452.74 1.26
68 clinfo 15.88 6.30
38 vulkaninfo 1.49 0.76
4 vulkani:disk$0 0.15 0.08
6 glxinfo:gdrv0 0.12 0.06
6 glxinfo:gl0 0.12 0.06
2 llvmpipe-0 0.08 0.04
2 llvmpipe-1 0.08 0.04
2 llvmpipe-10 0.08 0.04
2 llvmpipe-11 0.08 0.04
2 llvmpipe-12 0.08 0.04
2 llvmpipe-13 0.08 0.04
2 llvmpipe-14 0.08 0.04
2 llvmpipe-15 0.08 0.04
2 llvmpipe-2 0.08 0.04
2 llvmpipe-3 0.08 0.04
2 llvmpipe-4 0.08 0.04
2 llvmpipe-5 0.08 0.04
2 llvmpipe-6 0.08 0.04
2 llvmpipe-7 0.08 0.04
2 llvmpipe-8 0.08 0.04
2 llvmpipe-9 0.08 0.04
6 clang 0.07 0.04
6 php 0.06 0.27
2 glxinfo 0.06 0.02
2 glxinfo:cs0 0.06 0.02
2 glxinfo:disk$0 0.06 0.02
2 glxinfo:sh0 0.06 0.02
2 glxinfo:shlo0 0.06 0.02
3 rocminfo 0.03 0.00
1 lspci 0.00 0.03
84 sh 0.00 0.00
13 gcc 0.00 0.00
12 gsettings 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 llvm-link 0.00 0.00
5 phoronix-test-s 0.00 0.00
3 gmain 0.00 0.00
2 cc 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 date 0.00 0.00
1 dconf worker 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 ps 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
0 processes running
47 maximum processes
Computation blocks
158451) ngspice cpu=7 start=6.70 finish=118.97
158452) ngspice cpu=5 start=6.70 finish=118.96
158453) ngspice cpu=6 start=8.54 finish=118.96
158458) ngspice cpu=7 start=122.98 finish=232.45
158459) ngspice cpu=0 start=122.99 finish=232.45
158460) ngspice cpu=1 start=124.84 finish=232.45
158467) ngspice cpu=7 start=236.47 finish=347.93
158468) ngspice cpu=0 start=236.47 finish=347.93
158469) ngspice cpu=2 start=238.31 finish=347.93
158473) sh cpu=9 start=347.94 finish=347.94
158474) sh cpu=2 start=347.94 finish=347.94
158475) ngspice cpu=15 start=358.43 finish=452.29
158476) ngspice cpu=8 start=358.44 finish=452.27
158477) ngspice cpu=10 start=371.73 finish=452.27
158478) ngspice cpu=15 start=456.29 finish=551.97
158479) ngspice cpu=0 start=456.29 finish=551.95
158480) ngspice cpu=9 start=469.75 finish=551.95
158513) ngspice cpu=15 start=555.97 finish=649.15
158514) ngspice cpu=0 start=555.97 finish=649.14
158515) ngspice cpu=1 start=569.14 finish=649.14
